This repository contains a Jupyter notebook with PySpark code, to get started with PySpark in the context of retail. A blog corresponding to this notebook can be found at https://www.rrighart.com/pyspark . The online retail data were used and be found at http://archive.ics.uci.edu/ml/machine-learning-databases/00502/ .
Ruthger Righart Email: rrighart@googlemail.com