sparkify

Here are 12 public repositories matching this topic...

brunowdev / sparkify

This is the final project for the Data Scientist Nanodegree, where our goal is to predict churn for a fictional streaming service called Sparkify.

udacity pyspark pyspark-mllib data-science-capstone sparkify

Updated Jul 6, 2023
HTML

abduygur / churn-prediction-using-spark

Star

Churn Prediction using PySpark

data-science machine-learning pyspark churn-prediction sparkify

Updated Jan 29, 2021
HTML

SimplifyData / Cloud-Data-Warehouse-with-Redshift-AWS

Star

Cloud Data Warehouse of Sparkify Data using Redshift

database data-engineering data-lake redshift data-modeling music-database aws-redshift dimension-tables etl-pipeline staging-tables sparkify data-warehouses analytics-tables redshift-aws

Updated Jun 16, 2020
Python

fpcarneiro / Data-Warehouse

Star

Students will build an ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team.

udacity redshift data-engineer etl-pipeline sparkify data-warehouses

Updated Jun 4, 2019
Python

alessiococchieri / BDA-project-sparkify

Star

This Git repo showcases my analysis of Sparkify dataset with PySpark on Apache Spark cluster mode and JupyterLab on Docker. The goal was to identify at-risk customers and develop retention strategies. The analysis tested multiple machine learning models and uncovered insights into customer behavior and churn patterns.

machine-learning big-data spark apache-spark pyspark churn-prediction big-data-analytics big-data-processing churn-analysis sparkify

Updated Feb 15, 2023
Jupyter Notebook

Mcamin / User-Churn-Prediction

Star

Data Analysis in Spark to Identify Customer Churn for a fictional music service.

python pyspark tuning logistic-regression support-vector-machines churn-prediction gradient-boosting sparkify

Updated Nov 25, 2019
Jupyter Notebook

Guli-Y / SparkifyRedshift

Star

a ETL pipeline for extracting data from s3, staging themon Redshift and transforming them into fact and dimensional tables for song play analysis

etl s3 redshift sparkify

Updated May 21, 2021
Python

cdumen / Sparkify_Churn_Prediction

Star

Sparkify project for predicting customer loyality.

udacity python3 churn-prediction sparkify

Updated Nov 3, 2019
HTML

pratikwatwani / ETL-pipeline-for-Sparkify

Star

An ETL model designed using Postgres SQL for Sparkify database 🗄, modeling user activity data to create a database and ETL pipeline🔀 for a music streaming app 🎼.

database etl postgresql data-modeling datamodel etl-pipeline sparkify