Skip to content
#

data-engineering-pipeline

Here are 136 public repositories matching this topic...

A end-to-end real-time stock market data pipeline with Python, AWS EC2, Apache Kafka, and Cassandra Data is processed on AWS EC2 with Apache Kafka and stored in a local Cassandra database.

  • Updated Jun 7, 2023
  • Python

💜🌈📊 A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api 🌺

  • Updated Nov 14, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-engineering-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-engineering-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more