aws-glue-crawler

Here are 34 public repositories matching this topic...

aws-samples / aws-glue-crawler-utilities

This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS CDK applications.

aws-glue aws-glue-crawler

Updated Dec 21, 2021
Python

aws-samples / amazon-rds-export-to-s3-automation

Star

This repository contains source code for the AWS Database Blog Post Reduce data archiving costs for compliance by automating RDS snapshot exports to Amazon S3

aws-lambda aws-kms aws-cloudformation amazon-rds amazon-sns amazon-s3 amazon-athena aws-backup aws-glue amazon-eventbridge aws-glue-crawler

Updated Apr 26, 2023

fermat01 / ETL-Data-Pipeline-using-AWS-EMR-Spark-Glue-Athena

Star

ETL Data pipeline using aws services

aws aws-s3 aws-athena aws-emr-clusters aws-glue-crawler

Updated Aug 23, 2024
Python

GabrielDan92 / AWS_Terraform_PySpark-ETL_Job

Star

Terraform configuration that creates several AWS services, uploads data in S3 and starts the Glue Crawler and Glue Job.

aws terraform s3-bucket pyspark glue-job glue-catalog aws-glue-crawler

Updated Feb 10, 2022
Python

aws-samples / automated-datastore-discovery-with-aws-glue

Star

Automation framework to catalog AWS data sources using Glue

aws typescript aws-s3 dynamodb glue python3 data-catalog rds gdpr pii data-governance aws-cdk aws-glue-workflow aws-glue-crawler aws-glue-data-catalog

Updated May 24, 2024
Python

masood2iq / AWS-Athena-Glue-S3-Bucket-Deployment-Through-AWSConsole

Star

AWS Athena, Glue Database, Glue Crawler and S3 buckets deployment through AWS GUI console.

aws-athena aws-glue simple-query aws-glue-crawler aws-s3-bucket

Updated Dec 13, 2022

Akanksha-tetwar / YouTube-Trending-video-analysis-ETL-using-AWS-Services

Star

In this project I have used the Trending YouTube Video Statistics data from Kaggle to analyze and prepare it for usage.

python aws aws-s3 aws-athena awslambda quicksight aws-glue-crawler awsglue

Updated Nov 7, 2022

subhamay-cloudworks / 0090-deutzia-cft

Sponsor

Star

Creating an audit table for a DynamoDB table using CloudTrail, Kinesis Data Stream, Lambda, S3, Glue and Athena and CloudFormation

aws-python-lambda aws-iam aws-cloudformation aws-cloudtrail aws-cloudwatch aws-athena aws-cloudwatch-logs aws-kinesis-stream aws-glue-crawler aws-iam-roles aws-iam-policies aws-s3-bucket aws-glue-data-catalog

Updated Jul 6, 2023
Python

SadafAsad / LinkedIn-Jobs-Analysis

Star

Unveiling job market trends with Scrapy and AWS

python aws-s3 scrapy aws-ec2 aws-athena aws-quicksight aws-glue-crawler aws-glue-data-catalog

Updated Apr 5, 2024
Python

sarah-zhan / data_pipeline_amazon_products

Star

An end-to-end data pipeline built with AWS S3, Glue, Crawler, Athena, Tableau visulization

aws s3-bucket tableau aws-athena aws-glue-crawler

Updated Mar 27, 2024
Jupyter Notebook

Saurabhkhandebharad / BigData-SK

Star

Analyzed a multicategory e-commerce store using big data techniques on a Kaggle dataset with the help of AWS EC2, AWS S3, PySpark, AWS Glue ETL, AWS Athena, AWS CloudFormation, AWS Lambda and Power BI!

aws big-data aws-lambda power-bi pyspark aws-ec2 aws-cloudformation aws-athena kaggle-dataset aws-services end-to-end-pipeline end-to-end-project aws-glue-crawler aws-s3-bucket

Updated Sep 7, 2024
Python

masood2iq / AWS-Athena-Glue-S3-CloudFormation-Deployment-AWSConsole

Star

AWS Athena, Glue Database, Glue Crawler and S3 buckets deployment through CloudFormation stack on AWS console.

aws-cloudformation aws-athena aws-glue simple-query aws-glue-crawler aws-s3-bucket

Updated Dec 14, 2022

dhvani-k / YouTrend_Insights_Analyzing_YouTube_Video_Landscape

Star

An end-to-end solution for managing and analyzing YouTube video data from Kaggle, leveraging AWS services and visualized through Quicksight and Tableau

aws marketing youtube aws-lambda aws-s3 youtube-api aws-iam tableau content-strategy aws-athena aws-lambda-python aws-glue quicksight aws-glue-crawler user-insights

Updated Sep 23, 2023
Python

subhamay-cloudworks / 0052-agapanthus-cft

Sponsor

Star

Working with Glue Data Catalog and Running the Glue Crawler On Demand

aws-cloudformation aws-glue aws-glue-crawler aws-iam-roles aws-iam-policies aws-glue-data-catalog

Updated May 11, 2023

DivineSamOfficial / SmartCityProject

Star

Smart City Realtime Data Engineering Project

python aws kafka aws-s3 pyspark spark-streaming aws-ec2 aws-athena aws-redshift aws-glue aws-quicksight aws-glue-crawler aws-glue-data-catalog

Updated May 24, 2024
Python

VvEK-Hiremath / Airlines-Data-Pipeline-Project-AWS

Star

Implementing data pipeline using AWS services for airlines data

python aws aws-s3 aws-sns aws-redshift step-functions aws-eventbridge aws-glue-workflow aws-glue-crawler

Updated Oct 15, 2024
Python

Shilpaar90 / AWS-Capturing-Schema-Changes-In-S3

Star

A pipeline within AWS to capture schema changes in S3 files and to update them in a DB.

aws crawler aws-lambda dynamodb s3 aws-dynamodb aws-cloudwatch-logs aws-lambda-python aws-glue aws-eventbridge glue-catalog aws-glue-crawler

Updated Nov 30, 2021

KRISHNASAIRAJ / AWS-Driven-Sales-Performance-Outlook

Star

The Project aims to establish a robust data pipeline for tracking and analyzing sales performance using various AWS services. The process involves creating a DynamoDB database, implementing Change Data Capture (CDC), utilizing Kinesis streams, and finally, storing and querying the data in Amazon Athena.

python aws-lambda dynamodb s3-bucket kinesis kinesis-firehose aws-athena glue-catalog aws-glue-crawler eventbridge-pipes

Updated Feb 11, 2024
Python

Kartik-Banga / Automated-ETL-Pipeline-for-Playstore-Data

Star

Implemented ETL pipeline on AWS for Playstore data using Lambda, Glue Crawlers, and Glue ETL Jobs. Orchestrated workflow with Step Functions and achieved seamless integration, optimal data merging, and enhanced data quality/accessibility.

python sql aws-lambda aws-s3 data-visualization pyspark data-engineering cloud-computing data-analysis powerbi data-cleaning aws-step-functions aws-glue aws-glue-crawler

Updated Jan 4, 2024

mihirkudale / Stock-Market-Real-Time-Data-Engineering-Project

Star

In this project, you will execute an End-To-End Data Engineering Project on Real-Time Stock Market Data using Kafka. We are going to use different technologies such as Python, Amazon Web Services (AWS), Apache Kafka, Glue, Athena, and SQL.

python aws csv kafka aws-s3 jupyter-notebook consumer amazon-ec2 aws-ec2 apache-kafka producer aws-athena stockmarket aws-glue-crawler stockmarketanalysis aws-glue-catalog

Updated May 23, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the aws-glue-crawler topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the aws-glue-crawler topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aws-glue-crawler

Here are 34 public repositories matching this topic...

aws-samples / aws-glue-crawler-utilities

aws-samples / amazon-rds-export-to-s3-automation

fermat01 / ETL-Data-Pipeline-using-AWS-EMR-Spark-Glue-Athena

GabrielDan92 / AWS_Terraform_PySpark-ETL_Job

aws-samples / automated-datastore-discovery-with-aws-glue

masood2iq / AWS-Athena-Glue-S3-Bucket-Deployment-Through-AWSConsole

Akanksha-tetwar / YouTube-Trending-video-analysis-ETL-using-AWS-Services

subhamay-cloudworks / 0090-deutzia-cft

SadafAsad / LinkedIn-Jobs-Analysis

sarah-zhan / data_pipeline_amazon_products

Saurabhkhandebharad / BigData-SK

masood2iq / AWS-Athena-Glue-S3-CloudFormation-Deployment-AWSConsole

dhvani-k / YouTrend_Insights_Analyzing_YouTube_Video_Landscape

subhamay-cloudworks / 0052-agapanthus-cft

DivineSamOfficial / SmartCityProject

VvEK-Hiremath / Airlines-Data-Pipeline-Project-AWS

Shilpaar90 / AWS-Capturing-Schema-Changes-In-S3

KRISHNASAIRAJ / AWS-Driven-Sales-Performance-Outlook

Kartik-Banga / Automated-ETL-Pipeline-for-Playstore-Data

mihirkudale / Stock-Market-Real-Time-Data-Engineering-Project

Improve this page

Add this topic to your repo