Skip to content
Change the repository type filter

All

    Repositories list

    • airbyte

      Public
      Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
      Python
      Other
      4.1k000Updated Jul 23, 2024Jul 23, 2024
    • created boto3 useful script for aws
      Python
      0000Updated Mar 26, 2024Mar 26, 2024
    • Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
      Python
      Other
      4.1k000Updated Mar 14, 2024Mar 14, 2024
    • Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
      Python
      Other
      4.1k000Updated Oct 9, 2023Oct 9, 2023
    • assumerolespark-s3
      Java
      0000Updated Jul 20, 2022Jul 20, 2022
    • staticpages
      HTML
      0000Updated Nov 24, 2021Nov 24, 2021
    • my-athena-udfs
      Java
      0000Updated Apr 25, 2021Apr 25, 2021
    • restcli

      Public
      Query the rest API
      0000Updated Dec 20, 2020Dec 20, 2020
    • springcacherestapi
      Java
      0000Updated Dec 20, 2020Dec 20, 2020
    • The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.
      Java
      Apache License 2.0
      295000Updated Sep 24, 2020Sep 24, 2020
    • 🌉 Reference implementation for granting cross-account AWS Glue Data Catalog access from Amazon Athena
      Python
      Apache License 2.0
      19000Updated Sep 18, 2020Sep 18, 2020
    • Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB
      Java
      Apache License 2.0
      134000Updated Aug 29, 2020Aug 29, 2020
    • awsemr

      Public
      AWS EMR related examples
      Java
      Apache License 2.0
      0000Updated Aug 17, 2020Aug 17, 2020
    • daily learning
      0000Updated Jul 12, 2020Jul 12, 2020
    • jupyter with docker
      Shell
      0000Updated Jul 5, 2020Jul 5, 2020
    • How to merge hdfs file in pyspark and add header
      Python
      0000Updated Jun 1, 2020Jun 1, 2020
    • web app to limit throttle
      Java
      0000Updated Apr 14, 2020Apr 14, 2020
    • awsemrfs

      Public
      awsemrfs
      Java
      0000Updated Oct 7, 2019Oct 7, 2019
    • Spark: The Definitive Guide's Code Repository
      Scala
      Other
      2.8k000Updated Sep 17, 2019Sep 17, 2019
    • how to infer speech to text using opensource model
      Python
      0000Updated Aug 18, 2019Aug 18, 2019
    • how to configure emr zeppelin to git hup
      0000Updated Aug 18, 2019Aug 18, 2019
    • awsflink

      Public
      how to connect kinesis from flink in emr in java
      Java
      0000Updated Jul 19, 2019Jul 19, 2019
    • The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.
      Other
      167000Updated May 29, 2019May 29, 2019
    • The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.
      Other
      81000Updated May 29, 2019May 29, 2019
    • 148000Updated May 2, 2019May 2, 2019
    • pyspark

      Public
      pyspark sample programs
      Python
      0000Updated Apr 30, 2019Apr 30, 2019
    • A curated list of awesome Deep Learning tutorials, projects and communities.
      6.1k000Updated Mar 7, 2019Mar 7, 2019
    • Gluesample jobs for learning
      0000Updated Mar 4, 2019Mar 4, 2019
    • Stream Processing with Apache Flink - Scala Examples
      Scala
      Apache License 2.0
      203000Updated Feb 11, 2019Feb 11, 2019
    • awsathenalab in python
      Python
      0100Updated Nov 29, 2018Nov 29, 2018