Stars
4
Forks
1
Language
Java
Last Updated
Feb 21, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 39 | Schemas for Mozilla's data ingestion pipeline and data lake outputs | May 18, 2022 | |
None | 2 | This is a repository to demonstrate Data Lake analytics with Starburst Galaxy on AWS | Feb 25, 2023 | |
Jupyter Notebook | 2 | Serverless Data Lake on AWS. Slideshare: https://www.slideshare.net/SmartBizVN/serverless-data-lake-on-aws | May 14, 2020 | |
HCL | 2 | Deploy aws data lake with terraform | Jan 10, 2023 | |
Jupyter Notebook | 2 | Data Deduplication using AWS Lake Formation FindMatches | Mar 09, 2023 | |
Python | 2 | Crontab for data ingestion/processing on AWS Lambda | Feb 28, 2024 | |
None | 11 | Workshop - Using AWS Lake Formation ML Transforms to cleanse the data in a data … | Apr 05, 2022 | |
None | 2 | It describes AWS Fleetwise which is an useful managed service to colect, transform, and transfer … | Nov 20, 2022 | |
Java | 7 | An AMQP connector for data ingestion into Kafka via AMQP protocol | Dec 17, 2018 | |
JavaScript | 5 | AWS Data Lake infrastructure to store and process Learning Data | Jul 16, 2020 | |
Rust | 34 | Library to connect to the NEAR Lake S3 and stream the data | Aug 03, 2022 | |
HCL | 25 | Terraform module which creates AWS MSK (Managed Streaming for Kafka) resources | Mar 15, 2023 | |
Python | 296 | Enterprise-grade, production-hardened, serverless data lake on AWS | Aug 11, 2022 | |
TypeScript | 9 | JS Library to connect to the NEAR Lake S3 and stream the data | Jul 21, 2022 | |
JavaScript | 2 | Code for the AWS ingestion of Rigado DevKit Sensor data | Feb 20, 2020 | |
Java | 2 | RESTful interface to access near real-time data | Sep 13, 2018 | |
CSS | 4 | It pulls the data from the json file and loads it into the page. | Mar 29, 2023 | |
Rust | 142 | A highly efficient daemon for streaming data from Kafka into Delta Lake | Aug 12, 2022 | |
None | 2 | Deploy Dremio Data Lake Engine as an AWS Cloudformation Stack | Apr 01, 2021 | |
Java | 2 | Kafka load testing with real world data. | Dec 07, 2022 | |
TypeScript | 11 | aws greengrass version2 based realtime data ingestion - all resources are created by aws cdk | Jul 21, 2022 | |
Python | 3 | End to End architecture and Implementation of a Data ingestion Pipeline in AWS Cloud for … | Mar 27, 2023 | |
Clojure | 6 | Graph time-based numeric data in near real-time | Aug 29, 2014 | |
Rust | 2 | An example of NEAR Lake Framework usage that prints the raw data from the stream | Jul 20, 2022 | |
HCL | 35 | Terraform modules which create AWS resources for a Segment Data Lake. | Jul 28, 2022 | |
None | 2 | Efficient Data Ingestion with Glue Concurrency: Using a Single Template for Multiple S3 Tables into … | Apr 14, 2023 | |
JavaScript | 9 | Middy middleware that loads secret data from AWS Secrets Manager | Jan 28, 2023 | |
HCL | 2 | Example to sync Kafka Topics on MSK with S3 Bucket for Data Lake creations | Sep 26, 2022 | |
R | 5 | Obtain historical and near real time data from Yahoo Finance | May 01, 2023 | |
Java | 306 | Continuously monitors a set of log files and sends new data to the Amazon Kinesis … | Jul 29, 2022 | |
Java | 6 | Kafka->HDFS pipeline from LInkedIn. It is a mapreduce job that does distributed data loads out … | Apr 26, 2021 | |
Python | 13 | Set up a near-real-time, scalable, serverless data aggregation pipeline in the AWS Cloud with Amazon … | Jun 27, 2022 | |
JavaScript | 10 | Code and instructions for streaming AWS CloudWatch logs to Scalyr in near-real-time. | Dec 06, 2019 | |
Python | 102 | Get market data from Yahoo Finance websocket in near-real time. | Sep 04, 2022 | |
Go | 37 | rtdl makes it easy to build and maintain a real-time data lake | May 14, 2022 | |
Python | 54 | This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines. | Aug 07, 2022 | |
Go | 168 | A server that pulls and parses MySQL binlog, pushs change data into different sinks like … | Sep 15, 2022 | |
Swift | 17 | Swift Apacha Kafka (real-time data pipelines and streaming apps) | Jul 08, 2022 | |
Python | 9 | AWS Lambda based microservice built using AWS Glue API to migrate tables off of Amazon … | Aug 08, 2022 | |
Jupyter Notebook | 9 | Code examples described in the blog "Exploring the public AWS COVID-19 data lake" | Nov 03, 2021 | |
None | 98 | This provides the contents for AWS Data Lake Handson in both Japanese and English. | Aug 02, 2022 | |
Go | 2 | ❤️ Cloud-based ingestion & analysis of DJI drone data using AWS, Google Cloud, and Azure. | Dec 12, 2023 | |
TypeScript | 58 | A solutions that automatically configures the AWS services necessary to easily capture, store, process, and … | Aug 27, 2022 | |
Dockerfile | 56 | Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi | Aug 21, 2022 | |
Java | 121 | Kafka Connect connector to stream data in real time from Twitter. | Jul 20, 2022 | |
Java | 3 | Kafka Connect connector to stream data in real time from Twitter. | Apr 15, 2023 | |
JavaScript | 2 | A grafana panel that pulls real time haproxy status data directly from the stats endpoint | Nov 11, 2016 | |
Python | 33 | Stream head-tracking data from the Samsung Galaxy Buds Pro in real-time | Aug 05, 2022 | |
JavaScript | 394 | Data ingestion for Amazon Elasticsearch Service from S3 and Amazon Kinesis, using AWS Lambda: Sample … | Aug 01, 2022 | |
HCL | 9 | Streaming data pipelines for real-time data warehousing. Includes fully managed connectors (PostgreSQL CDC, Snowflake). | Apr 20, 2023 |