|
None |
11 |
Workshop - Using AWS Lake Formation ML Transforms to cleanse the data in a data … |
Apr 05, 2022 |
|
Jupyter Notebook |
2 |
Serverless Data Lake on AWS. Slideshare: https://www.slideshare.net/SmartBizVN/serverless-data-lake-on-aws |
May 14, 2020 |
|
HCL |
2 |
Deploy aws data lake with terraform |
Jan 10, 2023 |
|
C++ |
5 |
Data Deduplication |
Apr 01, 2023 |
|
JavaScript |
5 |
AWS Data Lake infrastructure to store and process Learning Data |
Jul 16, 2020 |
|
None |
3 |
Simple AWS Architecture using the Cloud Formation tool |
Mar 27, 2023 |
|
Python |
54 |
This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines. |
Aug 07, 2022 |
|
None |
25 |
Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS … |
Oct 30, 2021 |
|
HCL |
2 |
Big data analysis using a data lake. |
Aug 19, 2022 |
|
Python |
296 |
Enterprise-grade, production-hardened, serverless data lake on AWS |
Aug 11, 2022 |
|
Shell |
4 |
YugabyteDB AWS Cloud Formation Template |
Jul 25, 2022 |
|
Ruby |
3 |
[DEPRECATED] Tsuru AWS Cloud Formation |
Jan 28, 2023 |
|
None |
2 |
Deploy Dremio Data Lake Engine as an AWS Cloudformation Stack |
Apr 01, 2021 |
|
Python |
9 |
AWS Lambda based microservice built using AWS Glue API to migrate tables off of Amazon … |
Aug 08, 2022 |
|
Jupyter Notebook |
3 |
Build Data Lake using Open Source tools |
Mar 10, 2023 |
|
Java |
19 |
Experimental data-lake implemented using Truffle / Graal |
Jun 02, 2022 |
|
Java |
540 |
Scalable entity resolution, data mastering and deduplication using ML |
Jul 06, 2022 |
|
C# |
33 |
Self-contained C# library for data deduplication using Sqlite |
Mar 07, 2023 |
|
None |
2 |
AWS Cloud Formation Templates for Hyrax |
Dec 09, 2021 |
|
Shell |
2 |
Some helpful AWS cloud formation scripts |
Jul 14, 2023 |
|
HCL |
35 |
Terraform modules which create AWS resources for a Segment Data Lake. |
Jul 28, 2022 |
|
Java |
9 |
Data Lake Engine |
Jun 08, 2022 |
|
Scala |
2 |
Data lake fun! |
Aug 15, 2019 |
|
Python |
9 |
Deduplication for cfDNA sequencing data |
Jun 18, 2020 |
|
JavaScript |
2 |
PingOne integration to AWS Security Lake |
Jun 15, 2023 |
|
None |
2 |
Amazon Macie using Machine Learning to discover sensitive data in S3 buckets. Amazon Lake Formation … |
Aug 24, 2022 |
|
Shell |
3 |
Using Snowflake as a data lake for Splunk |
Apr 22, 2020 |
|
Jupyter Notebook |
4 |
Sample Python app for Data Lake Analytics and Data Lake Store, built upon the Data … |
Jan 14, 2021 |
|
Python |
2 |
Easy management of AWS Cloud Formation stacks |
Feb 14, 2018 |
|
C# |
11 |
Sample .NET client library for Data Lake Analytics and Data Lake Store, built upon the … |
Jan 14, 2021 |
|
Java |
6 |
Apache Airavata Data Lake |
Jun 28, 2022 |
|
HTML |
3 |
Data analysis in Data Lake environment |
Jun 26, 2022 |
|
Python |
3 |
API for distributing Data Lake Data |
Jun 12, 2023 |
|
Jupyter Notebook |
14 |
deduplication |
Mar 15, 2023 |
|
Jupyter Notebook |
9 |
Code examples described in the blog "Exploring the public AWS COVID-19 data lake" |
Nov 03, 2021 |
|
None |
98 |
This provides the contents for AWS Data Lake Handson in both Japanese and English. |
Aug 02, 2022 |
|
None |
2 |
This is a repository to demonstrate Data Lake analytics with Starburst Galaxy on AWS |
Feb 25, 2023 |
|
Scala |
6 |
Code for blogpost Navigation in the data lake using Atlas |
Jan 29, 2021 |
|
Python |
9 |
Diving equipment for data lake |
Aug 27, 2022 |
|
Scala |
4 |
Examples for Smart Data Lake |
Mar 28, 2023 |
|
HCL |
3 |
Authorization for Apiary Data Lake |
Apr 18, 2023 |
|
Java |
2 |
Streaming Data Lake with RSocket |
May 05, 2021 |
|
Python |
2 |
Download your data to a data lake. |
Jul 30, 2022 |
|
C# |
4 |
Analyzing StackExchange data with Azure Data Lake |
Jan 28, 2023 |
|
Python |
4 |
Cloud Formation resources for integrating Lacework with an AWS Organization (NOT using Control Tower) |
Feb 23, 2023 |
|
Python |
5 |
A simple photo deduplication archiver, using pHash |
Feb 23, 2014 |
|
Python |
318 |
Btrfs deduplication |
Apr 24, 2023 |
|
Java |
4 |
Reading Dalta Lake data from Beam |
Dec 12, 2021 |
|
Python |
5 |
Data lake metadata / transaction log store |
Jun 28, 2022 |
|
None |
13 |
Data Lake ETL Code for GHCrawler |
Mar 13, 2022 |