Stars
3
Forks
6
Language
Python
Last Updated
Apr 13, 2021
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 2 | Bundle for analyzing data with Apache Spark and Apache Hadoop | Aug 31, 2016 | |
Jupyter Notebook | 60 | Analyzing tweets with Twint, Optimus and Apache Spark. | Jun 29, 2022 | |
Scala | 4 | Apache Spark 2.2.1 for Hadoop 2.2+ | Dec 25, 2019 | |
Python | 2 | Let’s Big Data. Hue is an open source Web interface for analyzing data with Apache … | Nov 20, 2023 | |
Python | 3 | try to analyzing weibo data with Spark | Sep 18, 2019 | |
Jupyter Notebook | 40 | This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark … | Dec 20, 2022 | |
Scala | 2 | Apache Spark with Scala for ML and Data Analytics | Mar 15, 2020 | |
None | 2 | Course work in a team for the course "Data Science Big Data Analysis" with using … | Sep 22, 2022 | |
Shell | 5 | Docker-ized Apache Spark + Hadoop for distributed computing (plus OpenBLAS). | Mar 02, 2022 | |
Jupyter Notebook | 22 | Apache Spark for data engineers | Jun 02, 2022 | |
Java | 106 | Explore, transform, and analyze FHIR data with Apache Spark | Jul 01, 2022 | |
Java | 9 | Java example of analyzing twitter data with hadoop map reduce. | Feb 17, 2023 | |
Java | 2 | Java example of analyzing wikipedia data with hadoop map reduce. | Jun 24, 2018 | |
Scala | 3 | Processing stock exchange data with Apache Spark | Feb 20, 2021 | |
Scala | 7 | A library for parsing and querying shapefile data with Apache Spark, for Spark SQL and … | Nov 21, 2022 | |
Scala | 12 | A library for querying Google AdWords data with Apache Spark, for Spark SQL and DataFrames | Aug 12, 2020 | |
Scala | 29 | PostgreSQL and GreenPlum Data Source for Apache Spark | Mar 25, 2022 | |
Java | 1281 | Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop … | Aug 11, 2022 | |
Java | 2 | Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop … | Jun 07, 2021 | |
Java | 13 | Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop … | Mar 18, 2023 | |
None | 8 | GreenPlum Data Source for Apache SparK | Dec 19, 2021 | |
Scala | 542 | Avro Data Source for Apache Spark | Aug 01, 2022 | |
Scala | 579 | Redshift data source for Apache Spark | Aug 01, 2022 | |
Scala | 153 | Snowflake Data Source for Apache Spark. | Aug 13, 2022 | |
Scala | 8 | An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark. | Sep 29, 2022 | |
Scala | 3 | Tools for working with dirty data in Apache Spark. | Nov 08, 2017 | |
Java | 2 | Projects on Spark and Hadoop | Dec 29, 2021 | |
Java | 102 | Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm | Nov 16, 2022 | |
Dockerfile | 20 | Docker with hadoop spark pig hive | Nov 18, 2023 | |
Java | 32 | This projects provides a NFSv3 connector for Hadoop. Using the connector, Apache Hadoop and Apache … | Aug 26, 2021 | |
None | 3 | Data analysis using apache spark | May 24, 2023 | |
Java | 196 | Connecting Apache Spark with different data stores [DEPRECATED] | Feb 12, 2023 | |
Java | 8 | Demo code for SpringOne2GX 2013 Getting started with Spring Data and Apache Hadoop | Jun 09, 2015 | |
Fantom | 2 | Write Hadoop MapReduce in high-level API. Inspired by Apache Spark | Jun 08, 2020 | |
Shell | 12 | This is a script to deploy a cluster with Apache Hadoop and Apache Spark + … | Apr 30, 2023 | |
Java | 242 | Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache … | Jul 29, 2022 | |
Scala | 110 | Performant Redshift data source for Apache Spark | May 01, 2023 | |
Scala | 18 | Google BigQuery data source for Apache Spark | Dec 07, 2022 | |
Jupyter Notebook | 2 | Apache Spark with python | Nov 01, 2022 | |
Java | 28 | Tidy up Spark and Hadoop tutorials. | Sep 08, 2022 | |
Scala | 6 | The study of hadoop and spark. | Nov 16, 2022 | |
None | 10 | Twitch Stream Analysis with Apache Spark and Apache Zeppelin | Mar 06, 2022 | |
Scala | 2 | Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs | Mar 28, 2022 | |
Scala | 199 | Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs | Jul 28, 2022 | |
Java | 5 | Apache Hadoop for Windows | Feb 13, 2023 | |
Python | 20 | Supercharge your analysis of Cassandra data with Apache Spark | Apr 28, 2021 | |
None | 2 | This repository contains Big data hadoop and spark developer project datasets | Jun 16, 2022 | |
HTML | 82 | R Code + R Notebook for analyzing millions of Amazon reviews using Apache Spark | May 03, 2022 | |
None | 14 | Apache Spark 3 for Data Engineering and Analytics with Python , By Packt publishing | Jul 28, 2022 | |
Scala | 9 | This repository contains datasets for the Big Data Hadoop and Spark Developer course. | Sep 19, 2022 |