Stars
18
Forks
2
Language
Jupyter Notebook
Last Updated
Feb 08, 2024
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 2 | Apache NiFi 1.5+ with Apache Livy, Apache Spark 2, PySpark, HDF 3.1 | Aug 13, 2019 | |
Python | 3 | Apache Spark as a Service with Apache Livy Client | Apr 23, 2023 | |
Scala | 3 | Integration of Apache Livy, Apache NiFi and Apache Spark | Apr 10, 2019 | |
XSLT | 7 | Apache Livy - Apache NiFi - Example Scala Spark Job | Dec 02, 2019 | |
Shell | 2 | Dockerized Livy, REST server for Apache Spark | May 04, 2020 | |
Python | 2 | The dbt-spark-livy adapter allows you to use dbt along with Apache spark-livy and Cloudera Data … | Aug 15, 2022 | |
Shell | 3 | Apache Livy for Apache Spark on Mesosphere DC/OS | Jul 01, 2020 | |
Python | 2 | Apache Spark, PySpark on Ubuntu Debian | Aug 30, 2022 | |
Python | 12 | Kaggle code for Loan Default Prediction competition | Apr 11, 2023 | |
Jupyter Notebook | 258 | Apache Spark (PySpark) Practice on Real Data | Apr 22, 2023 | |
Jupyter Notebook | 9 | PySpark notebooks to learn Apache Spark (WIP) | Jul 18, 2022 | |
Jupyter Notebook | 2 | Apache Spark (PySpark) Practice on Real Data | May 07, 2023 | |
Python | 6 | Example Dash app running against Spark cluster via Apache Livy | May 27, 2021 | |
None | 40 | Docker image of Apache Spark with its Python interface, pyspark. | Jun 11, 2022 | |
Python | 14 | Training models with Apache Spark, PySpark for Titanic Kaggle competition | Jun 05, 2022 | |
HTML | 3 | Churn Prediction using PySpark | Jun 06, 2022 | |
Python | 6 | Using Apache Spark (PySpark) to Analyze the Access Logfiles of Nginx | Oct 04, 2022 | |
Python | 85 | Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow | May 24, 2023 | |
Jupyter Notebook | 5 | A beginner's guide to Apache Spark 3.2 (PySpark) | Sep 26, 2022 | |
HTML | 8 | Twitter Spark Streaming using PySpark | May 08, 2021 | |
Scala | 984 | Livy is an open source REST interface for interacting with Apache Spark from anywhere | Aug 18, 2022 | |
Python | 3 | Churn prediction with PySpark | Aug 13, 2021 | |
Python | 7 | Loan Prediction System using ML | Jan 06, 2023 | |
Jupyter Notebook | 2 | Loan Prediction using Machine Learning | Oct 14, 2023 | |
Python | 2 | Distributed R-Tree implementation for spatial data storage using Apache Spark (PySpark). | Feb 06, 2023 | |
Jupyter Notebook | 70 | Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References | Apr 02, 2023 | |
Python | 38 | Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using … | Mar 23, 2023 | |
Shell | 3 | demo of running apache spark jobs using tekton and s2i workflows | Feb 06, 2021 | |
Jupyter Notebook | 232 | Fundamentals of Spark with Python (using PySpark), code examples | Aug 23, 2022 | |
None | 2 | Fundamentals of Spark with Python (using PySpark), code examples | Sep 23, 2021 | |
Jupyter Notebook | 2 | Lending Club Loan Default Analysis using historic loan applications data. | Nov 20, 2022 | |
Scala | 34 | A connector for Apache Spark and PySpark to Dgraph databases. | Jul 05, 2022 | |
Python | 3 | A visualization of ANAC open database builded with Apache Airflow, PySpark, Docker, Terraform and GCP | Oct 09, 2023 | |
R | 22 | R code for Kaggle's Loan Default Prediction - Imperial College London challenge | May 22, 2021 | |
Jupyter Notebook | 354 | SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark. | Aug 18, 2022 | |
None | 2 | SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark. | Apr 19, 2023 | |
None | 3 | SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark. | Jun 30, 2023 | |
Scala | 2 | Bulletproof Apache Spark jobs with fast root cause analysis of failures. | Feb 27, 2023 | |
Scala | 66 | Bulletproof Apache Spark jobs with fast root cause analysis of failures. | Jun 28, 2022 | |
Python | 3 | Analysis pipeline using Apache Airflow | Jul 28, 2022 | |
None | 6 | Monitoring Apache Airflow Using Prometheus | Apr 28, 2023 | |
Jupyter Notebook | 3 | Course of Spark Using Python Language (pyspark). | Dec 08, 2021 | |
Python | 9 | Apache Airflow Provider for creating tasks in Airflow to execute SAS Studio Flows and Jobs. | Apr 24, 2023 | |
Shell | 5 | Apache Livy is a service that enables easy interaction with a Spark cluster over a … | Jun 14, 2022 | |
Scala | 62 | A library that provides useful extensions to Apache Spark and PySpark. | Aug 11, 2022 | |
Python | 7 | Simple Apache Drill alternative using PySpark | Jan 28, 2023 | |
None | 7 | Code Repository for Apache Spark Streaming with Python and PySpark(v), Published by Packt | May 26, 2022 | |
Python | 5 | Sample data pipeline using Kafka, Spark, Airflow | Mar 15, 2023 | |
Python | 3 | Apartments Data Pipeline using Airflow and Spark. | Apr 04, 2022 | |
None | 2 | Experimental Spark master and worker with spark-shell and pyspark | Jun 11, 2023 |