Popular repositories Loading
-
Real_estate_data_engineering_project
Real_estate_data_engineering_project PublicWeb scraping project to collect real estate data from online sources for further data analysis and insights.
Python 1
-
Data_Quality_in_Lakehouse
Data_Quality_in_Lakehouse PublicThis repository contains a complete data lakehouse implementation using Docker. It showcases an end-to-end data pipeline with Apache Spark for ETL, MinIO and Delta Lake for storage, Airflow for orc…
Python 1
-
Clean_Bank_Marketing_Campaign_Data
Clean_Bank_Marketing_Campaign_Data PublicData cleaning and preparation of a bank marketing campaign dataset for exploratory analysis and modeling.
Jupyter Notebook
-
Web_Scraping
Web_Scraping PublicPython scripts for scraping structured data from Amazon, visualizing data and so on.
Jupyter Notebook
-
Web_data_to_postgresql
Web_data_to_postgresql PublicETL pipeline that extracts data from websites and loads it into a PostgreSQL database using Python.
Python
-
Real_time_kafka_spark_data_pipeline
Real_time_kafka_spark_data_pipeline PublicReal-time data pipeline built with Apache Kafka and Spark for streaming data and storing it in Cassandra.
Python
If the problem persists, check the GitHub status page or contact support.