Skip to content
View qdinh18's full-sized avatar

Block or report qdinh18

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. Real_estate_data_engineering_project Real_estate_data_engineering_project Public

    Web scraping project to collect real estate data from online sources for further data analysis and insights.

    Python 1

  2. Data_Quality_in_Lakehouse Data_Quality_in_Lakehouse Public

    This repository contains a complete data lakehouse implementation using Docker. It showcases an end-to-end data pipeline with Apache Spark for ETL, MinIO and Delta Lake for storage, Airflow for orc…

    Python 1

  3. Clean_Bank_Marketing_Campaign_Data Clean_Bank_Marketing_Campaign_Data Public

    Data cleaning and preparation of a bank marketing campaign dataset for exploratory analysis and modeling.

    Jupyter Notebook

  4. Web_Scraping Web_Scraping Public

    Python scripts for scraping structured data from Amazon, visualizing data and so on.

    Jupyter Notebook

  5. Web_data_to_postgresql Web_data_to_postgresql Public

    ETL pipeline that extracts data from websites and loads it into a PostgreSQL database using Python.

    Python

  6. Real_time_kafka_spark_data_pipeline Real_time_kafka_spark_data_pipeline Public

    Real-time data pipeline built with Apache Kafka and Spark for streaming data and storing it in Cassandra.

    Python