Skip to content
View AswiN-7's full-sized avatar

Block or report AswiN-7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AswiN-7/README.md

Hi, I'm Aswin πŸš€

Data Engineer | AWS | PySpark | Real-time Pipelines | Cloud Automation | GenAI Enthusiast


πŸš€ About Me

I'm a Data Engineer with over 3 years of experience in building and optimizing cloud-native, scalable data pipelines using modern data stacks. My journey has focused on crafting high-performance ETL/ELT workflows, automating cloud infrastructure, and enabling real-time data access that supports analytics and ML use cases.

I have hands-on experience designing robust systems using:

  • AWS Services like Glue, Lambda, Step Functions, DMS, Kinesis, S3, Redshift, CloudWatch
  • Big Data technologies like Apache Kafka, PySpark, Delta Lake, Airflow, and Ab Initio
  • Languages & Frameworks: Python, SQL, Flask, Shell, LangChain, PyTorch

I am also passionate about AI/ML and LLMs, and actively explore ways to integrate them into data engineering workflows.


πŸ“Š Recent Highlights

  • πŸ† 3rd Place Winner – Barclays GenAI Hackathon (Regional Level)
  • βš™οΈ Built a real-time data streaming pipeline using Kafka, Python, and AWS S3
  • ✨ Contributed to DaFE (Data Forge Engine), a cloud-native, low-code processing platform
  • βœ… Automated AWS DMS, EC2 cost-optimization workflows, and CI/CD config pipelines

πŸ› οΈ Tech Stack

Languages:       Python, SQL, Java, Shell
Cloud & DevOps:  AWS (Glue, Lambda, S3, DMS, DynamoDB, Athena, Step Functions, CloudWatch), Jenkins, GitLab, Docker
Data Engineering: PySpark, Airflow, Kafka, Ab Initio, Delta Lake, ETL/ELT, Streaming, Data Governance
Storage:         PostgreSQL, MongoDB
AI/ML Tools:     PyTorch, LangChain, Hugging Face, LLM, NLP

πŸ’Ό Notable Projects

✨ Real-Time Data Streaming Pipeline

  • Built a real-time ingestion pipeline with Apache Kafka, Python, and AWS S3
  • Automated metadata detection using Glue Crawlers + Athena for serverless querying

🌐 Tamil QA RAG System

  • Developed an open-domain retrieval-augmented generation (RAG) model for Tamil using fine-tuned Roberta + XLM
  • Dense vector indexing with Milvus and deployed APIs with Flask

πŸ“† Education

Bachelor of Engineering (Computer Science)
SSN College of Engineering – Chennai, India (2018–2022)
CGPA: 7.79 / 10


πŸ‘₯ Let's Connect


πŸ“ˆ GitHub Stats


🌟 Featured Badges


"Data isn't just numbersβ€”it's a story waiting to be understood. Let's build systems that tell it better."

Pinned Loading

  1. tamilNLP tamilNLP Public

    Python 2

  2. School-Auth School-Auth Public

    Forked from mohanram123/School-Auth

    An authentication system for school management system using nodejs and mysql

    JavaScript

  3. blog-creating-system blog-creating-system Public

    Java

  4. CountryDatabaseManagement CountryDatabaseManagement Public

    Java

  5. mess-management-system mess-management-system Public

    C

  6. tnau-gpa-calculator tnau-gpa-calculator Public

    Subject Information, Credit scores are taken from TNAU website

    JavaScript 1