Skip to content
View theharshkonda's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report theharshkonda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
theharshkonda/README.md

Hi πŸ‘‹, I'm Harshvardhan Konda

Software Engineer | Data Engineer | Python | AWS | Big Data


πŸš€ About Me

  • πŸŽ“ B.Tech in Computer Science & Engineering (CGPA 9.11)
  • πŸ’Ό Software Engineer at Anura Infotech (CRIF India) β€” Big Data, ETL, PySpark
  • 🧠 Currently learning: Advanced Data Engineering, Cloud, MERN Stack
  • 🀝 Open to collaborate on Big Data, Python, Cloud, and Open Source Projects
  • πŸ‘¨β€πŸ’» Check out all my projects: GitHub Repositories
  • πŸ“« Reach me here: Email

🏒 Professional Experience

πŸ”Ή Software Engineer β€” Anura Infotech Pvt. Ltd (CRIF India) (Aug 2024 – Present)

  • Built & optimized ETL pipelines using PySpark, Hadoop, HBase, Oracle SQL
  • Processed 1M+ loan records across Consumer, Commercial & MFI domains
  • Automated ingestion pipelines using Shell Scripts + Azkaban, improving processing speed by 20%
  • Designed stored procedures for high-accuracy data ingestion into Oracle DWH & HBase
  • Ensured 98%+ data quality through validation, reporting & reconciliation
  • Collaborated with BA / QA / Dev teams for requirement analysis, BRDs, and CRs
  • Performed functional & performance testing of ETL workflows using JIRA & QTest

πŸ”Ή Software Engineer Intern β€” CRIF Solutions (Feb 2024 – Aug 2024)

  • Debugged production issues & worked on API integrations
  • Automated testing workflows reducing manual efforts by 30%
  • Worked with Git, CI/CD, JIRA, and collaborated with QA & production teams

πŸ“‚ Projects

πŸš€ Serverless ETL Pipeline (AWS Glue + S3 + Athena)

πŸ”— GitHub: https://github.com/theharshkonda/aws-etl-pipeline-apache-spark

  • Built a fully serverless ETL pipeline using AWS Glue (PySpark), S3, Athena
  • Automated schema detection using Glue Crawlers
  • Converted CSV β†’ Parquet for high-performance querying
  • Designed within AWS free tier & ensured scalable architecture

🩺 Dhanvantari – Healthcare App

πŸ”— GitHub: https://github.com/theharshkonda/Dhanvantari

  • Tech: React Native, Node.js, Express, MySQL, AWS
  • Features: Ayurvedic chatbot, mental-health support, doctor booking
  • Managed backend workflows & clinical decision data

πŸ› οΈ Tech Stack

Languages

Python β€’ SQL β€’ Java β€’ JavaScript β€’ TypeScript β€’ PySpark

Big Data & ETL

Apache Spark β€’ Hadoop β€’ HBase β€’ Parquet β€’ ETL Pipelines β€’ Data Validation

Cloud

AWS Glue β€’ S3 β€’ Athena β€’ Lambda β€’ EC2 β€’ RDS β€’ IAM β€’ Docker

Frontend & Mobile

React β€’ React Native β€’ HTML β€’ CSS

Backend

Node.js β€’ Express.js β€’ Flask

Tools

Git β€’ Linux β€’ Jenkins β€’ Airflow β€’ JIRA β€’ Confluence


Connect with me:


🧰 Languages & Tools



πŸ“Š GitHub Stats






Pinned Loading

  1. BMI_Calculator BMI_Calculator Public

    Java

  2. Harshh Harshh Public

    CSS

  3. FreshBuck FreshBuck Public

    FreshBuck application is a user-friendly mobile platform designed to order organic vegetables, fruits, dairy products, and dry fruits. The application aims to streamline the grocery shopping proces…

    Java

  4. theharshkonda theharshkonda Public

    My Repo..