A passionate Data Engineer building robust and scalable data solutions.
Hi there! I'm Abhay Kumar, a highly detail-oriented and results-driven Data Engineer with 6+ years of expertise in architecting and delivering robust big data solutions for FinTech and E-commerce.
My experience spans building scalable real-time and batch data pipelines with Spark, Kafka, NiFi, and Airflow on the Hadoop ecosystem, leveraging AWS cloud services like S3, EMR, Glue, and Lambda. I excel at handling diverse data sources for analytics, reporting, and ML initiatives, consistently driving business insights and leading successful projects.
I believe in continuous learning and leveraging technology to solve real-world problems. Let's build something amazing together!
Mar 2022 – Present
Nov 2020 – Feb 2022
Mar 2019 – Oct 2020
Designed and implemented a real-time data ingestion pipeline using Apache Kafka and Spark Streaming to process high-volume sensor data, enabling immediate analytics and anomaly detection.
Built a scalable data lakehouse architecture on AWS using S3, Glue, and Athena, facilitating efficient storage, processing, and querying of structured and unstructured data.
Optimized existing ETL processes by refactoring legacy code, introducing parallel processing, and implementing data validation checks, resulting in a 40% reduction in processing time.
Have Experience data scraping more than 100+ websites.
Jan 2022 – Feb 2023
CGPA: 3.49/4.0
Aug 2015 – Apr 2019
CGPA: 8.2/10.0
June, 2025
June, 2018
June, 2018
For leading the successful Cloudera migration at IDFC FIRST Bank.
In my spare time, I enjoy engaging in activities that keep me active and mentally stimulated. Here are some of my passions:
Have a project in mind or just want to say hello? Feel free to reach out!