Senior Data Engineer · Bangalore, India
Scalable Data Platforms · Distributed Systems · SQL Optimization
I build scalable, high-performance data systems that turn complex data into reliable, decision-ready assets at scale.
Who I Am
11+ years of experience as a JAVA, Python & Big Data Developer with deep expertise in scalable systems and cloud infrastructure. I specialize in designing and building robust data solutions that handle complex processing at scale.
8+ years in Big Data & Cloud Development — proficient in Batch and Near Real-Time Processing of clinical data, SSD/HDD test data, and enterprise workloads. Expert in Python, Shell scripting, SQL, and relational databases. Strong knowledge of dependency management tools like Apache Maven and Git for source control.
Hands-on experience with AWS (EMR, EC2, MSK, EKS, MWAA) and GCP/GKE, enabling cloud-native data solutions. Proven track record in designing and leading implementation of scalable applications and tools. Strong collaborator with excellent communication and problem-solving capabilities.
Experienced in mentoring and evaluating team members, conducting peer code reviews, managing releases, and deploying code to production environments. Known for maintaining high coding standards and translating complex requirements into reliable, production-grade solutions.
Career
Highlights
Improved processing times dramatically through advanced SQL optimization — joins, window functions, and CTE restructuring.
Enhanced data correctness by systematically addressing null handling, inconsistencies, and schema evolution edge cases.
Contributed to architectural decisions improving maintainability, extensibility, and long-term platform scalability.
Enabled faster decision-making by delivering clean, well-structured datasets to downstream consumers and business teams.
Toolkit
Let's Connect
I'm always open to interesting conversations about data engineering, distributed systems, or new opportunities. Feel free to reach out.
Prefer a direct message? Send me an email at gaurabkjha@gmail.com and I'll respond promptly.
You can also find my work and open-source contributions on GitHub, and connect professionally on LinkedIn.