Data Engineer with 2.5+ years at ZS Associates building scalable, production-grade PySpark/Apache Spark pipelines on AWS (Redshift, S3, RDS, EKS/Kubernetes) within regulated, compliance-driven data environments. 2x Databricks Certified (Data Engineer Associate; Generative AI Engineer Associate). Experienced across ETL/ELT, dimensional data modeling, data governance and data quality frameworks, Airflow orchestration, and CI/CD automation. Skilled at processing payer, transaction, and clinical data at TB scale and partnering with business teams to deliver reliable, audit-ready data products. Strong foundation in GenAI/LLM application development using LangChain.