About Me

Data engineer with 8+ years of experience designing, leading, and optimizing large-scale data systems. Skilled in architecting cloud-native solutions, leading technical strategy, and driving significant operational efficiencies.

Technical Skills

Experience

Dec 2022 - Present

GoPuff — Senior Data Engineer

  • Led development of a contract-driven, self-serving data platform, saving 15+ hours/week in manual work.
  • Migrated legacy data ingestion pipelines to a modern platform, reducing latency by 95%.
  • Migrated enterprise Kafka & Kafka Connect to self-hosted Kubernetes, cutting costs by $130k annually.
  • Developed internal APIs & GitHub Actions to enhance platform interoperability.
  • Implemented data anomaly detection & regression testing using Monte Carlo.
May 2021 - Nov 2022

Stripe — Software Engineer

  • Designed data enrichment & aggregation pipelines using Spark, Iceberg, S3, and Scala for a Financial Data Warehouse.
  • Led data preparation for Payment Volume report, ensuring dollar-value accuracy.
  • Automated CSV-based journal entry ingestion, saving 10+ engineering hours during month-close.
  • Partnered with TPMs to translate business definitions into scalable data models.
June 2020 - Apr 2021

McGraw-Hill Education — Senior Software Engineer

  • Built data aggregation workflows to produce analytical reports used by millions of students & teachers.
  • Re-modeled eventing data, improving data processing efficiency by ~60% and simplifying downstream ETL.
  • Collaborated with data scientists and product managers to design scalable, ML-ready datasets for personalized learning experiences.
Aug 2016 - June 2020

McGraw-Hill Education — Software Engineer

  • Led migration from Elasticsearch-based microservices to streaming data pipelines, reducing costs by 40%.
  • Optimized Spark pipelines & PostgreSQL queries, improving data processing SLA by 40%.
  • Developed REST APIs in Node.js to ingest eventing data and expose processed analytics for reporting.
  • Integrated custom logging, alerting, and metrics tracking into data pipelines, improving system reliability.
Sept 2015 - May 2016

University of Massachusetts Dartmouth — Research Assistant

  • Researched action-based vs change-based data provenance capturing techniques in web applications.
  • Implemented SIMProv.js framework to capture, replay, and securely share an action-based data provenance.
  • Integrated SIMProv.js framework with web applications and data visualizations for dogfooding.
May 2015 - Aug 2015

Preferred Freezer Services — Data Science Research Intern

  • Global Aquaculture Alliance partnered with Preferred Freezer Services and UMass Dartmouth to build aquaculture facilities database. Goal was to provide a comprehensive picture of the aquaculture industry in four key areas: feed production, hatcheries, farming, and processing.
  • Developed a web-based data visualization tool to analyze and compare aquaculture facilities data using Emberjs.
  • Automated customized email sending using PHPMailer.

Education

University of Massachusetts Dartmouth - MS in Computer Science, 2016.

University of Mumbai - BE in Computer Engineering, 2013.

Recent Projects

Research & Technical Writings