CV

Virinderpal Singh Batth

Data Engineer Resume

Experience

Lead Data Engineer | Insurance

evolv ConsultingOct 2025 – Present

  • Leading and growing a team of 3+ data engineers to architect an insurance client’s first unified operational data store in Snowflake using dbt
  • Defining master data standards across 4+ legacy platforms, resolving data overlaps between sub-companies
  • Architecting high-performance data pipelines serving a transactional Kong API layer for real-time business insights
  • Built reusable dbt framework for automated data extracts to AWS S3, enabling self-service reporting

Data Engineer | Financial Credit Risk

USAAApr 2021 – Oct 2025

  • Resolved critical Snowflake performance bottleneck, reducing compute usage by 95% and significantly decreasing query costs
  • Optimized SCD Type 2 queries achieving 90% reduction in running time (3 hours → 15 minutes) and 70% reduction in data scan volume
  • Designed Kafka streaming pipeline for ML model training with staging tables, automated resiliency checks, and idempotent loading
  • Led platform modernization migrating 2+ terabytes from Hadoop to Snowflake using PySpark and parquet format
  • Drove Secured Card to Credit Card transition resulting in 30% increase in member engagement
  • Implemented decoupled-push architecture reducing On-Call overhead by 90%
  • Architected cross-organizational data lake POC with AWS S3, reducing transfer time by 50%
  • Enhanced PII/PCI/PHI security with data masking, tokenization, and RBAC

Technical Skills

Programming: Python (PySpark, Pandas, SQLAlchemy, FastAPI), SQL, Bash, Git
Data Engineering: dbt, Kafka, Flink, NiFi, IBM DataStage
Cloud & Platforms: AWS (S3, EC2, Redshift, Lambda, Athena), Snowflake, Hadoop, DB2
Practices: CI/CD, Data Governance, RBAC, Data Masking, Tokenization

Education

B.S. in Computer Science | Big Data & Analytics Concentration New York Institute of Technology, New York