onsite
Senior Data Engineer - HarmonyCares
Data Engineer
Senior Data Engineer building scalable data pipelines and analytics solutions on AWS, leveraging Python, SQL, and Spark to transform complex healthcare data into actionable insights for value‑based care delivery.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines that ingest, transform, and load large volumes of healthcare data from disparate sources into cloud data warehouses.
- Implement ETL processes using Python, SQL, and Spark, ensuring data quality, lineage, and compliance with healthcare regulations.
- Collaborate with data scientists, analysts, and product teams to define data models, schemas, and performance tuning strategies.
- Monitor pipeline performance, troubleshoot issues, and optimize resource utilization on AWS services such as S3, Redshift, Glue, and EMR.
- Document architecture, code, and best practices to support knowledge transfer and audit readiness.
Requirements
- 5+ years of experience in data engineering within a healthcare or related domain.
- Proficiency in Python, SQL, and Spark for large‑scale data processing.
- Hands‑on experience with AWS data services (S3, Redshift, Glue, EMR).
- Strong understanding of data modeling, ETL design patterns, and performance optimization.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.