remote
Senior Data Engineer - Interra Health
Data Engineer
Lead the design, build, and optimization of scalable data pipelines and lakehouse architecture using Python, SQL, and AWS services, driving real‑time insights for healthcare decision‑making.
About the role
Key Responsibilities
- Architect and maintain end‑to‑end data pipelines from ingestion to analytics, ensuring high quality and reliability.
- Design and implement data models and schemas for large‑scale healthcare datasets, optimizing for performance and cost.
- Leverage AWS services (Glue, Redshift, S3, Athena) and Spark to process and transform data at petabyte scale.
- Collaborate with data scientists and product teams to deliver actionable insights and support real‑time decision making.
- Implement monitoring, alerting, and automated testing to guarantee pipeline uptime and data integrity.
Requirements
- 5+ years of experience in data engineering, preferably in healthcare or related domains.
- Proficiency in Python, SQL, and Spark for data processing and transformation.
- Hands‑on experience with AWS data services (Glue, Redshift, S3, Athena, EMR).
- Strong understanding of data modeling, ETL best practices, and performance tuning.
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.