remote
Senior Data Engineer - Zocalo Health
Data Engineer
Lead end‑to‑end data pipeline development for a health tech platform, leveraging Python, SQL, AWS, and Airflow to build scalable, high‑quality data warehouses and analytics solutions.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines that ingest, transform, and load large volumes of healthcare and behavioral data into cloud data warehouses.
- Collaborate with data scientists, product managers, and clinicians to define data models, schemas, and quality metrics that support analytics and reporting.
- Implement and optimize ETL workflows using Airflow, Python, and SQL, ensuring reliability, scalability, and adherence to data governance standards.
- Leverage AWS services (S3, Redshift, Glue, Lambda) and BigQuery to build cost‑effective, high‑performance data storage and processing solutions.
- Monitor pipeline performance, troubleshoot issues, and continuously improve data quality and processing efficiency.
Requirements
- 5+ years of experience as a data engineer in a fast‑paced, cloud‑native environment.
- Proficiency in Python, SQL, and experience building ETL pipelines with Airflow.
- Hands‑on experience with AWS data services (S3, Redshift, Glue, Lambda) and Google BigQuery.
- Strong understanding of data modeling, schema design, and data warehousing best practices.
- Excellent problem‑solving skills and a passion for delivering clean, reliable data solutions.
Skills
pythonsqlawsairflow