onsite
Senior Data Engineer - Agentic AI - Vytalize Health
Data Engineer
Lead the design and maintenance of intelligent data pipelines and automation systems for healthcare data, leveraging Python, SQL, Airflow, and AWS to scale clinical and claims data processing while ensuring governance and quality.
About the role
Key Responsibilities
- Architect, develop, and maintain scalable data pipelines that ingest, normalize, and govern clinical and claims data using Python, SQL, and Airflow.
- Collaborate with data scientists and product teams to build agentic AI workflows that automate data preparation and feature engineering.
- Implement robust data quality checks, lineage tracking, and governance policies to support compliance and auditability.
- Optimize pipeline performance and cost on AWS, leveraging services such as S3, Redshift, Glue, and Lambda.
- Mentor junior engineers, conduct code reviews, and promote best practices in data engineering and DevOps.
Requirements
- 5+ years of experience in data engineering, with a strong background in Python, SQL, and Airflow.
- Hands‑on experience building data pipelines on AWS, including Glue, Redshift, and Lambda.
- Deep understanding of data modeling, ETL design, and data governance principles.
- Experience working with healthcare data (clinical, claims) and familiarity with HIPAA compliance.
- Excellent problem‑solving skills and a collaborative mindset.
Skills
pythonsqlairflowaws