remote
Data Engineer - Vivian Health
Data Engineer
Build and maintain scalable data pipelines that power an AI‑driven healthcare recruitment platform, leveraging Python, SQL, and AWS to deliver clean, reliable data for analytics and machine learning.
About the role
Key Responsibilities
- Design, develop, and maintain robust ETL pipelines to ingest, transform, and load large volumes of healthcare workforce data into data warehouses.
- Collaborate with data scientists and product teams to ensure data quality, consistency, and accessibility for AI models and reporting.
- Implement data modeling best practices, including dimensional modeling and schema optimization for performance.
- Monitor pipeline health, troubleshoot issues, and continuously improve reliability and scalability using AWS services.
- Document data flows, architecture decisions, and maintain data dictionaries for internal stakeholders.
Requirements
- 3+ years of experience as a data engineer in a fast‑paced environment.
- Proficiency in Python, SQL, and experience with AWS data services (Redshift, S3, Glue, Lambda).
- Strong understanding of data modeling, ETL design patterns, and performance tuning.
- Experience with version control (Git) and CI/CD pipelines for data workflows.
- Excellent problem‑solving skills and a collaborative mindset.