onsite
Senior Data Engineer - CVS Health
Data Engineer
Lead the design and implementation of scalable data pipelines using Python, SQL, and AWS services, driving data quality and performance for enterprise health analytics.
About the role
Key Responsibilities
- Architect, develop, and maintain large-scale data pipelines and ETL processes using Python, SQL, and Apache Spark on AWS.
- Design and enforce data models, schemas, and metadata management to support analytics and reporting.
- Collaborate with data scientists, analysts, and product teams to understand data requirements and deliver high‑quality datasets.
- Optimize pipeline performance, monitor data quality, and implement automated testing and alerting.
- Mentor junior engineers and promote best practices in coding, documentation, and DevOps.
Requirements
- 5+ years of experience in data engineering, with a strong background in Python and SQL.
- Proficiency with AWS services (S3, Redshift, Glue, EMR, Lambda) and experience building serverless data workflows.
- Hands‑on experience with Apache Spark, Hadoop, or similar big‑data frameworks.
- Solid understanding of data modeling, ETL design, and data governance principles.
- Excellent problem‑solving skills, strong communication, and a collaborative mindset.
Skills
pythonsqlawsapache spark