onsite
Sr. Data Engineer - CVS Health
Data Engineer
Senior Data Engineer responsible for designing, building, and maintaining large‑scale data pipelines and ETL workflows using Python, Spark, and AWS services, ensuring high‑quality, scalable data solutions for health analytics.
About the role
Key Responsibilities
- Design, develop, and optimize large‑scale data pipelines and ETL workflows using Python, Apache Spark, and AWS services.
- Collaborate with data scientists and business stakeholders to translate analytical requirements into robust data models and solutions.
- Implement and maintain data quality, governance, and security best practices across all data assets.
- Monitor, troubleshoot, and improve pipeline performance, ensuring reliability and scalability.
- Document architecture, processes, and data lineage for internal and external audit purposes.
Requirements
- 5+ years of experience in data engineering with a strong focus on ETL and data pipeline development.
- Proficiency in Python, SQL, and Apache Spark for large‑scale data processing.
- Hands‑on experience with AWS data services (Redshift, S3, Glue, EMR) and orchestration tools like Airflow.
- Solid understanding of data modeling, schema design, and data warehousing concepts.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.
Skills
pythonsqlapache sparkawsairflow