onsite
Staff Data Engineer - CVS Health
Data Engineer
Lead the design, development, and maintenance of large‑scale data pipelines and ETL workflows using Python, SQL, Spark, Airflow, and AWS services to support enterprise analytics and decision‑making.
About the role
Key Responsibilities
- Architect, build, and optimize scalable data pipelines and ETL processes to ingest, transform, and load data from diverse sources into enterprise data warehouses.
- Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver high‑quality, reproducible data assets.
- Implement and maintain data quality, lineage, and governance practices, ensuring compliance with security and privacy standards.
- Leverage AWS services (S3, Redshift, Glue, EMR) and open‑source tools (Spark, Airflow) to deliver robust, cost‑effective solutions.
- Monitor pipeline performance, troubleshoot issues, and continuously improve throughput and reliability.
Requirements
- 10+ years of experience in data engineering, with a strong background in Python, SQL, and large‑scale data processing.
- Proven expertise in designing and operating ETL pipelines using Spark, Airflow, and AWS data services.
- Deep understanding of data modeling, data warehousing, and data governance principles.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.
- Strong communication skills and a passion for delivering data solutions that drive business impact.
Skills
pythonsqlapache sparkaws