onsite
Senior Data Analytics Engineer - Providence Global Center
Data Engineer
Lead end‑to‑end data engineering for healthcare analytics, building scalable pipelines on AWS, optimizing SQL and Spark workloads, and designing robust data models to support advanced analytics and reporting.
About the role
Key Responsibilities
- Design, develop, and maintain large‑scale data pipelines using Python, SQL, and Apache Spark on AWS.
- Implement and optimize ETL processes to ingest, transform, and load data from diverse healthcare sources.
- Collaborate with data scientists and business analysts to define data models, schemas, and metadata standards.
- Ensure data quality, governance, and security compliance across all data assets.
- Monitor pipeline performance, troubleshoot issues, and continuously improve system reliability.
Requirements
- 5+ years of experience in data engineering or analytics roles within a healthcare or large enterprise environment.
- Proficiency in Python, SQL, and Spark for data processing and transformation.
- Hands‑on experience with AWS services (S3, Redshift, Glue, EMR, Lambda).
- Strong understanding of data modeling, ETL best practices, and data governance.
- Excellent problem‑solving skills and ability to work collaboratively in a cross‑functional team.
Skills
pythonsqlapache sparkaws