remoteonsite
Data Engineer 2 - Providence Global Center
Data Engineer
Senior Data Engineer focused on building scalable data pipelines and modeling solutions using Python, SQL, AWS, and Spark to support healthcare analytics and reporting.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines that ingest, transform, and load large volumes of healthcare data from diverse sources.
- Implement data modeling best practices to support analytics, reporting, and machine learning initiatives.
- Leverage AWS services (S3, Redshift, Glue, EMR) to build scalable, secure, and cost‑efficient data infrastructure.
- Collaborate with data scientists, analysts, and product teams to understand requirements and deliver high‑quality data solutions.
- Monitor pipeline performance, troubleshoot issues, and continuously optimize for speed and reliability.
Requirements
- 3+ years of experience as a Data Engineer in a healthcare or large enterprise environment.
- Proficiency in Python, SQL, and Spark for data processing and transformation.
- Hands‑on experience with AWS data services (S3, Redshift, Glue, EMR).
- Strong understanding of data modeling, ETL design, and data quality practices.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced setting.