onsite
Data Engineer III - RELX Group
Data Engineer
Senior Data Engineer driving scalable data pipelines and analytics solutions in healthcare technology, leveraging Python, SQL, AWS, and Spark to deliver high‑quality, privacy‑compliant data services.
About the role
Key Responsibilities
- Design, develop, and maintain large‑scale data pipelines using Python, SQL, and Spark on AWS.
- Implement robust ETL processes to ingest, transform, and load healthcare data while ensuring data quality and compliance.
- Collaborate with data scientists, analysts, and product teams to define data models and optimize query performance.
- Monitor pipeline health, troubleshoot issues, and continuously improve reliability and scalability.
- Document architecture, data flows, and best practices for future maintenance and onboarding.
Requirements
- 5+ years of experience in data engineering with a focus on healthcare or related domains.
- Strong proficiency in Python, SQL, and Spark for data processing.
- Hands‑on experience with AWS services (S3, Redshift, Glue, EMR, Lambda).
- Solid understanding of data modeling, ETL design, and performance tuning.
- Excellent communication skills and a collaborative mindset.