onsite
Senior Research Data Engineer - PointClickCare
Data Engineer
Senior Research Data Engineer driving large‑scale health data pipelines, building robust ETL processes, and enabling analytics on a massive long‑term care dataset using Python, Spark, SQL, and AWS services.
About the role
Key Responsibilities
- Design, develop, and maintain high‑performance data pipelines that ingest, transform, and store massive healthcare datasets.
- Implement scalable ETL workflows using Python, Apache Spark, and SQL on AWS cloud infrastructure.
- Collaborate with data scientists, product managers, and domain experts to translate research requirements into reliable data solutions.
- Optimize data models and storage strategies for query performance and cost efficiency.
- Ensure data quality, governance, and compliance with healthcare regulations.
Requirements
- 5+ years of experience engineering data pipelines in a cloud environment, preferably AWS.
- Strong proficiency in Python, SQL, and Spark for large‑scale data processing.
- Hands‑on experience with data modeling, ETL design, and big‑data storage technologies (e.g., Redshift, S3, Snowflake).
- Background in healthcare or long‑term care data is a plus.
- Excellent problem‑solving skills and ability to work cross‑functionally in an agile setting.
Skills
pythonsqlapache sparkaws