remote
Senior Data Engineer - CMR Surgical
Data Engineer
Lead the design, development, and maintenance of scalable data pipelines and lakehouse architecture using Python, Spark, and AWS services to enable real‑time analytics and machine learning for surgical robotics data.
About the role
Key Responsibilities
- Architect and implement end‑to‑end data pipelines that ingest, transform, and store large volumes of surgical procedure data using Python and Apache Spark on AWS.
- Design and maintain data models, schemas, and metadata catalogs to support analytics, reporting, and ML workloads.
- Collaborate with data scientists, product managers, and clinical teams to define data requirements and deliver high‑quality datasets.
- Optimize query performance and pipeline throughput, ensuring data quality, consistency, and compliance with regulatory standards.
- Mentor junior engineers and promote best practices in data engineering, version control, and CI/CD.
Requirements
- 5+ years of experience in data engineering, with a strong background in Python, SQL, and Spark.
- Hands‑on experience with AWS services such as S3, Redshift, Glue, and EMR.
- Proven ability to design scalable, fault‑tolerant data pipelines and lakehouse architectures.
- Strong analytical skills and a passion for turning complex data into actionable insights.
- Excellent communication skills and a collaborative mindset.
Skills
pythonsqlapache sparkaws