remote
Associate Director, Data Engineer - MSD
Data Engineer
Lead data engineering initiatives for a global biopharma organization, designing scalable pipelines, data warehouses, and analytics solutions using Python, Spark, SQL, and AWS cloud services.
About the role
Key Responsibilities
- Architect and implement end‑to‑end data pipelines that ingest, transform, and load large‑scale biomedical and operational data.
- Design, build, and maintain cloud‑based data warehouses and data lakes on AWS, ensuring high performance and reliability.
- Lead a team of data engineers, providing technical guidance, code reviews, and mentorship to foster best practices.
- Collaborate with data scientists, analysts, and business stakeholders to translate requirements into robust data models and analytical solutions.
- Establish data governance, security, and quality frameworks aligned with regulatory standards in the pharmaceutical industry.
Requirements
- 5+ years of hands‑on experience in data engineering, with a strong focus on Python, SQL, and Apache Spark.
- Proven expertise designing and operating data warehouses or data lakes on AWS (e.g., Redshift, S3, Glue).
- Deep understanding of ETL/ELT processes, data modeling, and performance optimization for large datasets.
- Experience leading technical teams and delivering complex data solutions in a regulated environment.
- Bachelor’s or higher degree in Computer Science, Engineering, or a related field.
Skills
pythonsqlapache sparkaws