remote
Data Platform Engineer Analyst - Sanofi
Data Engineer
Data Platform Engineer Analyst responsible for building and maintaining scalable data pipelines, optimizing data lake architecture, and ensuring high-quality data for analytics across the organization using Python, SQL, Spark, and AWS services.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines that ingest, transform, and load data into the enterprise data lake.
- Collaborate with data scientists and business analysts to define data models, schemas, and metadata standards.
- Implement performance tuning, monitoring, and troubleshooting for large-scale ETL processes using Spark and AWS Glue.
- Ensure data quality, lineage, and security compliance across all data assets.
- Document architecture, code, and best practices for future maintainability.
Requirements
- 3+ years of experience in data engineering with a focus on cloud-based data platforms.
- Proficiency in Python, SQL, and Spark for data processing.
- Hands‑on experience with AWS services such as S3, Glue, Redshift, and Lake Formation.
- Strong understanding of data lake concepts, data modeling, and ETL best practices.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.