onsite
Data Engineer - Transdev
Data Engineer
Build and maintain scalable data pipelines, optimize data storage, and enable analytics across cloud platforms using Python, SQL, AWS, and Spark.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines to ingest, transform, and store large volumes of structured and unstructured data.
- Implement data models and schemas that support business intelligence, reporting, and advanced analytics.
- Optimize query performance and storage costs on AWS services such as Redshift, S3, and Glue.
- Collaborate with data scientists, analysts, and product teams to understand data requirements and deliver high‑quality datasets.
- Monitor pipeline health, troubleshoot issues, and implement automated alerts and recovery mechanisms.
Requirements
- Proven experience with Python, SQL, and data engineering tools (e.g., Apache Spark, Airflow).
- Hands‑on knowledge of AWS data services (Redshift, S3, Glue, Lambda).
- Strong understanding of data modeling, ETL best practices, and performance tuning.
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.
Skills
pythonsqlawsapache spark