onsite
Data Engineer - Arriva
Data Engineer
Design, build, and maintain scalable data pipelines and analytics platforms using Python, SQL, Spark, Airflow, and AWS to enable real‑time decision‑making across railway operations.
About the role
Key Responsibilities
- Build and maintain scalable ETL/ELT pipelines to ingest structured and unstructured data from operational and enterprise systems.
- Develop and optimise data models and schemas for performance, clarity, and alignment with downstream analytics.
- Collaborate with data scientists, analysts, and business stakeholders to translate requirements into robust data solutions.
- Implement data quality, monitoring, and governance practices to ensure reliability and compliance.
- Leverage AWS services (S3, Redshift, Glue, EMR) and orchestration tools (Airflow) to deliver end‑to‑end data workflows.
Requirements
- Proven experience designing and deploying data pipelines with Python, SQL, and Spark.
- Strong knowledge of ETL/ELT frameworks and data modeling best practices.
- Hands‑on experience with AWS data services and workflow orchestration (Airflow).
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.
- Effective communication skills for translating technical concepts to non‑technical stakeholders.
Skills
pythonsqlapache sparkairflowaws