onsite
ML Data Engineer - Landesamt fur Zentrale Polizeiliche Dienste NRW
Data Engineer
ML Data Engineer responsible for building scalable data pipelines, deploying machine learning models, and optimizing data workflows on AWS. Requires strong Python, SQL, and Spark skills, with experience in ML model integration and cloud infrastructure.
About the role
Key Responsibilities
- Design, develop, and maintain end‑to‑end data pipelines for large‑scale datasets using Spark and Python.
- Collaborate with data scientists to deploy and monitor machine learning models in production environments.
- Implement data quality checks, performance tuning, and automated testing for reliability.
- Leverage AWS services (S3, Glue, EMR, SageMaker) to build scalable, cost‑effective solutions.
- Document architecture, processes, and best practices for cross‑functional teams.
Requirements
- Proven experience in data engineering with Python and SQL.
- Hands‑on knowledge of Spark, Hadoop ecosystem, and distributed computing.
- Strong understanding of machine learning workflows and model deployment.
- Experience with AWS services and infrastructure as code.
- Excellent problem‑solving skills and ability to work in a collaborative environment.
Skills
pythonsqlmachine learningaws