onsite
KI Data Engineer - Syngenio
Data Engineer
Lead end‑to‑end data pipeline development, integrating AI models into scalable cloud solutions using Python, SQL, AWS, and Spark. Drive data quality, automation, and performance for advanced analytics.
About the role
Key Responsibilities
- Design, build, and maintain robust data pipelines that ingest, transform, and store large volumes of structured and unstructured data.
- Implement and optimize machine learning workflows, ensuring seamless integration of models into production environments.
- Leverage AWS services (S3, Redshift, Glue, Lambda) and containerization (Docker) to deliver scalable, secure data solutions.
- Collaborate with data scientists, analysts, and product teams to translate business requirements into technical specifications.
- Monitor pipeline performance, troubleshoot issues, and continuously improve data quality and processing efficiency.
Requirements
- Proven experience as a data engineer or similar role, with strong Python and SQL skills.
- Hands‑on expertise with AWS data services and container orchestration.
- Solid understanding of machine learning concepts and model deployment pipelines.
- Experience with big data frameworks such as Apache Spark.
- Excellent problem‑solving abilities and a collaborative mindset.
Skills
pythonsqlawsmachine learningapache sparkdocker