onsite
AI Data Engineer - VisiConsult X-ray Systems & Solutions GmbH
Data Engineer
Lead the design and implementation of AI-driven data pipelines, leveraging Python, SQL, and Spark on AWS to deliver scalable, high‑quality datasets for machine learning models.
About the role
Key Responsibilities
- Design, develop, and maintain end‑to‑end data pipelines that ingest, transform, and store large volumes of structured and unstructured data.
- Collaborate with data scientists to prepare feature stores and ensure data quality for AI model training.
- Implement scalable solutions on AWS (S3, Redshift, EMR, Glue) and optimize performance using Spark and SQL.
- Automate data workflows with Airflow or similar orchestration tools, ensuring reliability and observability.
- Monitor pipeline health, troubleshoot issues, and continuously improve data processing efficiency.
Requirements
- 3+ years of experience in data engineering, with strong proficiency in Python and SQL.
- Hands‑on experience with Spark, AWS services, and data pipeline orchestration.
- Solid understanding of machine learning data requirements and feature engineering.
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.
Skills
pythonsqlmachine learningaws