onsite
AI Data Engineer - Hays Professional Solutions GmbH Standort Ulm
Data Engineer
Design and implement scalable data pipelines and AI solutions, leveraging Python, Spark, and cloud services to support advanced analytics and machine learning initiatives.
About the role
Key Responsibilities
- Develop and maintain robust ETL pipelines using Python and Apache Spark to ingest, transform, and store large‑scale datasets.
- Design data models and schemas optimized for machine‑learning workloads and analytical queries.
- Integrate AI/ML models into production data flows, ensuring reliability and performance.
- Collaborate with data scientists, analysts, and engineering teams to define data requirements and deliver end‑to‑end solutions.
- Implement data governance, security, and quality controls in line with best practices.
- Utilize AWS services (e.g., S3, Redshift, Glue, Lambda) to build cloud‑native data platforms.
Requirements
- Strong proficiency in Python and SQL for data manipulation and automation.
- Hands‑on experience with Apache Spark or similar distributed processing frameworks.
- Solid understanding of machine‑learning concepts and experience deploying models in production.
- Proficiency with AWS cloud services and infrastructure-as-code tools.
- Background in data modeling, warehousing, and data‑pipeline architecture.
Skills
pythonsqlapache sparkmachine learningaws