onsite
Data Engineer - Hochschule fur angewandte Wissenschaften Munchen
Data Engineer
Data Engineer responsible for designing, building, and maintaining scalable data pipelines and infrastructure using Python, SQL, AWS, and Spark to support analytics and machine learning initiatives.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines and ETL processes to ingest, transform, and load data from diverse sources into data warehouses and lakes.
- Implement data models and schemas that support business intelligence, reporting, and advanced analytics.
- Optimize data workflows for performance, reliability, and cost-efficiency on AWS services such as S3, Redshift, Glue, and EMR.
- Collaborate with data scientists, analysts, and stakeholders to understand data requirements and deliver high-quality, reproducible datasets.
- Monitor, troubleshoot, and improve data pipeline health, ensuring data quality and compliance with governance policies.
Requirements
- Proven experience as a Data Engineer or similar role, with strong proficiency in Python and SQL.
- Hands‑on experience with AWS data services (S3, Redshift, Glue, EMR) and big‑data processing frameworks (Apache Spark).
- Solid understanding of data modeling, ETL design patterns, and data warehousing concepts.
- Experience with version control (Git), CI/CD pipelines, and automated testing for data workflows.
- Excellent problem‑solving skills and ability to work collaboratively in a cross‑functional team.
Skills
pythonsqlawsapache spark