onsite
Data Engineer - CETEO GmbH
Data Engineer
Data Engineer responsible for designing, building, and maintaining scalable data pipelines and warehouses using Python, SQL, and AWS services to support analytics and machine learning initiatives.
About the role
Key Responsibilities
- Design, develop, and optimize ETL pipelines to ingest, transform, and load data from diverse sources into cloud data warehouses.
- Implement data modeling and schema design for efficient querying and reporting.
- Collaborate with data scientists and analysts to ensure data quality, consistency, and accessibility.
- Monitor pipeline performance, troubleshoot issues, and implement automated alerts and logging.
- Maintain and enhance data infrastructure using AWS services such as S3, Redshift, Glue, and Lambda.
Requirements
- Proven experience with Python, SQL, and data pipeline frameworks.
- Strong knowledge of AWS data services and cloud architecture.
- Experience with big data technologies (e.g., Spark, Hive) is a plus.
- Solid understanding of data modeling, ETL best practices, and data governance.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.