onsite
Data Engineer - alfatraining Bildungszentrum GmbH
Data Engineer
Data Engineer responsible for designing, building, and maintaining scalable data pipelines using Python, SQL, and AWS services to support analytics and machine learning initiatives.
About the role
Key Responsibilities
- Design, develop, and optimize ETL pipelines to ingest, transform, and load large datasets from diverse sources into data warehouses.
- Implement data modeling best practices, ensuring data integrity, consistency, and performance across relational and cloud-based storage solutions.
- Collaborate with data scientists and analysts to provide reliable, high‑quality data for reporting, dashboards, and predictive models.
- Monitor pipeline health, troubleshoot issues, and proactively improve reliability and scalability using AWS monitoring tools.
- Document data flows, schema definitions, and operational procedures for maintainability and compliance.
Requirements
- Proven experience with Python, SQL, and AWS services (S3, Redshift, Glue, Lambda).
- Strong understanding of data modeling, normalization, and performance tuning.
- Hands‑on experience building and maintaining ETL pipelines in a cloud environment.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced team.