onsite
Principal Data Engineer - Alexander Thamm GmbH
Data Engineer
Lead the design and implementation of scalable data pipelines, leveraging Python, SQL, and AWS services to deliver high‑quality data solutions for enterprise analytics.
About the role
Key Responsibilities
- Architect, develop, and maintain end‑to‑end data pipelines using Python, SQL, and AWS services (S3, Redshift, Glue).
- Design and enforce data models, schemas, and metadata management for large‑scale datasets.
- Implement and optimize ETL processes with Apache Spark and Airflow, ensuring performance and reliability.
- Collaborate with data scientists and business stakeholders to translate analytical requirements into robust data solutions.
- Mentor junior engineers, conduct code reviews, and promote best practices in data engineering.
Requirements
- 10+ years of experience in data engineering, with a proven track record in large‑scale data platform development.
- Expertise in Python, SQL, and AWS ecosystem (S3, Redshift, Glue, EMR).
- Strong knowledge of data modeling, ETL design, and performance tuning.
- Hands‑on experience with Apache Spark, Airflow, and related big‑data technologies.
- Excellent communication skills and ability to lead cross‑functional teams.
Skills
pythonsqlawsapache sparkairflow