onsite
Senior Data Engineer - Data Integration & Data Warehouse - Wardow GmbH Potsdam
Data Engineer
Lead end‑to‑end data integration and warehouse development, designing scalable pipelines with Python, SQL, Spark, and AWS services to deliver high‑quality analytics for business stakeholders.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines and ETL processes using Python, SQL, and Spark to ingest, transform, and load data into enterprise data warehouses.
- Architect and optimize data warehouse solutions on AWS (Redshift, S3, Glue) ensuring performance, scalability, and cost efficiency.
- Implement and manage workflow orchestration with Airflow, monitoring job health and troubleshooting failures.
- Collaborate with data scientists, analysts, and product teams to understand data requirements and deliver clean, well‑documented datasets.
- Enforce data quality, governance, and security best practices across all data assets.
Requirements
- 5+ years of experience in data engineering, with a strong background in ETL and data warehousing.
- Proficiency in Python, SQL, and Spark for large‑scale data processing.
- Hands‑on experience with AWS data services (Redshift, Glue, S3, Lambda).
- Solid knowledge of workflow orchestration tools, preferably Airflow.
- Excellent problem‑solving skills and a proactive, collaborative mindset.
Skills
pythonsqlawsapache sparkairflow