onsite
Data Engineer - DAMPSOFT GmbH
Data Engineer
Data Engineer responsible for designing, building, and maintaining scalable data pipelines and infrastructure using Python, SQL, AWS, and Spark to support analytics and machine learning initiatives.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines and ETL processes to ingest, transform, and store large volumes of structured and unstructured data.
- Implement data models and schemas in relational and NoSQL databases, ensuring optimal performance and data integrity.
- Leverage AWS services (S3, Redshift, Glue, EMR) to build scalable, cost‑effective data solutions.
- Collaborate with data scientists and analysts to provide clean, well‑documented datasets for modeling and reporting.
- Monitor pipeline health, troubleshoot issues, and continuously improve data quality and processing efficiency.
Requirements
- 3+ years of experience as a Data Engineer or similar role.
- Experience with data modeling, ETL design, and performance tuning.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.
Skills
pythonsqlawsapache spark