onsite
Data Engineer - Rheinmetall AG
Data Engineer
Data Engineer responsible for designing, building, and maintaining scalable data pipelines and lakehouse architecture using Python, SQL, and AWS services, ensuring high data quality and performance for analytics and machine learning workloads.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines and lakehouse solutions on AWS, leveraging services such as S3, Glue, Redshift, and Athena.
- Implement ETL processes using Python and Apache Spark to ingest, transform, and load large volumes of structured and semi‑structured data.
- Collaborate with data scientists and business analysts to define data models, schemas, and metadata management strategies.
- Ensure data quality, lineage, and security compliance across all data assets.
- Optimize query performance and storage costs through partitioning, indexing, and cost‑effective data lake design.
- Monitor pipeline health, troubleshoot issues, and continuously improve automation and documentation.
Requirements
- 3+ years of experience in data engineering, preferably in a cloud environment.
Skills
pythonsqlawsapache spark