onsite
Junior Big Data Engineer Consultant - Integration Factory
Data Engineer
Entry‑level consultant focused on designing, building, and maintaining scalable big data pipelines using Spark, Python, and AWS services. Works closely with clients to transform raw data into actionable insights.
About the role
Key Responsibilities
- Design, develop, and deploy data pipelines on AWS using Spark and Python.
- Transform and cleanse large datasets, ensuring data quality and integrity.
- Collaborate with data scientists and business stakeholders to translate requirements into technical solutions.
- Monitor pipeline performance, troubleshoot issues, and implement optimizations.
- Document architecture, code, and best practices for future maintenance.
Requirements
- Strong foundation in SQL and relational database concepts.
- Hands‑on experience with Apache Spark and Python for data processing.
- Familiarity with AWS services such as S3, EMR, Glue, and Redshift.
- Excellent problem‑solving skills and ability to work in a fast‑paced consulting environment.
- Effective communication skills in German and English.
Skills
apache sparkpythonsqlaws