onsite
Junior Data Engineer - Statista
Data Engineer
Junior Data Engineer responsible for building and maintaining data pipelines, optimizing data workflows, and ensuring data quality using Python, SQL, and cloud services like AWS. Focus on ETL, Spark, and data modeling to support analytics and reporting.
About the role
Key Responsibilities
- Design, develop, and maintain scalable ETL pipelines to ingest, transform, and load data from diverse sources into data warehouses.
- Collaborate with data scientists and analysts to understand data requirements and deliver clean, well‑documented datasets.
- Optimize data processing workflows using Spark, SQL, and Python, ensuring performance and reliability.
- Implement data quality checks, monitoring, and alerting to maintain high data integrity.
- Manage data storage and compute resources on AWS (S3, Redshift, EMR, Glue).
- Document data models, pipeline logic, and best practices for future reference.
Requirements
- Strong programming skills in Python and SQL.
- Experience with ETL tools and frameworks (e.g., Apache Spark, Airflow).
- Familiarity with cloud data services, preferably AWS.
- Solid understanding of data modeling and relational database concepts.
- Excellent problem‑solving skills and attention to detail.