onsite
Senior Data Engineer - Mozilla
Data Engineer
Lead end‑to‑end data pipeline development, architect scalable solutions on AWS, and drive data quality and governance for a high‑traffic, privacy‑focused organization.
About the role
Key Responsibilities
- Design, build, and maintain robust data pipelines that ingest, transform, and store large volumes of structured and unstructured data.
- Leverage AWS services (S3, Redshift, Glue, EMR) to create scalable, cost‑effective data architectures.
- Implement ETL processes using Python and Spark, ensuring data integrity, performance, and compliance with privacy standards.
- Collaborate with data scientists, product teams, and stakeholders to define data requirements and deliver actionable insights.
- Monitor pipeline health, troubleshoot issues, and continuously optimize for speed and reliability.
Requirements
- 5+ years of experience in data engineering, with a strong background in Python and SQL.
- Proven expertise in AWS data services and big‑data processing frameworks such as Spark.
- Solid understanding of data modeling, ETL best practices, and data governance.
- Experience with version control (Git), CI/CD pipelines, and automated testing.
- Excellent communication skills and a collaborative mindset.