remote
Principal Data Engineer - BHP
Data Engineer
Lead the design and implementation of enterprise data pipelines, leveraging Python, Spark, and AWS to deliver scalable, high‑performance data solutions for mining operations.
About the role
Key Responsibilities
- Architect, develop, and maintain large‑scale data pipelines using Python, Spark, and AWS services (Glue, Redshift, S3).
- Design and enforce data models, schemas, and metadata management to support analytics and reporting.
- Collaborate with data scientists, analysts, and business stakeholders to translate requirements into robust data solutions.
- Implement best practices for data quality, lineage, and security across the data platform.
- Mentor and guide junior engineers, fostering a culture of continuous improvement and knowledge sharing.
Requirements
- 10+ years of experience in data engineering, with a strong focus on big data technologies.
- Proficiency in Python, SQL, and Apache Spark for data processing and transformation.
- Hands‑on experience with AWS data services (Glue, Redshift, S3, EMR).
- Deep understanding of data modeling, ETL design, and data governance principles.
- Excellent communication skills and ability to work cross‑functionally in a fast‑paced environment.
Skills
pythonsqlapache sparkaws