remoteonsite
Principal Data Engineer - Amgen
Data Engineer
Lead end‑to‑end data platform design and implementation, driving scalable pipelines, data quality, and analytics enablement using Python, SQL, AWS, and Big Data technologies.
About the role
Key Responsibilities
- Architect and build enterprise‑scale data pipelines and lakehouse solutions on AWS, ensuring performance, reliability, and security.
- Design data models, schemas, and metadata management strategies to support analytics, ML, and reporting workloads.
- Lead cross‑functional teams in developing ETL/ELT processes using Spark, Python, and SQL, optimizing for cost and latency.
- Implement data governance, lineage, and quality frameworks to meet regulatory and compliance standards.
- Mentor and coach junior engineers, fostering best practices in coding, testing, and documentation.
Requirements
- 10+ years of data engineering experience with a proven track record in large‑scale data platform delivery.
- Expertise in AWS services (S3, Redshift, Glue, EMR, Athena) and Big Data processing frameworks.
- Strong programming skills in Python and SQL, with experience in Spark/Databricks.
- Deep understanding of data modeling, ETL design, and data governance principles.
- Excellent communication skills and ability to influence stakeholders across technical and business domains.