remote
Senior Data Engineer - People Analytics - SAP
Data Engineer
Lead the design and implementation of scalable data pipelines for People Analytics, leveraging Python, Spark, and AWS to transform and model workforce data for actionable insights.
About the role
Key Responsibilities
- Architect, develop, and maintain robust data pipelines that ingest, transform, and store large volumes of workforce data using Python and Apache Spark.
- Collaborate with data scientists and business stakeholders to define data models and ensure data quality for People Analytics use cases.
- Implement and optimize ETL processes on AWS services (Glue, S3, Redshift) to support real‑time and batch analytics.
- Monitor pipeline performance, troubleshoot issues, and continuously improve data processing efficiency.
- Document data architecture, pipeline logic, and best practices for future maintenance and onboarding.
Requirements
- 5+ years of experience in data engineering with a focus on large-scale data processing.
- Strong proficiency in Python, SQL, and Apache Spark.
- Hands‑on experience with AWS data services (Glue, S3, Redshift, Athena).
- Solid understanding of data modeling, ETL design, and data quality principles.
- Excellent communication skills and ability to work cross‑functionally with analytics teams.
Skills
pythonsqlapache sparkaws