onsite
Forward Deployed Data Engineer Expert - SAP
Data Engineer
Lead end‑to‑end data engineering projects, building scalable pipelines on AWS, Spark, and Airflow, while collaborating closely with data science teams to deploy ML models and optimize data workflows.
About the role
Key Responsibilities
- Design, develop, and maintain large‑scale data pipelines using Python, Spark, and Airflow on AWS.
- Collaborate with data scientists to operationalize machine learning models into production workflows.
- Ensure data quality, governance, and security across all data assets.
- Optimize performance and cost of data infrastructure, implementing best practices for scalability.
- Mentor junior engineers and drive continuous improvement of engineering processes.
Requirements
- 5+ years of experience in data engineering, with strong Python and SQL skills.
- Proven expertise in AWS services (EMR, Redshift, S3, Glue) and Spark.
- Hands‑on experience with Airflow and CI/CD for data pipelines.
- Solid understanding of data modeling, ETL, and data lake architectures.
- Excellent communication skills and a collaborative mindset.
Skills
pythonsqlawsairflowmachine learning