remote
Data Engineering Expert - SAP
Software Engineer
Lead end‑to‑end data pipelines, architect scalable solutions on AWS, and optimize Spark workloads to deliver high‑quality data products for enterprise customers.
About the role
Key Responsibilities
- Design, develop, and maintain large‑scale data pipelines using Python, SQL, and Apache Spark.
- Implement and manage data ingestion, transformation, and storage solutions on AWS (S3, Redshift, Glue).
- Collaborate with data scientists and product teams to define data models and ensure data quality.
- Optimize pipeline performance, troubleshoot issues, and implement monitoring and alerting.
- Document architecture, processes, and best practices for internal and external stakeholders.
Requirements
- 5+ years of experience in data engineering or related roles.
- Proficiency in Python, SQL, and Spark (PySpark).
- Hands‑on experience with AWS data services (S3, Redshift, Glue, Athena).
- Strong understanding of ETL concepts, data modeling, and data warehousing.
- Excellent problem‑solving skills and ability to work in a fast‑paced environment.
Skills
pythonsqlapache sparkaws