onsite
Lead / Principal Data Engineer - Stashfin
Data Engineer
Senior data engineering leader designing, building, and scaling end‑to‑end data pipelines on AWS, leveraging Python, Spark, and Airflow to support product, analytics, and ML teams with robust data models and governance.
About the role
Key Responsibilities
- Architect, develop, and maintain scalable data pipelines and ETL workflows using Python, Spark, and Airflow on AWS.
- Define and enforce data modeling, quality, governance, and security standards across the organization.
- Collaborate with product, analytics, and ML teams to translate business requirements into reliable data solutions.
- Lead technical decision‑making, evaluate new tools, and drive continuous improvement of data infrastructure.
- Mentor and guide a team of data engineers, fostering a culture of best practices and high performance.
Requirements
- 10+ years of data engineering experience with a proven track record in large‑scale pipeline design.
- Expertise in Python, SQL, Apache Spark, and Airflow, with hands‑on experience on AWS services (Redshift, S3, Glue, EMR).
- Strong knowledge of data modeling, governance, and security principles.
- Excellent communication skills and ability to translate technical concepts to non‑technical stakeholders.
- Experience leading and scaling engineering teams in a fast‑moving environment.
Skills
pythonsqlapache sparkairflowaws