remote
Data Platform Architect - Happen Bank
Software Engineer
Lead the design and implementation of a scalable, cloud‑native data platform that powers analytics, reporting, and machine‑learning initiatives across the organization.
About the role
Key Responsibilities
- Architect and evolve a unified data lake and warehouse solution on AWS, ensuring high availability, security, and performance.
- Design and implement robust ETL pipelines using Python and Apache Spark to ingest, transform, and load data from diverse sources.
- Collaborate with data scientists, analysts, and product teams to define data models, schemas, and governance policies that support business intelligence and ML workloads.
- Drive data quality, lineage, and metadata management initiatives, leveraging tools such as AWS Glue, Lake Formation, and Airflow.
- Mentor and guide a small team of data engineers, fostering best practices in coding, testing, and documentation.
Requirements
- 5+ years of experience building enterprise data platforms in a cloud environment.
- Proficiency in Python, SQL, and Spark for large‑scale data processing.
- Hands‑on experience with AWS services (S3, Redshift, Glue, Lake Formation, Athena).
- Strong understanding of data modeling, ETL design patterns, and data governance principles.
- Excellent communication skills and a collaborative mindset.
Skills
pythonapache sparkaws