remote
Data Engineer - Active Care Group
Data Engineer
Data Engineer responsible for designing, building, and maintaining robust data pipelines that ingest diverse source systems into Snowflake, leveraging Python, SQL, Airflow, and AWS services to enable reliable reporting and operational analytics.
About the role
Key Responsibilities
- Design, develop, and maintain scalable ETL pipelines that move data from multiple source platforms into Snowflake.
- Implement workflow orchestration using Apache Airflow, ensuring reliable scheduling, monitoring, and error handling.
- Collaborate with the Head of Software Engineering and cross‑functional teams to define data models, schema, and integration requirements.
- Write efficient, production‑grade Python and SQL code for data transformation, validation, and enrichment.
- Manage cloud infrastructure on AWS, including S3 storage, IAM permissions, and networking for secure data movement.
- Monitor pipeline performance, troubleshoot issues, and continuously optimise for cost and speed.
Requirements
- Proven experience building data pipelines with Python, SQL, and Airflow in a cloud environment.
- Strong hands‑on knowledge of Snowflake data warehousing, including loading, clustering, and query optimisation.
- Familiarity with AWS services such as S3, Lambda, and IAM for data ingestion and security.
- Solid understanding of data modelling, ETL best practices, and data quality frameworks.
- Ability to work autonomously, prioritize tasks, and communicate technical concepts to non‑technical stakeholders.
Skills
pythonsqlsnowflakeaws