Lead the design and implementation of real‑time and batch data pipelines on AWS, building scalable, governed data platforms that empower analytics and AI initiatives.
About the role
Key Responsibilities
Design, develop, and maintain real‑time and batch data pipelines using AWS services such as Kinesis, Glue, Redshift, and S3.
Build and evolve scalable data platforms, ensuring high availability, performance, and cost efficiency.
Implement data governance, quality, and automation frameworks to support analytics, AI, and machine learning workloads.
Collaborate with data scientists, analysts, and stakeholders to translate business requirements into robust data solutions.
Monitor, troubleshoot, and optimize pipeline performance, applying best practices for security and compliance.
Requirements
Proven experience as a Data Engineer with deep knowledge of AWS data services.
Strong programming skills in Python or similar, with experience in ETL tooling and workflow orchestration.
Hands‑on experience with data governance, quality frameworks, and automation pipelines.
Familiarity with machine learning data pipelines and model deployment workflows.
Excellent problem‑solving skills and ability to work in a fast‑paced, regulated environment.