onsite
Engineering Manager Workload Management - Graphcore
Engineering Manager
Lead and scale AI DevOps teams, architecting Kubernetes-based pipelines and cloud solutions to deliver high‑performance AI workloads. Drive engineering excellence, mentorship, and cross‑functional collaboration in a fast‑paced environment.
About the role
Key Responsibilities
- Lead a multidisciplinary engineering team focused on AI workload management, ensuring delivery of scalable, reliable, and secure solutions.
- Design, implement, and maintain Kubernetes clusters and CI/CD pipelines that support rapid AI model deployment and experimentation.
- Collaborate with data scientists, product managers, and infrastructure teams to translate business requirements into robust, production‑ready architectures.
- Mentor engineers, fostering a culture of continuous improvement, code quality, and knowledge sharing.
- Drive performance optimization, cost efficiency, and operational excellence across cloud and on‑prem environments.
Requirements
- 5+ years of experience in AI/ML engineering or DevOps, with a strong background in Kubernetes and cloud platforms.
- Proven track record of managing and scaling engineering teams in a fast‑moving tech environment.
- Hands‑on expertise in CI/CD tooling, container orchestration, and infrastructure automation.
- Excellent communication skills and ability to translate complex technical concepts to non‑technical stakeholders.
- Passion for emerging AI technologies and a commitment to building high‑quality, scalable systems.