Site Reliability Engineer
Site Reliability Engineer II focused on building and operating AI infrastructure on Akamai’s Inference Cloud platform, leveraging Linux, automation, monitoring, incident response, and SRE best practices to ensure high availability and performance.
Are you passionate about cutting-edge AI infrastructure?
Do you want to build your SRE career on one of the most exciting platforms in cloud computing?
Join the Akamai Inference Cloud Team
The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design, implement, deploy and operate AI platforms that enable customers to run inference models and developers to create AI applications
Partner with the best
As an SRE II, responsibilities include automation, monitoring, incident response, and working collaboratively with skilled team members. Candidates should possess expertise in Linux systems, automation, and SRE practices. Daily activities involve coding, improving dashboards, enhancing alerts, and minimizing repetitive tasks. Opportunities exist to focus on GPU infrastructure, Kubernetes, and ensuring reliability for AI workloads within Akamai's serverless inference platform.
As a Site Reliability Engineer II, you will be responsible for:
Do what you love
To be successful in this role you will:
Work in a way that works for you
FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know fle
Posted June 26, 2026