onsite
SRE - Apple
Site Reliability Engineer
We're looking for a Site Reliability Engineer focused on designing and building scalable technical solutions. This mid level role requires 3+ years of relevant experience.
About the role
SRE at the organization.
Key technologies: Kubernetes, Prometheus, Grafana.
Key Responsibilities
- Define and track SLOs, SLIs and error budgets
- Design and implement observability stacks (metrics, logging, tracing)
- Automate toil and improve system reliability through engineering
- Conduct post-mortems and drive blameless incident retrospectives
Requirements
- 3+ years of relevant experience in site reliability engineer
- Proficiency with monitoring tools (Prometheus, Grafana, Datadog)
- Strong programming skills for automation and tooling
Skills
kubernetesprometheusgrafana