onsite
Staff Site Reliability Engineer - pura
Site Reliability Engineer
Lead the reliability and scalability of Pura’s smart‑home fragrance platform, driving automation, incident response, and performance at scale using Kubernetes, AWS, and advanced monitoring tools.
About the role
Key Responsibilities
- Architect, build, and maintain highly available, scalable infrastructure for Pura’s cloud‑native fragrance platform on AWS.
- Design and implement CI/CD pipelines, automated deployments, and blue‑green/rolling release strategies for microservices running in Kubernetes.
- Define and enforce reliability SLIs/SLOs, conduct post‑mortems, and lead blameless incident response to minimize downtime.
- Collaborate with product, security, and DevOps teams to integrate observability, logging, and alerting across the stack.
- Mentor and coach junior SREs, fostering a culture of continuous improvement and knowledge sharing.
Requirements
- 10+ years of production engineering experience, with 5+ years in a senior SRE or DevOps role.
- Deep expertise in Kubernetes, AWS services (EKS, ECS, RDS, S3, CloudWatch), and container orchestration.
- Proficient with CI/CD tools (GitHub Actions, ArgoCD, Jenkins) and infrastructure as code (Terraform, CloudFormation).
- Strong background in monitoring, alerting, and incident management (Prometheus, Grafana, PagerDuty).
- Excellent communication skills and a proven track record of mentoring technical teams.