onsite
Site Reliability Engineer - Reapit ANZ
Site Reliability Engineer
Drive reliability and scalability for a leading property tech platform, managing containerized services, cloud infrastructure, and automated monitoring using Kubernetes, Docker, AWS, Terraform, and Python.
About the role
Key Responsibilities
- Design, deploy, and maintain highly available, scalable services on Kubernetes clusters in AWS.
- Implement infrastructure as code with Terraform, ensuring repeatable and auditable deployments.
- Develop and maintain CI/CD pipelines, automating build, test, and release processes.
- Monitor application performance and infrastructure health, using Prometheus, Grafana, and custom alerts.
- Collaborate with development teams to embed reliability best practices into the software development lifecycle.
Requirements
- 3+ years of experience in site reliability or DevOps roles.
- Proficiency with Kubernetes, Docker, and AWS services (EKS, EC2, S3, CloudWatch).
- Hands‑on experience with Terraform and CI/CD tooling (GitHub Actions, Jenkins, ArgoCD).
- Strong scripting skills in Python or Bash for automation.
- Excellent problem‑solving skills and a proactive, collaborative mindset.
Skills
kubernetesdockerawsterraformpython