remoteonsite
Senior Cloud Operations Engineer - Pegasystems
Systems Engineer
Senior Cloud Operations Engineer driving reliability of global cloud services using AWS, Azure, Kubernetes, and CI/CD pipelines, with strong scripting and monitoring expertise.
About the role
Key Responsibilities
- Design, deploy, and maintain highly available cloud infrastructure across AWS and Azure environments.
- Implement and manage Kubernetes clusters, ensuring scalability, security, and performance.
- Develop and maintain CI/CD pipelines for automated application delivery and infrastructure as code.
- Monitor system health, troubleshoot incidents, and conduct root‑cause analysis to improve reliability.
- Collaborate with cross‑functional teams to integrate new services and optimize existing workloads.
Requirements
- 5+ years of experience in cloud operations or site reliability engineering.
- Proficiency with AWS, Azure, and Kubernetes, including IaC tools like Terraform or ARM templates.
- Strong scripting skills in Python or PowerShell for automation.
- Experience with monitoring and alerting platforms (Prometheus, Grafana, Datadog).
- Excellent problem‑solving skills and ability to work in a 24x7 global team.
Skills
awsazurekubernetescicdpython