onsite
Site Reliability Engineer 9 Month FTC - Evelyn Partners
Site Reliability Engineer
Join a fast‑moving wealth‑management team as a Site Reliability Engineer, building and operating cloud‑native services on AWS using Kubernetes, Terraform, Python and modern CI/CD pipelines.
About the role
Key Responsibilities
- Design, implement, and maintain highly available services on AWS using Kubernetes and Terraform.
- Develop automation scripts and tools in Python and Bash to improve reliability and reduce manual toil.
- Build and manage CI/CD pipelines for seamless code deployment and infrastructure changes.
- Monitor system health with Prometheus, Grafana, and alerting frameworks; respond to incidents and perform root‑cause analysis.
- Collaborate with development and security teams to embed reliability, performance, and compliance into the software lifecycle.
Requirements
- 3+ years of experience in site reliability or DevOps engineering, preferably in a financial services environment.
- Strong proficiency with Linux systems, Python scripting, and infrastructure‑as‑code tools such as Terraform.
- Hands‑on experience managing Kubernetes clusters and AWS services (EC2, RDS, S3, IAM).
- Familiarity with CI/CD platforms (Jenkins, GitLab CI, GitHub Actions) and monitoring solutions like Prometheus/Grafana.
- Excellent problem‑solving skills, ability to work under pressure, and strong communication with cross‑functional teams.
Skills
pythonlinuxkubernetesawsterraformcicdprometheus