remote
Compliance Engineering Site Reliability Engineer Associate - Goldman Sachs
Site Reliability Engineer
Associate SRE in Compliance Engineering, building and operating scalable, secure platforms that detect and mitigate regulatory risk using cloud-native technologies, automation, and observability tools.
About the role
Key Responsibilities
- Design, develop, and maintain highly available services that support compliance monitoring and risk mitigation.
- Implement infrastructure-as-code using Terraform and automate deployment pipelines on AWS.
- Containerize applications and manage workloads with Kubernetes, ensuring reliability and performance at scale.
- Develop monitoring, alerting, and incident response processes using Prometheus, Grafana, and related tools.
- Collaborate with security, data, and product teams to embed compliance controls into the software development lifecycle.
Requirements
- Strong programming experience in Python or Java and solid understanding of Linux systems.
- Hands‑on experience with cloud platforms (AWS) and container orchestration (Kubernetes).
- Proficiency in infrastructure‑as‑code (Terraform) and CI/CD pipelines.
- Knowledge of monitoring, logging, and observability frameworks (Prometheus, Grafana, ELK).
- Ability to troubleshoot complex distributed systems and implement automated remediation.
Skills
pythonjavakubernetesawsterraformlinuxprometheus