remote
Site Reliability Engineer I - Mastercard
Site Reliability Engineer
Entry‑level Site Reliability Engineer focused on building and maintaining scalable, highly available services using Linux, Kubernetes, AWS, Terraform, and Python.
About the role
Key Responsibilities
- Design, implement, and operate reliable cloud‑native services on AWS.
- Automate infrastructure provisioning and configuration management using Terraform.
- Develop monitoring, alerting, and incident response processes with tools such as Prometheus and Grafana.
- Collaborate with development teams to improve application performance, scalability, and resilience.
- Participate in on‑call rotations, troubleshoot production issues, and drive root‑cause analysis.
Requirements
- Strong foundation in Linux system administration and networking.
- Hands‑on experience with container orchestration platforms, preferably Kubernetes.
- Familiarity with cloud services (AWS) and infrastructure‑as‑code tools (Terraform).
- Proficiency in scripting or programming languages, such as Python.
- Understanding of monitoring, logging, and incident management best practices.
Skills
linuxkubernetesawsterraformpython