onsite
Site Reliability Engineer I - Allegiant Air
Site Reliability Engineer
Entry‑level Site Reliability Engineer focused on designing, maintaining, and scaling on‑premises and cloud systems, driving cloud transformation, and ensuring high availability using AWS, Kubernetes, and automation tools.
About the role
Key Responsibilities
- Design, implement, and operate highly available services across on‑premises and AWS environments.
- Develop automation scripts and infrastructure‑as‑code using Python, Terraform, and CI/CD pipelines.
- Monitor system performance, troubleshoot incidents, and drive root‑cause analysis to improve reliability.
- Collaborate with development and operations teams to embed SRE best practices and cloud‑native patterns.
- Participate in cloud migration initiatives, ensuring seamless transition and scalability.
Requirements
- Strong Linux administration skills and experience with scripting (Python preferred).
- Hands‑on experience with AWS services (EC2, S3, RDS, etc.) and container orchestration using Kubernetes.
- Familiarity with infrastructure‑as‑code tools such as Terraform or CloudFormation.
- Understanding of monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, ELK).
- Ability to work independently and as part of a team, with solid problem‑solving and communication skills.
Skills
pythonlinuxawskubernetesterraform