remote
Cloud Operations Architecture Lead - Loftware, Inc.
Systems Engineer
Lead cloud operations architecture, driving scalable, reliable services on AWS and Kubernetes, while managing incidents, optimizing infrastructure, and mentoring a high‑performing team.
About the role
Key Responsibilities
- Serve as the primary escalation point for complex cloud incidents, ensuring rapid resolution and post‑mortem analysis.
- Design and implement scalable, highly available infrastructure on AWS and Kubernetes, aligning with product and architecture teams.
- Lead continuous improvement of CI/CD pipelines, monitoring, and automation to enhance operational excellence.
- Mentor and coach engineers, fostering a culture of ownership, collaboration, and professional growth.
- Collaborate with cross‑functional stakeholders to define service level objectives, capacity planning, and cost optimization strategies.
Requirements
- 10+ years of experience in cloud operations, with deep expertise in AWS and Kubernetes.
- Proven track record of leading incident response and driving post‑incident improvements.
- Strong architectural skills, including designing resilient, scalable systems and defining best practices.
- Excellent communication and leadership abilities, capable of guiding technical teams and influencing stakeholders.
- Hands‑on experience with CI/CD tooling, monitoring, and automation frameworks.