remote
Senior Cloud Operations Engineer / Team Lead - Loftware
Systems Engineer
Lead a global 24/7 Cloud Operations team, driving reliability and automation across AWS and Azure environments. Hands‑on engineering with Linux/Windows, Kubernetes, and CI/CD pipelines, while mentoring staff and shaping best practices.
About the role
Key Responsibilities
- Lead and mentor a distributed Cloud Operations team, ensuring 24/7 service availability across AWS and Azure platforms.
- Design, implement, and maintain highly available, secure, and scalable infrastructure using Linux, Windows, Kubernetes, and container orchestration.
- Develop and enforce CI/CD pipelines, automation scripts (Python, Bash), and monitoring solutions to accelerate deployment cycles and reduce incidents.
- Collaborate with development, security, and product teams to define operational requirements, incident response plans, and post‑mortem processes.
- Drive continuous improvement initiatives, including cost optimization, performance tuning, and documentation of best practices.
Requirements
- 10+ years of experience in cloud operations, with deep expertise in AWS and Azure.
- Proven leadership in a high‑availability, 24/7 environment, managing cross‑functional teams.
- Strong scripting skills in Python and Bash, and hands‑on experience with Kubernetes and CI/CD tools.
- Excellent troubleshooting, incident management, and communication skills.
- Relevant certifications (e.g., AWS Solutions Architect, Azure Solutions Architect, Kubernetes Administrator) are a plus.
Skills
awsazurelinuxkubernetespythonbashcicd