remote
Technical Project Manager / IT Infrastructure Engineer - Nebius
Devops Engineer
Lead cross‑functional teams to design, deploy, and operate large‑scale AI cloud infrastructure, focusing on compute, storage, networking, and automation using Kubernetes, AWS, and CI/CD pipelines.
About the role
Key Responsibilities
- Plan, coordinate, and deliver complex infrastructure projects for AI workloads, ensuring alignment with product roadmaps and business goals.
- Architect, provision, and maintain scalable cloud environments on AWS, leveraging Kubernetes, Terraform, and container orchestration.
- Implement CI/CD pipelines and automation frameworks to accelerate deployment and reduce operational overhead.
- Collaborate with software engineers, data scientists, and security teams to define requirements, troubleshoot issues, and optimize performance.
- Monitor system health, conduct capacity planning, and drive continuous improvement of reliability and cost efficiency.
Requirements
- 5+ years of experience managing large‑scale cloud infrastructure projects, preferably in AI/ML environments.
- Strong hands‑on expertise with AWS services, Kubernetes, Linux systems, and infrastructure‑as‑code tools (e.g., Terraform, CloudFormation).
- Proven ability to lead cross‑functional teams, communicate technical concepts to stakeholders, and manage timelines and budgets.
- Experience implementing CI/CD pipelines, monitoring, and observability solutions (e.g., Jenkins, GitLab CI, Prometheus).
- Solid understanding of networking, storage, and security best practices in cloud environments.
Skills
project managementkubernetesawscicdlinux