remote
Azure Platform Engineer - Nebius
Devops Engineer
Azure Platform Engineer driving cloud infrastructure for AI workloads, focusing on Kubernetes orchestration, Terraform automation, and CI/CD pipelines to deliver scalable, GPU‑enabled services on Azure.
About the role
Key Responsibilities
- Design, implement, and maintain Azure-based infrastructure for large‑scale AI workloads, ensuring high availability and performance.
- Build and manage Kubernetes clusters, including GPU node pools, autoscaling, and networking configurations.
- Develop and maintain Terraform modules for reproducible, versioned infrastructure deployments.
- Integrate CI/CD pipelines (GitHub Actions, Azure DevOps) to automate build, test, and deployment of AI services.
- Collaborate with data scientists and ML engineers to optimize inference pipelines and resource utilization.
- Monitor, troubleshoot, and optimize cost, security, and compliance across the platform.
Requirements
- 3+ years of experience with Azure cloud services (AKS, Azure Container Registry, Azure Monitor).
- Proficient in Kubernetes administration and GPU scheduling.
- Hands‑on experience with Terraform and IaC best practices.
- Strong scripting skills in Python and Bash.
- Familiarity with CI/CD tooling and container registry management.
Skills
azurekubernetesterraformcicdpython