remote
Network DevOps Engineer, RDMA Fabric Automation - Vultr
Devops Engineer
Lead the design, deployment, and automation of RDMA fabric solutions across a global cloud platform, leveraging DevOps practices to ensure high-performance, scalable networking for AI and enterprise workloads.
About the role
Key Responsibilities
- Architect and implement RDMA fabric infrastructure across multiple data centers, ensuring low latency and high throughput for cloud services.
- Develop and maintain automation pipelines (CI/CD) for network configuration, monitoring, and troubleshooting using IaC tools.
- Collaborate with platform, security, and reliability teams to integrate network automation into the broader cloud stack.
- Analyze performance metrics, identify bottlenecks, and propose optimizations for RDMA-enabled workloads.
- Document network designs, operational procedures, and best practices for internal and external stakeholders.
Requirements
- 5+ years of experience in network engineering with a focus on high-performance computing or cloud networking.
- Deep knowledge of RDMA, InfiniBand, and related fabric technologies.
- Proficiency in DevOps tools (Git, Jenkins, Terraform, Ansible) and scripting (Python, Bash).
- Experience with cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes).
- Strong analytical skills and a track record of automating complex network operations.
Skills
pythongorustphpansiblelinuxjenkinsgithub actions