remote
Forward Deployed Engineer AI Inference - Red Hat, Inc.
Software Engineer
Senior Forward Deployed Engineer focused on AI inference workloads, building scalable backend services with Python and Go on Kubernetes, and driving DevOps excellence for production reliability.
About the role
Key Responsibilities
- Design, develop, and maintain high‑performance AI inference pipelines using Python and Go.
- Deploy and manage inference services on Kubernetes clusters, ensuring scalability and resilience.
- Collaborate with data scientists to optimize model performance and reduce latency.
- Implement CI/CD pipelines, automated testing, and monitoring for continuous delivery.
- Provide on‑call support and troubleshoot production incidents, driving root‑cause analysis.
Requirements
- 5+ years of backend development experience with Python and Go.
- Deep knowledge of Kubernetes architecture, Helm, and container orchestration.
- Hands‑on experience with AI/ML inference frameworks (e.g., TensorFlow, PyTorch, ONNX Runtime).
- Strong DevOps skills: CI/CD, GitOps, observability, and cloud infrastructure.
- Excellent problem‑solving, communication, and collaboration abilities.