remote
Model Serving Engineer - BV Teck
Software Engineer
Model Serving Engineer responsible for designing, deploying, and scaling machine‑learning inference services using Python, TensorFlow Serving, Docker, Kubernetes, and AWS to deliver low‑latency, reliable APIs for business applications.
About the role
Key Responsibilities
- Design and implement scalable model serving pipelines using TensorFlow Serving, TorchServe, or similar frameworks.
- Containerize inference services with Docker and orchestrate deployments on Kubernetes clusters.
- Develop and maintain RESTful APIs that expose model predictions to internal and external applications.
- Integrate serving solutions with cloud platforms (AWS) and implement CI/CD pipelines for automated rollout.
- Monitor performance, latency, and resource utilization; troubleshoot and optimize models in production.
Requirements
- 3+ years of professional experience in Python development and machine‑learning model deployment.
- Hands‑on expertise with containerization (Docker) and orchestration (Kubernetes).
- Strong knowledge of cloud services, preferably AWS (EKS, S3, SageMaker).
- Experience building and maintaining RESTful APIs for model inference.
- Familiarity with monitoring tools (Prometheus, Grafana) and performance tuning of serving systems.
Skills
pythondockerkubernetesaws