remote
Senior Machine Learning Engineer, Agentic Systems - ServiceNow
ML Engineer
Lead the design and deployment of agentic AI solutions, building large‑language‑model pipelines and scalable inference services on cloud infrastructure using Python, PyTorch/TensorFlow, and container orchestration.
About the role
Key Responsibilities
- Architect, develop, and optimize end‑to‑end machine‑learning pipelines for agentic AI assistants, from data ingestion to model serving.
- Design and implement large language model (LLM) fine‑tuning, prompt engineering, and retrieval‑augmented generation techniques.
- Build scalable, low‑latency inference services on AWS using Kubernetes, Docker, and serverless components.
- Collaborate with product, UX, and infrastructure teams to integrate AI capabilities into enterprise workflows.
- Research and prototype novel ML approaches (e.g., reinforcement learning, multimodal models) to improve initiative‑taking AI behavior.
Requirements
- 5+ years of professional experience in machine learning engineering, with deep expertise in Python and frameworks such as PyTorch or TensorFlow.
- Hands‑on experience building, fine‑tuning, and deploying large language models in production.
- Strong background in cloud platforms (AWS) and container orchestration (Kubernetes, Docker).
- Proven ability to design scalable, high‑throughput ML systems and troubleshoot performance bottlenecks.
- Excellent problem‑solving skills and a track record of delivering AI solutions that impact large‑scale enterprise users.
Skills
pythonpytorchtensorflowawskubernetes