onsite
Lead AI Engineer - Generative AI Platform & LLM Orchestration - Capital One
AI Engineer
Lead the design and delivery of a generative AI platform, building agentic AI services and LLM orchestration pipelines using Python, cloud infrastructure, and modern MLOps practices.
About the role
Key Responsibilities
- Architect and implement a scalable generative AI platform that supports agentic AI workflows and large language model (LLM) orchestration.
- Design, develop, and optimize core AI services using Python, PyTorch, and TensorFlow, ensuring high performance and reliability.
- Lead the deployment of AI workloads on Kubernetes and AWS, establishing robust CI/CD and MLOps pipelines.
- Collaborate with cross‑functional product and data science teams to translate business requirements into AI solutions.
- Establish best practices for model versioning, monitoring, security, and responsible AI governance.
Requirements
- 5+ years of hands‑on experience building and scaling AI/ML systems, preferably with LLMs or generative models.
- Strong proficiency in Python and deep‑learning frameworks such as PyTorch or TensorFlow.
- Extensive experience with container orchestration (Kubernetes) and cloud platforms (AWS).
- Demonstrated ability to design end‑to‑end MLOps pipelines, including CI/CD, monitoring, and automated testing.
- Excellent problem‑solving skills and a track record of leading technical teams in fast‑paced environments.
Skills
pythonpytorchtensorflowkubernetesawsmlops