Our client, a leading enterprise enterprise, is seeking a Senior AI Engineer specializing in Agentic Workflows and LLM Integration . This specialized engineering role sits at the cutting edge of AI innovation, commanding an organization-wide mandate to design, deploy, and own multi-step autonomous agent systems. The successful candidate will build robust backend infrastructures, orchestrate tool-calling logic, and manage advanced retrieval architectures to deliver resilient, production-grade AI applications within an enterprise framework.
Key Accountabilities
AI Architecture & Agent Workflow Engineering
- Agent Execution Patterns: Design, build, and test highly complex, multi-step agent workflows utilizing established advanced architectural design patterns such as ReAct, planner-executor, and complex tool-chaining.
- LLM Core Integration: Integrate flagship Large Language Models (including Anthropic Claude and OpenAI) with legacy enterprise APIs and internal microservices, engineering robust fault tolerance for retries, edge cases, and degraded ecosystem states.
- Orchestration & Failure Resilience: Implement programmatic tool calling, function orchestration pipelines, and automated compensating actions to guarantee agent workflows remain stable under catastrophic or unexpected third-party failure conditions.
- Human-in-the-Loop Controls: Architect and deploy conditional human-in-the-loop validation frameworks, including automated executive approvals, smart escalations, and exception-handling logic mandated by business governance or risk considerations.
Data Engineering, Prompts & Retrieval (RAG)
- Context & Memory Architecture: Build and manage advanced agent memory retention layers and data retrieval mechanisms utilizing vector databases and Retrieval-Augmented Generation (RAG), tuning indexing schemas to ensure relevant context.
- Prompt Management: Develop, maintain, optimize, and version-control complex prompt logic, semantic routing rules, and supporting technical documentation in accordance with strict enterprise engineering standards.
Production Deployment, Security & Observability
- Cloud Operations: Deploy mission-critical AI services into production cloud environments, actively monitoring logs, distributed traces, and telemetry metrics to rapidly isolate and patch behavioral anomalies.
- Enterprise Governance: Ensure all deployed solutions strictly mirror enterprise-grade security controls, identity management requirements, and rigorous data governance protocols.
- Reliability Engineering: Partner with QA and Core Operations teams to continually upgrade automated test coverage, build out operational runbooks, establish incident response protocols, and drive system performance optimizations.
Requirements
Education & Experience<