Keshav Kothapalli

AI Engineer

https://www.opentalent.in/keshav-kothapalli

AI Engineer with 6+ years in LLM-powered Applications & Agentic AI Workflows

Freelance AI Engineer

Key Strengths

Extensive experience in designing, building, and deploying LLM-powered applications, agentic AI workflows, and RAG pipelines.
Proficient in multi-agent orchestration using LangChain, LangGraph, and OpenAI/Anthropic APIs.
Strong background in MLOps, including model training, serving, versioning, fine-tuning (LoRA/QLoRA), and CI/CD for ML.
Demonstrated ability to translate complex business problems into production-ready AI systems across healthcare, fintech, and recruitment.
Deep understanding of LLM evaluation frameworks (RAGAS, LangSmith) and telemetry for monitoring AI system performance.
Experience with privacy-first AI applications and custom Model Context Protocol (MCP) servers.

Cultural & Operational Fit

Cultural Fit Analysis

The candidate's project diversity, spanning document Q&A, autonomous agents, real-time speech processing, multi-agent systems, and LLM evaluation, indicates a broad interest in AI applications. Their experience in healthcare, fintech, and recruitment domains shows adaptability and a willingness to tackle varied challenges. The focus on building production-ready, scalable, and privacy-first solutions aligns with a culture that values innovation, reliability, and ethical AI development. The freelance experience also suggests self-motivation and initiative.

Soft Skills & Operational Fit

The candidate's resume highlights full system ownership, cross-functional collaboration, and technical documentation, indicating a strong operational fit. The project descriptions demonstrate a proactive approach to problem-solving and a focus on delivering measurable impact, which aligns well with senior engineering roles. The diverse project portfolio suggests adaptability and a strong learning curve.

AI is analyzing your overall score…

Identifying your key strengths…

Evaluating your skill match against the job requirements…

Assessing your cultural and operational fit

About

AI/ML Engineer with 6+ years of experience designing and deploying intelligent systems across healthcare, fintech, and recruitment domains. Specialises in LLM-powered application development, agentic Al workflows, RAG pipelines, and multi-agent orchestration using LangChain, LangGraph, and OpenAI/Anthropic APIs. Proven ability to translate complex business problems into production-ready Al systems - from rapid prototyping to scalable cloud deployment on AWS. Passionate about leveraging cutting-edge Al to automate workflows, surface insights, and deliver measurable impact in real-world environments.

Top Skills

Rag PipelinesPrompt EngineeringLanggraphWebsockets

Skills

text-classificationClusteringrelation-extractionSentiment AnalysisInformation ExtractionTransformersHugging FaceSpacyScikit LearnOpenAI APIAnthropic APIgemini-apiLangchainLanggraphLlamaindexCrewaiRag PipelinesPrompt EngineeringAgentic WorkflowsFunction CallingLangsmithMcpMlflowAutomated retrainingDrift monitoringPEFTPineconechromadbElasticsearchSemantic SearchOpencvyoloObject DetectionOCRMultimodal AiPythonFastapiFlaskSQLJavaScriptPandasNumpyPyTorchAzureDockerGithub ActionsCiCdGitPostgresqlMysqlMongoDBRedisCross Functional CollaborationTechnical DocumentationWebsocketsLlmsOpentelemetryollamaAngular

Experience

Freelance AI Engineer

June 1, 2025 – May 31, 2026

Hyderābād, Telangana, India

Saven Technologies

Software Engineer

December 1, 2023 – May 31, 2025

Hyderābād, Telangana, India

Sids Farm

Freelance Software Engineer

February 1, 2021 – November 30, 2023

India

Cure.fit

Software Engineer

June 1, 2020 – January 31, 2021

Bengaluru, Karnataka, India

Cure.fit

Software Engineer Intern

May 1, 2019 – July 31, 2019

Bengaluru, Karnataka, India

Projects

Document Q&A System over PDFs

June 24, 2026 – Present

Built an end-to-end document Q&A system ingesting PDFs and docs, embedding content into Pinecone, and answering natural language queries with source attribution via a FastAPI REST endpoint. Implemented hybrid search, re-ranking, and metadata filtering; evaluated with RAGAS for context precision, recall, and answer faithfulness in production.

Autonomous Agent with Tool Use

June 24, 2026 – Present

Built a fully autonomous LLM agent dynamically selecting and invoking tools (web search, code execution, external APIs) to complete multi-step tasks using ReAct-style reasoning loops with error recovery. Exposed agent capabilities via a FastAPI streaming endpoint; implemented structured tool outputs and graceful failure handling for production reliability.

Real-Time Speech-to-Text + Summarization Pipeline

June 24, 2026 – Present

Built a real-time audio pipeline streaming microphone input over WebSockets, transcribing with Whisper, and generating LLM summaries end-to-end latency under 3 seconds. Handled chunked audio streaming, speaker turn detection, and incremental summarisation for meetings, lectures, and interviews.

Multi-Agent Collaboration System

June 24, 2026 – Present

Architected a multi-agent system with specialised sub-agents (researcher, planner, coder, reviewer) orchestrated by a supervisor — addressing agent loops, context overflow, and conflicting outputs. Benchmarked against single-agent approaches, demonstrating measurable improvements in task completion and output quality for complex research and code generation tasks.

LLM Evaluation Framework

June 24, 2026 – Present

Built a custom evaluation framework to benchmark LLMs (GPT-4, Mistral, Llama 3) on domain-specific tasks measuring accuracy, hallucination rate, latency, and cost across standardised test sets. Integrated RAGAS for RAG-specific metrics and LangSmith for end-to-end tracing; surfaced results in a structured dashboard for data-driven model selection.

MCP Server & Offline Privacy-First AI App

June 24, 2026 – Present

Designed a custom MCP server exposing domain-specific tools (file I/O, DB queries, API calls) to Claude for structured tool use in agentic workflows. Built a fully local AI application using Ollama + Llama 3 with on-device RAG (nomic-embed-text + ChromaDB) zero data leaves the machine, viable for healthcare, legal, and enterprise compliance use cases.

Fine-Tuning Pipeline with LoRA/QLoRA

June 24, 2026 – Present

Built a reusable fine-tuning pipeline for instruction-tuning LLMs on domain-specific datasets using QLoRA — 4x GPU memory reduction vs full fine-tuning while retaining model quality. Covered full pipeline: data cleaning, instruction formatting, supervised fine-tuning with PEFT/LoRA, checkpoint management, and evaluation against held-out test sets.

RAG Pipeline with Evaluation & Telemetry

June 24, 2026 – Present

Built a production-grade RAG pipeline with full observability — OpenTelemetry/LangSmith tracing for retrieval quality, latency, and answer faithfulness monitoring. Implemented RAGAS automated evaluation measuring context precision, recall, and answer relevancy to iteratively improve pipeline quality.

Full-Stack Web App - Reddit Clone

June 24, 2026 – Present

Designed and implemented RESTful APIs with FastAPI; API patterns directly applicable to building LLM inference and embedding endpoints for GenAI products.

Distributed File-Sharing System - Mini Torrent

June 24, 2026 – Present

Built a P2P file-sharing system with parallel multi-piece downloads demonstrates concurrency and systems-level Python programming skills.

Key Strengths

Extensive experience in designing, building, and deploying LLM-powered applications, agentic AI workflows, and RAG pipelines.
Proficient in multi-agent orchestration using LangChain, LangGraph, and OpenAI/Anthropic APIs.
Strong background in MLOps, including model training, serving, versioning, fine-tuning (LoRA/QLoRA), and CI/CD for ML.
Demonstrated ability to translate complex business problems into production-ready AI systems across healthcare, fintech, and recruitment.
Deep understanding of LLM evaluation frameworks (RAGAS, LangSmith) and telemetry for monitoring AI system performance.
Experience with privacy-first AI applications and custom Model Context Protocol (MCP) servers.

Cultural & Operational Fit

Cultural Fit Analysis

Soft Skills & Operational Fit

Keshav Kothapalli

Key Strengths

Cultural & Operational Fit

About

Top Skills

Skills

Education

Experience

Projects

Key Strengths

Cultural & Operational Fit