onsite
Generative AI Developer Specialist - Hexaware Technologies
AI Engineer
Seasoned engineer needed to design, build, and operate a secure, scalable Generative AI platform on AWS, handling LLM inference, retrieval‑augmented generation, agents, and governance for production workloads.
About the role
Key Responsibilities
- Design and implement a cost‑efficient, enterprise‑grade Generative AI platform on AWS, supporting production LLM applications in regulated environments.
- Own the end‑to‑end platform lifecycle: architecture, deployment, operations, monitoring, incident response, and continuous performance tuning.
- Develop and manage AWS Bedrock‑based solutions for LLM inference, Retrieval‑Augmented Generation (RAG), autonomous agents, and AI guardrails, ensuring high availability and SLA compliance.
- Establish robust operational and governance frameworks, including observability, alerting, root‑cause analysis, security controls, and access management.
- Collaborate with cross‑functional teams to integrate AI services into business workflows while maintaining compliance and data privacy.
Requirements
- 8+ years of professional software engineering experience, with a focus on cloud platforms and AI/ML services.
- Deep expertise in AWS services (especially Bedrock, SageMaker, Lambda, CloudFormation) and cloud‑native architecture patterns.
- Strong background in building, deploying, and operating Large Language Model workloads, including RAG and agent frameworks.
- Proven skills in DevOps practices: CI/CD pipelines, infrastructure‑as‑code, monitoring, and incident management.
- Solid understanding of security, governance, and compliance requirements for regulated enterprise environments.