remote
AI Architect Generative AI - Ipeople Infosysteams LLC
AI Engineer
Hands‑on AI Architect who transforms business challenges into production‑grade Generative AI solutions, designing APIs, microservices, and containerized workloads on cloud platforms.
About the role
Key Responsibilities
- Translate business objectives into technical roadmaps and deliver end‑to‑end Generative AI products with measurable impact.
- Design and implement robust APIs and microservices (REST/gRPC) that serve text and image generation workloads.
- Containerize AI services using Docker and orchestrate them on Kubernetes or AWS ECS/EKS for scalable production deployment.
- Collaborate with data scientists and product teams to integrate model training pipelines, monitoring, and continuous improvement processes.
- Ensure security, reliability, and performance of AI systems in cloud environments, applying best practices for CI/CD and observability.
Requirements
- 5+ years of experience building and operating production Generative AI solutions (text and image).
- Strong proficiency in Python and modern AI frameworks (e.g., PyTorch, TensorFlow, LangChain).
- Hands‑on expertise with Docker, Kubernetes, and AWS services (ECS, EKS, S3, IAM).
- Proven ability to design and expose AI functionality via REST and gRPC APIs.
- Experience with CI/CD pipelines, monitoring, and scaling AI workloads in cloud environments.
Skills
generative aipythondockerkubernetesawsgrpc