onsite
LLM / AI Engineer - Programming.com
AI Engineer
We are seeking an experienced LLM/AI Engineer to design, build, and deploy large language model solutions on a cloud‑native enterprise AI platform, leveraging Python, deep‑learning frameworks, and AWS services.
About the role
Key Responsibilities
- Design, develop, and fine‑tune Large Language Models (LLMs) for enterprise financial use‑cases.
- Implement prompt‑engineering strategies and retrieval‑augmented generation pipelines.
- Build scalable, production‑grade AI services using Python, PyTorch/TensorFlow, and AWS (SageMaker, Lambda, EKS).
- Collaborate with data engineers and domain experts to integrate structured financial data into LLM workflows.
- Monitor model performance, conduct A/B testing, and iterate to improve accuracy, latency, and cost efficiency.
Requirements
- 5–8 years of hands‑on experience in AI/ML engineering, with a focus on LLMs or NLP.
- Strong proficiency in Python and deep‑learning frameworks such as PyTorch or TensorFlow.
- Experience deploying models on cloud platforms, preferably AWS (SageMaker, ECS/EKS, Lambda).
- Solid understanding of prompt engineering, retrieval‑augmented generation, and model optimization techniques.
- Track record of delivering production‑grade AI solutions in a fast‑moving, data‑intensive environment.
Skills
pythonpytorchtensorflowaws