onsite
AI Architect - LLM - Syncreon Consulting
Research Engineer
Lead the design and deployment of cutting‑edge LLM solutions, leveraging Python, AWS, Docker and Kubernetes to build scalable, production‑ready AI services that drive business value.
About the role
Key Responsibilities
- Architect and implement large language model pipelines from data ingestion to inference, ensuring performance, scalability and security.
- Collaborate with data scientists and product teams to translate business requirements into robust ML solutions.
- Design and maintain cloud‑native infrastructure on AWS, using Docker, Kubernetes and CI/CD pipelines for rapid deployment.
- Optimize model inference latency and cost through quantization, pruning and efficient serving strategies.
- Establish best practices for model governance, monitoring, and continuous improvement.
Requirements
- 5+ years of experience in AI/ML engineering with a focus on LLMs and NLP.
- Proficiency in Python, PyTorch/TensorFlow, and experience with Hugging Face Transformers.
- Strong background in AWS services (SageMaker, ECS/EKS, Lambda) and container orchestration.
- Hands‑on experience with Docker, Kubernetes, and CI/CD tooling.
- Excellent problem‑solving skills and ability to communicate complex concepts to non‑technical stakeholders.
Skills
nlpmachine learningpythonawsdockerkubernetes