onsite

AI Architect - LLM - Syncreon Consulting

Research Engineer

Lead the design and deployment of cutting‑edge LLM solutions, leveraging Python, AWS, Docker and Kubernetes to build scalable, production‑ready AI services that drive business value.

About the role

Key Responsibilities

Architect and implement large language model pipelines from data ingestion to inference, ensuring performance, scalability and security.
Collaborate with data scientists and product teams to translate business requirements into robust ML solutions.
Design and maintain cloud‑native infrastructure on AWS, using Docker, Kubernetes and CI/CD pipelines for rapid deployment.
Optimize model inference latency and cost through quantization, pruning and efficient serving strategies.
Establish best practices for model governance, monitoring, and continuous improvement.

Requirements

5+ years of experience in AI/ML engineering with a focus on LLMs and NLP.
Proficiency in Python, PyTorch/TensorFlow, and experience with Hugging Face Transformers.
Strong background in AWS services (SageMaker, ECS/EKS, Lambda) and container orchestration.
Hands‑on experience with Docker, Kubernetes, and CI/CD tooling.
Excellent problem‑solving skills and ability to communicate complex concepts to non‑technical stakeholders.

Skills

nlpmachine learningpythonawsdockerkubernetes

CompanySyncreon Consulting

DepartmentResearch

LocationIrvine, CA, United States

Experience7+ years

Tenurefull-time

LevelLead

Posted June 19, 2026