onsite
Generative AI Engineer - Apertera
AI Engineer
Lead the design and deployment of large‑language‑model solutions, fine‑tuning state‑of‑the‑art LLMs for high‑stakes content, while building scalable pipelines on AWS and Docker.
About the role
Key Responsibilities
- Design, train, and fine‑tune large language models (LLMs) using PyTorch or TensorFlow to meet client‑specific language and compliance requirements.
- Develop and maintain end‑to‑end AI pipelines, including data ingestion, preprocessing, model serving, and monitoring on AWS infrastructure.
- Implement prompt‑engineering strategies to optimize model outputs for legal, financial, and regulatory domains.
- Collaborate with cross‑functional teams to integrate AI solutions into production workflows, ensuring scalability, security, and compliance.
- Document model architectures, training procedures, and performance metrics for internal knowledge sharing and client reporting.
Requirements
- 3+ years of experience in generative AI or NLP engineering, with a strong portfolio of LLM projects.
- Proficiency in Python, PyTorch/TensorFlow, and experience with prompt‑engineering techniques.
- Hands‑on experience deploying models on AWS (SageMaker, ECS, Lambda) and containerizing with Docker.
- Strong analytical skills, ability to troubleshoot model performance and data quality issues.
- Excellent communication skills and a collaborative mindset for working with multidisciplinary teams.
Skills
pythonpytorchawsdocker