onsite
Lead/Senior Engineer AI/LLM Developer - Hirezy
Research Engineer
Lead the design, development, and deployment of large‑language‑model applications, optimizing performance and scalability across cloud platforms.
About the role
Key Responsibilities
- Architect and implement AI solutions using state‑of‑the‑art LLM technologies.
- Lead LLM integration, fine‑tuning, and deployment pipelines on cloud infrastructure.
- Optimize model inference for latency, throughput, and resource utilization.
- Build and maintain end‑to‑end data pipelines for training and inference.
- Collaborate with cross‑functional teams to translate business problems into AI products.
Requirements
- 5+ years of software engineering experience with a focus on AI/ML.
- Deep expertise in Python, PyTorch/TensorFlow, and LLM fine‑tuning.
- Proficiency with containerization (Docker) and orchestration (Kubernetes).
- Hands‑on experience deploying models on AWS or GCP.
- Strong understanding of NLP concepts and data pipeline engineering.
Skills
pythonpytorchdockerkubernetesawsnlp