onsite
Senior Software Engineer - AI/ML - GeekyAnts
Software Engineer
Lead the design, optimization, and deployment of advanced AI systems, from data pipelines to production‑grade model serving, using Python, PyTorch, and LLMs while driving innovation in RAG, robotics, and inference optimization.
About the role
Key Responsibilities
- Architect and deploy end‑to‑end AI systems, including data pipelines, model training, and scalable serving infrastructure.
- Design modular SDKs for multi‑provider AI integration, enabling seamless use of LLMs, vision, and speech models across platforms.
- Lead the fine‑tuning, evaluation, and continuous improvement of large language models and retrieval‑augmented generation (RAG) pipelines.
- Drive inference optimization for real‑time robotics and IoT applications, ensuring low latency and high throughput.
- Mentor junior engineers, conduct code reviews, and establish best practices for model explainability and reliability.
Requirements
- 5–7 years of experience in AI/ML engineering with a strong background in deep learning frameworks (PyTorch/TensorFlow).
- Proven expertise in LLMs, RAG, and deploying models at scale.
- Hands‑on experience with inference optimization techniques (quantization, pruning, ONNX, TensorRT).
- Solid understanding of robotics or IoT AI use cases and real‑time constraints.
- Excellent communication skills and a track record of mentoring junior talent.
Skills
pythonpytorchllmrag