onsite
Gen AI/Python Engineer - Assistant Vice President - Citi
Software Engineer
Lead the design and deployment of generative AI solutions, leveraging LLMs, prompt engineering, and fine‑tuning techniques in Python. Drive end‑to‑end AI workflows from research to production using PyTorch/TensorFlow and containerized deployments.
About the role
Key Responsibilities
- Architect and implement generative AI models, applying LLMs, prompt engineering, and fine‑tuning methods such as LoRA and QLoRA.
- Develop and maintain RAG systems, integrating hybrid search techniques to enhance retrieval‑augmented generation.
- Collaborate with data scientists and software engineers to translate research prototypes into production‑ready services using PyTorch or TensorFlow.
- Containerize AI workloads with Docker, ensuring scalable and reproducible deployments across cloud environments.
- Monitor model performance, conduct A/B testing, and iterate on prompts and architectures to meet business objectives.
Requirements
- Strong proficiency in Python and experience with major ML frameworks (PyTorch, TensorFlow, Keras).
- Hands‑on experience with large language models, prompt engineering, and fine‑tuning techniques.
- Knowledge of RAG architectures and hybrid search mechanisms.
- Experience containerizing AI services with Docker and deploying to cloud platforms.
- Excellent problem‑solving skills and ability to communicate complex ideas to cross‑functional teams.
Skills
pythonpytorchtensorflowdocker