About the Role
Oracle Cloud Infrastructure (OCI) blends the speed of a startup with the scale of an enterprise leader. Our Generative AI Solutions team builds advanced AI solutions that run on powerful cloud infrastructure tackling real-world, global challenges. As part of this team, you'll contribute to large-scale cloud solutions utilizing cutting-edge machine learning and Generative AI technologies, aimed at addressing complex global challenges.
Join our OCI Gen-AI Solutions team to build cutting-edge AI applications that tackle global challenges. We’re seeking an experienced Principal Machine Learning Engineer (IC4) to design, develop, and deploy customized Generative AI solutions for strategic customers focusing on Agentic solutions and Retrieval Augmented Generation (RAG). You’ll work closely with applied scientists and product managers to turn innovation into real-world impact.
Responsibilities
- Design, build, and deploy cutting-edge machine learning and generative AI solutions, with a focus on Large Language Models (LLMs), AI agents, Retrieval-Augmented Generation (RAG), and large-scale search.
- Collaborate with scientists, engineers, and product teams to turn complex problems into scalable, cloud-ready AI solutions for enterprises.
- Run experiments, explore new algorithms, and push the boundaries of AI to optimize performance, customer experience, and business outcomes.
- Ensure ethical and responsible AI practices in all solutions.
- Work directly with key customers and accompany them on their Gen AI journey – understanding their requirements, help them envision and design and build the right solutions and work together with their ML engineering to remove blockers.
- Design and implement agentic workflows using diverse frameworks, incorporating prompt engineering to optimize performance.
- Dive deep into model structure to optimize model performance and scalability.
- Build state of art solutions with brand new technologies in this fast-evolving area.
- Configure large scale OpenSearch clusters, setting up data ingestion pipelines to get the data into the OpenSearch.
- Diagnose, troubleshoot, and resolve issues in AI model training and serving.
- Build re-usable solution patterns and reference solutions / showcases that can apply across multiple customers.
- Be an enthusiastic, self-motivated, and a great collaborator.
- Be our product evangelist - engage directly with customers and partners, participate and present in external events and conferences, etc.
Qualifications
- Bachelor’s or Master’s degree in Computer Science or related technical field, with 10+ years of experience in AI, ML, or data-driven solution development.
- Proven track record designing, building, and deploying scalable AI/ML solutions in production environments.
- Deep expertise in Large Language Models (LLMs), Generative AI, Agentic solutions, and advanced ML techniques (fine-tuning, prompt engineering, model optimization).
- Strong experience with OpenSearch, vector databases, data ingestion pipelines, and large-scale search optimization.
- Skilled in diagnosing, troubleshooting, and resolving issues in AI model training and serving.
- Hands-on experience with MCP, NLP, NLU, RAG architectures, Agents, and modern AI frameworks (e.g., LangChain, LangGraph, LlamaIndex).
- Proficient in Python and shell scripting, with familiarity in deep learning frameworks (PyTorch, TensorFlow, JAX, or Transformers).
- Experience with popular model training and serving frameworks like KServe, KubeFlow, Triton etc.
- Excellent communication skills for translating complex technical concepts into clear proposals, designs, and presentations.
- Collaborative mindset with experience working closely with product managers, engineers, and customers.
- Ability to mentor and guide junior data scientists or ML engineers.
- Experience acting as a technical evangelist, presenting at conferences, customer briefings, or industry events.