Senior Software Engineer, Machine Learning Inference
As a Senior Software Engineer on the TensorRT team, you will design and implement inference software optimizations for AI applications on NVIDIA GPUs. Your responsibilities include developing and optimizing TensorRT and TensorRT-LLM using C++, Python, and CUDA for efficient deployment of LLMs and Generative AI models, while collaborating with deep learning experts and GPU architects.
At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the world’s most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators.
As a Senior Software Engineer in the TensorRT team, you will be responsible for designing and implementing inference software optimizations to power AI applications on NVIDIA GPUs. If you're ready to take on challenging projects and make a significant impact in a company that values creativity, excellence, and collaboration, we want to hear from you!
Posted June 9, 2026