OpenTalent
Hire AI TalentFor EmployeesTop 3%Jobs
Sign inJoin freeEmployer Login
Join free
OpenTalent

The Cohire for AI engineers — and the hiring partner for the teams building frontier intelligence.

Features

  • AI Job Match
  • Resume AI
  • Application Autofill
  • Cohire

For engineers

  • Browse jobs
  • AI Research roles
  • ML Engineering roles
  • Applied AI roles
  • Early-career track
  • Salary data

Resources

  • Blog
  • Events
  • Interview guides
  • Frontier lab insights

Company

  • About
  • For employees
  • Careers
  • Partners
  • Contact
  • Privacy · Terms
© 2026 Gravity Engineering Services Pvt. Ltd. All rights reserved.hello@opentalent.in
All jobs
remoteonsite

Staff Software Engineer - AI Traffic & Inference Infrastructure

Software Engineer - AI Traffic & Inference Infrastructure

Staff Software Engineer - AI Traffic & Inference Infrastructure position — see original posting for full details.

About the role

Company Introduction

We exist to wow our customers. We know we’re doing the right thing when we hear our customers say, “How did we ever live without Coupang?” Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We are one of the fastest-growing e-commerce companies that established an unparalleled reputation for being a dominant and reliable force in South Korean commerce.

We are proud to have the best of both worlds — a startup culture with the resources of a large global public company. This fuels us to continue our growth and launch new services at the speed we have been since our inception. We are all entrepreneurs surrounded by opportunities to drive new initiatives and innovations. At our core, we are bold and ambitious people that like to get our hands dirty and make a hands-on impact. At Coupang, you will see yourself, your colleagues, your team, and the company grow every day.

Our mission to build the future of commerce is real. We push the boundaries of what’s possible to solve problems and break traditional tradeoffs. Join Coupang now to create an epic experience in this always-on, high-tech, and hyper-connected world.

Role Overview

As a Staff Engineer on our Coupang intelligent Cloud Infrastructure team, you will design and scale the intelligent nervous system of our CIC Cloud AI platform. You won't just be moving packets; you’ll be building the orchestration and routing layers that ensure our LLMs and foundation models are highly available, low-latency, and cost-efficient. You will own the end-to-end lifecycle of traffic management from global load balancing to hardware-aware request routing across thousands of accelerators.

What You Will Do

Intelligent Routing : Design and implement sophisticated load-balancing algorithms tailored for AI workloads (training, inference), optimizing request distribution based on model availability, and accelerator health.

Inference Orchestration : Architect and evolve our inference infrastructure to support seamless model deployment, auto-scaling, and multi-AZ failover.

Performance Engineering : Drive initiatives to minimize tail latency (P95 /P99) and maximize throughput using advanced batching, caching, and streaming token delivery techniques.

Fleet Automation : Build robust infrastructure-as-code and CI/CD pipelines to manage dynamic compute fleets, ensuring they automatically scale to meet production and research demands.

Observability & Optimization : Leverage deep telemetry data to tune system performance and hardware-agnostic scheduling across diverse GPU/TPU environments.

Technical Leadership : Lead cross-functional initiatives across infrastructure and SW team, ML teams, providing mentorship and setting up the long-term technical roadmap for traffic management.

Basic Qualifications

Skills

llmpythonkubernetesawsgcpazure
Sign Up to Apply
Sign Up to Apply
CompanyCoupang
DepartmentEngineering
LocationBengaluru
Experience7+ years
Tenurefull-time
LevelLead

Posted June 8, 2026

Staff Software Engineer - AI Traffic & Inference Infrastructure | OpenTalent