AI Inference Engineer
As an Applied AI Inference Engineer, you will be responsible for architecting, building, and deploying high-scale production AI inference systems for Evergrid's customers. This hands-on role involves owning customer projects from initial exploration to production deployment, optimizing AI/ML inference pipelines, and collaborating with both internal and customer engineering teams.
Evergrid builds the infrastructure that powers advanced artificial intelligence at scale, with the Most Advanced Neocloud. We design and operate critical GPU and power infrastructure for frontier AI workloads — environments where performance, reliability, and execution are non-negotiable. Our customers are building systems at the edge of what’s possible, and they depend on infrastructure that scales quickly and works under sustained pressure. We care deeply about outcomes, ownership, and building durable systems and long-term partnerships.
As an Applied AI Inference Engineer, you will work directly with Evergrid’s customers to architect, build, and deploy high-scale production AI inference systems on our infrastructure. You will own customer projects end to end — from early exploration through production deployment and monitoring — translating ambiguous goals into observable, reliable services with clear quality, latency, and cost outcomes. This is a hands-on engineering role that blends software development, inference performance engineering, and customer-facing execution. You’ll work closely with internal platform and infrastructure teams while embedding with customer engineering organizations.
Posted June 8, 2026