Engineer, Compute Fleet Management
Principal Engineer for Compute Fleet Management at Databricks, focusing on building and scaling foundational compute infrastructure using Python, Node.js, Machine Learning, and AWS.
# P-725
At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business.
About the Team: Join the Core of Databricks Infrastructure
The Compute Infra Team is the engine behind all of Databricks' products and Control Plane services. We build and scale the foundational compute infrastructure that enables every Databricks customer to succeed, operating one of the largest and most dynamic data and AI clouds in the world.
Mission: Define the Future of Cloud Compute Efficiency and Scale
As the Technical Lead for Compute Fleet Management, you won't just manage a fleet—you will set the standard for how Databricks consumes and optimizes compute across all three major clouds (AWS, Azure, and GCP). This is a mission-critical role with direct impact on our gross margin and customer experience. Your mandate includes:
Outcomes: The Impact You Will Deliver
This role is for an engineer who thrives on owning the most challenging and impactful outcomes:
Requirements: Are You Ready for this Challenge?
We are seeking a seasoned Principal Engineer who has not only built but successfully operated large-scale, mission-critical infrastructure systems in production. You must have a track record of:
Posted June 6, 2026