remote
Senior Staff Engineer, Platform R&D - CRUSOE
Software Engineer
Lead the design and scaling of AI infrastructure, driving GPU and distributed system innovations to deliver energy‑efficient, high‑performance compute for large‑scale AI workloads.
About the role
Key Responsibilities
- Architect and implement next‑generation GPU‑centric compute platforms, optimizing for performance, scalability, and power efficiency.
- Lead cross‑functional teams in developing low‑level drivers, runtime libraries, and orchestration tools for large‑scale AI workloads.
- Drive research and adoption of emerging hardware technologies (e.g., new GPU architectures, AI accelerators) and integrate them into the platform stack.
- Collaborate with data scientists and ML engineers to benchmark, profile, and tune models for maximum throughput and minimal energy consumption.
- Mentor and mentor junior engineers, fostering a culture of technical excellence and continuous improvement.
Requirements
- 10+ years of experience in systems engineering, with deep expertise in C++, Python, and CUDA.
- Proven track record designing and scaling distributed GPU clusters in cloud or on‑prem environments.
- Strong understanding of energy‑efficiency metrics and experience optimizing hardware utilization.
- Excellent communication skills and ability to influence technical direction across teams.
- Passion for AI infrastructure and a desire to shape the future of large‑scale compute.