onsite
Staff Software Engineer, AI/ML Storage Infrastructure - Google
Software Engineer
Lead the design and implementation of large‑scale AI/ML storage infrastructure, driving performance, reliability, and scalability using C++ and distributed systems principles.
About the role
Key Responsibilities
- Architect and develop high‑performance storage solutions for AI/ML workloads, ensuring low latency and high throughput.
- Lead cross‑functional teams in the design, testing, and deployment of distributed systems at scale.
- Drive continuous improvement of software quality through rigorous testing, code reviews, and performance profiling.
- Mentor junior engineers, fostering a culture of technical excellence and knowledge sharing.
- Collaborate with product, research, and infrastructure teams to align storage capabilities with evolving AI/ML requirements.
Requirements
- Bachelor’s degree in Computer Science or related field (Master’s/PhD preferred).
- 8+ years of C++ programming experience with deep understanding of data structures and algorithms.
- 5+ years building and operating large‑scale distributed storage or compute infrastructure.
- Strong background in software design, architecture, and performance optimization.
- Proven technical leadership and experience managing complex engineering projects.