remote
Staff Software Engineer/Tech Lead, ML Network Infrastructure - Google
Software Engineer
Lead the design and scaling of large‑scale ML network infrastructure, driving architecture, performance, and team execution using C++ and distributed systems expertise.
About the role
Key Responsibilities
- Architect and implement high‑throughput, low‑latency ML network services in C++ across distributed environments.
- Own end‑to‑end product lifecycle: design, testing, deployment, and continuous improvement of infrastructure components.
- Mentor and lead cross‑functional engineering teams, setting technical direction and ensuring best practices.
- Collaborate with data scientists and product managers to translate ML requirements into scalable system designs.
- Drive performance tuning, reliability, and security across the network stack.
Requirements
- 8+ years of C++ programming with deep knowledge of data structures and algorithms.
- 5+ years building and operating large‑scale distributed systems or network infrastructure.
- Strong background in software design, architecture, and testing.
- Proven leadership experience managing engineering teams and projects.
- Excellent communication skills and a passion for mentoring and knowledge sharing.