onsite
Staff Software Engineer, Site Reliability Engineering, Networking - Google Germany GmbH
Software Engineer
Lead the design and operation of highly available networking services, driving reliability, automation, and performance at scale using Kubernetes, Docker, and cloud infrastructure tools.
About the role
Key Responsibilities
- Architect, build, and maintain resilient networking services that support millions of users worldwide.
- Implement and evolve CI/CD pipelines, infrastructure as code, and automated monitoring for continuous delivery.
- Collaborate with cross‑functional teams to troubleshoot incidents, conduct post‑mortems, and drive root‑cause analysis.
- Mentor and coach junior engineers, fostering a culture of reliability and operational excellence.
- Evaluate and integrate emerging networking technologies to improve performance and reduce operational overhead.
Requirements
- 10+ years of software engineering experience with a focus on large‑scale distributed systems.
- Deep expertise in networking protocols, Kubernetes, Docker, and cloud platforms (AWS, GCP, or Azure).
- Proficiency in Python, Terraform, and scripting for automation.
- Strong problem‑solving skills and a proven track record of driving reliability improvements.
- Excellent communication skills and ability to work in a fast‑paced, collaborative environment.
Skills
kubernetesdockerpythonterraform