onsite
Staff Site Reliability Engineer, Protected Data - Google
Site Reliability Engineer
Senior SRE role focused on protected data platforms, requiring deep Linux internals, networking expertise, and strong programming skills in Python, C++ and Java to design, operate, and scale reliable services.
About the role
Key Responsibilities
- Design, implement, and maintain highly available, secure infrastructure for protected data services.
- Develop automation and monitoring tools using Python, C++ or Java to improve reliability and incident response.
- Diagnose and resolve complex Linux kernel, filesystem, and networking issues across large-scale clusters.
- Collaborate with product, security, and compliance teams to ensure data protection requirements are met.
- Drive capacity planning, performance tuning, and continuous improvement of SRE processes.
Requirements
- Bachelor's degree in Computer Science or equivalent practical experience.
- 5+ years in product demand/supply planning, production, and inventory management.
- 3+ years of hands‑on experience with Unix/Linux internals and networking (TCP/IP, routing, SDN).
- Proficiency in at least one programming language: Python, C++, or Java.
- Demonstrated ability to build scalable, reliable systems in a fast‑moving environment.