onsite
Site Reliability Engineer - Tata Consultancy Services (TCS)
Site Reliability Engineer
Site Reliability Engineer focused on building and maintaining robust, automated systems using Python and DevOps practices, ensuring high availability and continuous improvement in a collaborative, Agile environment.
About the role
Key Responsibilities
- Develop, test, and maintain high‑quality software solutions, frameworks, and automations to support reliability and scalability.
- Collaborate with cross‑functional teams to analyze requirements and design solutions that enhance system stability.
- Participate in code reviews, enforce coding standards, and share knowledge across the engineering team.
- Identify, troubleshoot, and resolve incidents and problems, driving root‑cause analysis and preventive measures.
- Implement and maintain DevOps/SRE best practices, including monitoring, alerting, and continuous deployment pipelines.
- Contribute to continuous improvement initiatives, optimizing processes and tooling for greater efficiency.
Requirements
- Proficiency in Python or a comparable scripting language.
- Strong understanding of DevOps principles and Site Reliability Engineering practices.
- Experience with Agile development methodologies and collaborative teamwork.
- Excellent problem‑solving skills and a proactive approach to incident management.
- Effective communication skills, both written and verbal, to convey technical concepts clearly.