remoteonsite
Automation Reliability Engineer - SAP
Software Engineer
Lead automation and reliability initiatives for IT services, ensuring high availability and efficient incident management using Python, cloud platforms, and advanced monitoring tools.
About the role
Key Responsibilities
- Design, implement, and maintain automation frameworks to streamline IT service delivery and reduce manual effort.
- Develop and enforce reliability best practices, including capacity planning, performance tuning, and fault tolerance.
- Collaborate with cross‑functional teams to define service level objectives and monitor compliance using cloud and on‑prem solutions.
- Lead incident response, root cause analysis, and post‑mortem activities to continuously improve system resilience.
- Write and maintain Python scripts and tools for data collection, alerting, and remediation automation.
Requirements
- 3+ years of experience in automation and reliability engineering within IT services or SRE roles.
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.
- Effective communication skills in English, both written and verbal.