remoteonsite
Site Reliability Engineer - UBS
Site Reliability Engineer
Lead proactive problem management and reliability initiatives for on‑premises and cloud database environments, ensuring high availability and stability while driving the organization toward an agile, responsive culture.
About the role
Key Responsibilities
- Design, implement, and maintain highly available database infrastructure across on‑premises and cloud platforms.
- Proactively monitor, troubleshoot, and resolve incidents to minimize downtime and maintain service level objectives.
- Collaborate with development and operations teams to embed reliability best practices into CI/CD pipelines.
- Automate configuration, deployment, and scaling of database services using infrastructure-as-code tools.
- Drive continuous improvement initiatives, including capacity planning, performance tuning, and cost optimization.
Requirements
- Strong experience in Site Reliability Engineering for database systems.
- Hands‑on expertise with cloud platforms (AWS, Azure, or GCP) and on‑prem infrastructure.
- Proficiency in monitoring, alerting, and incident response tools.
- Solid understanding of high‑availability architectures and disaster recovery.
- Excellent communication skills and a collaborative mindset in an agile environment.
Skills
pythonsqlazurepostgresqlagile