remote
Lead Observability and Monitoring Engineer - SimCorp
Software Engineer
Lead the design and implementation of observability and monitoring solutions for a leading FinTech platform, driving reliability, performance, and insight across cloud-native services.
About the role
Key Responsibilities
- Architect and maintain end‑to‑end observability stack (metrics, logs, traces) for large‑scale financial services applications.
- Design and implement Prometheus/Grafana dashboards, alerting rules, and automated remediation pipelines.
- Collaborate with DevOps, SRE, and product teams to define SLIs/SLOs and improve system resilience.
- Lead incident response, post‑mortem analysis, and root‑cause investigations to drive continuous improvement.
- Mentor and coach engineering teams on best practices for observability, monitoring, and cloud‑native tooling.
Requirements
- 5+ years of experience in software engineering with a focus on observability and monitoring.
- Hands‑on expertise with Prometheus, Grafana, Loki, OpenTelemetry, and related ecosystems.
- Strong background in Kubernetes, container orchestration, and cloud platforms (AWS, GCP, or Azure).
- Proficiency in Python or Go for scripting and automation.
- Excellent communication skills and a proven ability to mentor junior engineers.
Skills
prometheusgrafanakubernetesawspython