onsite
Senior Backend Software Engineer Observability - Jobgether
Software Engineer
Lead the design and implementation of observability services for a next‑generation cloud platform, ensuring reliability and performance at scale using Python, Go, Kubernetes, Prometheus, Grafana and AWS.
About the role
Key Responsibilities
- Architect and develop observability pipelines that ingest, process, and expose telemetry data for millions of services across a distributed cloud platform.
- Design and maintain scalable, fault‑tolerant data stores and streaming solutions using Go and Kubernetes.
- Integrate Prometheus, Grafana, and custom dashboards to provide real‑time visibility into system health and performance.
- Collaborate with cross‑functional teams to define SLAs, alerting rules, and incident response workflows.
- Continuously evaluate and adopt emerging observability tools and best practices to improve reliability and reduce mean time to recovery.
Requirements
- 8+ years of backend engineering experience with a strong focus on observability and monitoring.
- Proficiency in Python and Go, with hands‑on experience building high‑throughput services.
- Deep knowledge of Kubernetes, Prometheus, Grafana, and cloud‑native observability patterns.
- Experience deploying and managing services on AWS (EKS, CloudWatch, S3).
- Excellent problem‑solving skills and a passion for building reliable, scalable systems.
Skills
pythongokubernetesprometheusgrafanaaws