remote
Senior Software Engineer - Observability Platform - Grafana Labs
Software Engineer
Senior engineer building and scaling Grafana Cloud's observability platform, focusing on data pipelines, AI‑enhanced analytics, and reliable, high‑performance services using Go, Python, Kubernetes and AWS.
About the role
Key Responsibilities
- Design, develop, and maintain core services of a large‑scale observability platform using Go and Python.
- Implement and optimize data ingestion, storage, and query pipelines for metrics, logs, and traces.
- Collaborate with product and AI teams to integrate intelligent alerting and noise‑reduction features.
- Deploy, monitor, and troubleshoot services in Kubernetes clusters on AWS, ensuring high availability and performance.
- Contribute to open‑source components such as Prometheus and Grafana, and drive best practices for observability tooling.
Requirements
- 5+ years of professional software development experience, with strong expertise in Go and Python.
- Deep understanding of cloud‑native architectures, Kubernetes, and AWS services.
- Hands‑on experience with observability stacks (Prometheus, Grafana, Loki, Tempo) and distributed tracing.
- Proven ability to design scalable, low‑latency systems and troubleshoot production incidents.
- Experience contributing to open‑source projects and working in remote, collaborative teams.
Skills
gopythonkubernetesprometheusgrafanaaws