onsite
Software Engineer - Observability Instrumentation - G Research
Software Engineer
Senior Software Engineer focused on observability instrumentation, building scalable monitoring and tracing systems using Python, Go, Kubernetes, Prometheus, Grafana, and AWS to empower quantitative research teams.
About the role
Key Responsibilities
- Design, develop, and maintain observability pipelines that capture metrics, logs, and traces across distributed services.
- Implement instrumentation libraries in Python and Go to expose custom metrics and trace spans for high‑frequency trading systems.
- Collaborate with platform and research teams to define observability requirements and integrate solutions into CI/CD workflows.
- Optimize data collection and storage on Kubernetes clusters, ensuring low latency and high availability.
- Automate alerting, dashboards, and incident response workflows using Prometheus, Grafana, and AWS monitoring services.
Requirements
- 5+ years of software engineering experience with a strong focus on observability and monitoring.
- Hands‑on experience with Prometheus, Grafana, and related alerting ecosystems.
- Excellent problem‑solving skills and a passion for building reliable, scalable systems.
Skills
pythongokubernetesprometheusgrafanaaws