remote
Senior Site Reliability Engineer C , .NET - Climavision
Site Reliability Engineer
Lead the reliability and scalability of our high‑resolution weather platform using C#/.NET, AWS, Docker, and Kubernetes. Drive automation, monitoring, and incident response to ensure 99.9% uptime for climate‑tech services.
About the role
Key Responsibilities
- Design, implement, and maintain highly available, scalable infrastructure for real‑time weather data pipelines using AWS services.
- Automate deployment, scaling, and configuration of C#/.NET microservices with Docker and Kubernetes.
- Develop and maintain observability stack (metrics, logs, traces) to detect and resolve incidents proactively.
- Collaborate with software engineering teams to embed reliability best practices into CI/CD pipelines.
- Lead post‑mortem analysis and implement root‑cause remediation to improve system resilience.
Requirements
- 5+ years of experience in site reliability or DevOps roles, with strong background in C#/.NET.
- Proficient with AWS, Docker, Kubernetes, and infrastructure as code (Terraform/CloudFormation).
- Hands‑on experience with monitoring/alerting tools (Prometheus, Grafana, Datadog).
- Excellent problem‑solving skills and ability to work in a fast‑paced, remote environment.
Skills
cawsdockerkubernetes