onsite
Senior Site Reliability Engineer - Edge - Dominion Dynamics
Site Reliability Engineer
Senior Site Reliability Engineer focused on building, hardening, and operating edge compute modules, integrating hardware, OS, and runtime with mesh radios, sensors, and SDKs using Linux, Kubernetes, Docker, Terraform, and observability tools.
About the role
Key Responsibilities
- Design, implement, and maintain a reproducible edge compute platform that combines hardware, operating system, and runtime components.
- Automate provisioning, configuration, and deployment pipelines using Terraform, CI/CD tools, and container orchestration (Kubernetes/Docker).
- Ensure security hardening, reliability, and field‑readiness of edge devices from prototype to production.
- Develop monitoring, alerting, and performance dashboards with Prometheus and related observability stacks.
- Collaborate with hardware, firmware, and software teams to integrate mesh radios, sensors, and SDKs into a seamless edge solution.
Requirements
- 5+ years of SRE or DevOps experience with Linux‑based systems and container orchestration.
- Strong expertise in infrastructure as code (Terraform) and CI/CD pipeline automation.
- Hands‑on experience securing and hardening edge devices for field deployment.
- Proficiency in monitoring and observability tools such as Prometheus, Grafana, or similar.
- Ability to work across hardware and software teams, translating prototype workflows into production‑grade processes.
Skills
linuxkubernetesdockerterraformcicdprometheus