remote
Senior Director, Reliability and Security Engineering - Beacon Biosignals
Software Engineer
Lead the reliability and security strategy for a cloud‑native EEG analytics platform, driving SRE practices, security posture, and automation using AWS, Kubernetes, Python and CI/CD pipelines.
About the role
Key Responsibilities
- Define and execute the reliability and security roadmap for a large‑scale, cloud‑native analytics platform supporting millions of EEG data points.
- Build and mentor a high‑performing SRE and security team, establishing best practices for incident response, observability, and threat modeling.
- Design, implement, and maintain automated CI/CD pipelines, infrastructure‑as‑code, and monitoring solutions on AWS and Kubernetes.
- Partner with product, data science, and compliance teams to embed security controls and reliability standards throughout the development lifecycle.
- Lead root‑cause analysis of production incidents, drive post‑mortems, and continuously improve system resilience and performance.
Requirements
- 10+ years of experience in site reliability engineering, cloud security, or related fields, with at least 5 years in a leadership role.
- Deep expertise in AWS services, Kubernetes orchestration, and infrastructure‑as‑code tools (e.g., Terraform, CloudFormation).
- Proficiency in scripting/automation using Python and strong knowledge of CI/CD frameworks (Jenkins, GitLab CI, GitHub Actions).
- Demonstrated success implementing SRE practices: SLIs/SLOs, error budgets, automated remediation, and observability stacks.
- Excellent communication and stakeholder management skills, with a track record of driving cross‑functional initiatives in regulated environments.
Skills
awskubernetespythoncicd