remote
Director, Engineering - Detection Platform - Datadog
Software Engineer
Lead a high‑impact engineering organization that builds scalable alerting, event intelligence, and AI‑driven detection systems for enterprise customers, driving product strategy and cross‑functional collaboration.
About the role
Key Responsibilities
- Lead and mentor engineering managers and their teams, setting vision and execution for the Detection Platform’s core infrastructure.
- Architect and deliver scalable alerting, event management, and autonomous detection services that support millions of events per second.
- Collaborate closely with Product Management, Applied Science, Design, and other engineering leaders to define feature roadmaps and technical standards.
- Drive continuous improvement of monitoring, observability, and AI‑powered detection capabilities across the platform.
- Champion best practices in cloud architecture, CI/CD, and automated testing to ensure high reliability and performance.
Requirements
- 10+ years of software engineering experience with a proven track record in large‑scale distributed systems.
- Strong background in Python, Go, and cloud-native technologies (AWS, Kubernetes).
- Experience building and scaling alerting, event intelligence, and machine‑learning‑based detection pipelines.
- Excellent leadership, communication, and cross‑functional collaboration skills.
- Hands‑on expertise in designing for high availability, observability, and security at scale.
Skills
pythongoawskubernetesmachine learning