remote
Principal Software Engineer - AI Platform - Oracle
Software Engineer
Lead the design and delivery of AI‑powered automation frameworks for OCI’s global network, driving operational excellence through scalable, event‑driven platforms and advanced observability tools.
About the role
Key Responsibilities
- Architect and implement large‑scale, event‑driven automation frameworks that integrate AI/ML models to predict and mitigate network incidents.
- Collaborate with Network Reliability Engineering and GNOC teams to define observability requirements and deliver actionable insights at hyperscale.
- Lead code reviews, performance tuning, and scalability testing for high‑availability services written in Python and Node.js.
- Drive continuous improvement of CI/CD pipelines, ensuring rapid, reliable deployment of new features across OCI’s cloud infrastructure.
- Mentor senior engineers and foster a culture of innovation, knowledge sharing, and best practices in cloud‑native development.
Requirements
- 10+ years of software engineering experience with a strong focus on cloud infrastructure and automation.
- Proficiency in Python and Node.js, with hands‑on experience building event‑driven architectures.
- Deep knowledge of AI/ML concepts and practical experience deploying models in production environments.
- Strong background in observability, monitoring, and incident response for large‑scale distributed systems.
- Excellent communication skills and a proven ability to lead cross‑functional teams.
Skills
pythonnodejsmachine learning