onsite
Principal Core Infrastructure Engineer - Oracle
Devops Engineer
Lead the design and implementation of highly scalable, fault‑tolerant distributed infrastructure, optimizing code paths for hyper‑scale workloads and building proactive telemetry, validation, and upgrade mechanisms.
About the role
Key Responsibilities
- Architect and develop core components of elastic, distributed systems that support hyper‑scale, high‑throughput workloads.
- Define scalability and reliability requirements, applying redundancy, replication, failover, load‑shedding, throttling, and rate‑limiting to meet strict SLOs.
- Optimize code and data paths for performance, ensuring low latency and efficient resource utilization.
- Design in‑service upgrade strategies and fault‑injection validation frameworks to guarantee continuous availability.
- Establish key performance indicators, build dashboards, and configure alerts for proactive monitoring and incident response.
Requirements
- Deep expertise in building and operating large‑scale distributed systems and data‑plane platforms.
- Strong background in performance tuning, scalability engineering, and fault‑tolerant design patterns.
- Proficiency with monitoring, telemetry, and observability tools to drive reliability.
- Experience implementing automated validation, fault injection, and upgrade mechanisms in production environments.
- Excellent problem‑solving skills and ability to lead technical direction across cross‑functional teams.
Skills
software developmentsystem designproblem solving