onsite
Principal Data Systems Software Engineer - Oracle
Software Engineer
Lead the design, implementation, and operation of mission‑critical, cloud‑scale data systems, leveraging Kubernetes, Prometheus, ELK, and strong scripting skills in Python and Ruby.
About the role
Key Responsibilities
- Architect, develop, and maintain large‑scale distributed data platforms that meet high availability and performance requirements.
- Design and operate cloud‑native infrastructure using Kubernetes, Prometheus, and ELK for monitoring, logging, and alerting.
- Write robust automation and diagnostic scripts in Python, Ruby, and shell to streamline deployment, troubleshooting, and performance tuning.
- Collaborate with cross‑functional teams to integrate web servers such as Apache Tomcat, Nginx, or Netty into the data pipeline.
- Lead root‑cause analysis of complex networking and operating‑system issues, defining new metrics and dashboards to improve observability.
Requirements
- Minimum six years of experience building and supporting mission‑critical, large‑scale systems.
- Deep expertise in distributed systems design and cloud‑scale infrastructure.
- Proven hands‑on experience with Kubernetes, Prometheus, and ELK stack.
- Strong scripting abilities in Python, Ruby, and shell environments.
- Solid knowledge of web servers such as Apache Tomcat, Nginx, or Netty and excellent problem‑analysis communication skills.
Skills
kubernetesprometheuspythonruby