onsite
Java SRE Technical Lead - Litmus7 System Consulting
Engineering Manager
Lead a high‑performing SRE team managing Java/J2EE web applications in enterprise environments, driving production support, performance tuning, and incident response using Azure, Kafka, Apigee, Oracle, and modern observability tools.
About the role
Key Responsibilities
- Lead and mentor a cross‑functional SRE team to ensure 99.9% uptime for Java/J2EE web applications.
- Own incident management, including PagerDuty on‑call rotations, root‑cause analysis, and post‑mortem documentation.
- Design and implement monitoring, alerting, and observability solutions with Dynatrace, Splunk, and Grafana.
- Perform deep performance tuning: thread dump, heap dump analysis, JVM tuning, and database query optimization.
- Collaborate with development and DevOps to integrate Azure Integration Services, Kafka, and Google Apigee into the CI/CD pipeline.
- Drive continuous improvement initiatives, automation, and best‑practice adoption across the platform.
Requirements
- 8+ years of production support experience for Java/J2EE applications in enterprise settings.
- Proven leadership skills with a track record of managing SRE or DevOps teams.
- Hands‑on expertise in Azure, Kafka, Apigee, Oracle, and SQL.
- Strong knowledge of performance troubleshooting, JVM internals, and database tuning.
- Experience with modern observability tools (Dynatrace, Splunk, Grafana) and incident‑management platforms (PagerDuty).