remote
Senior Reliability Engineer - CRM - Medtronic
Software Engineer
Lead the reliability of CRM platforms, ensuring high availability, performance, and rapid incident resolution using Python, AWS, and advanced monitoring tools.
About the role
Key Responsibilities
- Design, implement, and maintain highly available CRM infrastructure on AWS, ensuring 99.9% uptime.
- Develop automated monitoring, alerting, and incident response workflows using Python and observability tools.
- Collaborate with development and product teams to embed reliability best practices into CI/CD pipelines.
- Lead post‑incident reviews, root cause analysis, and continuous improvement initiatives.
- Document reliability standards, runbooks, and knowledge base articles for cross‑functional teams.
Requirements
- 5+ years of experience in reliability or site‑reliability engineering, preferably in CRM or SaaS environments.
- Strong proficiency in Python, AWS services (EC2, RDS, Lambda, CloudWatch), and infrastructure automation.
- Hands‑on experience with monitoring/observability stacks (Prometheus, Grafana, Datadog, etc.).
- Excellent problem‑solving skills and a proactive, data‑driven mindset.
- Effective communication skills and ability to work cross‑functionally in a fast‑paced environment.