remote
Systems Development Engineer, Amazon Core Search - Amazon.com
Software Engineer
Systems Development Engineer focused on maintaining and improving Amazon’s global search infrastructure, ensuring high availability and performance using Python, AWS, and operational excellence practices.
About the role
Key Responsibilities
- Operate and support the front‑end services of one of the world’s largest search infrastructures, ensuring 99.99% uptime.
- Investigate, triage, and resolve high‑severity incidents, collaborating with cross‑functional teams to minimize customer impact.
- Design and implement monitoring, alerting, and automation solutions to detect and remediate performance regressions.
- Participate in continuous improvement initiatives, applying Operational Excellence principles to streamline processes and reduce toil.
- Contribute to capacity planning, scaling strategies, and reliability engineering for distributed search services.
Requirements
- 3+ years of experience in systems or software engineering, with a strong focus on distributed search or web-scale services.
- Proficiency in Python and AWS services (EC2, S3, CloudWatch, Lambda).
- Hands‑on experience with monitoring, alerting, and incident response tools (e.g., Prometheus, Grafana, PagerDuty).
- Solid understanding of distributed systems concepts, performance tuning, and reliability engineering.
- Excellent communication skills and a collaborative mindset for working in a global, cross‑functional team.