remote
Technical Operations Analyst - 2U
Systems Engineer
Analyst driving operational excellence for a leading online education platform, leveraging Python, SQL, AWS, and DevOps practices to diagnose, automate, and optimize system performance and reliability.
About the role
Key Responsibilities
- Analyze and troubleshoot complex system issues across cloud and on‑prem environments, providing root‑cause analysis and actionable recommendations.
- Design, develop, and maintain automation scripts in Python to streamline incident response, data collection, and reporting.
- Collaborate with engineering, security, and product teams to implement monitoring, alerting, and capacity planning solutions on AWS.
- Document processes, create runbooks, and conduct knowledge‑share sessions to elevate team capabilities.
- Participate in on‑call rotations, ensuring rapid resolution of high‑priority incidents and continuous service improvement.
Requirements
- 3+ years of experience in technical operations, site reliability, or DevOps roles.
- Proficiency in Python scripting, SQL querying, and AWS services (EC2, RDS, CloudWatch, Lambda).
- Strong analytical skills with a track record of automating repetitive tasks and improving system reliability.
- Excellent communication and collaboration abilities across cross‑functional teams.
- Experience with CI/CD pipelines, containerization (Docker), and configuration management (Ansible, Terraform) is a plus.