remoteonsite
Specialist I - Software Engineering - UST
Software Engineer
Senior Automation Operations Engineer responsible for incident management, continuous improvement of automation workflows, and AI‑driven process optimization using Python, AWS, and CI/CD tools.
About the role
Key Responsibilities
- Serve as the primary point of contact for all automation‑related incidents, ensuring rapid detection, triage, and resolution.
- Collaborate with cross‑functional teams to analyze root causes, implement corrective actions, and document lessons learned.
- Design, develop, and maintain scalable automation scripts and workflows in Python, leveraging AWS services for deployment and monitoring.
- Integrate AI/ML components to enhance automation efficiency, including anomaly detection and predictive maintenance.
- Drive continuous improvement initiatives by reviewing performance metrics, identifying bottlenecks, and recommending process enhancements.
- Participate in on‑call rotations and support incident response during critical events.
Requirements
- 10+ years of experience in automation engineering, incident management, or related fields.
- Proficiency in Python, AWS, and CI/CD pipelines.
- Strong understanding of DevOps practices and infrastructure as code.
- Experience with AI/ML integration in operational workflows.
- Excellent problem‑solving skills and ability to work independently in a fast‑paced environment.