onsite
Sustaining Engineer - Activ 2
Software Engineer
Sustaining Engineer responsible for maintaining and enhancing production systems, ensuring high availability and performance using Python, AWS, and DevOps practices. Focus on incident response, root cause analysis, and continuous improvement of deployment pipelines.
About the role
Key Responsibilities
- Monitor and troubleshoot production environments, diagnosing and resolving incidents with minimal downtime.
- Develop and maintain automation scripts in Python to streamline operations and reduce manual effort.
- Collaborate with development teams to implement CI/CD pipelines on AWS, ensuring reliable and repeatable deployments.
- Analyze system metrics and logs to identify performance bottlenecks and recommend architectural improvements.
- Document incident post-mortems and contribute to knowledge base for future reference.
Requirements
- 3+ years of experience in a sustaining or operations engineering role.
- Proficiency in Python and experience scripting for automation.
- Strong understanding of AWS services (EC2, S3, RDS, CloudWatch) and infrastructure-as-code tools.
- Hands‑on experience with CI/CD tools such as Jenkins, GitLab CI, or AWS CodePipeline.
- Excellent problem‑solving skills and ability to work under pressure during incidents.