onsite
IT Operations Engineer - airasia
Systems Engineer
Responsible for maintaining and optimizing the company's IT infrastructure, automating processes, and ensuring high availability of cloud‑based services using Linux, Python, AWS, and container technologies.
About the role
Key Responsibilities
- Design, deploy, and manage Linux‑based servers and cloud resources on AWS.
- Develop and maintain automation scripts in Python and Bash to streamline operational tasks.
- Monitor system performance and reliability using Prometheus, Grafana, and alerting tools.
- Implement and support containerized workloads with Docker and Kubernetes.
- Troubleshoot network, security, and infrastructure incidents to meet SLA targets.
- Collaborate with development and security teams to integrate CI/CD pipelines and enforce best practices.
Requirements
- 3+ years of experience in IT operations or site reliability engineering.
- Strong proficiency with Linux administration and shell scripting.
- Hands‑on experience with AWS services (EC2, S3, VPC, IAM).
- Familiarity with container orchestration (Docker, Kubernetes) and monitoring tools (Prometheus, Grafana).
- Solid understanding of networking concepts and security fundamentals.
Skills
linuxpythonbashawsdockerkubernetesprometheus