onsite
Software Engineer, Infrastructure - aaru
Software Engineer
Build and maintain scalable, resilient infrastructure for AI-driven predictive models, leveraging Python, AWS, Docker, Kubernetes, Terraform, and CI/CD pipelines to support rapid deployment and high availability.
About the role
Key Responsibilities
- Design, implement, and operate scalable cloud infrastructure for large‑scale AI workloads using AWS services.
- Containerize applications with Docker and orchestrate them on Kubernetes clusters, ensuring high availability and efficient resource utilization.
- Automate infrastructure provisioning and configuration with Terraform, maintaining versioned, reproducible environments.
- Develop and maintain CI/CD pipelines to streamline code integration, testing, and deployment across multiple environments.
- Monitor system performance, troubleshoot incidents, and implement proactive scaling and resilience strategies.
Requirements
- 3+ years of experience building and managing cloud infrastructure for data‑intensive applications.
- Strong understanding of CI/CD practices and tools such as GitHub Actions, Jenkins, or GitLab CI.
- Experience with monitoring, logging, and alerting solutions (Prometheus, Grafana, CloudWatch).
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced, AI‑focused environment.
Skills
pythonawsdockerkubernetesterraformcicd