Lead end‑to‑end cloud operations, designing scalable CI/CD pipelines, Kubernetes clusters, and infrastructure as code on AWS to support AI‑driven lending services.
About the role
Key Responsibilities
Design, implement, and maintain highly available Kubernetes clusters and Docker workloads on AWS.
Build and optimize CI/CD pipelines using Git, Terraform, and automated testing to accelerate feature delivery.
Implement infrastructure as code (IaC) for reproducible, version‑controlled environments.
Monitor application performance, troubleshoot incidents, and drive continuous improvement of observability tools.
Collaborate with data science, backend, and security teams to ensure compliance and scalability of AI services.
Requirements
5+ years of DevOps experience in a fast‑moving fintech or AI environment.
Proficiency with Kubernetes, Docker, AWS services (EKS, EC2, S3, CloudWatch), and Terraform.
Strong scripting skills (Python, Bash) and experience with CI/CD tools (GitHub Actions, Jenkins, ArgoCD).
Hands‑on experience with monitoring/alerting (Prometheus, Grafana, Datadog).
Excellent problem‑solving skills and a collaborative mindset.