remoteonsite
IT Lead - Sarvam
Software Engineer
Lead the IT organization for a sovereign AI platform, driving secure, scalable infrastructure and DevOps excellence across research, models, and applications using Python, Node.js, AWS, Kubernetes, and CI/CD pipelines.
About the role
Key Responsibilities
- Architect, deploy, and maintain a highly available, secure cloud infrastructure on AWS for AI research, model training, and production services.
- Lead a cross‑functional DevOps team to implement CI/CD pipelines, automated testing, and blue‑green deployments for Python and Node.js microservices.
- Define and enforce security best practices, including IAM, encryption, network segmentation, and compliance with financial regulations.
- Collaborate with data science, product, and compliance teams to ensure seamless integration of AI models into production workflows.
- Monitor system performance, troubleshoot incidents, and drive continuous improvement of infrastructure reliability and cost efficiency.
Requirements
- 5+ years of experience in cloud architecture and DevOps, with hands‑on expertise in AWS, Kubernetes, and CI/CD tools.
- Strong programming background in Python and Node.js, with experience building scalable microservices.
- Deep understanding of security principles, regulatory compliance, and incident response in a financial context.
- Excellent communication skills and a proven ability to lead technical teams in a fast‑paced environment.
Skills
pythonnodejsawskubernetescicd