onsite
IT / AI SysOps Manager - SureSwift Capital
Software Engineer
Lead the operation and AI enablement of multiple SaaS businesses, designing and automating cloud infrastructure, monitoring performance, and integrating machine‑learning pipelines using Python, AWS, Kubernetes and IaC tools.
About the role
Key Responsibilities
- Design, deploy, and maintain scalable cloud infrastructure for a portfolio of SaaS products using AWS, Kubernetes, Terraform and Ansible.
- Automate monitoring, alerting, and incident response workflows to ensure high availability and performance.
- Collaborate with data science teams to operationalize machine‑learning models, building CI/CD pipelines for AI workloads.
- Develop and maintain Python scripts and tooling for system automation, cost optimization, and security compliance.
- Provide technical guidance and mentorship to cross‑functional engineering teams, fostering best practices in DevOps and AI Ops.
Requirements
- 5+ years of experience in SysOps, DevOps, or Site Reliability Engineering, preferably in SaaS environments.
- Strong proficiency with Linux systems, AWS services, container orchestration (Kubernetes/Docker), and infrastructure‑as‑code tools (Terraform, Ansible).
- Hands‑on experience scripting in Python and automating AI/ML model deployment pipelines.
- Demonstrated ability to troubleshoot complex distributed systems and implement proactive monitoring solutions.
- Excellent communication skills and a collaborative mindset to work across product, engineering, and data science teams.
Skills
pythonlinuxawskubernetesterraformansibledocker