remote
Technical Support Engineer, Tavily - Nebius
Software Engineer
Technical Support Engineer responsible for diagnosing and resolving complex cloud infrastructure issues for an AI platform, leveraging Python, Node.js, AWS, and DevOps practices to ensure high availability and performance.
About the role
Key Responsibilities
- Provide tier‑2/3 support for cloud‑based AI services, troubleshooting performance, connectivity, and configuration issues.
- Collaborate with engineering teams to reproduce, analyze, and resolve bugs in production environments.
- Automate monitoring and alerting workflows using Python scripts and AWS CloudWatch.
- Document root causes, workarounds, and best‑practice guides for internal and external stakeholders.
- Participate in on‑call rotations and incident post‑mortems to improve system resilience.
Requirements
- 3+ years of experience supporting cloud infrastructure and AI workloads.
- Experience with DevOps tools (Docker, Kubernetes, Terraform, CI/CD pipelines).
- Excellent problem‑solving skills and ability to communicate complex technical concepts clearly.