remote
AI Engineer - Zoom Communications
AI Engineer
Develop and scale AI-powered APIs and SDKs for speech, translation, OCR, and summarization, leveraging Python, deep‑learning frameworks, and cloud infrastructure.
About the role
Key Responsibilities
- Design, implement, and maintain high‑performance AI APIs and SDKs that expose speech recognition, machine translation, OCR, and summarization capabilities.
- Collaborate with AI research, platform engineering, and product teams to integrate cutting‑edge models into production services.
- Build scalable, cloud‑native infrastructure using AWS, Docker, and container orchestration to support global API consumption.
- Develop robust RESTful interfaces, authentication, rate‑limiting, and monitoring to ensure reliability and security for external developers.
- Optimize model inference latency and cost through quantization, batching, and efficient resource allocation.
Requirements
- Strong proficiency in Python and experience with deep‑learning frameworks such as TensorFlow or PyTorch.
- Hands‑on experience building and deploying AI services on AWS (e.g., SageMaker, Lambda, ECS/EKS).
- Proven ability to design RESTful APIs and SDKs, with a focus on performance, scalability, and developer experience.
- Familiarity with containerization (Docker) and orchestration (Kubernetes/EKS) for production workloads.
- Solid understanding of machine‑learning concepts, model optimization, and real‑time inference pipelines.
Skills
pythontensorflowpytorchawsdocker