onsite
Senior Machine Learning Engineer, Computer Vision - Uber
ML Engineer
Senior ML Engineer specializing in computer vision and multimodal models, building scalable document understanding and transcription systems using Python, PyTorch/TensorFlow, and cloud infrastructure.
About the role
Key Responsibilities
- Design, train, and fine‑tune state‑of‑the‑art vision and vision‑language models for document understanding, receipt transcription, and menu digitization.
- Develop end‑to‑end pipelines that integrate model inference into high‑throughput production services.
- Optimize model performance and cost on cloud platforms, leveraging AWS services, Docker containers, and orchestration tools.
- Collaborate with product, data, and infrastructure teams to define requirements and ensure reliable deployment at scale.
- Mentor junior engineers and contribute to best‑practice guidelines for computer‑vision development.
Requirements
- 5+ years of experience building and deploying computer‑vision or multimodal ML systems.
- Strong proficiency in Python and deep‑learning frameworks such as PyTorch or TensorFlow.
- Hands‑on experience with large‑scale model training, inference optimization, and cloud deployment (AWS, Docker, Kubernetes).
- Solid understanding of computer‑vision techniques, including object detection, OCR, and vision‑language transformers.
- Proven ability to translate research prototypes into production‑ready services.
Skills
pythonpytorchtensorflowcomputer visiondeep learningawsdocker