remote
Staff Software Engineer, AI/ML - DigitalOcean
Software Engineer
Lead the design and delivery of scalable AI/ML services on cloud infrastructure, driving innovation in data pipelines, model deployment, and performance optimization for a high‑growth SaaS platform.
About the role
Key Responsibilities
- Architect and implement end‑to‑end AI/ML pipelines, from data ingestion to model serving, using Python and AWS services.
- Design scalable, containerized microservices with Kubernetes and Docker, ensuring high availability and fault tolerance.
- Collaborate with cross‑functional teams to define product requirements, translate them into technical specifications, and deliver production‑ready solutions.
- Mentor and guide junior engineers, fostering a culture of code quality, continuous integration, and automated testing.
- Optimize model performance and cost efficiency through profiling, hyperparameter tuning, and resource allocation strategies.
Requirements
- 10+ years of software engineering experience with a strong focus on AI/ML.
- Proficiency in Python, machine learning frameworks (TensorFlow, PyTorch), and cloud-native deployment on AWS.
- Hands‑on experience with Kubernetes, Docker, and CI/CD pipelines.
- Excellent problem‑solving skills and a track record of delivering complex, high‑impact projects.
- Strong communication skills and ability to influence stakeholders across technical and non‑technical domains.
Skills
pythonmachine learningawskubernetesdocker