remoteonsite
Senior Forward Deployed Engineer I AI/ML - DigitalOcean
Software Engineer
Senior engineer focused on operationalizing AI/ML workloads at scale, building robust pipelines, and integrating models into production cloud environments using Python, Kubernetes, Docker, and major cloud services.
About the role
Key Responsibilities
- Design, develop, and deploy end‑to‑end AI/ML pipelines that run reliably at scale on cloud infrastructure.
- Collaborate with product, data science, and SRE teams to translate research prototypes into production‑ready services.
- Implement containerized solutions using Docker and orchestrate them with Kubernetes, ensuring high availability and performance.
- Automate CI/CD workflows for model training, validation, and rollout, incorporating testing and monitoring best practices.
- Optimize resource utilization and cost on AWS (or equivalent cloud platforms) while maintaining security and compliance.
Requirements
- 5+ years of software engineering experience with strong proficiency in Python.
- Hands‑on experience building and operating AI/ML workloads in production, using frameworks such as TensorFlow or PyTorch.
- Deep knowledge of containerization (Docker) and orchestration (Kubernetes) in cloud environments.
- Proven ability to create automated CI/CD pipelines and monitor deployed models.
- Excellent problem‑solving skills, ability to work cross‑functionally, and a growth‑mindset oriented toward continuous improvement.
Skills
pythonkubernetesdockertensorflowpytorchawscicd