onsite
AI Infrastructure Engineer - Wolfel Group
Devops Engineer
Design, build, and maintain scalable AI infrastructure on AWS, leveraging Kubernetes, Docker, and Terraform to support machine learning workloads. Drive automation, performance tuning, and secure deployment pipelines for high‑availability AI services.
About the role
Key Responsibilities
- Architect and deploy AI workloads on AWS using Kubernetes, Docker, and Terraform, ensuring scalability and reliability.
- Implement CI/CD pipelines for model training, testing, and production deployment, integrating with Git, Jenkins, or GitHub Actions.
- Monitor and optimize resource utilization, cost, and performance of AI services, applying autoscaling and spot‑instance strategies.
- Collaborate with data scientists and ML engineers to translate model requirements into infrastructure solutions.
- Maintain security best practices, including IAM, network segmentation, and encryption for data at rest and in transit.
Requirements
- 3+ years of experience building production AI or ML infrastructure on AWS.
- Proficiency in Python, Kubernetes, Docker, and Terraform.
- Strong understanding of CI/CD, monitoring, and observability tools.
- Experience with ML frameworks (TensorFlow, PyTorch) and model serving (SageMaker, TorchServe) is a plus.
- Excellent problem‑solving skills and a collaborative mindset.
Skills
pythonawskubernetesdockerterraformmachine learningcicd