onsite
Senior DevOps / Platform Engineer - AI Driven Infrastructure & Scaling - PALIGO GmbH
Devops Engineer
Lead the design and operation of scalable, AI‑enabled cloud infrastructure, driving automation, reliability, and performance across multi‑cloud environments.
About the role
Key Responsibilities
- Architect, deploy, and maintain highly available Kubernetes clusters on AWS, ensuring seamless integration with AI workloads.
- Design and implement end‑to‑end CI/CD pipelines using GitOps principles, Terraform, and automated testing to accelerate feature delivery.
- Collaborate with data science and ML teams to optimize resource allocation, cost, and latency for large‑scale inference pipelines.
- Monitor system health, troubleshoot incidents, and conduct post‑mortem analyses to continuously improve reliability.
- Drive security best practices, including IAM, network segmentation, and compliance audits across the platform.
Requirements
- 5+ years of professional DevOps experience, with a strong focus on Kubernetes and cloud-native technologies.
- Proficient in AWS services (EKS, EC2, S3, RDS) and infrastructure as code tools such as Terraform.
- Hands‑on experience with CI/CD tooling (GitHub Actions, ArgoCD, Jenkins) and container orchestration.
- Solid scripting skills in Python or Bash, and familiarity with monitoring/observability stacks (Prometheus, Grafana, ELK).
- Excellent problem‑solving abilities, strong communication skills, and a proactive, collaborative mindset.
Skills
kubernetesawsterraformcicd