remote
SRE / DevOps Engineer - CopeCart GmbH
Site Reliability Engineer
Senior SRE/DevOps Engineer responsible for building and maintaining scalable, highly available infrastructure using Kubernetes, Docker, Terraform, and AWS. Drives CI/CD pipelines, automates deployments, and ensures robust monitoring and incident response for a digital product platform.
About the role
Key Responsibilities
- Design, implement, and manage Kubernetes clusters and containerized workloads across AWS environments.
- Build and maintain CI/CD pipelines with GitHub Actions, Jenkins, or similar tools to enable rapid, reliable deployments.
- Automate infrastructure provisioning and configuration using Terraform, Ansible, or equivalent IaC tools.
- Implement comprehensive monitoring, logging, and alerting with Prometheus, Grafana, ELK stack, or similar solutions.
- Lead incident response, post‑mortem analysis, and continuous improvement of reliability and performance.
- Collaborate with product, security, and support teams to ensure compliance, scalability, and high availability.
Requirements
- 5+ years of experience in SRE or DevOps roles, with a strong focus on cloud-native technologies.
- Proficient in Kubernetes, Docker, and AWS services (EKS, EC2, S3, RDS).
- Hands‑on experience with Terraform, Ansible, or similar IaC tools.
- Solid understanding of CI/CD principles and experience with GitHub Actions, Jenkins, or GitLab CI.
- Strong scripting skills (Python, Bash) and familiarity with monitoring/alerting tools.
Skills
kubernetescicdawsdockerterraform