onsite
Computer Scientist - Open Science & Analysis Facilities
Software Engineer
Lead the design and automation of cloud‑native, containerized data lake solutions for open science, leveraging Python and CVMFS to streamline scientific workflows and infrastructure.
About the role
Key Responsibilities
- Architect and implement cloud‑native, containerized pipelines for large‑scale scientific data lakes.
- Develop and maintain automation scripts using Python to manage CVMFS repositories and data ingestion workflows.
- Collaborate with research teams to translate scientific requirements into scalable, reproducible infrastructure.
- Optimize performance and reliability of data storage, retrieval, and processing across distributed environments.
- Document architecture, best practices, and operational procedures for internal and external stakeholders.
Requirements
- Strong experience with Python, container orchestration (Docker, Kubernetes), and cloud native technologies.
- Hands‑on knowledge of CVMFS, data lake architectures, and big data processing frameworks.
- Proficiency in automation, CI/CD pipelines, and infrastructure as code.
- Excellent problem‑solving skills and ability to work collaboratively in a research‑driven environment.