remote
System Infrastructure / Platform Engineer - Bay Systems Consulting Inc.
Devops Engineer
Design, develop, and maintain scalable platform services that integrate scientific workflows with supercomputing resources, using Python, C++, Linux, and container orchestration technologies.
About the role
Key Responsibilities
- Develop and maintain APIs and backend services that connect scientific workflow tools to high‑performance computing (HPC) clusters.
- Design, implement, and optimize containerized environments using Docker and Kubernetes for reproducible, scalable execution.
- Collaborate with researchers and software teams to translate scientific requirements into robust, production‑grade infrastructure.
- Automate deployment, monitoring, and troubleshooting of Linux‑based systems across on‑premise and cloud HPC resources.
- Ensure code quality, version control, and documentation standards using Git and CI/CD pipelines.
Requirements
- 3+ years of experience in software or platform engineering, preferably with scientific or HPC workloads.
- Proficiency in Python and C++ for building performance‑critical services.
- Strong background in Linux system administration and HPC job schedulers (e.g., Slurm, PBS).
- Hands‑on experience with container technologies (Docker) and orchestration platforms (Kubernetes).
- Familiarity with RESTful API design, Git workflows, and automated testing/CI pipelines.
Skills
pythonclinuxdockerkubernetes