remote
HPC & Supercomputing Systems Engineer - Research - StaffRight Associates, LLC
Systems Engineer
Lead the design, deployment, and optimization of high‑performance computing environments for cutting‑edge drug discovery research, leveraging GPU clusters, MPI, and Slurm to accelerate large‑scale simulations and data analytics.
About the role
Key Responsibilities
- Architect and maintain scalable HPC clusters, integrating GPU nodes, high‑speed interconnects, and storage solutions to support large‑scale drug discovery workloads.
- Develop and optimize MPI and GPU‑accelerated code, ensuring efficient parallel execution and minimal bottlenecks across multi‑node systems.
- Configure and manage Slurm or equivalent workload managers, tailoring job scheduling policies to maximize resource utilization and reduce queue times.
- Collaborate with computational scientists to translate research requirements into HPC infrastructure specifications and performance benchmarks.
- Implement monitoring, logging, and security best practices, ensuring compliance with institutional data governance and cybersecurity standards.
Requirements
- Advanced degree (Ph.D. or Master’s) in Computer Science, Computational Science, or related field with strong HPC experience.
- Proven expertise in HPC system administration, GPU programming (CUDA/CUDA‑C++), and MPI parallelism.
- Hands‑on experience with Slurm, PBS, or similar workload managers and performance profiling tools.
- Strong scripting skills in Python and Bash, with ability to automate deployment and monitoring tasks.
- Excellent problem‑solving skills, ability to work independently, and a passion for advancing computational methods in drug discovery.