onsite
Software Integration Engineer HPC
Implementation Engineer
Lead integration of HPC software stacks, optimizing performance and reliability for mission‑critical national security applications using Python, C++, MPI, and GPU technologies on Linux clusters.
About the role
Key Responsibilities
- Design, develop, and integrate HPC software components, ensuring compatibility across distributed computing environments.
- Optimize code for performance, scalability, and memory efficiency on GPU‑enabled clusters.
- Collaborate with system architects to implement MPI and other parallel frameworks.
- Maintain and troubleshoot production HPC workloads, providing root‑cause analysis and performance tuning.
- Document integration procedures, test plans, and deployment scripts for reproducibility.
Requirements
- Strong experience with Python and C++ in a high‑performance computing context.
- Proficiency in MPI, OpenMP, or similar parallel programming models.
- Hands‑on expertise with Linux cluster administration and GPU programming (CUDA/CUPTI).
- Solid understanding of performance profiling, debugging, and optimization techniques.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced, mission‑critical environment.