remote
Engineer II Systems HPC Engineer - Microchip Technology
Software Engineer
Senior HPC Engineer responsible for designing, deploying, and optimizing high‑performance computing clusters, leveraging MPI, GPU acceleration, and Linux environments to deliver scalable solutions for complex computational workloads.
About the role
Key Responsibilities
- Design, build, and maintain HPC clusters, ensuring high availability and scalability for scientific and engineering workloads.
- Implement and optimize MPI and GPU‑accelerated applications, tuning performance across multi‑node systems.
- Develop automation scripts in Python and Bash to streamline deployment, monitoring, and resource management.
- Collaborate with software and hardware teams to integrate new accelerators and storage solutions.
- Provide technical guidance and support to end users, troubleshooting performance bottlenecks and hardware issues.
Requirements
- 3+ years of experience in HPC system administration or software development.
- Solid programming skills in Python and C++ for performance analysis and automation.
- Experience with cluster management tools (e.g., Slurm, PBS) and performance profiling.
- Strong analytical, problem‑solving, and communication skills.