remote
Senior Staff AI Engineer - On-Prem AI Infrastructure & Agentic Systems - SK hynix memory solutions
AI Engineer
Lead the design and deployment of on‑premise AI infrastructure and agentic systems, leveraging Python, C++, CUDA, and container orchestration to deliver high‑performance, scalable solutions for next‑generation memory technologies.
About the role
Key Responsibilities
- Architect and implement on‑premise AI compute platforms that support large‑scale model training and inference.
- Design, develop, and optimize high‑performance kernels using CUDA and C++ for memory‑intensive workloads.
- Build and maintain containerized environments with Docker and Kubernetes to ensure reproducibility and scalability.
- Collaborate with hardware and firmware teams to integrate AI workloads tightly with memory subsystems.
- Lead the creation of agentic AI systems, including reinforcement‑learning pipelines and autonomous decision‑making modules.
Requirements
- 10+ years of experience in AI/ML engineering, with a strong focus on systems‑level programming.
- Expertise in Python and C++ development, GPU programming (CUDA), and Linux system administration.
- Proven track record designing and operating container orchestration platforms (Kubernetes, Docker) at scale.
- Deep understanding of distributed systems, networking, and performance optimization for AI workloads.
- Experience building or integrating agentic or reinforcement‑learning systems in production environments.
Skills
pythonccudakubernetesdockerlinuxmachine learning