onsite

Machine Learning Intern Singapore - TetraMem INC

ML Engineer

Machine Learning Intern focused on developing and optimizing Python and C++ tools for neural network compression, conversion, and deployment on analog compute‑in‑memory hardware, enhancing model efficiency through quantization, pruning, and graph optimization.

About the role

Key Responsibilities

Develop, optimize, and maintain Python and C++ software for neural network model compression, conversion, deployment, and runtime environments.
Design and implement tools to improve machine learning model efficiency, including quantization, pruning, graph optimization, and memory reduction techniques.
Collaborate with hardware engineers to integrate software solutions with TetraMem’s analog compute‑in‑memory chips.
Analyze model performance, identify bottlenecks, and propose architectural improvements.
Document code, workflows, and best practices for internal use and future scaling.

Requirements

Strong programming skills in Python and C++.
Solid understanding of machine learning concepts and neural network architectures.
Experience with model compression techniques such as quantization, pruning, and graph optimization.
Familiarity with hardware-software co‑design and performance profiling.
Excellent problem‑solving skills and ability to work independently in a fast‑paced environment.

Skills

pythoncmachine learning

CompanyTetraMem INC

DepartmentResearch

LocationSan Jose, CA, United States

Experience1+ years

Tenurefull-time

LevelEntry-Level

Posted June 20, 2026