onsite
Machine Learning Intern Singapore - TetraMem INC
ML Engineer
Machine Learning Intern focused on developing and optimizing Python and C++ tools for neural network compression, conversion, and deployment on analog compute‑in‑memory hardware, enhancing model efficiency through quantization, pruning, and graph optimization.
About the role
Key Responsibilities
- Develop, optimize, and maintain Python and C++ software for neural network model compression, conversion, deployment, and runtime environments.
- Design and implement tools to improve machine learning model efficiency, including quantization, pruning, graph optimization, and memory reduction techniques.
- Collaborate with hardware engineers to integrate software solutions with TetraMem’s analog compute‑in‑memory chips.
- Analyze model performance, identify bottlenecks, and propose architectural improvements.
- Document code, workflows, and best practices for internal use and future scaling.
Requirements
- Strong programming skills in Python and C++.
- Solid understanding of machine learning concepts and neural network architectures.
- Experience with model compression techniques such as quantization, pruning, and graph optimization.
- Familiarity with hardware-software co‑design and performance profiling.
- Excellent problem‑solving skills and ability to work independently in a fast‑paced environment.
Skills
pythoncmachine learning