onsite

Machine Learning Engineer - Applied AI

TikTok is seeking a Machine Learning Engineer to deploy and optimize machine learning systems, focusing on evaluating existing ML lifecycles, productionizing model pipelines, and enhancing AI model predictive automation. This role involves model optimization, deployment, inference pipeline productionization, service deployment, tracking, and maintenance of scalable infrastructure.

About the role

About the team

The success of data business model hinges on the supply of a large volume of high quality labeled data that will grow exponentially as our business scales up. However, the current cost of data labeling is excessively high. The Data Solutions team is built to understand data strategically at scale for all Global Business Solution (GBS) business needs. Data Solutions Team uses quantitative and qualitative data to guide and uncover insights, turning our findings into real products to power exponential growth. Data Solutions Team responsibility includes infrastructure construction, recognition capabilities management, global labeling delivery management.

We are looking for a highly capable machine learning engineer to deploy and optimise our machine learning systems. You will be evaluating existing machine learning (ML) lifecycle, understanding and productionizing the model pipeline, and enhancing and maintaining the performance of our AI model's predictive automation capabilities.

Responsibilities

What you will do

Model optimisation: collaborate with data scientists to improve existing machine learning model training and evaluation pipelines, optimize the model training pipeline speed for faster iteration;
Model Deployment: optimize the model inferencing performance through quantization and model conversion, define and leverage appropriate resources for model hosting and inferencing;
Inference Pipeline Productionisation: work with data scientists and data engineers to design and implement the data pipelines for machine learning models that will support the current and future needs of our business;
Service Deployment: build continuous integration, testing, and scalable deployment pipelines in cloud computing environments for machine learning services;
Tracking: build logging, tracking, analyzing, monitoring and reporting pipelines for both data and model tracking in cloud computing environments to ensure correct model output and model performance;
Maintenance: build scalable and reliable infrastructure that supports feature engineering, model training, deployment, inferencing, performance monitoring.

Requirements

What you will need

Ability to understand the business use case to optimise and implement scalable solution;
Knowledge of machine learning concepts and fundamentals; deep learning proficiency in at least one of CV and NLP, with solid experience in model training/inferencing optimization such as quantization and conversion;
Solid programming skills with experience writing and maintaining high-quality production code;
Experience in ML pipeline, model training orchestration; large-scale/distributed training experience is desirable;
Ability to work independently and complete projects from beginning to end and in a timely manner;
Great communication skills, both written and oral; comfortable presenting findings and recommendations to non-technical audiences.

Qualifications

BS or above in Computer Science, Software Engineering, Data Science or a related field;
3+ years of industry experience building ML infrastructure at scale; At least 1 year of experience in developing and deploying large-scale systems, version control, scaling and monitoring;
Experience in machine learning frameworks (scikit-learn, Tensorflow, Pytorch), big data frameworks (Spark/Hadoop/Flink) and experience in resource management and task scheduling for large scale distributed systems;
Proficient in Python/SQL and one of C++/Go, with deep knowledge of Linux and CD tools (e.g. git); experience with any Go/Python microservice framework is highly desirable;
Familiar with cloud infrastructure, good understanding of different data storages and message queues for data streaming and pipelining;

About the role

About the team

Responsibilities

What you will do

Model optimisation: collaborate with data scientists to improve existing machine learning model training and evaluation pipelines, optimize the model training pipeline speed for faster iteration;
Model Deployment: optimize the model inferencing performance through quantization and model conversion, define and leverage appropriate resources for model hosting and inferencing;
Inference Pipeline Productionisation: work with data scientists and data engineers to design and implement the data pipelines for machine learning models that will support the current and future needs of our business;
Service Deployment: build continuous integration, testing, and scalable deployment pipelines in cloud computing environments for machine learning services;
Tracking: build logging, tracking, analyzing, monitoring and reporting pipelines for both data and model tracking in cloud computing environments to ensure correct model output and model performance;
Maintenance: build scalable and reliable infrastructure that supports feature engineering, model training, deployment, inferencing, performance monitoring.

Requirements

What you will need

Ability to understand the business use case to optimise and implement scalable solution;
Knowledge of machine learning concepts and fundamentals; deep learning proficiency in at least one of CV and NLP, with solid experience in model training/inferencing optimization such as quantization and conversion;
Solid programming skills with experience writing and maintaining high-quality production code;
Experience in ML pipeline, model training orchestration; large-scale/distributed training experience is desirable;
Ability to work independently and complete projects from beginning to end and in a timely manner;
Great communication skills, both written and oral; comfortable presenting findings and recommendations to non-technical audiences.

Qualifications

BS or above in Computer Science, Software Engineering, Data Science or a related field;
3+ years of industry experience building ML infrastructure at scale; At least 1 year of experience in developing and deploying large-scale systems, version control, scaling and monitoring;
Experience in machine learning frameworks (scikit-learn, Tensorflow, Pytorch), big data frameworks (Spark/Hadoop/Flink) and experience in resource management and task scheduling for large scale distributed systems;
Proficient in Python/SQL and one of C++/Go, with deep knowledge of Linux and CD tools (e.g. git); experience with any Go/Python microservice framework is highly desirable;
Familiar with cloud infrastructure, good understanding of different data storages and message queues for data streaming and pipelining;

Machine Learning Engineer - Applied AI

About the role

About the team

Responsibilities

Requirements

Qualifications

Machine Learning Engineer - Applied AI

About the role

About the team

Responsibilities

Requirements

Qualifications

Skills