remote

Staff Machine Learning Engineer, ML Efficiency - reddit

ML Engineer

Lead the design and deployment of scalable ML efficiency solutions using Python, PyTorch, and AWS, driving performance improvements across large‑scale recommendation and content ranking systems.

About the role

Key Responsibilities

Architect and implement distributed ML pipelines that reduce inference latency and resource consumption for high‑traffic recommendation models.
Collaborate with data scientists and platform engineers to optimize model training, serving, and monitoring on AWS infrastructure.
Develop reusable tooling and libraries for model compression, quantization, and automated hyper‑parameter tuning.
Conduct rigorous performance benchmarking and A/B testing to validate efficiency gains in production.
Mentor junior engineers and contribute to best‑practice documentation for ML operations.

Requirements

5+ years of production ML engineering experience with Python and PyTorch/TensorFlow.
Strong background in distributed systems, cloud services (AWS), and performance optimization.
Proven track record of deploying large‑scale ML models that handle millions of requests per day.
Excellent problem‑solving skills and ability to translate business goals into technical solutions.
Effective communicator who thrives in a fast‑moving, collaborative environment.

Skills

pythonmachine learningawspytorch

Companyreddit

DepartmentResearch

LocationLondon, ENG, United Kingdom

Experience7+ years

Tenurefull-time

LevelLead

Posted June 20, 2026