remote
Staff Machine Learning Engineer, ML Efficiency - reddit
ML Engineer
Lead the design and deployment of scalable ML efficiency solutions using Python, PyTorch, and AWS, driving performance improvements across large‑scale recommendation and content ranking systems.
About the role
Key Responsibilities
- Architect and implement distributed ML pipelines that reduce inference latency and resource consumption for high‑traffic recommendation models.
- Collaborate with data scientists and platform engineers to optimize model training, serving, and monitoring on AWS infrastructure.
- Develop reusable tooling and libraries for model compression, quantization, and automated hyper‑parameter tuning.
- Conduct rigorous performance benchmarking and A/B testing to validate efficiency gains in production.
- Mentor junior engineers and contribute to best‑practice documentation for ML operations.
Requirements
- 5+ years of production ML engineering experience with Python and PyTorch/TensorFlow.
- Strong background in distributed systems, cloud services (AWS), and performance optimization.
- Proven track record of deploying large‑scale ML models that handle millions of requests per day.
- Excellent problem‑solving skills and ability to translate business goals into technical solutions.
- Effective communicator who thrives in a fast‑moving, collaborative environment.
Skills
pythonmachine learningawspytorch