remote
Machine Learning Systems Engineer, Ads ML Platform - reddit
ML Engineer
Design and operate large‑scale machine‑learning infrastructure for Reddit's advertising platform, building performant pipelines, serving models, and ensuring reliability across cloud environments.
About the role
Key Responsibilities
- Develop and maintain high‑throughput data pipelines that feed real‑time features into ad‑targeting models.
- Design, implement, and operate model serving systems that meet low‑latency and high‑availability requirements.
- Collaborate with data scientists to translate research prototypes into production‑ready services.
- Build monitoring, alerting, and automated remediation tools to ensure platform stability.
- Optimize resource utilization and cost on cloud infrastructure using Kubernetes and AWS services.
Requirements
- Strong programming experience in Python and C++ for building performance‑critical components.
- Hands‑on experience with machine‑learning frameworks such as TensorFlow or PyTorch.
- Proficiency in container orchestration (Kubernetes) and cloud platforms (AWS).
- Solid understanding of distributed systems, networking, and data processing at scale.
- Ability to work independently in a remote setting and communicate effectively with cross‑functional teams.
Skills
pythonctensorflowkubernetesaws