Software Engineer - MLOps
Phare Health (R1 RCM, R37 Lab) is seeking a Software Engineer - MLOps to own the production runtime for their ML stack, deploying, serving, and scaling models across inference endpoints and workflows. The role involves building progressive delivery pipelines, managing SLOs, and instrumenting end-to-end observability, utilizing technologies like Terraform, Kubernetes, and CI/CD. This hybrid role in NYC requires a minimum of 5 years of software engineering experience with at least 2 years in ML Ops.
Phare Health, now part of R1 and its AI innovation engine, R37 Lab, is building the first AI-native Healthcare Revenue Operating System. This connected platform leverages frontier clinical reasoning technology to automate medical coding, billing, and follow-up by reasoning over full medical records, payer logic, and financial workflows. Our agentic AI systems are already powering production workflows across 95 of the top 100 U.S. health systems, processing hundreds of millions of patient encounters annually.
This role offers startup-level ownership with enterprise-level impact, focusing on building AI that ships, scales, and measurably improves healthcare operations.
As a Software Engineer - MLOps, you will own the production runtime for Phare’s ML stack. This includes deploying, serving, and scaling models across inference endpoints and batch/streaming workflows. You will be responsible for building progressive delivery pipelines with automated rollouts and rollbacks, managing SLOs for latency and availability, and instrumenting end-to-end observability (metrics, logs, traces, drift, regression). Furthermore, you will harden the platform using Terraform, Kubernetes, and CI/CD to ensure reproducible and auditable ML releases.
We are hiring across several seniority levels, from Mid-level up to Staff, and expect candidates to have at least 5 years of software engineering experience with a minimum of 2 years in ML Ops.
This is an in-person role in NYC, requiring at least 3 days in the SoHo office.
You have a strong background in operating ML systems at scale, where uptime and feedback loops are as crucial as model accuracy. Your experience should include:
We are looking for candidates at various levels:
Posted June 11, 2026