About the job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .
Position: Code-Data Eval Author — Software Test Engineer / SDET Type: Contract Compensation: $30–$100/hour Location: Remote Commitment: 30+ hours/week
Role Responsibilities
- Design verifiers and correctness rubrics for coding tasks to ensure AI agent code functionality.
- Enumerate edge cases and build adversarial test cases for comprehensive agent/model evaluation.
- Grade agent trajectories and improve test/rubric quality through detailed review.
- Work independently and asynchronously to meet deadlines while enhancing AI model performance .
- Collaborate with subject matter experts to ensure test consistency and relevance.
Qualifications
Must-Have
- 5+ years as an SDET / software test engineer at a real product organization.
- Proficiency in writing code and tests using automation frameworks like pytest , Playwright , Cypress .
- Experience with CI/CD processes; SDET preferred over manual-only QA.
- Clear written communication skills.
Preferred
- Familiarity with AI tools and evaluations.
Interview Process
- Mercor Technical Screen : Paid $200 for completing all three steps.
- Live Code Review Session .
- Domain Expert Interview.
Compensation & Legal
- Hourly contractor .
- Paid weekly via Stripe Connect .
Application Process (Takes 20–30 mins to complete)
- Upload resume
- AI interview based on your resume
- Submit form
Resources & Support
- For details about the interview process and platform information, please check: https://talent.docs. mercor .com/welcome
- For any help or support, reach out to: support@ mercor .com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
Originally posted on Himalayas