About the job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .
Position: AI Safety Experts — English & Gujarati Type: Contract Compensation: $20–$22/hour Location: Remote
Role Responsibilities
- Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
- Document reproducibly by producing reports, datasets, and attack cases for customer action.
- Work independently and asynchronously to meet deadlines while improving AI model performance .
Qualifications
Must-Have
- Fluent in English and Gujarati .
- Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
- Ability to explain risks clearly to technical and non-technical stakeholders.
Preferred
- Experience with Adversarial ML : jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction.
- Background in Cybersecurity : penetration testing, exploit development, reverse engineering.
- Knowledge of Socio-technical risk : harassment/disinfo probing, abuse analysis, conversational AI testing.
Application Process (Takes 20–30 mins to complete)
- Upload resume
- AI interview based on your resume
- Submit form
Resources & Support
- For details about the interview process and platform information, please check: https://talent.docs. mercor .com/welcome
- For any help or support, reach out to: support@ mercor .com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
Originally posted on Himalayas