Software Engineer
Freelance specialist designing AI evaluation frameworks, crafting prompts and rubrics, and generating high‑quality scientific training data for model benchmarking, leveraging expertise in scientific writing, data annotation, and Python‑based tooling.
Project Overview
We are sourcing independent Scientific and Technical Service Specialists to provide their expertise for an AI benchmark evaluation project. As AI models increasingly generate professional-grade scientific analyses, technical reports, and STEM-focused deliverables, their accuracy relies entirely on robust, expert-crafted training data. The objective of this project is to autonomously produce high-quality evaluation tasks, strong prompts, and clear, well-structured rubrics that generate clean, reliable data for model training.
Project Deliverables & Scope
Operate autonomously to design complex evaluation frameworks and provide structured training data. Expected deliverables include:
Required Expertise
To successfully fulfill the deliverables of this project, Contractors must possess deep industry knowledge to craft realistic professional scenarios.
Core skillset includes:
We offer a pay range of $10-to-$30 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply.
Engagement Type: Freelance / Independent Contractor
Workplace Type: Remote
Originally posted on Himalayas
Posted June 26, 2026