About The Job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
Benchmark,
General Catalyst,
Peter Thiel,
Adam D'Angelo,
Larry Summers, and
Jack Dorsey.
Position: AI Model Evaluation Specialist
Type:Contract
Compensation:$25$35/hour
Commitment:20 hours/week
Role Responsibilities
- Write realistic prompts reflecting professional and consumer domain-specific guidance.
- Evaluate AI-generated responses for factual accuracy and practical usefulness.
- Identify fabricated claims and misleading reasoning in model outputs.
- Score and rank model responses using structured rubrics.
- Provide written justifications with specific evidence for evaluations.
Qualifications
Must-Have
- Professional experience applying domain expertise in a practitioner or advisory capacity.
- Familiarity with industry-specific standards, regulations, or clinical guidelines.
- Strong written communication and critical reasoning skills.
Application Process (Takes 2030 mins to complete)
- Submit your resume to begin.
- Complete the Model Response Evaluation assessment.
Resources & Support
- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
- For any help or support, reach out to: [Confidential Information]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
,