12 days ago

Mathematics AI Evaluator

Mercor

Hybrid
Part Time
$151,840
Hybrid

Job Overview

Job TitleMathematics AI Evaluator
Job TypePart Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$151,840
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: Mathematics AI Evaluator

This role is available as full-time or part-time contract work with competitive hourly compensation.

Role Responsibilities

  • Write and refine prompts to guide model behavior in mathematical contexts.
  • Evaluate LLM-generated responses to mathematics-related queries for correctness, rigor, and logical coherence.
  • Verify mathematical claims, derivations, and proofs using domain expertise.
  • Conduct fact-checking using authoritative public sources and domain knowledge.
  • Annotate model responses by identifying strengths, areas of improvement, and inaccuracies.
  • Ensure model responses align with expected conversational behavior and system guidelines.

Qualifications

Must-Have:

  • PhD in Mathematics or a closely related field.
  • Demonstrated experience in Probability & Statistics.
  • Significant experience using large language models (LLMs).
  • Excellent writing skills for explaining complex mathematical concepts.
  • Strong attention to detail to notice subtle issues.
  • Experience reviewing or editing technical or academic writing.

Preferred:

  • Prior experience with RLHF, model evaluation, or data annotation work.
  • Experience teaching or mentoring mathematical concepts to non-experts.
  • Familiarity with evaluation rubrics, benchmarks, or structured review frameworks.

Application Process

Upload resume, complete an AI interview based on your resume, and submit a form. The process takes 20–30 minutes.

Resources & Support

For interview process details, visit this link. For help or support, email support@mercor.com.

Key skills/competency

  • Mathematics
  • LLM Evaluation
  • Probability
  • Statistics
  • Prompt Engineering
  • Proof Verification
  • Fact-checking
  • Technical Writing
  • Attention to Detail
  • AI Models

Tags:

Mathematics AI Evaluator
mathematics
LLM
evaluation
prompt writing
proof verification
Probability
statistics
technical writing
fact-checking
AI models
review

Share Job:

How to Get Hired at Mercor

  • Research Mercor's culture: Understand mission, investor background, and AI focus.
  • Customize your resume: Highlight PhD and math expertise.
  • Prepare examples: Showcase prompt writing and evaluation skills.
  • Practice interviews: Focus on technical and academic experiences.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background