27 days ago

Mathematics AI Evaluator

Mercor

Hybrid
Part Time
$151,840
Hybrid
Apply

Job Overview

Job TitleMathematics AI Evaluator
Job TypePart Time
Offered Salary$151,840
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About Mercor

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: Mathematics AI Evaluator

This role is available as full-time or part-time contract work, offering a compensation of $73/hour. We are looking for candidates located in the USA, UK, Canada, or the EU.

Role Responsibilities

  • Write and refine prompts to guide model behavior in mathematical contexts.
  • Evaluate LLM-generated responses to mathematics-related queries for correctness, rigor, and logical coherence.
  • Verify mathematical claims, derivations, and proofs using domain expertise.
  • Conduct fact-checking using authoritative public sources and domain knowledge.
  • Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
  • Ensure model responses align with expected conversational behavior and system guidelines.

Qualifications

Must-Have
  • PhD in Mathematics or a closely related field.
  • Demonstrated experience in Probability & Statistics.
  • Significant experience using large language models (LLMs).
  • Excellent writing skills for explaining complex mathematical concepts.
  • Strong attention to detail with the ability to notice subtle issues.
  • Experience reviewing or editing technical or academic writing.

Preferred
  • Prior experience with RLHF, model evaluation, or data annotation work.
  • Experience teaching, mentoring, or explaining mathematical concepts to non-expert audiences.
  • Familiarity with evaluation rubrics, benchmarks, or structured review frameworks.

Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Key skills/competency

  • Mathematical Reasoning
  • Large Language Models (LLMs)
  • Prompt Engineering
  • Content Evaluation
  • Proof Verification
  • Fact-Checking
  • Technical Writing
  • Probability & Statistics
  • Data Annotation
  • Academic Review

Tags:

Mathematics AI Evaluator
AI evaluation
LLM evaluation
Prompt engineering
Mathematical proof
Fact-checking
Technical writing
Probability & Statistics
Data annotation
Academic review
AI
Large Language Models
Machine learning
Natural language processing
Python
Deep learning
Reinforcement learning
Data science
Computational mathematics
Software development

Share Job:

How to Get Hired at Mercor

  • Research Mercor's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor, focusing on their AI research initiatives.
  • Tailor your resume: Highlight your PhD in Mathematics, experience with LLMs, and strong writing skills. Use keywords like "mathematical evaluation," "prompt engineering," and "AI model assessment."
  • Excel in the AI interview: Prepare to discuss your expertise in Probability & Statistics and your ability to explain complex mathematical concepts clearly. Practice articulating your experience with AI evaluation or data annotation.
  • Showcase your mathematical rigor: Be ready to demonstrate your ability to verify mathematical claims and proofs, emphasizing your attention to detail and logical coherence.
  • Highlight communication skills: Emphasize your experience in technical or academic writing and your capacity to annotate and provide constructive feedback on complex AI-generated content.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background