1 day ago

Mathematics AI Evaluator

Mercor

Hybrid
Part Time
$151,840
Hybrid

Job Overview

Job TitleMathematics AI Evaluator
Job TypePart Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$151,840
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

This role is for a Mathematics AI Evaluator, offering full-time or part-time contract work.

Role Responsibilities

  • Write and refine prompts to guide model behavior in mathematical contexts.
  • Evaluate LLM-generated responses to mathematics-related queries for correctness, rigor, and logical coherence.
  • Verify mathematical claims, derivations, and proofs using domain expertise.
  • Conduct fact-checking using authoritative public sources and domain knowledge.
  • Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
  • Ensure model responses align with expected conversational behavior and system guidelines.

Qualifications

Must-Have
  • PhD in Mathematics or a closely related field.
  • Demonstrated experience in Probability & Statistics.
  • Significant experience using large language models (LLMs).
  • Excellent writing skills for explaining complex mathematical concepts.
  • Strong attention to detail with the ability to notice subtle issues.
  • Experience reviewing or editing technical or academic writing.
Preferred
  • Prior experience with RLHF, model evaluation, or data annotation work.
  • Experience teaching, mentoring, or explaining mathematical concepts to non-expert audiences.
  • Familiarity with evaluation rubrics, benchmarks, or structured review frameworks.

Application Process

The application process takes approximately 20–30 minutes to complete:

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

For details about the interview process and platform information, please check: talent.docs.mercor.com

For any help or support, reach out to: support@mercor.com

Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Key skills/competency

  • Large Language Models (LLMs)
  • Mathematics
  • Probability & Statistics
  • Prompt Engineering
  • AI Model Evaluation
  • Technical Writing
  • Fact-Checking
  • Logical Coherence
  • Data Annotation
  • Research & Analysis

Tags:

Mathematics AI Evaluator
prompt engineering
LLM evaluation
mathematical reasoning
fact-checking
data annotation
technical writing
proof verification
logical coherence
model refinement
AI
machine learning
natural language processing
prompt design
data science
deep learning
Python
TensorFlow
PyTorch
NLP libraries

Share Job:

How to Get Hired at Mercor

  • Research Mercor's mission: Study their focus on connecting top talent with AI research labs, understanding their impact and investor backing.
  • Tailor your resume: Customize your resume to prominently feature your PhD in Mathematics, significant LLM experience, and evaluation skills for AI-centric roles at Mercor.
  • Ace the AI interview: Prepare for Mercor's AI interview by practicing explaining complex mathematical concepts and demonstrating your analytical reasoning abilities.
  • Demonstrate domain expertise: Showcase deep knowledge in Probability & Statistics, mathematical verification, and technical writing pertinent to AI evaluation tasks.
  • Highlight attention to detail: Provide examples of your meticulous review processes and ability to identify subtle inaccuracies in technical or academic content.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background