Mathematics AI Evaluator
Mercor
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About The Job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
This is a full-time or part-time contract position for a Mathematics AI Evaluator.
Role Responsibilities for a Mathematics AI Evaluator
- Write and refine prompts to guide model behavior in mathematical contexts.
- Evaluate LLM-generated responses to mathematics-related queries for correctness, rigor, and logical coherence.
- Verify mathematical claims, derivations, and proofs using domain expertise.
- Conduct fact-checking using authoritative public sources and domain knowledge.
- Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
- Ensure model responses align with expected conversational behavior and system guidelines.
Qualifications
Must-Have
- PhD in Mathematics or a closely related field.
- Demonstrated experience in Probability & Statistics.
- Significant experience using large language models (LLMs).
- Excellent writing skills for explaining complex mathematical concepts.
- Strong attention to detail with the ability to notice subtle issues.
- Experience reviewing or editing technical or academic writing.
Preferred
- Prior experience with RLHF, model evaluation, or data annotation work.
- Experience teaching, mentoring, or explaining mathematical concepts to non-expert audiences.
- Familiarity with evaluation rubrics, benchmarks, or structured review frameworks.
Application Process
The application process takes approximately 20-30 minutes to complete and includes:
- Uploading your resume
- An AI interview based on your resume
- Submitting the application form
Resources & Support
For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
For any help or support, reach out to: support@mercor.com
Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
Key skills/competency
- Mathematics
- AI Evaluation
- Large Language Models (LLMs)
- Prompt Engineering
- Probability & Statistics
- Mathematical Verification
- Fact-checking
- Data Annotation
- Technical Writing
- Logical Coherence
How to Get Hired at Mercor
- Research Mercor's mission: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor to understand their impact in AI.
- Tailor your resume for AI evaluation: Highlight your PhD in Mathematics, significant LLM experience, and strong technical writing skills to align with Mercor's needs.
- Master the AI interview: Practice articulating complex mathematical concepts clearly and concisely, preparing for the AI-driven assessment specific to Mercor.
- Showcase advanced domain expertise: Emphasize your background in probability and statistics, along with experience in rigorous mathematical verification and fact-checking.
- Demonstrate communication and review skills: Prepare to provide examples of your excellent writing, technical editing, and ability to give structured feedback on complex content.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background