AI Red-Teamer
Mercor
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About Mercor
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Position: AI Red-Teamer
This role can be Full-time or Part-time, with compensation ranging from $50–$111/hour. It is a Remote-friendly position, restricted to US, UK, and Canada time zones.
Role Responsibilities
- Red-team AI models and agents through jailbreaks, prompt injections, misuse cases, and exploits.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
- Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
- Flex across projects to support different customers, from LLM jailbreaks to socio-technical abuse testing.
Qualifications
Must-Have
- Prior red-teaming experience in AI adversarial work, cybersecurity, or socio-technical probing.
- Curiosity and adversarial instinct to push systems to breaking points.
- Structured approach using frameworks or benchmarks.
- Strong communication skills to explain risks to technical and non-technical stakeholders.
- Adaptability to thrive across various projects and customers.
Preferred
- Experience with Adversarial ML, including jailbreak datasets, prompt injection, RLHF/DPO attacks, and model extraction.
- Cybersecurity skills in penetration testing, exploit development, and reverse engineering.
- Understanding of socio-technical risk, including harassment/disinfo probing and abuse analysis.
- Creative probing skills in psychology, acting, or writing for unconventional adversarial thinking.
Compensation & Legal
This is an hourly contractor position. Compensation varies by project, customer, and content category.
Application Process
The application process takes 20–30 minutes to complete and involves the following steps:
- Upload resume
- AI interview based on your resume
- Submit form
Resources & Support
For interview process details and platform information, visit talent.docs.mercor.com/welcome/welcome. For help, contact support@mercor.com.
The Mercor team reviews applications daily. Please complete your AI interview and all application steps for consideration.
Key skills/competency
- AI Security
- Red Teaming
- Adversarial AI
- Prompt Injection
- Jailbreaking
- Vulnerability Assessment
- Cybersecurity
- Data Annotation
- Risk Analysis
- LLM Exploits
How to Get Hired at Mercor
- Research Mercor's mission: Understand their focus on connecting talent with AI labs and their investor backing.
- Tailor your resume for AI red-teaming: Highlight prior adversarial AI, cybersecurity, or socio-technical probing experience.
- Prepare for the AI interview: Expect questions on your red-teaming methodologies, curiosity, and structured problem-solving.
- Showcase adversarial instinct: Emphasize your ability to creatively find system breaking points and identify risks.
- Demonstrate clear communication: Practice explaining complex technical risks to both experts and non-technical stakeholders.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background