18 hours ago

AI Red-Teamer

Mercor

Hybrid
Part Time
$166,400
Hybrid

Job Overview

Job TitleAI Red-Teamer
Job TypePart Time
Offered Salary$166,400
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About Mercor

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: AI Red-Teamer

This role focuses on red-teaming AI models and agents to identify vulnerabilities and risks. It can be full-time or part-time, with compensation ranging from $50–$111 per hour. The position is remote-friendly, restricted to US, UK, and Canada time zones and geographies.

Role Responsibilities

  • Red-team AI models and agents through jailbreaks, prompt injections, misuse cases, and exploits.
  • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
  • Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
  • Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
  • Flex across projects to support different customers, from LLM jailbreaks to socio-technical abuse testing.

Qualifications

Must-Have
  • Prior red-teaming experience in AI adversarial work, cybersecurity, or socio-technical probing.
  • Curiosity and adversarial instinct to push systems to breaking points.
  • Structured approach using frameworks or benchmarks.
  • Strong communication skills to explain risks to technical and non-technical stakeholders.
  • Adaptability to thrive across various projects and customers.

Preferred
  • Experience with Adversarial ML, including jailbreak datasets, prompt injection, RLHF/DPO attacks, and model extraction.
  • Cybersecurity skills in penetration testing, exploit development, and reverse engineering.
  • Understanding of socio-technical risk, including harassment/disinfo probing and abuse analysis.
  • Creative probing skills in psychology, acting, or writing for unconventional adversarial thinking.

Compensation & Legal

This is an hourly contractor position. Compensation varies by project, customer, and content category.

Key skills/competency

  • AI Red-Teaming
  • Cybersecurity
  • Adversarial Machine Learning
  • Prompt Injection
  • Vulnerability Assessment
  • Risk Management
  • Exploit Development
  • Penetration Testing
  • Socio-Technical Probing
  • Technical Communication

Tags:

AI Red-Teamer
red-teaming
jailbreaks
prompt injection
misuse cases
vulnerability classification
risk flagging
structured testing
reporting
documentation
customer support
adversarial instinct
communication
Adversarial ML
jailbreak datasets
RLHF/DPO attacks
model extraction
penetration testing
exploit development
reverse engineering
socio-technical risk
abuse analysis
creative probing

Share Job:

How to Get Hired at Mercor

  • Research Mercor's mission: Understand their focus on connecting talent with AI research labs and their investor backing to align your application.
  • Tailor your resume for AI Red-Teamer: Highlight specific experience in AI adversarial work, cybersecurity, red-teaming, or socio-technical probing.
  • Prepare for the AI interview: Practice articulating your experience with jailbreaks, prompt injection, and structured vulnerability testing methods.
  • Showcase adversarial instinct: Be ready to discuss examples where you've pushed systems to their breaking points and documented findings.
  • Emphasize communication skills: Prepare to demonstrate your ability to explain complex technical risks to both technical and non-technical audiences effectively.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background