AI Red Team Specialist
Mercor
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About Mercor
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Position Overview
This is for an AI Red Team Specialist position, available for full-time or part-time contract work, with compensation at $26/hour. This role is fully remote and offers flexible hours.
Role Responsibilities
- Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
- Document reproducibly by producing reports, datasets, and attack cases that customers can act on.
- Work independently and asynchronously to meet deadlines while improving AI model performance.
Qualifications
Must-Have
- Native-level fluency in English and Spanish.
- Prior red teaming experience in AI adversarial work, cybersecurity, or socio-technical probing.
- Structured approach using frameworks or benchmarks.
- Strong communication skills to explain risks clearly to technical and non-technical stakeholders.
- Adaptability to thrive on moving across projects and customers.
Preferred
- Experience in Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction.
- Background in Cybersecurity: penetration testing, exploit development, reverse engineering.
- Expertise in socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing.
- Creative probing skills: psychology, acting, writing for unconventional adversarial thinking.
Compensation & Legal
Hourly contractor, Paid weekly.
Application Process
The application process typically takes 20-30 minutes to complete:
- Upload resume
- AI interview based on your resume
- Submit form
Resources & Support
For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
For any help or support, reach out to: support@mercor.com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
Key skills/competency
- AI Red Teaming
- Adversarial ML
- Prompt Injection
- Jailbreaking
- Cybersecurity
- Penetration Testing
- Risk Analysis
- Conversational AI
- Socio-technical Probing
- English Spanish Fluency
How to Get Hired at Mercor
- Research Mercor's mission: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Tailor your resume: Highlight AI adversarial work, cybersecurity, or socio-technical probing experience.
- Prepare for AI interview: Practice explaining complex risks to diverse technical and non-technical stakeholders.
- Showcase language fluency: Emphasize native English and Spanish skills prominently.
- Demonstrate adaptable mindset: Be ready to discuss creative probing and project versatility.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background