Adversarial AI Specialist
Mercor
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
Adversarial AI Specialist at Mercor
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
This is a full-time or part-time contract position for an Adversarial AI Specialist. The compensation is $50/hour, with a commitment of 20+ hours per week, open to candidates in the USA and Japan.
Role Responsibilities
- Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to maintain consistent testing.
- Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
- Work independently and asynchronously to meet deadlines while improving AI model performance.
Qualifications
Must-Have
- Fluent Language Skills Required: Native-level fluency in English & Japanese.
- Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
- Ability to explain risks clearly to technical and non-technical stakeholders.
- Adaptability to move across projects and customers.
Preferred
- Experience in Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction.
- Background in Cybersecurity: penetration testing, exploit development, reverse engineering.
- Expertise in Socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing.
- Skills in Creative probing: psychology, acting, writing for unconventional adversarial thinking.
Compensation & Legal
Hourly contractor, paid weekly via Stripe Connect.
Application Process
The application process takes 20-30 minutes to complete:
- Upload resume
- AI interview based on your resume
- Submit form
Resources & Support
For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
For any help or support, reach out to: support@mercor.com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
Key skills/competency
- AI adversarial work
- Cybersecurity
- Prompt injection
- Jailbreaks
- Conversational AI
- Red teaming
- Machine learning
- Vulnerability assessment
- Data annotation
- Asynchronous work
How to Get Hired at Mercor
- Research Mercor's mission: Study their focus on connecting elite talent with leading AI research labs and their investor backing.
- Tailor your resume: Customize your resume to highlight adversarial AI, red teaming, cybersecurity, and socio-technical probing experience.
- Prepare for AI interview: Be ready for the AI interview, demonstrating your technical depth in AI adversarial work and problem-solving skills.
- Showcase language fluency: Emphasize your native-level English and Japanese proficiency as it's a critical requirement.
- Demonstrate adaptability: Highlight instances where you've moved across diverse projects and adapted to new customer needs effectively.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background