8 hours ago

Adversarial AI Specialist

Mercor

Hybrid
Part Time
$104,000
Hybrid

Job Overview

Job TitleAdversarial AI Specialist
Job TypePart Time
Offered Salary$104,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Adversarial AI Specialist at Mercor

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

This is a full-time or part-time contract position for an Adversarial AI Specialist. The compensation is $50/hour, with a commitment of 20+ hours per week, open to candidates in the USA and Japan.

Role Responsibilities

  • Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
  • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
  • Apply structure by following taxonomies, benchmarks, and playbooks to maintain consistent testing.
  • Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
  • Work independently and asynchronously to meet deadlines while improving AI model performance.

Qualifications

Must-Have

  • Fluent Language Skills Required: Native-level fluency in English & Japanese.
  • Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
  • Ability to explain risks clearly to technical and non-technical stakeholders.
  • Adaptability to move across projects and customers.

Preferred

  • Experience in Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction.
  • Background in Cybersecurity: penetration testing, exploit development, reverse engineering.
  • Expertise in Socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing.
  • Skills in Creative probing: psychology, acting, writing for unconventional adversarial thinking.

Compensation & Legal

Hourly contractor, paid weekly via Stripe Connect.

Application Process

The application process takes 20-30 minutes to complete:

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Key skills/competency

  • AI adversarial work
  • Cybersecurity
  • Prompt injection
  • Jailbreaks
  • Conversational AI
  • Red teaming
  • Machine learning
  • Vulnerability assessment
  • Data annotation
  • Asynchronous work

Tags:

Adversarial AI Specialist
Red teaming
Prompt injection
Jailbreaking
Vulnerability assessment
Data annotation
Risk identification
AI security
Conversational AI
Documentation
Asynchronous work
Adversarial ML
Cybersecurity
RLHF
DPO
Penetration testing
Exploit development
Reverse engineering
Socio-technical risk
Creative probing
Psychology

Share Job:

How to Get Hired at Mercor

  • Research Mercor's mission: Study their focus on connecting elite talent with leading AI research labs and their investor backing.
  • Tailor your resume: Customize your resume to highlight adversarial AI, red teaming, cybersecurity, and socio-technical probing experience.
  • Prepare for AI interview: Be ready for the AI interview, demonstrating your technical depth in AI adversarial work and problem-solving skills.
  • Showcase language fluency: Emphasize your native-level English and Japanese proficiency as it's a critical requirement.
  • Demonstrate adaptability: Highlight instances where you've moved across diverse projects and adapted to new customer needs effectively.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background