9 days ago

Adversarial AI Specialist

Mercor

Hybrid
Part Time
$104,000
Hybrid

Job Overview

Job TitleAdversarial AI Specialist
Job TypePart Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$104,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About the Adversarial AI Specialist Role at Mercor

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position Details

This is an Adversarial AI Specialist position. It is offered as a full-time or part-time contract work, with compensation at $50/hour. This is a remote role, with geography restricted to the USA and Japan. A commitment of 20+ hours per week is required.

Role Responsibilities

  • Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
  • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
  • Apply structure by following taxonomies, benchmarks, and playbooks to maintain consistent testing.
  • Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
  • Work independently and asynchronously to meet deadlines while improving AI model performance.

Qualifications

Must-Have
  • Fluent Language Skills Required: Native-level fluency in English & Japanese.
  • Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
  • Ability to explain risks clearly to technical and non-technical stakeholders.
  • Adaptability to move across projects and customers.

Preferred
  • Experience in Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction.
  • Background in Cybersecurity: penetration testing, exploit development, reverse engineering.
  • Expertise in Socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing.
  • Skills in Creative probing: psychology, acting, writing for unconventional adversarial thinking.

Compensation & Legal

This is an hourly contractor position, paid weekly via Stripe Connect.

Application Process (Takes 20–30 mins to complete)

Please follow these steps to apply:

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome. For any help or support, reach out to: support@mercor.com.

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Key skills/competency

  • AI Red Teaming
  • Prompt Injection
  • Jailbreaking
  • Adversarial ML
  • Cybersecurity
  • Vulnerability Analysis
  • Risk Assessment
  • Conversational AI
  • Native English Fluency
  • Native Japanese Fluency

Tags:

Adversarial AI Specialist
red teaming
prompt injection
jailbreaking
vulnerability analysis
data annotation
risk assessment
documentation
independent work
asynchronous
AI security
machine learning
cybersecurity
penetration testing
exploit development
reverse engineering
conversational AI
socio-technical risk
DPO attacks
RLHF

Share Job:

How to Get Hired at Mercor

  • Research Mercor's mission: Study their focus on connecting top talent with leading AI research labs and their investor backing.
  • Tailor your resume for AI security: Highlight specific experience in adversarial AI, cybersecurity, red teaming, and socio-technical probing.
  • Prepare for the AI interview: Understand that Mercor uses an AI-based interview process; practice articulating your experience concisely and clearly.
  • Showcase language proficiency: Emphasize your native-level fluency in both English and Japanese as it's a critical 'must-have' qualification.
  • Demonstrate independent work ethic: Provide examples of successfully managing projects asynchronously and delivering high-quality results autonomously.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background