Question 1

What is the primary focus of an Adversarial AI Specialist role at Mercor?

Accepted Answer

The core responsibility of an Adversarial AI Specialist at Mercor is to red team conversational AI models, focusing on identifying jailbreaks, prompt injections, and other misuse cases. You'll generate human data by annotating failures and classifying vulnerabilities to enhance AI model safety.

Question 2

What are the essential qualifications for Mercor's Adversarial AI Specialist position?

Accepted Answer

To be considered for this role at Mercor, you must have native-level fluency in both English and Japanese. Prior experience in red teaming, whether it's AI adversarial work, cybersecurity, or socio-technical probing, is also a critical must-have, along with strong communication skills for technical and non-technical stakeholders.

Question 3

Does Mercor's Adversarial AI Specialist role offer flexibility in work arrangements?

Accepted Answer

Yes, Mercor offers this Adversarial AI Specialist position as either full-time or part-time contract work, requiring a minimum commitment of 20 hours per week. It is a fully remote role, specifically open to candidates located in the USA or Japan.

Question 4

How does the application and interview process work for this Mercor role?

Accepted Answer

The application process for the Adversarial AI Specialist at Mercor involves three main steps: uploading your resume, completing an AI-based interview tailored to your experience, and submitting a final form. Mercor's team reviews applications daily, emphasizing the importance of completing all steps promptly.

Question 5

What kind of compensation can an Adversarial AI Specialist expect at Mercor?

Accepted Answer

The compensation for the Adversarial AI Specialist role at Mercor is set at $50 per hour. This is an hourly contractor position, and payments are processed weekly via Stripe Connect.

Question 6

Are there specific technical skills preferred for the Adversarial AI Specialist at Mercor?

Accepted Answer

While prior red teaming experience is a must, preferred skills for the Mercor Adversarial AI Specialist include experience in Adversarial ML (e.g., jailbreak datasets, prompt injection, RLHF/DPO attacks), a background in cybersecurity (penetration testing, exploit development), and expertise in socio-technical risk and creative probing techniques.

Question 7

Where is the Mercor Adversarial AI Specialist role geographically restricted?

Accepted Answer

This remote Adversarial AI Specialist position at Mercor is geographically restricted to candidates residing in the USA or Japan. Applicants from other regions will not be considered for this specific opportunity.

Question 8

How can I learn more about the interview platform used by Mercor?

Accepted Answer

Mercor provides detailed information about their interview process and platform. You can find all necessary resources and support by visiting their talent documentation site at talent.docs.mercor.com/welcome/welcome.

Question 9

What types of 'misuse cases' will I be identifying for conversational AI models at Mercor?

Accepted Answer

As an Adversarial AI Specialist, you will be identifying various misuse cases in conversational AI models, including, but not limited to, jailbreaks (bypassing safety filters), prompt injections (manipulating model behavior), and other forms of abusive or unintended model interactions that pose systemic risks.

This job post expired on March 18, 2026

Adversarial AI Specialist

Mercor

Job Overview

Who's the hiring manager?

Job Description

About the Adversarial AI Specialist Role at Mercor

Position Details

Role Responsibilities

Qualifications

Must-Have

Preferred

Compensation & Legal

Application Process (Takes 20–30 mins to complete)

Resources & Support

Key skills/competency

Tags:

How to Get Hired at Mercor

Frequently Asked Questions