1 month ago

AI Red Team Specialist

Mercor

Hybrid
Part Time
$104,000
Hybrid
Apply

Job Overview

Job TitleAI Red Team Specialist
Job TypePart Time
Offered Salary$104,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: AI Red Team Specialist

Type: Full-time or Part-time Contract Work Compensation: $50/hour Location: Remote; Geography restricted to USA, Japan Commitment: 20+ hours/week

Role Responsibilities

  • Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
  • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
  • Apply structure by following taxonomies, benchmarks, and playbooks to maintain consistent testing.
  • Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
  • Work independently and asynchronously to meet deadlines while improving AI model performance.

Qualifications

Must-Have
  • Fluent Language Skills Required: Native-level fluency in English & Japanese.
  • Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
  • Ability to explain risks clearly to technical and non-technical stakeholders.
  • Adaptability to move across projects and customers.
Preferred
  • Experience in Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction.
  • Background in Cybersecurity: penetration testing, exploit development, reverse engineering.
  • Expertise in Socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing.
  • Skills in Creative probing: psychology, acting, writing for unconventional adversarial thinking.

Compensation & Legal

Hourly contractor, Paid weekly via Stripe Connect.

Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Key skills/competency

  • AI Red Teaming
  • Adversarial Machine Learning
  • Cybersecurity
  • Penetration Testing
  • Prompt Injection
  • Vulnerability Assessment
  • Risk Analysis
  • Conversational AI
  • English Fluency
  • Japanese Fluency

Tags:

AI Red Team Specialist
AI
Adversarial AI
Cybersecurity
Red Teaming
Prompt Injection
Vulnerability Assessment
Machine Learning
Conversational AI
Remote Work
Japanese
English

Share Job:

How to Get Hired at Mercor

  • Tailor your resume: Highlight AI red teaming, cybersecurity, and language skills.
  • Prepare for AI interview: Understand Mercor's platform and application process.
  • Showcase your skills: Emphasize problem-solving and clear communication.
  • Be proactive: Complete all application steps promptly for consideration.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background