4 days ago

AI Safety Expert

Hackajob

Hybrid
Full Time
$70,000
Hybrid

Job Overview

Job TitleAI Safety Expert
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$70,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About the Role: AI Safety Expert

Why This Role Exists

At Mercor, we believe the safest AI is the one that’s already been attacked — by us. That’s why we’re building a pod of safety specialists to assess the capabilities and limitations of frontier models.

What You’ll Do

  • Generate high-quality human data: annotate failures, classify vulnerabilities, and flag systemic risks.
  • Apply structure: follow taxonomies, benchmarks, and playbooks to keep testing consistent.
  • Document reproducibly: produce reports, datasets, and attack cases customers can act on.
  • Flex across projects: support different customers, from LLM jailbreaks to socio-technical abuse testing.

Who You Are

  • You’re curious and adversarial: you instinctively push systems to breaking points.
  • You’re structured: you use frameworks or benchmarks, not just random hacks.
  • You’re communicative: you explain risks clearly to technical and non-technical stakeholders.
  • You’re adaptable: thrive on moving across projects and customers.

Why Join Mercor

  • Build experience in human data AI work at the frontier of safety.
  • Play a direct role in making AI systems more robust, safe, and trustworthy.

About Mercor

Mercor partners with leading AI labs and enterprises to train frontier models using human expertise. You will work on projects that focus on training and enhancing AI systems.

Key skills/competency

  • AI Safety
  • Vulnerability Assessment
  • Data Annotation
  • Risk Management
  • Adversarial Testing
  • Structured Testing
  • Technical Reporting
  • Large Language Models (LLM)
  • Communication Skills
  • Adaptability

Tags:

AI Safety Expert
AI model assessment
vulnerability classification
data annotation
risk management
adversarial testing
structured testing
technical reporting
abuse testing
frontier models
human expertise
Artificial Intelligence
Machine Learning
Large Language Models
AI Safety
Data Analysis
Prompt Engineering
Cybersecurity
NLP
Python
Benchmarking

Share Job:

How to Get Hired at Hackajob

  • Research Mercor's mission: Study their commitment to AI safety, values, and recent projects on LinkedIn and Glassdoor.
  • Tailor your resume: Highlight experience in AI, data annotation, cybersecurity, or adversarial thinking, customizing for Mercor's specific needs.
  • Showcase critical thinking: Prepare examples demonstrating your ability to identify vulnerabilities and push systems to their breaking points.
  • Emphasize communication skills: Practice explaining complex technical risks clearly to both technical and non-technical audiences.
  • Prepare for AI-specific questions: Understand frontier AI model capabilities, limitations, and ethical considerations relevant to Mercor's work.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background