3 days ago

AI Safety Evaluation Specialist

Keystone Recruitment

Remote
Contractor
$270,000
Remote

Job Overview

Job TitleAI Safety Evaluation Specialist
Job TypeContractor
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$270,000
LocationRemote

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

AI Safety Evaluation Specialist at Keystone Recruitment

Keystone Recruitment is seeking experienced AI Safety Evaluation Specialists on behalf of one of our clients. This project-based, remote contract opportunity focuses on evaluating and strengthening advanced AI systems by identifying vulnerabilities, assessing systemic risks, and contributing to the development of more robust and reliable AI models.

Key Responsibilities

  • Identify and document model limitations, vulnerabilities, and failure patterns.
  • Annotate outputs and classify risks using structured taxonomies and evaluation frameworks.
  • Develop reproducible testing cases and structured reports.
  • Conduct adversarial testing across language models and socio-technical risk scenarios.
  • Evaluate AI outputs against defined safety benchmarks and guidelines.
  • Produce clear, actionable documentation for technical and non-technical stakeholders.
  • Collaborate with project teams to improve model robustness and evaluation methodologies.

Required Qualifications

  • Experience in AI safety, security research, red teaming, model evaluation, or related technical domains.
  • Strong analytical and structured problem-solving skills.
  • Ability to document findings clearly and reproducibly.
  • Familiarity with risk assessment frameworks or benchmarking methodologies.
  • Strong written communication skills.

Preferred Qualifications

  • Experience with adversarial testing or model vulnerability research.
  • Background in cybersecurity, machine learning, or socio-technical risk analysis.
  • Experience working with evaluation datasets or safety benchmarking tools.
  • Ability to translate complex technical findings into clear, structured reports.

Engagement Details

This is an independent contractor engagement offering fully remote work with flexible scheduling. It is project-based, with potential for extensions based on needs and performance. Compensation is hourly, ranging from $85–$185 depending on expertise and scope, with weekly payments via secure platforms. Applicants must be legally authorized to provide independent contractor services in their country of residence.

Key skills/competency

  • AI Safety
  • Model Evaluation
  • Risk Assessment
  • Adversarial Testing
  • Machine Learning
  • Cybersecurity
  • Structured Problem-Solving
  • Technical Documentation
  • Evaluation Frameworks
  • Language Models

Tags:

AI Safety Evaluation Specialist
AI safety
model evaluation
risk assessment
adversarial testing
vulnerability identification
security research
documentation
problem-solving
socio-technical risk
benchmarking
machine learning
language models
AI systems
evaluation frameworks
security testing tools
data annotation
risk analysis software
AI ethics
model robustness
MLOps

Share Job:

How to Get Hired at Keystone Recruitment

  • Research Keystone Recruitment's culture: Explore their mission and values, focusing on their approach to client partnerships and contractor engagement.
  • Tailor your resume for AI Safety: Highlight experience in AI safety, model evaluation, and adversarial testing, aligning with the job description's keywords.
  • Showcase analytical and documentation skills: Prepare examples demonstrating your ability to identify vulnerabilities and produce clear, structured reports.
  • Prepare for technical discussions: Be ready to discuss risk assessment frameworks, socio-technical risks, and AI model limitations.
  • Emphasize independent contractor readiness: Outline your experience with remote, project-based work and your legal authorization for contracting services.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background