13 days ago
AI Safety Evaluation Specialist
Keystone Recruitment
Remote
Contractor
$150,000
Remote
Job Overview
Job TitleAI Safety Evaluation Specialist
Job TypeContractor
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$150,000
LocationRemote
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
AI Safety Evaluation Specialist
We are hiring experienced safety specialists to evaluate and strengthen advanced AI systems. This role focuses on identifying vulnerabilities, assessing systemic risks, and contributing to the development of robust AI models.
Key Responsibilities
- Identify and document model limitations, vulnerabilities, and failure patterns
- Annotate outputs and classify risks using structured taxonomies and evaluation frameworks
- Develop reproducible testing cases and structured reports
- Conduct adversarial testing across language models and socio-technical risk scenarios
- Evaluate AI outputs against defined safety benchmarks and guidelines
- Produce clear, actionable documentation for both technical and non-technical stakeholders
- Collaborate with project teams to improve model robustness and evaluation methodologies
Required Qualifications
- Experience in AI safety, security research, red teaming, model evaluation, or related fields
- Strong analytical and structured problem-solving skills
- Ability to document findings clearly and reproducibly
- Familiarity with risk assessment frameworks or benchmarking methodologies
- Excellent written communication skills
Preferred Qualifications
- Experience with adversarial testing or model vulnerability research
- Background in cybersecurity, machine learning, or socio-technical risk analysis
- Experience with evaluation datasets or safety benchmarking tools
- Ability to translate complex technical findings into clear, structured reports
Engagement Details
This is a project-based, independent contractor opportunity. The role is fully remote with flexible scheduling, offering hourly compensation between USD 85 and USD 185. Weekly payments are made via secure payment platforms.
Key skills/competency
- AI Safety
- Risk Assessment
- Model Evaluation
- Adversarial Testing
- Documentation
- Cybersecurity
- Analytical Skills
- Structured Reporting
- Red Teaming
- Benchmarking
How to Get Hired at Keystone Recruitment
- Customize resume: Highlight AI safety and evaluation experience.
- Emphasize skills: Detail risk assessment and adversarial testing.
- Research Keystone Recruitment: Understand their client requirements and culture.
- Prepare reports: Practice clear, structured documentation examples.
Frequently Asked Questions
Find answers to common questions about this job opportunity
01What does Keystone Recruitment look for in an AI Safety Evaluation Specialist?
02How do I showcase my model evaluation skills for Keystone Recruitment’s role?
03Is prior cybersecurity experience required for the AI Safety Evaluation Specialist position at Keystone Recruitment?
04What are the work arrangements for the AI Safety Evaluation Specialist role at Keystone Recruitment?
Explore similar opportunities that match your background