English (U.S. Native) AI Trainer & Evaluator (Remote, Hourly Contrator) at CNTXT AI | Apply at CNTXT AI | Jobs near Brooklyn

About the role

AI Trainer and Evaluator

This is a fully remote, hourly contractor role supporting AI data and language projects on a project-based, flexible hour basis. Project scopes vary and may include:

Content generation: writing high-quality prompts and model responses, or recording high-quality voice samples, to guide AI learning across diverse topics.
Data annotation: labeling, classifying, and structuring documents, tables, and other content to support AI training datasets.
LLM evaluation: reviewing AI-generated responses for accuracy, reasoning quality, coherence, and cultural/linguistic appropriateness.
Localization QA: ensuring terminology, tone, cultural nuance, and locale-specific details (units, references, names, dates) are consistently accurate across outputs.

This position exclusively seeks native speakers of American English born and raised in the United States.

Profile Requirements

Native speakers of American English born and raised in the United States.
Excellent editorial judgment in register, tone, punctuation, inclusivity, and cultural nuance, with extreme attention to detail.
Ability to identify meaning drift, ambiguity, locale inconsistencies, and subtle errors, and to explain corrections clearly in writing.
Ability to rigorously fact-check localized content (units, references, names, dates) using reliable sources and consistent reasoning.
Ability to identify reasoning gaps, methodological errors, and unclear explanations even when language is fluent.
Reliable, self-directed, and able to deliver consistent quality with clear communication and responsiveness across time zones.

Preferred Experience

Familiarity with MQM/LQA concepts (severity, category, and root-cause thinking) for consistent quality decisions.
Familiarity with QA workflows.
Previous experience with AI data training, annotation, or evaluation.

About CNTXT AI

CNTXT AI builds artificial intelligence products and data solutions with a focus on making AI accurate, safe, and globally relevant for impact. Our work spans data services, custom AI solutions, and proprietary AI products, with deep expertise in Arabic-native and secure, sovereign solutions.

Key skills/competency

Native American English Speaker
Content Generation
Data Annotation
LLM Evaluation
Localization QA
Editorial Judgment
Fact-Checking
Attention to Detail
AI Training
Quality Assurance

How to get hired

Tailor your resume: Highlight experience in AI training, data annotation, LLM evaluation, and localization QA.
Craft a compelling cover letter: Emphasize your native American English fluency and keen editorial judgment.
Showcase attention to detail: Provide examples of identifying subtle errors and explaining corrections clearly.
Demonstrate self-direction: Mention your ability to work reliably and communicate effectively across time zones.
Highlight preferred experience: If applicable, detail your familiarity with MQM/LQA concepts and QA workflows.

Frequently asked questions

What are the primary responsibilities of an AI Trainer and Evaluator at CNTXT AI?

As an AI Trainer and Evaluator, you'll be responsible for tasks such as generating content (prompts, responses, voice samples), annotating data for AI training sets, evaluating AI-generated responses for accuracy and appropriateness, and performing localization QA to ensure cultural and linguistic consistency. This role requires a native American English speaker with excellent editorial judgment.

What specific qualifications are essential for the AI Trainer and Evaluator role at CNTXT AI?

The most critical qualification is being a native speaker of American English, born and raised in the United States. Additionally, you need excellent editorial judgment, extreme attention to detail, the ability to identify subtle errors and meaning drift, and the skill to rigorously fact-check content. Self-direction and clear communication are also key.

Is this AI Trainer and Evaluator position remote, and what are the working hours?

Yes, this is a fully remote, hourly contractor role. The hours are flexible and project-based, allowing you to work on your own schedule within project requirements and across different time zones.

What does 'LLM evaluation' entail for an AI Trainer and Evaluator at CNTXT AI?

LLM evaluation involves reviewing responses generated by large language models (LLMs). You'll assess their accuracy, the quality of their reasoning, how coherent they are, and whether they are culturally and linguistically appropriate for the target audience. This requires a sharp editorial eye and an understanding of nuanced language.

How important is localization QA in the AI Trainer and Evaluator role?

Localization QA is very important. It ensures that AI outputs are not only accurate in terms of terminology and tone but also culturally nuanced and locally relevant. This includes verifying locale-specific details like units, references, names, and dates to maintain consistency across different regions.

Does CNTXT AI prefer candidates with specific QA or AI training experience for the AI Trainer and Evaluator role?

While not strictly required, CNTXT AI prefers candidates with familiarity with MQM/LQA concepts (severity, category, root-cause thinking) for quality decisions, general QA workflows, and prior experience in AI data training, annotation, or evaluation. This experience can give you an edge.

English (U.S. Native) AI Trainer & Evaluator (Remote, Hourly Contrator)

Job highlights

About the role

AI Trainer and Evaluator

Profile Requirements

Preferred Experience

About CNTXT AI

Key skills/competency

Skills & topics

How to get hired

Technical preparation

Behavioral questions

Frequently asked questions

Similar roles