
English (U.S. Native) AI Trainer & Evaluator (Remote, Hourly Contrator)
CNTXT AI · New York, NY
- Hybrid
- Full-time
- $40,000 / year
- New York, NY
This role may have been filled. Drop your résumé and we'll check if it's still open — or find you similar roles.
Job highlights
- Remote hourly contract for AI data and language projects.
- Write prompts, record voice, and generate AI content.
- Label, classify, and structure data for AI training.
- Evaluate AI responses for accuracy and appropriateness.
- Requires U.S. native English speakers born and raised.
About the role
English (U.S. Native) AI Trainer & Evaluator
This is a fully remote, hourly contractor role supporting AI data and language projects on a project-based, flexible hour basis. Project scopes vary and may include:
Project Responsibilities:
- Content generation: writing high-quality prompts and model responses, or recording high-quality voice samples, to guide AI learning across diverse topics.
- Data annotation: labeling, classifying, and structuring documents, tables, and other content to support AI training datasets.
- LLM evaluation: reviewing AI-generated responses for accuracy, reasoning quality, coherence, and cultural/linguistic appropriateness.
- Localization QA: ensuring terminology, tone, cultural nuance, and locale-specific details (units, references, names, dates) are consistently accurate across outputs.
Profile Requirements:
- Native speakers of American English born and raised in the United States.
- Excellent editorial judgment in register, tone, punctuation, inclusivity, and cultural nuance, with extreme attention to detail.
- Ability to identify meaning drift, ambiguity, locale inconsistencies, and subtle errors, and to explain corrections clearly in writing.
- Ability to rigorously fact-check localized content (units, references, names, dates) using reliable sources and consistent reasoning.
- Ability to identify reasoning gaps, methodological errors, and unclear explanations even when language is fluent.
- Reliable, self-directed, and able to deliver consistent quality with clear communication and responsiveness across time zones.
Preferred Experience:
- Familiarity with MQM/LQA concepts (severity, category, and root-cause thinking) for consistent quality decisions.
- Familiarity with QA workflows.
- Previous experience with AI data training, annotation, or evaluation.
About CNTXT AI:
CNTXT AI builds artificial intelligence products and data solutions with a focus on making AI accurate, safe, and globally relevant for impact. Our work spans data services, custom AI solutions, and proprietary AI products, with deep expertise in Arabic-native and secure, sovereign solutions.
Key skills/competency:
- AI Training
- Data Annotation
- LLM Evaluation
- Content Generation
- Localization QA
- American English Native Speaker
- Editorial Judgment
- Attention to Detail
- Fact-Checking
- Quality Assurance
Skills & topics
- AI Trainer
- AI Evaluator
- Data Annotation
- LLM Evaluation
- Content Generation
- Localization QA
- Remote Work
- Contractor
- American English
- CNTXT AI
How to get hired
- Tailor your resume: Highlight experience in AI training, data annotation, LLM evaluation, and content generation, using keywords from the job description.
- Showcase language expertise: Emphasize your native American English proficiency, born and raised in the U.S., and your keen editorial judgment.
- Demonstrate detail orientation: Provide examples of your ability to identify subtle errors, meaning drift, and inconsistencies.
- Explain your reliability: Illustrate your self-directed work ethic and ability to communicate effectively across time zones.
- Prepare for evaluation: Be ready to discuss your understanding of AI ethics, quality assurance, and localization nuances.
Technical preparation
Practice writing clear, concise AI prompts.,Review grammar and style guides for American English.,Familiarize with data annotation tools and methods.,Understand AI response evaluation criteria.
Behavioral questions
Describe a time you found a subtle error.,How do you ensure accuracy in your work?,How do you handle feedback on your writing?,How do you manage tasks with flexible hours?
Frequently asked questions
- What does CNTXT AI do?
- CNTXT AI focuses on building accurate, safe, and globally relevant AI products and data solutions. They specialize in data services, custom AI solutions, and proprietary AI products, with a strong emphasis on Arabic-native and secure, sovereign AI.
- Is this AI Trainer & Evaluator role remote?
- Yes, this AI Trainer & Evaluator role is fully remote.
- What are the primary responsibilities of an AI Trainer & Evaluator at CNTXT AI?
- The primary responsibilities include content generation (writing prompts/responses, recording voice samples), data annotation (labeling/structuring data), LLM evaluation (reviewing AI responses), and localization QA (ensuring cultural and locale accuracy).
- What specific language and origin requirements are there for this AI Trainer & Evaluator position?
- This position exclusively seeks native speakers of American English who were born and raised in the United States.
- What kind of experience is preferred for this AI Trainer & Evaluator role?
- Preferred experience includes familiarity with MQM/LQA concepts, QA workflows, and previous work in AI data training, annotation, or evaluation.
- How flexible are the hours for this AI Trainer & Evaluator role?
- The role offers flexible hours on a project-based, hourly contractor basis.
- What skills are crucial for an AI Trainer & Evaluator to succeed at CNTXT AI?
- Key skills include excellent editorial judgment, attention to detail, ability to identify subtle errors and meaning drift, rigorous fact-checking, and clear written communication.
- Does CNTXT AI focus on specific languages or regions?
- While this role focuses on American English, CNTXT AI has deep expertise in Arabic-native solutions and aims for globally relevant AI.
Similar roles
Open positions we recommend based on this role.