PitchMeAI
Jobs Ai

AI Quality Engineer (Remote)

Jobs Ai · France

  • Hybrid
  • Part-time
  • $75,000 / year
  • France

Job highlights

  • Evaluate autonomous AI agents across LLMs.
  • Design and debug AI agent behavior.
  • Assess production-grade software architecture.
  • Provide technical feedback for LLM training.
  • Work remotely on complex AI systems.

About the role

AI Quality Engineer (Remote)

Location: Remote (Finland, France, Italy, Norway)

Work Mode: Fully Remote

Role Overview

Help design and evaluate autonomous AI agents across multiple LLMs, spanning health, education, daily life, and other real-world domains (all coding work). Shape the future of agentic AI systems by providing expert human feedback to leading AI organisations. Help train Large Language Models (LLMs) for complex, multi-step architectural workflows.

Key Responsibilities

AI Agent Evaluation
  • Write evaluation rubrics with objective pass/fail criteria
  • Debug agent traces to identify failure patterns
  • Stress test agents against edge cases, prompt injection, and tool misuse
Technical Assessment
  • Assess production-grade modular software architecture
  • Analyse multi-turn system interactions and behaviours
  • Provide high-density technical feedback for LLM training

Project Workflow

  • Create an account and upload a resume/ID
  • Complete the onboarding assessment
  • Start earning through flexible task assignments

Qualifications

  • Experience in backend engineering, AI automation, or complex systems integration
  • Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting)
  • Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases
  • Practical experience building for live, non-mocked environments and handling multi-turn system interactions

Preferred (Nice to Have)

  • Experience integrating agents with live tools such as Supabase, Gmail, and other APIs
  • Familiarity with persistent state and session-tracking patterns
  • Experience identifying privacy leaks, authority escalation, or indirect prompt injection vulnerabilities

Compensation

Hourly compensation ranges from USD $30–$50, depending on experience and task complexity. Payments are issued weekly via supported payout platforms (e.g., PayPal or AirTM). Full compensation details are provided prior to task acceptance.

Equal Opportunity Statement

Selection decisions are based solely on skills, qualifications, and project requirements. We are committed to inclusive and fair engagement practices and consider all qualified applicants without regard to legally protected characteristics.

Key skills/competency

  • AI Quality Engineering
  • AI Agent Evaluation
  • LLM Training
  • Backend Engineering
  • Software Architecture
  • Python
  • JavaScript
  • SQL Databases
  • Systems Integration
  • Prompt Engineering

Skills & topics

  • AI Quality Engineer
  • AI
  • LLM
  • Backend Engineering
  • Software Architecture
  • AI Automation
  • Systems Integration
  • Python
  • JavaScript
  • SQL
  • Remote

How to get hired

  • Tailor your resume: Highlight backend engineering, AI automation, and complex systems integration experience. Emphasize modular software architecture and multi-turn interaction handling.
  • Prepare your portfolio: Showcase projects demonstrating your ability to build and maintain production-grade software in live environments. Include examples of your work with Python, JavaScript, Go, or Java and SQL databases.
  • Master the assessment: Understand the onboarding assessment will likely test your technical skills in AI agent evaluation and software architecture. Practice debugging and stress-testing agents.
  • Showcase your expertise: Be ready to discuss your experience with LLMs, AI agent evaluation rubrics, and providing technical feedback for AI model training during interviews.

Technical preparation

Practice debugging agent traces and identifying failure patterns.,Review modular software architecture principles.,Brush up on Python, JavaScript, Go, or Java.,Strengthen SQL database query skills.

Behavioral questions

Describe a complex system you integrated.,How do you identify potential vulnerabilities?,Share an experience with LLM feedback.,How do you handle edge cases in testing?

Frequently asked questions

What is the compensation for an AI Quality Engineer at Jobs Ai?
The hourly compensation for an AI Quality Engineer at Jobs Ai ranges from USD $30 to $50, depending on your experience level and the complexity of the tasks assigned. Payments are processed weekly.
What are the primary responsibilities of an AI Quality Engineer at Jobs Ai?
As an AI Quality Engineer, you will design and evaluate autonomous AI agents, write evaluation rubrics, debug agent traces, and stress test agents. You will also assess software architecture, analyze system interactions, and provide technical feedback for LLM training.
What technical skills are required for the AI Quality Engineer role at Jobs Ai?
Required technical skills include experience in backend engineering, AI automation, or complex systems integration. You need a proven ability to build production-grade software with modular separation and strong command of languages like Python, JavaScript, Go, or Java, along with SQL database experience.
What are the preferred qualifications for this remote AI Quality Engineer position?
Preferred qualifications include experience integrating agents with live tools (e.g., Supabase, Gmail, APIs), familiarity with persistent state and session-tracking, and experience identifying privacy leaks or prompt injection vulnerabilities.
Is the AI Quality Engineer position at Jobs Ai a remote role?
Yes, the AI Quality Engineer position at Jobs Ai is a fully remote role, with opportunities for individuals located in Finland, France, Italy, and Norway.
How does the application process work for the AI Quality Engineer role at Jobs Ai?
The application process involves creating an account, uploading your resume and ID, and completing an onboarding assessment. Once approved, you can start accepting flexible task assignments.
What programming languages are important for an AI Quality Engineer at Jobs Ai?
A strong command of at least two major programming languages such as Python, JavaScript, Go, or Java is required for this role. Experience working with SQL databases is also essential.