PitchMeAI
TELUS Digital

Multimodal AI Content Expert (AI Community)

TELUS Digital · Auvergne-Rhône-Alpes, France

  • Hybrid
  • Part-time
  • $75,000 / year
  • Auvergne-Rhône-Alpes, France

Job highlights

  • Analyze AI outputs across text, image, and video.
  • Ensure AI accuracy, context, and cultural resonance.
  • Audit AI for safety, bias, and consistency.
  • Requires Bachelor's degree and native language proficiency.
  • Pass a qualification exam and ID verification.

About the role

About the Role

At TELUS Digital, we are teaching AI to see, hear, and understand the world just as humans do. As a Multimodal AI Content Expert in our Global Community, you are at the forefront of the most exciting frontier in technology. We are moving beyond text-only models to create AI that can reason across images, videos, and audio in real-time. We look to you to ensure these complex, multi-layered outputs are accurate, contextually aware, and culturally resonant. At TELUS, you are helping to build the "eyes and ears" of the next generation of artificial intelligence.

Key Responsibilities

  • Cross-Modal Verification: Evaluate the relationship between different data types (e.g., verifying if an AI-generated video perfectly matches a complex text prompt).
  • Visual-Semantic Analysis: Audit image and video datasets to ensure the AI correctly identifies complex objects, spatial relationships, and subtle cultural nuances.
  • Temporal & Contextual Auditing: Review long-form video content to ensure the AI maintains logical consistency and "memory" from the beginning of the clip to the end.
  • Multimodal Safety & Bias Detection: Identify safety risks that only appear when media are combined (e.g., an image that is safe on its own but becomes harmful when paired with specific text).
  • Instruction Tuning for Media: Help design complex prompts that teach models how to describe visual scenes with high technical or artistic precision.

Qualification Path

  • Mandatory Qualifications: Minimum of a Bachelor’s Degree in Cybersecurity, Criminal Justice, Forensic Science, or Information Security.
  • Native Language: Native-level proficiency in your primary language is mandatory to identify localized document types and regional identity nuances.
  • English Proficiency: Minimum B1 (Intermediate) level English.
  • Analytical Eye: Exceptional attention to detail, especially in identifying pixel-level anomalies in digital images.

Assessment

In order to be hired into our community, you’ll go through a subject-specific qualification exam that will determine your suitability for the position and complete ID verification.

Key skills/competency

  • Multimodal AI
  • Content Expert
  • AI Safety
  • Bias Detection
  • Instruction Tuning
  • Cross-Modal Verification
  • Visual-Semantic Analysis
  • Temporal Auditing
  • Cultural Nuances
  • Analytical Skills

Skills & topics

  • Multimodal AI
  • AI Content Expert
  • AI Safety
  • Bias Detection
  • Instruction Tuning
  • Cross-Modal Verification
  • Visual-Semantic Analysis
  • Temporal Auditing
  • Cultural Nuances
  • Analytical Skills
  • Cybersecurity
  • Forensic Science
  • Information Security
  • Bachelor's Degree
  • Native Language
  • English Proficiency

How to get hired

  • Tailor your resume: Highlight your analytical skills and experience with digital content verification.
  • Prepare for assessment: Study AI concepts, multimodal data, and safety/bias detection.
  • Demonstrate native language: Be ready to showcase your proficiency and understanding of nuances.
  • Highlight attention to detail: Emphasize your ability to spot anomalies in digital media.
  • Express passion for AI: Show genuine interest in the future of artificial intelligence.

Technical preparation

Review AI and machine learning fundamentals.,Understand multimodal data types and interactions.,Practice identifying digital image anomalies.,Familiarize with AI safety and bias concepts.

Behavioral questions

Describe a time you found a subtle error.,How do you ensure accuracy in your work?,How do you handle complex, multi-layered data?,Discuss your understanding of cultural nuances.

Frequently asked questions

What are the mandatory qualifications for the Multimodal AI Content Expert role at TELUS Digital?
To be considered for the Multimodal AI Content Expert position at TELUS Digital, you must have a minimum of a Bachelor’s Degree in Cybersecurity, Criminal Justice, Forensic Science, or Information Security. Additionally, native-level proficiency in your primary language and a minimum B1 (Intermediate) level of English proficiency are required. Exceptional attention to detail, particularly in identifying pixel-level anomalies in digital images, is also a key requirement.
What does the assessment process involve for the Multimodal AI Content Expert job at TELUS Digital?
The assessment process for the Multimodal AI Content Expert role at TELUS Digital includes a subject-specific qualification exam designed to evaluate your suitability for the position. You will also need to complete an ID verification process as part of being hired into the TELUS Digital community.
Can I apply for the Multimodal AI Content Expert role if English is not my native language?
Yes, you can apply if English is not your native language, provided you have a minimum B1 (Intermediate) level of English proficiency. The mandatory requirement is native-level proficiency in your primary language, which is crucial for identifying localized document types and regional identity nuances.
What kind of AI capabilities is TELUS Digital working on for this role?
TELUS Digital is focused on teaching AI to perceive and understand the world like humans do. This involves developing AI that can reason across multiple data types simultaneously, including images, videos, and audio, moving beyond text-only models to create more comprehensive artificial intelligence.
How does the Multimodal AI Content Expert contribute to AI safety and bias detection?
The Multimodal AI Content Expert plays a critical role in identifying safety risks that emerge specifically when different media types are combined. This includes detecting scenarios where an image or video might be benign on its own but becomes harmful when paired with specific text or other media elements.