21 hours ago

Multimodal AI Engineer

BMO

On Site
Full Time
CA$120,000
Toronto, ON

Job Overview

Job TitleMultimodal AI Engineer
Job TypeFull Time
Offered SalaryCA$120,000
LocationToronto, ON

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

The Team

We accelerate BMO’s AI journey by building enterprise-grade, cloud-native AI solutions. Our team combines engineering excellence with cutting-edge AI to deliver scalable, secure, and responsible solutions that power business innovation across the bank. We enable and accelerate our partners on their AI journeys across the enterprise, helping teams across BMO unlock value at scale. We are engineers, AI practitioners, platform builders, thought leaders, multipliers, and coders. Above all, we are a global team of diverse individuals who enjoy working together to create smart, secure, and scalable solutions that make an impact across the enterprise. Our ambition is bold: deploy our capital and resources to their highest and most profitable use through a digital-first operating model, powered by data and AI-driven decisions.

About The Role

As a Multimodal AI Engineer, you will contribute to a multi-year initiative dedicated to advancing our digital-first, AI-powered business for enhanced value and future readiness. In this pivotal role, you will help shape and deliver agentic systems by integrating Large Language Models (LLMs) to orchestrate and automate business workflows, driving operational efficiency and optimizing user experiences. You will be hands-on in solution design, demonstrate engineering excellence, and provide technical leadership across high-impact capabilities, ensuring robust and scalable AI solutions for our organization.

Role Summary

We’re seeking a Multimodal AI Engineer to build next-generation AI interfaces. You will be the subject matter expert responsible for creating seamless interactions where users speak to a digital avatar, which then integrates with AI agents to provide real-time, intelligent responses.

  • Drive the development by designing, building, and operationalizing enterprise-grade AI interfaces
  • Serve as a player-coach, balancing hands-on engineering, building agent prototypes and platform components, with strategic guidance, including shaping product direction, advising on implementation best practices, and fostering a culture of technical excellence.
  • Initially focus on creating foundational patterns and frameworks that can be leveraged across, enabling scalability and reusability.

Key Responsibilities:

  • Orchestration – design and deploy multimodal solutions using Microsoft technologies, eg. Azure OpenAI, AI Speech and AI Vision.
  • Avatar development – design and implement lifelike, interactive digital personas using methodologies like neural text-to-speech, preferably with experience in Microsoft technologies, eg. Azure AI Speech Avatar.
  • Speech systems – design and build robust speech-to-text pipelines, specializing in custom speech models in multiple languages and noise reduction techniques.
  • System integration

Skill Requirements:

  • Expert in Azure AI Studio, python, C#/.NET, Azure DevOps and GitHub Actions for LLMOps.
  • Strong knowledge in AI Speech (Speech-to-Text, Text-to-Speech), Voice modeling, Speech synthesis markup language, Azure AI Speech Avatar and LLM integration.
  • Knowledge in Unity/ Unreal integration with Azure or Azure-hosted NVIDIA Audio2Face a plus.

Preferred Qualifications:

  • Microsoft certifications on Azure AI Engineer.
  • 5-7 years of AI software engineering experience.
  • Experience with Semantic Kernel or LangChain for orchestration complex AI workflows.
  • Proven delivery on multiple AI initiatives—comfortable shaping ambiguity into “the right questions,” crisp requirements, and practical design.

Key skills/competency

  • Azure AI
  • Multimodal AI
  • LLM integration
  • Python
  • C#
  • Azure DevOps
  • GitHub Actions
  • Speech-to-Text
  • Text-to-Speech
  • Voice modeling
  • Agentic Systems
  • AI Orchestration

Tags:

Multimodal AI Engineer
AI engineering
LLM integration
Azure AI
Speech AI
Visual AI
Avatar development
System orchestration
Cloud-native AI
Agentic systems
Technical leadership
Azure OpenAI
Python
C#
.NET
Azure DevOps
GitHub Actions
Azure AI Studio
Speech-to-Text
Text-to-Speech
Semantic Kernel
LangChain

Share Job:

How to Get Hired at BMO

  • Research BMO's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
  • Customize your BMO resume: Tailor your resume to highlight experience in Multimodal AI, Azure technologies, and enterprise solutions, aligning with the job description for the Multimodal AI Engineer role.
  • Prepare for BMO interviews: Practice explaining complex AI projects, demonstrate strong problem-solving skills, and showcase your understanding of cloud-native AI development and agentic systems.
  • Showcase your AI expertise: Be ready to discuss your experience with LLMs, speech/vision AI, Python, C#, and Azure AI services, emphasizing practical application and impact.
  • Network within BMO: Connect with current BMO employees on LinkedIn to gain insights into the company's AI initiatives and team dynamics.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background