Multimodal AI Engineer
BMO
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
The Team
We accelerate BMO’s AI journey by building enterprise-grade, cloud-native AI solutions. Our team combines engineering excellence with cutting-edge AI to deliver scalable, secure, and responsible solutions that power business innovation across the bank. We enable and accelerate our partners on their AI journeys across the enterprise, helping teams across BMO unlock value at scale. We are engineers, AI practitioners, platform builders, thought leaders, multipliers, and coders. Above all, we are a global team of diverse individuals who enjoy working together to create smart, secure, and scalable solutions that make an impact across the enterprise. Our ambition is bold: deploy our capital and resources to their highest and most profitable use through a digital-first operating model, powered by data and AI-driven decisions.
About The Role
As a Multimodal AI Engineer, you will contribute to a multi-year initiative dedicated to advancing our digital-first, AI-powered business for enhanced value and future readiness. In this pivotal role, you will help shape and deliver agentic systems by integrating Large Language Models (LLMs) to orchestrate and automate business workflows, driving operational efficiency and optimizing user experiences. You will be hands-on in solution design, demonstrate engineering excellence, and provide technical leadership across high-impact capabilities, ensuring robust and scalable AI solutions for our organization.
Role Summary
We’re seeking a Multimodal AI Engineer to build next-generation AI interfaces. You will be the subject matter expert responsible for creating seamless interactions where users speak to a digital avatar, which then integrates with AI agents to provide real-time, intelligent responses.
- Drive the development by designing, building, and operationalizing enterprise-grade AI interfaces
- Serve as a player-coach, balancing hands-on engineering, building agent prototypes and platform components, with strategic guidance, including shaping product direction, advising on implementation best practices, and fostering a culture of technical excellence.
- Initially focus on creating foundational patterns and frameworks that can be leveraged across, enabling scalability and reusability.
Key Responsibilities:
- Orchestration – design and deploy multimodal solutions using Microsoft technologies, eg. Azure OpenAI, AI Speech and AI Vision.
- Avatar development – design and implement lifelike, interactive digital personas using methodologies like neural text-to-speech, preferably with experience in Microsoft technologies, eg. Azure AI Speech Avatar.
- Speech systems – design and build robust speech-to-text pipelines, specializing in custom speech models in multiple languages and noise reduction techniques.
- System integration
Skill Requirements:
- Expert in Azure AI Studio, python, C#/.NET, Azure DevOps and GitHub Actions for LLMOps.
- Strong knowledge in AI Speech (Speech-to-Text, Text-to-Speech), Voice modeling, Speech synthesis markup language, Azure AI Speech Avatar and LLM integration.
- Knowledge in Unity/ Unreal integration with Azure or Azure-hosted NVIDIA Audio2Face a plus.
Preferred Qualifications:
- Microsoft certifications on Azure AI Engineer.
- 5-7 years of AI software engineering experience.
- Experience with Semantic Kernel or LangChain for orchestration complex AI workflows.
- Proven delivery on multiple AI initiatives—comfortable shaping ambiguity into “the right questions,” crisp requirements, and practical design.
Key skills/competency
- Azure AI
- Multimodal AI
- LLM integration
- Python
- C#
- Azure DevOps
- GitHub Actions
- Speech-to-Text
- Text-to-Speech
- Voice modeling
- Agentic Systems
- AI Orchestration
How to Get Hired at BMO
- Research BMO's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Customize your BMO resume: Tailor your resume to highlight experience in Multimodal AI, Azure technologies, and enterprise solutions, aligning with the job description for the Multimodal AI Engineer role.
- Prepare for BMO interviews: Practice explaining complex AI projects, demonstrate strong problem-solving skills, and showcase your understanding of cloud-native AI development and agentic systems.
- Showcase your AI expertise: Be ready to discuss your experience with LLMs, speech/vision AI, Python, C#, and Azure AI services, emphasizing practical application and impact.
- Network within BMO: Connect with current BMO employees on LinkedIn to gain insights into the company's AI initiatives and team dynamics.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background