Lead ML Platform Engineer
RBC
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
What's the opportunity?
We’re looking for an experienced Lead ML Platform Engineer who will bring focus and subject-matter expertise around designing and implementing machine learning infrastructure and automation tools (MLOps and DevOps). This is a unique opportunity to grow in the world of machine learning infrastructure and work with a team of passionate individuals committed to the mission of bringing ML to enterprise.
At RBC Borealis, you’ll be joining a team that works directly with leading researchers in machine learning, has access to rich and massive datasets, and offers the computational resources to support ongoing development in areas such as reinforcement learning, unsupervised learning and computer vision. You can find out more about our research areas at rbcborealis.com.
Your responsibilities include:
- Designing, building, and optimizing machine learning deployment tools and automation systems that operate the business’s data and ML applications;
- Designing and implementing best practices and standards for data and machine learning pipelines across the organization;
- Collaborating with engineers, and machine learning researchers to automate code analysis, build, integration and deployment of ML applications;
- Supporting applications and projects with infrastructure design decision, and monitoring solution;
- Building highly scalable, resilient cloud and on-premise systems for hosting machine learning systems using state-of-the-art technologies.
You're our ideal candidate if you have:
- 5+ years of experience designing and implementing distributed systems and Machine Learning systems;
- Working with building and maintaining DevOps pipeline such as Jenkins, GitHub actions;
- Previous experience with MLOps orchestration tools such as AirFlow, KubeFlow, Dagster, Flyte, or MetaFlow;
- In-depth knowledge of various stages of the machine learning application deployment process;
- Experience with building tools and applications to automate various infrastructure and DevOps tasks;
- Proficiency with programming languages such as Python, Bash, or JavaScript;
- Solid understanding of the UNIX operating system;
- Implementing monitoring solutions to identify system bottlenecks and production issues;
- Knowledge of professional software engineering best practices for the full software development life cycle, including testing methods, coding standards, code reviews and source control management;
- Hands-on experience building and deploying hybrid environments on-prem and major cloud environments, such as AWS and Azure;
- Familiarity with machine learning frameworks such as PyTorch, TensorFlow and/or similar.
What's in it for you?
- Become part of a team that thinks progressively and works collaboratively. We care about seeing each other reach full potential;
- A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock options where applicable;
- Leaders who support your development through coaching and managing opportunities;
- Ability to make a difference and lasting impact from a local-to-global scale.
About RBC Borealis
RBC Borealis is the driving force behind Royal Bank of Canada’s AI and data innovation. As part of Canada’s largest financial institution, we bring together a team of architects, engineers, scientists, and product experts on a mission to revolutionize finance through world-class research, solutions, and a resilient data platform. With locations across Toronto, Waterloo, Montreal, Calgary, and Vancouver, we’re at the forefront of AI research and platform development. With a focus on cutting-edge research in areas like time series forecasting, causal machine learning, and responsible AI, we are seamlessly integrating AI research and data engineering, to solve critical challenges in the financial industry. We are building intelligent, and scalable, data-driven solutions that will help communities thrive and drive innovation for our customers across the bank.
Key skills/competency
- Machine Learning Infrastructure
- MLOps & DevOps
- Distributed Systems Design
- Cloud Deployment (AWS, Azure)
- Python Programming
- Data Pipelines Automation
- System Monitoring
- Software Engineering Best Practices
- Kubernetes/Container Orchestration
- Hybrid Cloud Environments
How to Get Hired at RBC
- Research RBC's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Tailor your resume for ML Platform roles: Highlight your expertise in MLOps, DevOps, distributed systems, and cloud technologies.
- Showcase your technical projects: Provide examples of designing and implementing ML infrastructure and automation in your portfolio or resume.
- Prepare for technical deep-dives: Expect questions on Python, cloud platforms (AWS/Azure), MLOps tools like AirFlow, and distributed system architecture.
- Demonstrate collaborative leadership: Emphasize experience working with ML researchers and cross-functional engineering teams in your interview answers.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background