4 days ago

MLOps Engineer

DarkVision

On Site
Full Time
CA$130,000
North Vancouver, BC

Job Overview

Job TitleMLOps Engineer
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered SalaryCA$130,000
LocationNorth Vancouver, BC

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

MLOps Engineer at DarkVision

DarkVision is actively seeking an MLOps Engineer to join its innovative Imaging & AI team. In this pivotal role, you will be instrumental in designing, building, and maintaining the advanced automation platforms that power the entire machine learning lifecycle. As a crucial force multiplier, you will empower researchers and engineers to deploy models with greater speed and enhanced reliability.

DarkVision's cutting-edge ultrasound imaging system generates massive datasets, often on the order of terabytes, meticulously detecting sub-millimetric defects across hundreds of kilometers of industrial assets. Effectively managing this vast data flow necessitates robust automation. Your primary focus will be on "productionalizing" the machine learning process, which includes constructing robust CI/CD pipelines, developing scalable training infrastructure, and implementing sophisticated deployment systems to ensure consistent model performance in real-world scenarios.

This is an on-site position at our North Vancouver, BC HQ. Employees at our headquarters enjoy an extensive range of amenities, including a fully equipped gym, a squash court, a steam room, a climbing wall, and much more, fostering a vibrant and supportive work environment.

Our Team

Working within the dynamic Imaging & AI team, you will become part of a multidisciplinary group of talented scientists and engineers. This team is at the forefront of early-stage ideation, rigorous research, experimentation, and advanced development. You will engage in close collaboration with Machine Learning Scientists and Cloud Infrastructure Engineers, effectively bridging the critical gap between experimental code and robust production systems.

What You Will Do

  • Build ML Platforms: Design and maintain the cloud-based infrastructure, primarily on AWS, to support scalable model training and efficient batch inference pipelines.
  • Automate the Lifecycle: Develop and manage comprehensive CI/CD pipelines specifically for machine learning projects, ensuring that model training, testing, and deployment processes are fully automated and reproducible.
  • Model Operations: Implement industry best practices for model versioning, registry management, and meticulous artifact tracking. You will be responsible for ensuring every model in production is fully traceable and secure.
  • Reliability & Monitoring: Proactively manage system health by implementing advanced monitoring and logging tools. You will lead incident responses for the ML stack and guarantee high availability for all data processing workflows.

Who You Are (Basic Qualifications)

  • Bachelor’s degree in Computer Science, Engineering, or a closely related technical field.
  • A minimum of 3 years of professional experience in a DevOps, MLOps, or Software Engineering capacity.
  • Demonstrated proficiency in Python and hands-on experience with automation scripting.
  • Extensive experience with cloud infrastructure, specifically within AWS (including services like SageMaker, Batch, Lambda, and S3).
  • Proficiency with containerization and orchestration tools such as Docker and Kubernetes.
  • Proven experience in building CI/CD pipelines for either software development or machine learning projects.

What Will Put You Ahead

  • Hands-on experience with workflow orchestration tools (e.g., Prefect, Airflow, Kubeflow).
  • Experience with Infrastructure as Code (IaC) tools like Terraform and Ansible.
  • Familiarity with specialized MLOps tools for experiment tracking (e.g., Weights & Biases, DVC, MLFlow).
  • Knowledge of model optimization techniques (such as quantization and pruning) specifically for deployment.
  • A solid understanding of security best practices within a cloud environment.
  • Strong communication skills and a proven ability to collaborate effectively within a team setting.

About DarkVision

DarkVision Technologies Inc. is a Canada-based technology company that has been revolutionizing the industrial imaging market since 2013. We have developed the world’s most advanced acoustic-based imaging platform, which is being integrated into multiple new product lines, fundamentally transforming how our clients quantify and visualize the integrity of their critical assets.

Backed by Koch Inc., one of the world’s largest privately held companies, DarkVision’s diverse team of Mechanical, Skunkworks, Electrical, Software, and Machine Learning Engineers is rapidly expanding to meet the increasing demand for the company’s current and upcoming products.

We offer employees the unique opportunity to work on cutting-edge technologies that seamlessly blend advanced science with real-world applications. We invite you to join us on this exciting journey as we strive to become the global leader in industrial imaging.

Key skills/competency

  • MLOps
  • AWS
  • CI/CD
  • Python
  • Kubernetes
  • Docker
  • SageMaker
  • Automation
  • Machine Learning Lifecycle
  • Cloud Infrastructure

Tags:

MLOps Engineer
automation
CI/CD
deployment
monitoring
reliability
data management
infrastructure
model operations
machine learning lifecycle
Python
AWS
Docker
Kubernetes
SageMaker
Airflow
Terraform
MLFlow
S3
Lambda

Share Job:

How to Get Hired at DarkVision

  • Research DarkVision's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor. Understand their affiliation with Koch Inc.'s entrepreneurial philosophy.
  • Tailor your MLOps resume: Highlight specific experience with AWS (SageMaker, Batch, Lambda, S3), Python, Docker, Kubernetes, and building CI/CD pipelines for ML. Showcase projects involving model productionalization.
  • Showcase practical MLOps experience: Prepare to discuss real-world scenarios where you've designed, built, and maintained ML platforms, automated lifecycles, and ensured model reliability and monitoring.
  • Prepare for technical assessments: Expect questions on cloud infrastructure, containerization, scripting, and MLOps tools. Demonstrate problem-solving skills related to scalable ML systems and data flow management.
  • Emphasize collaboration and communication: DarkVision values teamwork. Be ready to discuss how you've collaborated with ML Scientists and Cloud Infrastructure Engineers to bridge experimental work with production systems.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background