Software Engineer II Backend Data Pipelines
@ Scribd, Inc.

Vancouver, BC
$126,000
On Site
Full Time
Posted 11 hours ago

Your Application Journey

Personalized Resume
Apply
Email Hiring Manager
Interview

Email Hiring Manager

XXXXXXXX XXXXXXXXXXXXX XXXXXXXXX***** @scribd.com
Recommended after applying

Job Details

About Scribd, Inc.

At Scribd, our mission is to spark human curiosity by democratizing ideas and empowering expertise. We create a world of stories and knowledge across our three products: Everand, Scribd, and Slideshare. We value authenticity, boldness, and a GRIT approach that combines Goals, Results, Innovation, and Teamwork.

About the Role

The Software Engineer II Backend Data Pipelines role is designed for experienced backend developers who enjoy solving complex data challenges at scale. You will design, build, and optimize distributed systems that handle metadata extraction, enrichment, and processing for millions of documents, images, and audio content. This position involves collaborating with ML engineers, product managers, and cross-functional teams to integrate ML and LLM-powered solutions into high-performance pipelines.

Team Overview

The ML Data Engineering team works on metadata extraction and enrichment for Scribd brands. They handle hundreds of millions of documents and billions of images, leveraging technologies like machine learning, data engineering, and distributed systems to deliver scalable solutions.

Key Responsibilities

  • Design and build scalable systems for metadata extraction and enrichment.
  • Integrate LLM capabilities for summarization, classification, and extraction.
  • Collaborate with ML engineers and product managers to optimize pipelines.
  • Ensure data accuracy and quality with automated validation.
  • Participate in code reviews and maintain high-quality standards.
  • Manage data pipelines, security, and infrastructure maintenance.

Required Skills & Experience

  • 4+ years of professional software engineering experience.
  • Proficiency in Python, Scala, Ruby, or similar languages.
  • Experience building distributed systems at scale.
  • Hands-on experience with AWS services and infrastructure-as-code (Terraform).
  • Familiarity with data processing frameworks like Spark or Databricks.
  • Bachelor’s degree in Computer Science or equivalent.
  • Bonus: Experience with LLMs or integrating ML models in production.

Compensation & Benefits

Salary ranges vary by location with competitive equity and a comprehensive benefits package. Scribd offers flexible work options through Scribd Flex with occasional in-person attendance required, along with benefits like healthcare, parental leave, retirement plans, learning allowances, and more.

Location Requirements

Candidates must reside in designated cities in the United States, Canada, or Mexico. Specific locations include major cities such as San Francisco, New York City, Toronto, and Mexico City among others.

Key skills/competency

  • Python
  • Scala
  • Ruby on Rails
  • AWS
  • Terraform
  • Spark
  • Databricks
  • Distributed Systems
  • Machine Learning
  • Data Pipelines

How to Get Hired at Scribd, Inc.

🎯 Tips for Getting Hired

  • Customize your resume: Highlight backend development and AWS skills.
  • Showcase project experience: Emphasize distributed systems accomplishments.
  • Research Scribd culture: Understand their GRIT values and flexible work style.
  • Prepare for technical rounds: Brush up on Python, Scala, and cloud technologies.

📝 Interview Preparation Advice

Technical Preparation

Review AWS Lambda and ECS configurations.
Practice coding in Python and Scala.
Study distributed systems design patterns.
Brush up on Terraform and Spark basics.

Behavioral Questions

Describe a successful teamwork experience.
Explain a challenging project and resolution.
Discuss your approach to continuous learning.
Share an instance of innovation under pressure.

Frequently Asked Questions