3 days ago

Software Engineer II, Backend and Data Pipelines

Scribd, Inc.

On Site
Full Time
$170,000
Vancouver, BC

Job Overview

Job TitleSoftware Engineer II, Backend and Data Pipelines
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$170,000
LocationVancouver, BC

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Software Engineer II, Backend and Data Pipelines at Scribd, Inc.

At Scribd Inc., our mission is to spark human curiosity. Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our four products: Everand, Scribd, Slideshare, and Fable.

This posting reflects an approved, open position within the organization.

We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.

When it comes to workplace structure, we believe in balancing individual flexibility and community connections. It’s through our flexible work benefit, Scribd Flex, that employees – in partnership with their manager – can choose the daily work-style that best suits their individual needs. A key tenet of Scribd Flex is our prioritization of intentional in-person moments to build collaboration, culture, and connection. For this reason, occasional in-person attendance is required for all Scribd Inc. employees, regardless of their location.

So what are we looking for in new team members? Well, we hire for “GRIT”. The textbook definition of GRIT is demonstrating the intersection of passion and perseverance towards long term goals. At Scribd Inc., we are inspired by the potential that this can unlock, and ask each of our employees to pursue a GRIT-ty approach to their work. In a tactical sense, GRIT is also a handy acronym that outlines the standards we hold ourselves and each other to. Here’s what that means for you: we’re looking for someone who showcases the ability to set and achieve Goals, achieve Results within their job responsibilities, contribute Innovative ideas and solutions, and positively influence the broader Team through collaboration and attitude.

About The Team

The ML Data Engineering team powers metadata extraction, enrichment, and content understanding across all Scribd brands. We process hundreds of millions of documents, billions of images, and deliver high-quality metadata to enable content discovery and trust for millions of users worldwide.

Our systems operate at massive scale, supporting diverse datasets like user-generated content (UGC), ebooks, audiobooks, and more. We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely with applied research and product teams to deploy scalable ML and LLM-powered solutions in production.

Role Overview

We’re seeking a Software Engineer II, Backend and Data Pipelines with strong backend development experience and a passion for solving complex data challenges at scale. In this role, you’ll design, build, and optimize distributed systems that extract, enrich, and process metadata for a wide range of content. You’ll work closely with ML engineers, product managers, and cross-functional partners to integrate machine learning models and LLM-based services into production pipelines and deliver impactful, high-performance solutions. This role offers the opportunity to work on cutting-edge generative AI and metadata enrichment problems at a truly global scale.

Tech Stack

Our team uses various technologies. The following are the ones that we use on a regular basis: Python, Scala, Ruby on Rails, Airflow, Databricks, Spark, HTTP APIs, AWS (Lambda, ECS, SQS, ElastiCache, Sagemaker, Cloudwatch, Datadog) and Terraform.

Key Responsibilities

  • Design and build scalable systems to extract, enrich, and process metadata from millions of documents, images, and audio content.
  • Leverage LLMs to integrate capabilities like summarization, classification, extraction, and enrichment into metadata pipelines.
  • Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions.
  • Optimize and refactor existing systems for performance, scalability, and reliability.
  • Ensure data accuracy, integrity, and quality through automated validation and monitoring.
  • Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase.
  • Manage and maintain data pipelines, security and infrastructure.

Requirements

  • 5+ years of professional software engineering experience.
  • Proficiency in Python, Scala, Ruby, or similar languages.
  • Experience designing and building distributed systems at scale.
  • Hands-on experience building, deploying, and optimizing solutions using ECS, EKS, or AWS Lambda.
  • Experience with infrastructure-as-code tools like Terraform (or similar).
  • Experience working with a public cloud provider (AWS, Azure, or Google Cloud).
  • Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads.
  • Proven ability to test, profile, and optimize systems for performance, scalability, and reliability.
  • Bachelor’s degree in Computer Science or equivalent professional experience.
  • Bonus: Experience working with LLMs or integrating ML models into production systems.

Working at Scribd Inc.

Are you currently based in a location where Scribd Inc. can employ you? Employees must have their primary residence in or near one of the following cities. This includes surrounding metro areas or locations within a typical commuting distance:

  • United States: Atlanta | Austin | Boston | Dallas | Denver | Chicago | Houston | Jacksonville | Los Angeles | Miami | New York City | Phoenix | Portland | Sacramento | Salt Lake City | San Diego | San Francisco | Seattle | Washington D.C.
  • Canada: Ottawa | Toronto | Vancouver
  • Mexico: Mexico City

Benefits, Perks, And Wellbeing At Scribd Inc.

Benefits/perks listed may vary depending on the nature of your employment with Scribd Inc. and the geographical location where you work.

  • Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees.
  • 12 weeks paid parental leave.
  • Short-term/long-term disability plans.
  • 401k/RSP matching.
  • Onboarding stipend for home office peripherals + accessories.
  • Learning & Development allowance.
  • Learning & Development programs.
  • Quarterly stipend for Wellness, WiFi, etc.
  • Mental Health support & resources.
  • Free subscription to the Scribd Inc. suite of products.
  • Referral Bonuses.
  • Book Benefit.
  • Sabbaticals.
  • Company-wide events.
  • Team engagement budgets.
  • Vacation & Personal Days.
  • Paid Holidays (+ winter break).
  • Flexible Sick Time.
  • Volunteer Day.
  • Company-wide Employee Resource Groups and programs that foster an inclusive and diverse workplace.
  • Access to AI Tools: We provide free access to best-in-class AI tools, empowering you to boost productivity, streamline workflows, and accelerate bold innovation.

Want to learn more about life at Scribd? www.linkedin.com/company/scribd/life

We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing accommodations@scribd.com about the need for adjustments at any point in the interview process.

Scribd Inc. is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage people of all backgrounds to apply, and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.

Key skills/competency

  • Backend Development
  • Data Pipelines
  • Distributed Systems
  • Machine Learning Integration
  • LLM
  • Cloud Platforms (AWS)
  • Python/Scala
  • Spark/Databricks
  • System Optimization
  • Metadata Processing

Tags:

Software Engineer
Backend Engineer
Data Engineer
Distributed Systems
Python
Scala
AWS
Spark
LLM
Terraform
Metadata Processing
Data Pipelines
Cloud Engineering

Share Job:

How to Get Hired at Scribd, Inc.

  • Research Scribd, Inc.'s culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
  • Tailor your resume: Highlight backend development, data pipeline construction, and ML integration experience for the Software Engineer II role.
  • Showcase distributed systems expertise: Emphasize hands-on experience with AWS services like ECS, Lambda, and SQS, alongside data processing with Spark or Databricks.
  • Prepare for technical interviews: Focus on Python/Scala proficiency, system design for scale, and optimizing complex data workflows.
  • Demonstrate collaborative problem-solving: Be ready to discuss how you've worked cross-functionally to deliver high-impact technical solutions.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background