Software Engineer II Backend Data Pipelines @ Scribd, Inc.
Your Application Journey
Email Hiring Manager
Job Details
About Scribd, Inc.
At Scribd, our mission is to spark human curiosity by democratizing ideas and empowering expertise. We create a world of stories and knowledge across our three products: Everand, Scribd, and Slideshare. We value authenticity, boldness, and a GRIT approach that combines Goals, Results, Innovation, and Teamwork.
About the Role
The Software Engineer II Backend Data Pipelines role is designed for experienced backend developers who enjoy solving complex data challenges at scale. You will design, build, and optimize distributed systems that handle metadata extraction, enrichment, and processing for millions of documents, images, and audio content. This position involves collaborating with ML engineers, product managers, and cross-functional teams to integrate ML and LLM-powered solutions into high-performance pipelines.
Team Overview
The ML Data Engineering team works on metadata extraction and enrichment for Scribd brands. They handle hundreds of millions of documents and billions of images, leveraging technologies like machine learning, data engineering, and distributed systems to deliver scalable solutions.
Key Responsibilities
- Design and build scalable systems for metadata extraction and enrichment.
- Integrate LLM capabilities for summarization, classification, and extraction.
- Collaborate with ML engineers and product managers to optimize pipelines.
- Ensure data accuracy and quality with automated validation.
- Participate in code reviews and maintain high-quality standards.
- Manage data pipelines, security, and infrastructure maintenance.
Required Skills & Experience
- 4+ years of professional software engineering experience.
- Proficiency in Python, Scala, Ruby, or similar languages.
- Experience building distributed systems at scale.
- Hands-on experience with AWS services and infrastructure-as-code (Terraform).
- Familiarity with data processing frameworks like Spark or Databricks.
- Bachelor’s degree in Computer Science or equivalent.
- Bonus: Experience with LLMs or integrating ML models in production.
Compensation & Benefits
Salary ranges vary by location with competitive equity and a comprehensive benefits package. Scribd offers flexible work options through Scribd Flex with occasional in-person attendance required, along with benefits like healthcare, parental leave, retirement plans, learning allowances, and more.
Location Requirements
Candidates must reside in designated cities in the United States, Canada, or Mexico. Specific locations include major cities such as San Francisco, New York City, Toronto, and Mexico City among others.
Key skills/competency
- Python
- Scala
- Ruby on Rails
- AWS
- Terraform
- Spark
- Databricks
- Distributed Systems
- Machine Learning
- Data Pipelines
How to Get Hired at Scribd, Inc.
🎯 Tips for Getting Hired
- Customize your resume: Highlight backend development and AWS skills.
- Showcase project experience: Emphasize distributed systems accomplishments.
- Research Scribd culture: Understand their GRIT values and flexible work style.
- Prepare for technical rounds: Brush up on Python, Scala, and cloud technologies.