
Senior Product Operations Manager, Evaluation
Harvey · San Francisco, CA
- On site
- Full-time
- $150,000 / year
- San Francisco, CA
Email the hiring manager to get a response.
Get their verified email + an intro that's ready to send.
Subject: Interested in the Senior Product Operations Manager, Evaluation role at Harvey
Hi Riley — I came across the Senior Product Operations Manager, Evaluation opening and wanted to reach out directly. I've spent the last few years doing exactly this kind of work, and Harvey stood out because…
✎ Personalized to your résumé after sign-up.
- ✓ Verified email of the hiring manager
- ✓ Intro email personalized to your résumé
- ✓ $9/mo = unlimited — any job link
Secure checkout · cancel anytime
Job highlights
- Build and scale AI evaluation systems globally.
- Operationalize evaluation methodologies into product lifecycle.
- Manage data providers and internal pipelines.
- Improve evaluation tooling and automation.
- Ensure model accuracy, reliability, and trust.
About the role
Senior Product Operations Manager, Evaluation
Why Harvey
At Harvey, we’re transforming how legal and professional services operate. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come. This is a rare chance to help build a generational company at a true inflection point. With 1500+ customers in 60+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched. Our team moves fast, takes ownership, and is deeply committed to the mission — operating with intensity, staying close to our customers, and pushing each other for excellence. We live by three values: Decisiveness, Simplicity, and Job's Not Finished. We act quickly on clear judgment over perfect information, we believe simplicity is what scales, and we're never satisfied with where we are. If you want to do the best work of your career alongside people who share that drive, we'd love to build with you. At Harvey, the future of professional services is being written today — and we’re just getting started.
Role Overview
We’re looking for a technical, systems-minded operator to build and scale the evaluation engine behind Harvey’s platform. As we expand globally, ensuring our models behave reliably, accurately, and jurisdictionally correctly is mission-critical—and evaluation complexity is increasing 10x. As a member of our Product Operations team, you’ll work closely with Applied Legal Researchers, Product, Engineering, AI Research, and human data providers to operationalize evaluation methodologies and embed them into our product development lifecycle. You’ll create the workflows, systems, and tooling that make evaluation a first-class product capability at Harvey. This is a high-ownership role for someone who thrives in ambiguity, loves building structure, and wants to help scale the evaluation infrastructure of a global AI company.
What You'll Do
- Build and scale the systems that power model and product evaluations across Harvey
- Run intake, triage, and prioritization for the evaluation request queue, routing capacity to the highest-value coverage gaps
- Embed evaluation workflows and readiness checkpoints into the product development lifecycle
- Create the single source of truth for evaluation status, results, history, and launch readiness
- Turn Expert-designed evaluation methodologies into scalable, repeatable operational processes
- Manage human data providers and stand up our internal contract-attorney pipeline, ensuring evaluation quality meets legal standards
- Work with Engineering and Research to improve evaluation tooling, automation, and dashboards
- Drive evaluation readiness for major product and model launches across geographies and jurisdictions
- Document and operationalize evaluation governance as complexity increases
- Help define how Harvey ensures model accuracy, reliability, and trust at global scale
What You Have
- 4–7+ years in technical program management, product operations, research operations, or evaluation/benchmarking roles
- Experience working with ML/AI evaluations, benchmarking frameworks, or scientific workflows
- Comfort with statistical methodologies and SQL or Python, or similar tools to interpret evaluation data (either natively or with AI tool support)
- Strong business acumen with an ability to apply an ROI-focused mindset to scaling
- Ability to work deeply with legal experts and operationalize complex evaluation methodologies
- Strong cross-functional coordination skills across Product, Engineering, Research, and data providers/vendors
- High attention to detail and a bias toward clarity, rigor, and reproducibility
- Ability to navigate an evolving landscape and bring order to complex systems
- Strong communication skills and comfort translating technical nuance for diverse stakeholders
- Desire to do whatever it takes to make evaluation systems successful—from writing documentation to diagnosing pipeline issues
Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here]. Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing accommodations@harvey.ai
Key skills/competency
- Product Operations Manager
- AI Evaluation
- ML/AI Benchmarking
- Technical Program Management
- Python
- SQL
- Statistical Methodologies
- Cross-functional Coordination
- Operational Process Scaling
- System Building
Skills & topics
- Product Operations Manager
- AI Evaluation
- ML/AI Benchmarking
- Technical Program Management
- Python
- SQL
- Statistical Methodologies
- Cross-functional Coordination
- Operational Process Scaling
- System Building
- AI
- Machine Learning
- Product Development
- Research Operations
- Legal Tech
How to get hired
- Tailor your resume: Highlight experience with ML/AI evaluations, statistical methods, and SQL/Python for the Senior Product Operations Manager role.
- Showcase operational skills: Emphasize experience in building scalable systems, managing data providers, and improving workflows for evaluation processes.
- Demonstrate cross-functional ability: Provide examples of collaborating with Product, Engineering, and Research teams on complex projects.
- Prepare for technical questions: Be ready to discuss your understanding of ML/AI evaluation frameworks and how to apply them operationally.
- Research Harvey's values: Align your responses with Decisiveness, Simplicity, and 'Job's Not Finished' during interviews.
Technical preparation
Behavioral questions
Frequently asked questions
- What is the primary focus of the Senior Product Operations Manager, Evaluation role at Harvey?
- The Senior Product Operations Manager, Evaluation at Harvey will focus on building and scaling the engine that evaluates the company's AI models and platform. This involves operationalizing evaluation methodologies, embedding them into the product development lifecycle, and ensuring model accuracy, reliability, and jurisdictional correctness at a global scale.
- What technical skills are most important for this Senior Product Operations Manager position at Harvey?
- Key technical skills for this role include comfort with statistical methodologies and proficiency in SQL or Python for interpreting evaluation data. Experience with ML/AI evaluations, benchmarking frameworks, and scientific workflows is also highly valued.
- How does Harvey approach product development and evaluation?
- Harvey emphasizes a fast-paced approach with a focus on ownership, intensity, and customer proximity. The company values Decisiveness, Simplicity, and 'Job's Not Finished.' For evaluation, the goal is to embed these processes into the product development lifecycle, turning expert methodologies into scalable, repeatable operational procedures.
- What kind of experience is expected for the Senior Product Operations Manager role at Harvey?
- The role requires 4-7+ years of experience in technical program management, product operations, research operations, or evaluation/benchmarking. Candidates should also possess strong business acumen, cross-functional coordination skills, and the ability to work deeply with legal experts.
- How does Harvey ensure the quality and rigor of its evaluations?
- Harvey ensures evaluation quality through operationalizing expert-designed methodologies into scalable processes, managing human data providers and internal pipelines to meet legal standards, and improving evaluation tooling for automation and dashboards. A high attention to detail and a bias toward clarity, rigor, and reproducibility are crucial.
- What opportunities for growth are available for a Senior Product Operations Manager at Harvey?
- Harvey offers unmatched opportunities for personal, professional, and financial growth, especially for those who want to build a generational company. The company's rapid scaling and definition of a new category provide a dynamic environment for career advancement.
Similar roles
Open positions we recommend based on this role.
