Senior Product Operations Manager, Evaluation (San Francisco) Job at The Rundown AI, Inc., San Francisco, CA

VzhzYXl2SndEQlRlM0RYckdnZW5leTBWQ1E9PQ==
  • The Rundown AI, Inc.
  • San Francisco, CA

Job Description

Why Harvey

At Harvey, were transforming how legal and professional services operate not incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, were reshaping how critical knowledge work gets done for decades to come.

This is a rare chance to help build a generational company at a true inflection point. With 700+ customers in 58+ countries, strong product-market fit, and world-class investor support, were scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth personal, professional, and financial is unmatched.

Our team is sharp, motivated, and deeply committed to the mission. We move fast, operate with intensity, and take real ownership of the problems we tackle from early thinking to long-term outcomes. We stay close to our customers from leadership to engineers and work together to solve real problems with urgency and care. If you thrive in ambiguity, push for excellence, and want to help shape the future of work alongside others who raise the bar, we invite you to build with us.

At Harvey, the future of professional services is being written today and were just getting started.

Role Overview

Were looking for a technical, systems-minded operator to build and scale the evaluation engine behind Harveys platform. As we expand globally, ensuring our models behave reliably, accurately, and jurisdictionally correctly is missioncriticaland evaluation complexity is increasing 10x.

As a member of our Product Operations team, youll work closely with Applied Legal Researchers, Product, Engineering, AI Research, and human data providers to operationalize evaluation methodologies and embed them into our product development lifecycle. Youll create the workflows, systems, and tooling that make evaluation a firstclass product capability at Harvey.

This is a highownership role for someone who thrives in ambiguity, loves building structure from ambiguity, and wants to help scale the evaluation infrastructure of a global AI company.

What Youll Do

  • Build and scale the systems that power model and product evaluations across Harvey

  • Embed evaluation workflows and readiness checkpoints into the product development lifecycle

  • Create the single source of truth for evaluation status, results, history, and launch readiness

  • Turn Expertdesigned evaluation methodologies into scalable, repeatable operational processes

  • Manage relationships with human data vendors and ensure evaluation quality meets legal standards

  • Work with Engineering and Research to improve evaluation tooling, automation, and dashboards

  • Drive evaluation readiness for major product and model launches across geographies and jurisdictions

  • Document and operationalize evaluation governance as complexity increases

  • Help define how Harvey ensures model accuracy, reliability, and trust at global scale

What You Have

  • 47+ years in technical program management, product operations, research operations, or evaluation/benchmarking roles

  • Experience working with ML/AI evaluations, benchmarking frameworks, or scientific workflows

  • Comfort with statistical methodologies and SQL or Python, or similar tools to interpret evaluation data

  • Ability to work deeply with legal experts and operationalize complex evaluation methodologies

  • Strong crossfunctional coordination skills across Product, Engineering, Research, and data providers/vendors

  • High attention to detail and a bias toward clarity, rigor, and reproducibility

  • Ability to navigate extreme ambiguity and bring order to complex systems

  • Strong communication skills and comfort translating technical nuance for diverse stakeholders

  • Desire to do whatever it takes to make evaluation systems successfulfrom writing documentation to diagnosing pipeline issues

Bonus Points

  • Experience in legal tech or working with domain experts in regulated industries

  • Experience managing human data providers or humanintheloop evaluation pipelines

  • Background in ML research, data quality management, or evaluation science

  • Early employee at a hypergrowth startup

  • Experience at worldclass product or platform operations orgs (ex: Stripe, Ramp)

Compensation

$178,500 - $210,000 USD

Please find our CA applicant privacy notice here.

#LI-CL1

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing accommodations@harvey.ai

#J-18808-Ljbffr

Job Tags

Full time,

Similar Jobs

Hustle Notice Biz

Entry Level Project Coordinator Job at Hustle Notice Biz

 ...Job Title: Entry Level Project Coordinator Location: San Antonio, TX Job Type: Full-time About Us We are seeking a motivated...  ...opportunity for someone who is looking to start a career in project management and gain hands-on experience in coordinating projects from... 

Allied Universal

Elite Armed Security Officer Job at Allied Universal

Allied Universal seeks a highly trained Elite Armed Security Officer to join our dedicated team in Conshohocken, Pennsylvania. In this role, you will combine your comprehensive security training with the latest in technology and strategic practices to protect our clients... 

Moffitt Cancer Center Partnership

CLINICAL DATA ENGINEER SNOWFLAKE Job at Moffitt Cancer Center Partnership

 ...SummaryDr. Ciara Freeman at the esteemed Moffitt Cancer Center is on the lookout for a talented Clinical Data Engineer with expertise in crafting Snowflake pipelines. Join us in pioneering the next generation of regulatory-grade AI models in oncologyyour expertise will... 

B Lab Global

Software Engineer II (Washington, D.C.) Job at B Lab Global

 ...This is a Full-Time Role (40 hours per week) with no option for part-time work. While this is a remote-first opportunity, the candidate filling this...  ...related tools. About the Opportunity As a Software Engineer II on the Assessment Squad, you will help... 

Petco

Veterinary Assistant Job at Petco

 ...patients and phenomenal customer care to their owners. The Veterinary Assistant represents the mission and values to all clients. Our Veterinary...  ...to do what it takes to create an exceptional customer experience.* contentious issues are dealt with and resolved as they...