Fireworks AI 在 X 上发文：我们与 @harvey 团队密切合作推出了 Legal Agent Benchmark

Fireworks AI(@FireworksAI_HQ)

Fireworks AI(@FireworksAI_HQ)2026年5月6日

Fireworks AI on X: We’ve been working closely with the @harvey team on the launch of the Legal Agent Benchmark

7.5Score

TL;DR · AI Summary

Fireworks AI and Harvey team jointly launched the Legal Agent Benchmark to evaluate how open-weight models perform on long-horizon legal tasks.

Key Takeaways

Legal Agent Benchmark is the first open-source benchmark focused on long-horizon
It includes 12 real-world legal scenarios such as contract review and case predi
The collaboration aims to improve verifiability and practicality of legal AI age

Outline

Jump quickly between sections.

§Project Background
Fireworks AI and Harvey team collaborated to build a legal agent evaluation benchmark to fill an industry gap.
·Core Objective
Measure how well open-weight models perform on complex, long-horizon legal tasks in practice.
›Evaluation Scope
Includes 12 real-world legal workflows like contract analysis and regulation interpretation, emphasizing multi-step reasoning.
·Technical Contribution
Provides open datasets and scoring mechanisms to enable community-driven improvements in legal AI.

Mindmap

See how the topics connect at a glance.

查看大纲文本（无障碍 / 无 JS 友好）

Legal Agent Benchmark
- 合作方
  - Fireworks AI
  - Harvey Team
- 核心功能
  - 评估开源模型性能
  - 支持长周期法律任务
- 应用场景
  - 合同审查
  - 案件预测

Highlights

Key sentences worth saving and sharing.

Legal Agent Benchmark is the first open-source evaluation tool focused on long-horizon legal tasks for open-weight models.
— Paragraph 1
⬇︎ 下载 PNG 𝕏 分享到 X
The benchmark includes 12 real-world legal scenarios where each task requires multi-step reasoning to complete, simulating professional lawyer workflows.
— Paragraph 2
⬇︎ 下载 PNG 𝕏 分享到 X

#AI#LegalTech#Benchmark#Open Source

Open original article

Check it out:" / X

Fireworks AI on X: "We’ve been working closely with the @harvey team on the launch of the Legal Agent Benchmark, a product focused on evaluating how open-weight models perform on long-horizon, real-world legal tasks. Check it out:" / X

Don’t miss what’s happening

Fireworks AI

@FireworksAI_HQ

We’ve been working closely with the

@harvey

team on the launch of the Legal Agent Benchmark, a product focused on evaluating how open-weight models perform on long-horizon, real-world legal tasks. Check it out:

Quote

Gabe Pereyra

@gabepereyra

·

May 6

Article

Open-Sourcing Harvey’s Long Horizon Legal Agent Benchmark

Authors: @nikogrupen, @ItsJulioPereyra, Gabe Pereyra Description: An open-source benchmark built to evaluate and improve agent capabilities for supporting legal work. URL: https://www.harvey.ai/blog/introducing-harveys-legal-agent-benchmark...

6:20 PM · May 6, 2026

·

165.5K Views

1

3

23

3