什么是 Harness？

LangChain

LangChainVideo2026年6月5日

What's the tea on harnesses?

7.2Score

Watchable video resourceOpen original video

TL;DR · AI Summary

A harness is the core infrastructure for building AI Agents, consisting of tools, execution environments, system prompts, and file systems. By optimizing harness engineering, developers can significantly boost Agent performance on benchmarks like Terminal Bench without changing the underlying model.

Key Takeaways

A harness is defined as the collection of tools, execution environments, system
The ability of coding agents (e.g., Claude Code) to decompose complex problems i
Harness engineering alone can improve Terminal Bench rankings from 30th to 5th w

Outline

Jump quickly between sections.

§Definition of a Harness
A harness is a comprehensive environment comprising tools, execution environments, system prompts, and file systems to turn a model into an agent.
§Drivers of Harness Popularity
The rise of harnesses is driven by increasing model capabilities and specific fine-tuning by model labs for these environments.
§Generalization from Coding to General Tasks
The methodology of breaking down complex problems used by coding agents is applicable to domains like data analysis and research.
§Efficacy of Harness Engineering
Optimizing core harness components like system prompts and context can drastically improve benchmark performance without changing the model.

Mindmap

See how the topics connect at a glance.

查看大纲文本（无障碍 / 无 JS 友好）

AI Agent Harness
- 组成部分
  - 工具 (Tools)
  - 执行环境 (Execution Env)
  - 系统提示词 (System Prompt)
  - 文件系统 (File System)
- 核心价值
  - 任务分解泛化 (Generalization)
  - 性能提升 (Harness Engineering)
- 典型案例
  - Claude Code
  - Codex
  - Terminal Bench

Highlights

Key sentences worth saving and sharing.

A harness is the tools, execution environment, system prompt, and file system that a model has access to — to make an agent.
— 0:05
⬇︎ 下载 PNG 𝕏 分享到 X
The way coding agents break complex problems down into manageable sub-tasks is generalizable across domains like data analysis and deep research.
— 0:32
⬇︎ 下载 PNG 𝕏 分享到 X
We moved from 30th to 5th on Terminal Bench just by doing some harness engineering, without even changing the underlying model.
— 0:51
⬇︎ 下载 PNG 𝕏 分享到 X

#AI Agents#Harness Engineering#LLM#LangChain