T
traeai
Sign in

产品

academy.dair.ai

别名:Dair AI Academy

提供构建有效 AI 代理的实践教学与资源平台。

相关材料

已收录 5 条与 academy.dair.ai 相关的内容,按评分排序。

New research from Google.

Just shows the impressive results you can get from custom agent harnesses...

Google's LEAP framework wraps a general-purpose LLM in an agentic scaffold that grounds every step in the Lean compiler and iterates against verifier feedback. It solves all 12 Putnam 2025 problems with one model, lifting the one-shot solve rate of the Lean-IMO-Bench from under 10% to 70%, outperforming a specialized gold-medal system that scores 48. Paper: arXiv:2606.03303. Learn to build effective AI agents at academy.dair.ai.

入选理由:LEAP 通用 LLM 一模型解决全部 12 道 Putnam 2025 题。

FeaturedTweet#LEAP#Lean compiler#Putnam 2025#agentic framework#general-purpose LLM英文
NEW paper from Meta.

(bookmark it)

It's an agent system that autonomously discovers neural archite...

NEW paper from Meta.

elvis(@omarsar0)198 字 (约 1 分钟)
87

Meta proposes AIRA, a dual-agent system that autonomously discovers neural architectures outperforming Llama 3.2 at 350M, 1B, and 3B scales within a 24-hour compute budget, offering a reusable engineering paradigm for AI agent design.

入选理由:AIRA系统在24小时内自动发现超越Llama 3.2的350M/1B/3B参数模型架构。

FeaturedTweet#AI Agent#Neural Architecture Search#Meta#Llama 3.2#AIRA英文
Interesting position paper on agentic AI as a foreseeable pathway to AGI.

(bookmark it)

There has ...

Interesting position paper on agentic AI as a foreseeable pathway to AGI

elvis(@omarsar0)188 字 (约 1 分钟)
75

The article argues that agentic AI systems are more promising for achieving AGI than simply scaling up foundation models.

入选理由:代理AI系统比更大基础模型更可能实现AGI

FeaturedTweet#AGI#AI Agent#Machine Learning英文
Interesting interpretability paper on tool-using agents.

The authors probe hidden states and find t...

Interpretability Study on Tool-Using Agents

elvis(@omarsar0)212 字 (约 1 分钟)
65

Paper reveals significant discrepancy between model's recognition and execution of tool calls, with match rate 26-54%, concentrated in cognition-to-action transition.

入选理由:模型识别应调用工具但实际未执行,匹配率26-54%

FeaturedTweet#AI#Tool Use#Interpretability#Model Behavior Analysis中文

跨材料问答 · academy.dair.ai

回答基于:academy.dair.ai 相关 5 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.