T
traeai
Sign in

人物

ClementDelangue

别名:clem

社交媒体用户,发布关于飞行和本地AI的内容。

已跟踪 8 条高相关材料

TraeAI 观察

相关材料

已收录 8 条与 ClementDelangue 相关的内容,按评分排序。

Most people training agentic LLMs with RL right now have a silently broken training loop and have no...

Most people training agentic LLMs with RL right now have a silently broken training loop and have no idea. Here's the trap: single-turn RL works beautifully. Clean curves, sane rewards, everything converges. Then you add tools so the model can act mid-rollout, and things get weird. Loss spikes for no reason. Eventually a shape-mismatch error. The culprit: every time you parse the model's output to detect a tool call, then re-tokenize the updated conversation for the next turn, you're rolling the

入选理由:单轮RL效果好,但加入工具后需小心处理,避免形状不匹配错误。

FeaturedTweet#Reinforcement Learning#LLM中文
Arcee needs more attention that it gets! There aren't a lot of great American open-source AI model c...

Arcee Needs More Attention!

clem 🤗(@ClementDelangue)82 字 (约 1 分钟)
35

The tweet calls for more attention to Arcee, a U.S.-based open-source AI model company, but lacks depth, novelty, and practicality, serving only as a simple recommendation.

入选理由:Arcee 是一家美国开源 AI 模型公司,但未提供具体模型或技术细节。

FeaturedTweet#AI#Open Source#US Company#Arcee英文
@josefchen Nice! Could you add the model and dataset to HF?

@josefchen Nice! Could you add the model and dataset to HF?

clem 🤗(@ClementDelangue)47 字 (约 1 分钟)
25

ClementDelangue 请求 Josefchen 将模型和数据集添加到 HF。

入选理由:ClementDelangue 请求将模型和数据集添加到 HF。

FeaturedTweet#Hugging Face#数据集#模型中文
Did @Jason just pull up in Miami?

Did @Jason just pull up in Miami?

clem 🤗(@ClementDelangue)50 字 (约 1 分钟)
25

ClementDelangue posted a photo on X, asking if Jason had just arrived in Miami.

入选理由:ClementDelangue 发布了关于 Jason 抵达迈阿密的推文。

FeaturedTweet#Social Media#X英文

跨材料问答 · ClementDelangue

回答基于:ClementDelangue 相关 8 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.