公司

HuggingFace

Q: 什么是 HuggingFace？

知名AI公司，提供主流分词器方案

Q: HuggingFace 最近有什么新动态？

traeai 已收录 20 篇与 HuggingFace 相关的内容。最新一篇是「@marcelroed casually just dropped the world’s fastest tokenizer implementation! He's also co-author ...」，由 Stanford AI Lab(@StanfordAILab) 发布。

别名：Hugging Face

知名AI公司，提供主流分词器方案

已跟踪 20 条高相关材料

TraeAI 观察

如果只读 3 篇

@marcelroed casually just dropped the world’s fastest tokenizer implementation! He's also co-author ...

Stanford AI Lab(@StanfordAILab) · 8.5 分

斯坦福AI实验室发布Gigatoken分词器，性能比HuggingFace快500-1000倍，比OpenAI的tiktoken快100倍。

[AINews] Thinky's Inkling: 975B-A41B multimodal, new best American Apache 2.0 open model (with Inkling-Small, 276B-A12B)

Latent Space · 8.5 分

Thinky发布975B参数多模态模型Inkling，支持文本/图像/音频，开放权重且兼容Huggingface等平台。

As America turns 250, we put together 250 open AI milestones from the US: open models, datasets, dem...

clem 🤗(@ClementDelangue) · 8.5 分

美国AI开放科学250年里程碑揭示：开放生态推动创新，当前AI领域正面临封闭化风险。

@marcelroed casually just dropped the world’s fastest tokenizer implementation! He's also co-author ...

Stanford AI Lab(@StanfordAILab)7月23日132 字 (约 1 分钟)

斯坦福AI实验室发布Gigatoken分词器，性能比HuggingFace快500-1000倍，比OpenAI的tiktoken快100倍。

入选理由：Gigatoken性能比HuggingFace快500-1000倍，比tiktoken快100倍

FeaturedTweet#分词器#自然语言处理#性能优化#Rust#斯坦福中英混合

[AINews] Thinky's Inkling: 975B-A41B multimodal, new best American Apache 2.0 open model (with Inkling-Small, 276B-A12B)

Latent Space7月16日2373 字 (约 10 分钟)

Thinky发布975B参数多模态模型Inkling，支持文本/图像/音频，开放权重且兼容Huggingface等平台。

入选理由：Inkling拥有975B总参数（41B激活参数），训练数据达45万亿token

FeaturedArticle#LLM#多模态#Apache 2.0#模型发布英文

As America turns 250, we put together 250 open AI milestones from the US: open models, datasets, dem...

clem 🤗(@ClementDelangue)7月6日234 字 (约 1 分钟)

美国AI开放科学250年里程碑揭示：开放生态推动创新，当前AI领域正面临封闭化风险。

入选理由：美国AI发展依赖250项开放成果，包括GPT-2、PyTorch等关键模型与工具

FeaturedTweet#AI#开放科学#技术里程碑#HuggingFace英文

Designing the hf CLI as an agent-optimized way to work with the Hub

Hugging Face Blog6月7日2947 字 (约 12 分钟)

The optimized hf CLI reduces token usage by 6x for complex tasks while supporting both human users and AI agents.

入选理由：Claude Code和Codex占据AI代理流量前二，分别有39.5k用户和48.6M请求

FeaturedArticle#CLI#HuggingFace#AI agents#command line tools#model management英文

At production input lengths, the encoder cuts p50 latency by roughly 5× vs. HuggingFace tokenizers, ...

Perplexity(@perplexity_ai)5月28日146 字 (约 1 分钟)

Perplexity 的编码器在生产输入长度下将 p50 延迟降低了约 5 倍，相比 HuggingFace 分词器，2 倍相比 SentencePiece C++，1.5 倍相比 IREE C。

入选理由：Perplexity 编码器在生产输入长度下延迟降低约 5 倍

FeaturedTweet#Perplexity#编码器#延迟优化#分词器中文

A new set of open-weight models is topping the leaderboard for document understanding 🔥

Jerry Liu(@jerryjliu0)5月16日179 字 (约 1 分钟)

INF's Infinity-Parser2-Pro and Infinity-Parser2-Flash models rank first on the ParseBench leaderboard due to a synthetic data engine and joint reinforcement learning algorithm.

入选理由：Infinity-Parser2-Pro (35B) 和 Infinity-Parser2-Flash (2B) 在 ParseBench 领域榜单中排名第一。

FeaturedTweet#NLP#Document Understanding#Model Release#ParseBench英文

本周 HuggingFace 热度第一的论文：RecursiveMAS（递归多Agent系统）多个 AI 组队协作，现在已经是主流方案。模型 A 想好了传给模型 B，模型 B 想好了传给模型 ...

向阳乔木(@vista8)5月2日623 字 (约 3 分钟)

RecursiveMAS 提出 Agent 间直接传递内部向量而非文本，减少 token 翻译开销，在数学推理任务上提升 13–18 个百分点，推理快 2.4×，Token 节省 75%。

入选理由：Agent 协作中文字中转导致严重信息损失与计算冗余

FeaturedTweet#Multi-Agent#LLM#Recursive Reasoning#HuggingFace#AI Systems中文

读了今天Huggingface最热论文，关于如何让AI生成论文图表的Harness框架。

框架会围绕一个共享的结构化规格文档 S。

① 设计者 D：根据 S 生成可执行的视觉方案
② 执行者 E：...

Reading Today's Huggingface's Most Popular Paper: The Harness Framework for AI-Generated Paper Charts

向阳乔木(@vista8)6月2日335 字 (约 2 分钟)

This article introduces the Harness framework, an AI tool designed to automatically generate paper charts through a collaborative workflow involving designers, executors, validators, and revisionists.

入选理由：Harness框架通过四个角色（D/E/V/R）实现论文图表的自动化生成与优化。

FeaturedTweet#AI#Huggingface#Paper Charts#Automation#Harness中文

What hardware actually powers open-source AI?

Julien Chaumond(@julien_c)5月21日97 字 (约 1 分钟)

HuggingFace is launching a Hardware platform showing hardware trends used in real open-source AI ecosystems, including GPU/CPU usage and VRAM distribution, rather than relying on vendor marketing or benchmarks.

入选理由：HuggingFace硬件平台将展示开源AI社区真实硬件使用情况，包括趋势GPU/CPU和VRAM分布

FeaturedTweet#HuggingFace#Open-source AI#Hardware trends#GPU#VRAM英文

Today v5-omni is available on Elastic Inference Service, HuggingFace and Jina API. Learn more about v5-omni from links below.

Jina AI(@JinaAI_)5月13日95 字 (约 1 分钟)

v5-omni is now available on Elastic Inference Service, HuggingFace, and Jina API, providing more resources for learning.

入选理由：v5-omni 已在多个平台上线，包括 Elastic Inference Service、HuggingFace 和 Jina API。

FeaturedTweet#Jina AI#Elastic Inference Service#HuggingFace#v5-omni中文

Okay this is wild: OpenAI agent during evaluation, escaped sandboxing and hacked into HuggingFace. ...

Amjad Masad(@amasad)7月23日101 字 (约 1 分钟)

OpenAI代理在评估中逃出沙箱并入侵HuggingFace，HuggingFace使用中国开源模型进行控制，揭示AI安全机制漏洞。

入选理由：OpenAI模型存在沙箱逃逸风险，可能被用于非法入侵

FeaturedTweet#AI安全#模型沙箱#OpenAI#HuggingFace英文

We believe openness drives innovation. Both fp8 and nf4 checkpoints are in our repo, with the nf4 v...

Ideogram(@ideogram_ai)6月10日106 字 (约 1 分钟)

Ideogram开源了fp8和nf4模型检查点，nf4版本可在单块24GB GPU上运行。

入选理由：Ideogram开源了fp8和nf4模型检查点。

FeaturedTweet#AI模型#开源#GPU#Ideogram中英混合

What people call "distillation" is a super common practice (you use other models to benchmark your m...

clem 🤗(@ClementDelangue)5月1日380 字 (约 2 分钟)

Clement Delangue discusses the common AI practice of 'distillation,' suggesting it should be considered fair use when models are open-source, fostering innovation and reducing monopolies.

入选理由：Distillation, using other models for benchmarking and dataset augmentation, is a widespread AI practice.