T
traeai
Sign in

模型

Qwen

别名:通义千问

阿里云大语言模型系列,Holo3.1 基于其架构设计。

已跟踪 30 条高相关材料

TraeAI 观察

相关材料

已收录 30 条与 Qwen 相关的内容,按评分排序。

DeepSeek 的 10 万亿美元大战略

DeepSeek's 10 Trillion USD Grand Strategy

宝玉的分享5756 字 (约 24 分钟)
92

DeepSeek reduces KV cache requirements through innovations, driving China's AI hardware ecosystem toward a $10 trillion industry.

入选理由:DeepSeek V4 Pro仅需5.48GB HBM,相比GLM5的60GB和Qwen3-235B-A22B的89GB显著节省显存

FeaturedArticle#AI Model#Hardware Ecosystem#KV Cache#DeepSeek#China AI中文
Hugging Face Blog 图标

Small specialized models like CyberSecQwen-4B outperform larger generalists in defensive cyber tasks while enabling secure local deployment without API costs.

入选理由:CyberSecQwen-4B 在 CTI-MCQ 测试中以 0.5868 准确率超越 Cisco 8B 模型 8.7 个百分点。

FeaturedArticle#LLM#Cybersecurity#AMD#Fine-tuning#Open Source英文
#558.AI时代的个人革命:Garry Tan 谈开源 AI、创业信仰、创伤动力

Garry Tan argues that AI is triggering the next personal computing revolution, where open-source Agents and personal AI will empower ordinary people with unprecedented creative capacity; YC’s core tenet is “make something people want”; entrepreneurs must convert trauma into creativity through authentic perception and strong agency.

入选理由:Garry Tan 提出‘个人AI必须由自己拥有和控制’,并正在开发 G Brain——整合邮件、日历、联系人与笔记的个人知识记忆系统。

FeaturedPodcast#AI#Open Source#Startup#YC#Personal Computing中文
Holo3.1: Fast & Local Computer Use Agents

Holo3.1: Fast & Local Computer Use Agents

Hugging Face Blog808 字 (约 4 分钟)
85

Holo3.1 is Hugging Face's new computer-use agent model supporting cross-platform, multi-framework deployment and first releasing quantized weights (FP8/Q4 GGUF/NVFP4) for local inference.

入选理由:Holo3.1 在 AndroidWorld 上 35B-A3B 模型准确率从 67% 提升至 79.3%

FeaturedArticle#Computer Use Agent#Hugging Face#Quantized Model#Mobile Automation英文
Databricks 图标

Reliable LLM Inference at Scale

Databricks1967 字 (约 8 分钟)
85

Databricks 提供了一个可靠的 LLM 推理平台,支持大规模多租户系统,通过先进的硬件和软件优化实现高可用性和低延迟。

入选理由:Databricks 平台支持多种前沿模型,包括开源和专有模型。

FeaturedArticle#LLM#推理平台#Databricks#GPU#多租户英文
Learn how to use explicit caching with Qwen models:
https://t.co/ooU4l36ALM

Learn how to use explicit caching with Qwen models: https://t.co/ooU4l36ALM

OpenRouter(@OpenRouterAI)56 字 (约 1 分钟)
85

本文介绍了如何通过显式缓存优化Qwen模型的使用,包括缓存的工作原理、实现方法和最佳实践,帮助用户提高效率并降低成本。

入选理由:显式缓存可以显著减少重复请求的处理时间,提高响应速度。

FeaturedTweet#Qwen#缓存#API优化#成本控制中文
Self-Evolving in the Wild:Over the course of ~35 hours of continuous autonomous execution, the model...

Qwen在自主执行过程中,通过连续运行约35小时,进行了1158次工具调用,完成了432次内核评估,自主编写、编译、分析和迭代改进了Extend Attention Kernel,实现了10.0倍的几何提升。

入选理由:Qwen在35小时内自主执行,进行了1158次工具调用和432次内核评估。

FeaturedTweet#Qwen#自主执行#内核优化#Extend Attention Kernel#性能提升中文
🚀🚀

Qwen on X: AI Advancements

Qwen(@Alibaba_Qwen)43 字 (约 1 分钟)
85

Qwen on X highlights the latest developments in AI, particularly the Qwen3.7 Plus Preview and its impact on the Vision Arena.

入选理由:Qwen's Qwen3.7 Plus Preview enhances performance with new features, making it competitive in the lab rankings.

FeaturedTweet#AI#Vision Arena中文
Qwen 3.7 Max 在 Arena Coding Agent 上排第四

Qwen 3.7 Max in Arena Coding Agent Ranks Fourth

歸藏(guizang.ai)(@op7418)98 字 (约 1 分钟)
80

Qwen 3.7 Max ranks fourth in Arena Coding Agent, surpassing GLM-5.1 and matching Claude Opus 4.6 in agentic web development tasks.

入选理由:Qwen 3.7 Max 排名第四

FeaturedTweet#Qwen#Arena Coding Agent#Programming Competition中文
Self-improving AI is a big deal!

As a first step, I've been exploring how much of the post-training...

Self-improving AI is a big deal!

elvis(@omarsar0)242 字 (约 1 分钟)
80

Using FireworksAI Agent to automate LLM fine-tuning demonstrates the feasibility of self-improving AI systems, enabling model iteration through natural language interaction. Future recursive self-improvement systems could revolutionize knowledge discovery and end-to-end research automation.

入选理由:FireworksAI Agent已实现LLM微调自动化,成功优化Qwen模型输出风格以适配PaperWiki知识库

FeaturedTweet#Self-improving AI#FireworksAI#LLM fine-tuning#PaperWiki英文
林俊旸果然创业了!一个“Qwen负责人”头衔值135亿

林俊旸果然创业了!一个“Qwen负责人”头衔值135亿

量子位1952 字 (约 8 分钟)
78

林俊旸离职阿里后创立的新公司种子轮融资目标估值高达20亿美元,其核心理念是推动AI从推理式思考转向智能体式思考,即让模型为了行动而思考,强调环境设计、轨迹采样和编排工程的重要性。

入选理由:林俊旸离职阿里后创立的新公司种子轮融资目标估值高达20亿美元。

FeaturedArticle#AI#创业#智能体#推理#Qwen中文
苏姿丰上海开讲:AI正在重新定义计算的每一层

Lisa Su Speaks in Shanghai: AI is Redefining Every Layer of Computing

量子位3330 字 (约 14 分钟)
75

At the AMD AI Developer Conference in Shanghai, CEO Lisa Su stated that AI competition is shifting from model capabilities to systems engineering and full-stack optimization. Developers need a deployable, optimizable, and continuously evolving engineering system. AMD, centered on its ROCm open-source platform, provides full-stack computing power from cloud to edge, while continuously strengthening its developer ecosystem in China.

入选理由:AI行业竞争焦点正从模型能力转向系统工程与全栈优化能力,开发者需要可落地、可优化、可持续演进的工程体系

FeaturedArticle#AMD#AI Engineering#ROCm#Lisa Su#Open Ecosystem中文
MiniCPM-V 4.6: The Agent Vision Model

MiniCPM-V 4.6: The Agent Vision Model

Sam Witteveen3945 字 (约 16 分钟)
75

MiniCPM-V 4.6 is a compact 1.3B parameter multimodal vision-language model using SIGLIP visual encoder and Qwen language model architecture, supporting image, document and video inputs for edge device deployment.

入选理由:模型仅 13 亿参数,支持 262K 上下文窗口处理多图像和视频

FeaturedVideo#MiniCPM-V#Multimodal Model#Edge Computing#OpenBMB#Vision-Language Model英文
Hermes Agent Explained

Hermes Agent Explained

Greg Isenberg273 字 (约 2 分钟)
72

Hermes Agent is a local AI assistant framework that combines OpenRouter and Qwen models for efficient reasoning, supports memory layers, Obsidian integration, and local automation scripts, running on Android devices.

入选理由:Hermes Agent 使用 Qwen 3.6+ 模型实现推理成本降低至原价的十分之一

FeaturedVideo#AI Agent#Local AI#Qwen#OpenRouter#Automation英文
Hermes Agent Explained

Hermes Agent Explained

Greg Isenberg273 字 (约 2 分钟)
72

Hermes Agent is a local-running intelligent agent system that combines OpenRouter and Qwen models for efficient reasoning, supports memory layers and automation scripts, and can run on Android devices.

入选理由:Hermes Agent 使用 Qwen 3.6+ 实现低成本高效率推理,将 token 成本从 $100 降至 $10。

FeaturedVideo#AI Agent#LLM#Local Execution#Automation#Qwen英文
Qwen系列多尺寸模型登陆SiliconFlow平台

Qwen Series Multi-Size Models Launch on SiliconFlow Platform

AI HOT 精选127 字 (约 1 分钟)
65

Alibaba Cloud's Qwen series multi-size models are now available on the SiliconFlow platform.

入选理由:Qwen3.5-397B-A17B是最大参数模型

FeaturedArticle#Large Model#AI Platform中文
Qwen3.7预览版登陆竞技场,阿里视觉排名升至第五

Qwen3.7 Preview Lands on Arena, Alibaba Vision Ranks Fifth

AI HOT 精选111 字 (约 1 分钟)
60

Qwen3.7 preview version is now on Arena, Alibaba's vision ranking rises to fifth, and the model series will be released soon.

入选理由:Qwen3.7-Plus-Preview在Arena视觉竞技场排名第五,整体排名第十六

FeaturedArticle#Qwen#Vision Model#Alibaba Cloud中文
已经用上了,Nous Portal 要绑卡。

Already in Use, Nous Portal to Bind Card.

Geek(@geekbb)72 字 (约 1 分钟)
60

Geekbb announced that Alibaba's Qwen3.6-Plus has gone live on Nous Portal and is free for a limited time. This marks a new phase in the collaboration between Qwen and Nous Research.

入选理由:阿里巴巴的 Qwen3.6-Plus 已经上线 Nous Portal 并且限时免费。

FeaturedTweet#AI#Qwen#Nous Portal#Hermes Agent中文
Demo3:Browser Agent

Demo3: Browser Agent

Qwen(@Alibaba_Qwen)33 字 (约 1 分钟)
50

Qwen released a demo video of Browser Agent on X, but no technical details are provided, only showing examples of automated browser operations.

入选理由:Qwen于2026年6月1日在X平台发布Browser Agent演示视频。

FeaturedTweet#Browser Agent#Qwen#Alibaba#AI#Demo中英混合
🚀🚀

🚀🚀

Qwen(@Alibaba_Qwen)42 字 (约 1 分钟)
50

Qwen 3.7 Max is now supported in Hermes Agent.

入选理由:Qwen 3.7 Max 在 Hermes Agent 中支持

FeaturedTweet#Qwen#Hermes Agent#Nous Research中文
👏👏 Introducing Qwen3.7-Plus — a multimodal agent model that unifies vision and language into one v...

Qwen3.7-Plus is a multimodal agent model from Qwen that unifies vision and language capabilities, supporting GUI/CLI operations and coding tasks, now available via Alibaba Cloud API.

入选理由:Qwen3.7-Plus支持多模态交互,统一GUI和CLI操作处理视觉和文本任务。

FeaturedTweet#Qwen#Multimodal#Agent Model#Alibaba Cloud#API英文
🚀Qwen3.6-Plus is on Nous Portal now and FREE for a limited time.
Hermes Agent, here we go!! ⚡️ @Nou...

🚀Qwen3.6-Plus is now free on Nous Portal for a limited time.

Qwen(@Alibaba_Qwen)111 字 (约 1 分钟)
45

Qwen3.6-Plus is now free on Nous Portal for a limited time, allowing users to access over 300 models with a single subscription, enjoy exclusive discounts, and simplified billing.

入选理由:Qwen3.6-Plus 在 Nous Portal 上限时免费。

FeaturedTweet#Qwen#Nous Portal#AI Model#Free Trial中文
📣We're calling for ambassadors!

Whether you're a developer with great technical taste or a local c...

📣We're Calling for Ambassadors!

Qwen(@Alibaba_Qwen)197 字 (约 1 分钟)
45

The article is an announcement from Alibaba's Qwen team recruiting ambassadors, offering early model access and API credits.

入选理由:招募开发者和社区领袖担任大使

FeaturedTweet#Recruitment#Community Management中文
Demo2: Multimodal Interactive Hybrid Agent

Demo2: Multimodal Interactive Hybrid Agent

Qwen(@Alibaba_Qwen)47 字 (约 1 分钟)
40

Qwen announced Demo2: Multimodal Interactive Hybrid Agent, but no technical details or implementation principles were provided.

入选理由:Qwen于2026年6月1日在X上宣布了Demo2: Multimodal Interactive Hybrid Agent。

FeaturedTweet#Qwen#Alibaba#Multimodal#Agent英文
Anthropic 蒸馏中国模型这回事,我第一次听到是在线下,从投资人的嘴里听到的
那时候他们说的是 Claude 蒸馏了 Kimi
我说你确定不是 Cursor 蒸馏了 Kimi 吗?
他非常肯定地...

Orange AI on X: 'Anthropic distilling Chinese models'

orange.ai(@oran_ge)380 字 (约 2 分钟)
40

Orange AI on X shared a tweet about Anthropic distilling Chinese models, indicating evidence that Claude distilled Kimi and Qwen, but the responsibility is questionable.

入选理由:Anthropic 蒸馏了中国的 Kimi 和 Qwen 模型。

FeaturedTweet#Anthropic#distillation#models#Twitter中文
orange.ai(@oran_ge) 图标

This post spreads a clearly false rumor: Claude is developed by Anthropic, Kimi by Moonshot AI; there is zero technical linkage—this is a typical AI misinformation case.

入选理由:Claude由Anthropic公司研发,与月之暗面的Kimi模型无任何训练或架构关系

FeaturedTweet#AI Rumor#Claude#Kimi#Anthropic#Moonshot AI中文

跨材料问答 · Qwen

回答基于:Qwen 相关 30 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.