概念

Artificial Analysis

Q: Artificial Analysis 最近有什么新动态？

traeai 已收录 8 篇与 Artificial Analysis 相关的内容。最新一篇是「I remember when people were saying "It's useless to open-source big models because nobody will be ab...」，由 clem 🤗(@ClementDelangue) 发布。

别名：artificialanalysis.ai

用于评估代码生成模型性能的基准测试平台。

已跟踪 8 条高相关材料

TraeAI 观察

如果只读 3 篇

I remember when people were saying "It's useless to open-source big models because nobody will be ab...

clem 🤗(@ClementDelangue) · 9.2 分

Cerebras 在企业环境中以每秒约1000个token的速度运行千亿参数模型Kimi K2.6，彻底颠覆了‘大模型开源无用’的旧认知，证明硬件创新可使开源大模型实现实用化部署。

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Simon Willison's Weblog · 8.7 分

Google发布Gemini 3.5 Flash，定价提升至3.1 Flash-Lite的6倍，却广泛部署于搜索、AI助手和企业平台，反映大模型厂商正通过高价模型试探API客户支付意愿，同时性能与成本显著高于前代产品。

Introducing North Mini Code: Cohere’s First Model For Developers

Hugging Face Blog · 8.5 分

Cohere 发布 North Mini Code，一个 30B 参数的 Mixture-of-Experts 模型，专为开发者设计，在多个代码生成基准测试中表现优异。

I remember when people were saying "It's useless to open-source big models because nobody will be ab...

clem 🤗(@ClementDelangue)5月20日102 字 (约 1 分钟)

Cerebras is now running the trillion-parameter Kimi K2.6 model in enterprise trials at ~1,000 tokens/s, shattering the old belief that open-source large models are impractical due to hardware limitations.

入选理由：Cerebras 在企业测试中以约1000 tokens/s的速度运行Kimi K2.6（千亿参数模型），创当前最快推理记录。

FeaturedTweet#Cerebras#Kimi K2.6#Open-Source LLM#Inference Performance#AI Hardware英文

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Simon Willison's Weblog5月20日615 字 (约 3 分钟)

Google released Gemini 3.5 Flash at six times the price of its predecessor, yet deployed it across Search, AI Assistant, and enterprise tools—revealing a strategic shift toward internal model saturation over API monetization.

入选理由：Gemini 3.5 Flash输入价格为$1.50/百万token，输出为$9/百万token，是3.1 Flash-Lite的6倍。

FeaturedArticle#Gemini#Google#AI Model#API Pricing#Large Model Deployment英文

Introducing North Mini Code: Cohere’s First Model For Developers

Hugging Face Blog6月11日2871 字 (约 12 分钟)

Cohere 发布 North Mini Code，一个 30B 参数的 Mixture-of-Experts 模型，专为开发者设计，在多个代码生成基准测试中表现优异。

入选理由：North Mini Code 是 Cohere 首个专为开发者设计的模型，参数量为 30B，其中 3B 为活跃参数。

FeaturedArticle#Cohere#模型#代码生成#Mixture-of-Experts#AI英文

Artificial Analysis放榜：千问3.7问鼎国产模型冠军，全球前五

Artificial Analysis Ranking: Qwen3.7-Max Tops Domestic Models, Ranks Global Top 5

量子位5月21日1050 字 (约 5 分钟)

Alibaba Cloud's Qwen3.7-Max scores 56.6 to rank 5th globally and 1st domestically in Artificial Analysis benchmark, soon available via Alibaba Cloud's BaiLian API.

入选理由：Qwen3.7-Max得分56.6分，超越国产所有模型，逼近GPT-5.4、Gemini3.1 Pro等国际顶尖模型

FeaturedArticle#Qwen3.7-Max#Artificial Analysis#Model Benchmarking#Alibaba Cloud中文

StepAudio 2.5 TTS Ranks Top 3 Globally in Blind Test on Speech Arena

AI HOT 精选5月9日121 字 (约 1 分钟)

StepAudio 2.5 TTS ranked #3 globally on the Artificial Analysis Speech Arena leaderboard in a blind test, behind only Inworld’s Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS.

入选理由：StepAudio 2.5 TTS 在盲测中排名全球第3，仅落后于 Inworld 和 Google 的模型。

FeaturedArticle#TTS#Speech Synthesis#AI Model#StepFun中文

AI Engineer Melbourne 2026 Keynote Livestream | Day 1

AI Engineer6月3日20378 字 (约 82 分钟)

AI is evolving from mere models to a combination of services, data, and products, with a surge in code generation and cost considerations impacting product strategies.

入选理由：AI不仅限于模型，还包括服务、数据和品牌，这对AI工程师有利。

FeaturedVideo#AI#Tech Trends#Code Generation英文

📢Qwen3.7-Max just hit #3 on ITbench-AA — a fresh benchmark testing how well models handle real-worl...

Qwen(@Alibaba_Qwen)5月29日125 字 (约 1 分钟)

Qwen3.7-Max 在 ITbench-AA 基准测试中排名第三，该测试评估模型处理企业级 IT 任务的能力。

入选理由：Qwen3.7-Max 在 ITbench-AA 测试中表现优异，排名第三。

FeaturedTweet#Qwen#ITbench-AA#AI模型#企业IT中英混合

And read more about it from @ArtificialAnlys:

OpenRouter(@OpenRouterAI)5月2日252 字 (约 2 分钟)

xAI发布Grok 4.3，AI指数达53，代理能力提升，输入/输出价格分别下降约40%和60%，但原文仅为推特转发，无技术细节或验证依据。

入选理由：Grok 4.3在Artificial Analysis Intelligence Index中得分为53

FeaturedTweet#xAI#Grok#LLM#AI Benchmark中文

跨材料问答 · Artificial Analysis

回答基于：Artificial Analysis 相关 8 条材料