T
traeai
Sign in

公司

Fireworks AI

别名:FireworksAI_HQ

提供 AI 模型服务的公司,上线了 Nemotron 3 Ultra 模型。

已跟踪 30 条高相关材料

TraeAI 观察

相关材料

已收录 30 条与 Fireworks AI 相关的内容,按评分排序。

Frontier models are powerful advisors.

On @harvey's Legal Agent Benchmark, a GLM 5.1 worker using C...

Frontier models are powerful advisors.

Fireworks AI(@FireworksAI_HQ)188 字 (约 1 分钟)
87

Fireworks AI demonstrates that GLM 5.1, when using Claude Opus 4.7 as a sparse advisor in the Legal Agent Benchmark, achieves 18/100 all-pass versus 14/100 for Opus alone at 39% of the cost.

入选理由:在 Harvey 法务代理基准上,GLM 5.1 + Claude Opus 4.7 稀疏顾问方案全对数达 18/100。

FeaturedTweet#Frontier Models#Legal Agent Benchmark#harness design#advisor pattern#Claude Opus 4.7英文
Many research labs only consider inference efficiency after the fact. Step 3.7 Flash is a 196B MoE m...

Step 3.7 Flash: A 196B MoE Model Built for Inference Efficiency

Fireworks AI(@FireworksAI_HQ)183 字 (约 1 分钟)
85

Step 3.7 Flash is a 196B MoE model designed from the ground up for inference efficiency, using MFA and AFD techniques to reduce KV-cache usage to ~22% of DeepSeek, supporting agent, coding, and multimodal workflows, open-sourced under Apache 2.0 and available on Fireworks.

入选理由:Step 3.7 Flash 是 196B MoE 模型,从设计之初就聚焦推理效率,而非事后优化。

FeaturedTweet#Step 3.7 Flash#MoE#Inference Optimization#Fireworks AI#Apache 2.0英文
10/ The bigger point: your product is the best RL environment you'll ever have.

Frontier labs ship ...

10/ The bigger point: your product is the best RL environment you'll ever have.

Fireworks AI(@FireworksAI_HQ)201 字 (约 1 分钟)
85

Fireworks AI provides a view that the success of a product lies in its uniqueness and adaptability as an RL environment.

入选理由:产品的独特性是其最大的护城河。

FeaturedTweet#Reinforcement Learning#Product Design#Moat中文
We ran 720 browser agent tasks with @nottecore across frontier models. 

One baseline model produced...

Fireworks AI on X: We ran 720 browser agent tasks with @nottecore across frontier models

Fireworks AI(@FireworksAI_HQ)236 字 (约 1 分钟)
85

Fireworks AI tests show baseline models had 20% retry rates in browser agent tasks, while Kimi K2.5/GLM-5/MiniMax M2.5 achieved near-zero retries with stable latency, directly impacting production system costs/delays/reliability.

入选理由:基线模型在5次调用中约1次输出畸形,导致多步骤工作流重试

FeaturedTweet#Fireworks AI#Browser Agents#Model Execution#Retry Rates#Cost Optimization英文
Fine-tuning used to mean a team, a GPU cluster, and weeks of iteration.

Now it's just a CLI command...

Fine-tuning used to mean a team, a GPU cluster, and weeks of iteration.

Fireworks AI(@FireworksAI_HQ)167 字 (约 1 分钟)
85

Fireworks AI states that model fine-tuning now requires only a CLI command, 10 minutes of GPU time, and a few cents of compute cost, with full ownership of the weights. While 2026's off-the-shelf open models are sufficient for deployment, they remain just a starting point.

入选理由:模型微调时间从数周团队协作缩短到10分钟GPU计算,成本仅需几美分

FeaturedTweet#Fireworks AI#Model Fine-tuning#LLM#CLI#GPU Optimization英文
Many research labs only consider inference efficiency after the fact. Step 3.7 Flash is a 198B spars...

Fireworks AI introduces Step 3.7 Flash: a 198B sparse MoE VLM designed for inference from the start, with a 196B language backbone and 1.8B vision encoder, achieving up to 400 token/s on real-world agent workloads.

入选理由:从设计阶段即优化推理效率,非事后补强。

FeaturedTweet#Step3.7 Flash#sparse MoE#VLM#198B#400 token/s英文
Routing and post-training open-source models won't only give you more accurate systems but also mean...

Routing and post-training open-source models significantly improve AI system accuracy, speed, and cost-efficiency. Harvey and Fireworks AI demonstrated that a hybrid architecture using GLM 5.1 as the primary worker with selective frontier model routing achieves superior quality and lower costs in legal tasks, proving this approach is a viable alternative to pure frontier models.

入选理由:Harvey实测显示混合法律Agent在质量和成本上均优于单一前沿模型。

FeaturedTweet#Model Routing#Post-training#Open Source LLM#Hybrid Agent#Legal AI英文
This tracks. 30 trillion tokens a day on our end, and open model share keeps climbing. 

Our partner...

Fireworks AI 每日处理 30 万亿 token,开放模型使用量持续增长,Factory AI 的开放模型使用量在过去一个月内增长了 3 倍。

入选理由:Fireworks AI 每日处理 30 万亿 token,显示其在大规模数据处理能力上的优势。

FeaturedTweet#AI#Open Models#Fireworks AI#Factory AI英文
Nathan's @cursor_ai team didn't prompt-engineer their way to Composer 2.5. They trained it. The mass...

The Cursor team achieved Composer 2.5 through reinforcement learning training rather than prompt engineering, with their large-scale RL program running inference on Fireworks, indicating that self-trained models will be the only way to maintain competitive moats after 2027.

入选理由:Cursor团队使用强化学习训练Composer 2.5,而非提示工程方法

FeaturedTweet#AI Training#Reinforcement Learning#Cursor#Fireworks#Model Training英文
The @cursor_ai team shipped Composer 2 and now Composer 2.5 on the same Kimi K2.5 base model. Perfor...

Cursor AI launched Composer 2.5 on the Kimi K2.5 base model, achieving 85% performance gains from reinforcement learning, with Fireworks AI providing the RL infrastructure for scalable deployment.

入选理由:Composer 2.5基于Kimi K2.5模型,性能显著提升,85%的算力增益来自强化学习(RL)。

FeaturedTweet#Composer#Kimi K2.5#Reinforcement Learning#Fireworks AI#Cursor AI英文
Day 2 at #MSBuild is about what it takes to move beyond generic foundation models. 

Think customiza...

Day 2 at #MSBuild is about what it takes to move beyond generic foundation models.

Fireworks AI(@FireworksAI_HQ)93 字 (约 1 分钟)
65

Day 2 at #MSBuild focuses on moving beyond generic foundation models, emphasizing customization, inference performance, and scalable deployment, with a live demo and real-world case study by @chahvivi.

入选理由:MSBuild 第二天主题:超越通用基础模型,关注定制化、推理性能与规模化部署。

FeaturedTweet#MSBuild#AI#Customization#Inference Performance#Scalable Deployment英文
Fine-tuning to production inference is the gap where teams get stuck.

At #MSBuild today, our own Ro...

From Fine-tuning to Production Inference: Where Teams Get Stuck

Fireworks AI(@FireworksAI_HQ)88 字 (约 1 分钟)
65

The gap between fine-tuning and production inference is where teams get stuck; Fireworks AI at MSBuild discusses customization trade-offs, serving infrastructure choices, and scaling cost/latency optimization.

入选理由:微调到生产推理存在落地缺口,团队常在定制化与性能之间权衡。

FeaturedTweet#fine-tuning#production inference#serving architecture#cost optimization#latency optimization英文
Weekends are for vibe coding. But are your vibes continuously improving?
Fine-tune your own model → ...

Fireworks AI New Training Update

Fireworks AI(@FireworksAI_HQ)213 字 (约 1 分钟)
65

Fireworks AI introduces a new training update with Gemma 4 Dense, supporting 256K context training with SFT, DPO, or RL.

入选理由:Gemma 4 Dense is now available for Full Param + LoRA RL on 256K context.

FeaturedTweet#AI#Training#Model Development#Self-Training中文
Weekends are for vibe coding. But are your vibes continuously improving?

Fine-tune your own model →...

Fireworks AI Model Update: Gemma 4 Dense Launch

Fireworks AI(@FireworksAI_HQ)204 字 (约 1 分钟)
65

Fireworks AI launches Gemma 4 Dense model, supporting custom fine-tuning, 256K context, and multiple training methods.

入选理由:Fireworks AI 推出 Gemma 4 Dense 模型,支持 Full Param + LoRA RL 训练

FeaturedTweet#AI#Model#Training#Fine-tuning中文
Fireworks Training Platform continues to expand.
Today GLM 5.1 LoRA RL is now live via Training API:...

Fireworks Training Platform continues to expand.

Fireworks AI(@FireworksAI_HQ)221 字 (约 1 分钟)
65

Fireworks AI announced the addition of GLM 5.1 LoRA RL functionality to its training platform, supporting SFT, DPO, and full RL.

入选理由:GLM 5.1 LoRA RL支持SFT、DPO和完整RL训练

FeaturedTweet#AI training#LoRA#GLM中文
𝐅𝐮𝐥𝐥-𝐏𝐚𝐫𝐚𝐦 𝐑𝐋 𝐧𝐨𝐰 𝐚𝐯𝐚𝐢𝐥𝐚𝐛𝐥𝐞 𝐟𝐨𝐫 𝐊𝐢𝐦𝐢 𝐊𝟐.𝟔

You've been told only 3 ...

_FULL-PARAM RL NOW AVAILABLE FOR KIMI K2.6_

Fireworks AI(@FireworksAI_HQ)216 字 (约 1 分钟)
65

Fireworks AI announces its Kimi K2.6 model now supports full-parameter reinforcement learning (Full-Param RL), allowing users to fully customize training on it to achieve the maximum data moat.

入选理由:Fireworks AI的Kimi K2.6模型现在支持全参数强化学习,提供256K上下文。

FeaturedTweet#AI#Machine Learning#Reinforcement Learning#Fireworks AI#Kimi K2.6中文
Microsoft MAI models. Coming soon to Fireworks.

Intelligence you control. End-to-end lineage you ca...

Microsoft MAI Models Coming Soon to Fireworks.

Fireworks AI(@FireworksAI_HQ)102 字 (约 1 分钟)
55

Fireworks AI announces the upcoming release of Microsoft MAI models, offering enterprise-level customized reasoning model services.

入选理由:Microsoft MAI 模型将集成至 Fireworks 平台,提供企业级定制化推理能力。

FeaturedTweet#AI#Model Fine-Tuning#Enterprise Applications#Data Privacy#Traceability中英混合
Move from test to production by running high-performance inference directly on Foundry.

At #MSBuild...

Fireworks AI: High-Performance Inference on Foundry, Moving from Test to Production

Fireworks AI(@FireworksAI_HQ)177 字 (约 1 分钟)
55

Fireworks AI demonstrated an end-to-end workflow for high-performance inference directly on Foundry at MSBuild, emphasizing how unified infrastructure reduces latency, cost, and simplifies deployment.

入选理由:Fireworks AI 在 MSBuild 展示 Foundry 上的高性能推理解决方案。

FeaturedTweet#AI#Foundry#MSBuild#Enterprise Applications#High-Performance Inference英文
If you're calling a third-party API, your competitor can make the same call tomorrow.

@lqiao is the...

The article points out that using third-party APIs may allow competitors to gain the same capabilities, emphasizing the importance of building a durable moat through fine-tuned models and tight feedback loops.

入选理由:使用第三方API可能让竞争对手获得相同能力

FeaturedTweet#AI#API#Competitive Strategy中文
Most teams can pick frontier models.

Fewer can run them at production scale without hitting constra...

Fireworks AI on X: "Most teams can pick frontier models."

Fireworks AI(@FireworksAI_HQ)165 字 (约 1 分钟)
55

Most teams can choose frontier models, but few can deploy them at production scale without facing constraints.

入选理由:多数团队可选前沿模型

FeaturedTweet#AI#Model Deployment#Azure中文
a new era of hackathons:
2005→ look what i built with web search 
2016 → can you build a rails app i...

Fireworks AI on X: A New Era of Hackathons

Fireworks AI(@FireworksAI_HQ)184 字 (约 1 分钟)
50

Fireworks AI announces a new era of hackathons, showcasing rapid development with AI tools and platforms.

入选理由:2025: Spin up a CX bot with Claude in 5 minutes.

FeaturedTweet#AI#Hackathon#Product#Tech#Engineering中文
great agents need great infrastructure. proud to be @LangChain's Deep Agents Inference Partner at In...

Fireworks AI on X: 'Great agents need great infrastructure'

Fireworks AI(@FireworksAI_HQ)162 字 (约 1 分钟)
50

Fireworks AI announced its partnership with LangChain for Deep Agents Inference at Interrupt 2026.

入选理由:Fireworks AI成为LangChain Deep Agents Inference合作伙伴

FeaturedTweet#AI Agents#Infrastructure#LangChain#Fireworks AI中文
We’re looking forward to seeing how developers and enterprises use Fireworks AI on @Microsoft Foundr...

Fireworks AI Demonstrates on Microsoft Foundry

Fireworks AI(@FireworksAI_HQ)103 字 (约 1 分钟)
35

Fireworks AI announces its demonstration of AI tools on Microsoft Foundry and invites developers and enterprises to experience it at booth F111 during the MSBuild conference.

入选理由:Fireworks AI 将在 Microsoft Foundry 平台上展示其 AI 工具。

FeaturedTweet#AI#Microsoft#MSBuild#Fireworks AI#Enterprise Applications英文
Production AI systems place very different demands on infrastructure once workloads scale.

How?

Jo...

Production AI systems place very different demands on infrastructure once workloads scale

Fireworks AI(@FireworksAI_HQ)88 字 (约 1 分钟)
35

Production AI systems place very different demands on infrastructure once workloads scale, but the tweet provides no technical details or mechanisms, only serving as event promotion.

入选理由:生产级AI系统在扩展时对基础设施有特殊需求,但文中未说明具体表现。

FeaturedTweet#AI#Infrastructure#MSBuild#Fireworks AI英文

跨材料问答 · Fireworks AI

回答基于:Fireworks AI 相关 30 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.