I remember when people were saying "It's useless to open-source big models because nobody will be ab...
Cerebras is now running the trillion-parameter Kimi K2.6 model in enterprise trials at ~1,000 tokens/s, shattering the old belief that open-source large models are impractical due to hardware limitations.
入选理由:Cerebras 在企业测试中以约1000 tokens/s的速度运行Kimi K2.6(千亿参数模型),创当前最快推理记录。










