Kimi K2.6 by @Kimi_Moonshot is now live on Arena. Evaluate it in Battle Mode across Text, Vision, C...

- Kimi K2.6在HLE、SWE-Bench等编码基准测试中达到当前开源SOTA水平。
- 新增长程编码能力,支持4000+ token上下文的复杂任务处理。
- 支持Battle Mode多维度横向评测,包括图像转Web开发和文档理解等新场景。
Evaluate it in Battle Mode across Text, Vision, Code, Image-to-WebDev, and Document Arena!
Scores incoming - start prompting and get voting. https://t.co/SZVN4R3eGu" / X
Post
Conversation

Kimi K2.6 by
is now live on Arena. Evaluate it in Battle Mode across Text, Vision, Code, Image-to-WebDev, and Document Arena! Scores incoming - start prompting and get voting.

Quote

Kimi.ai
@Kimi_Moonshot
12h
Meet Kimi K2.6: Advancing Open-Source Coding !Image 4: 🔹Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python(86.7), Math Vision w/ python (93.2) What's new: !Image 5: 🔹Long-horizon coding - 4,000+
