Tencent Hunyuan Open-Sources Chronicles-OCR Benchmark for Ancient Chinese Characters
Tencent Hunyuan has open-sourced Chronicles-OCR, a visual perception benchmark evaluating VLLMs on ancient Chinese characters. The dataset spans 3,000 years of evolution, covering 7 historical scripts from Oracle Bone to Cursive with 2,800 balanced images, and includes 4 core tasks: Character Spotting, Fine-grained Recognition, Ancient Text Parsing, and Script Classification.
入选理由:数据集覆盖3000年汉字演变,包含7种历史字体和2800张平衡图像
