Introducing Gemma 4 12B: a unified, encoder-free multimodal model
The Keyword (blog.google)693 字 (约 3 分钟)
87
Gemma 4 12B is a unified, encoder-free multimodal model bringing high-performance multimodal intelligence to your laptop. It matches the performance of our 26B MoE at less than half the memory footprint, supports native audio inputs, and runs locally on 16GB VRAM hardware with low-latency multi-step reasoning.
入选理由:Gemma 4 12B 性能接近 26B MoE,内存仅其一半,适合在 16GB VRAM 现代本机运行。
FeaturedArticle#Gemma 4#12B#multimodal#unified architecture#encoder-free英文
