Gemma-4 12B + Hermes, Google AI Edge: EASY, GOOD & LOCAL!
Gemma-4 12B is an encoder-free, unified multimodal model that runs directly on laptops with 16GB VRAM. It matches the performance of the 26B MOE with less than half the memory footprint, ships with Hermes and agent tools, macOS Edge Gallery, and RTLM, and is released under Apache 2.0.
入选理由:Gemma-4 12B 无需分别的视觉/音频编码器,图像与音频直接映射到 LLM,减少延迟与内存开销。




