Gemma 4 Quantization-Aware Training (QAT) weights are now available on Ollama! 

They reduce memory ...

ollama(@ollama)

ollama(@ollama)2026年6月5日

Gemma 4 Quantization-Aware Training (QAT) weights are now available on Ollama! They reduce memory ...

8.5Score

TL;DR · AI Summary

Gemma 4 QAT模型在Ollama和Hugging Face上线，显著降低内存需求并保持模型质量。

Key Takeaways

Gemma 4 QAT模型在Ollama和Hugging Face上线，支持多种模型大小。
QAT技术可降低内存需求，同时保持模型质量。
用户可通过ollama run命令运行不同版本的Gemma 4 QAT模型。

Outline

Jump quickly between sections.

§Gemma 4 QAT模型发布
Gemma 4 QAT模型在Ollama和Hugging Face上线，支持多种模型大小。
·QAT技术优势
QAT技术可降低内存需求，同时保持模型质量。
›模型运行方式
用户可通过ollama run命令运行不同版本的Gemma 4 QAT模型。

Mindmap

See how the topics connect at a glance.

查看大纲文本（无障碍 / 无 JS 友好）

Gemma 4 QAT模型发布
- QAT技术优势
  - 降低内存需求
  - 保持模型质量
- 模型运行方式
  - ollama run命令

Highlights

Key sentences worth saving and sharing.

Gemma 4 QAT weights are now available on Ollama! They reduce memory requirements while maintaining model quality.
— 第 1 段
⬇︎ 下载 PNG 𝕏 分享到 X
All Gemma 4 model sizes and their drafters are now optimized with QAT to cut memory requirements and maximize on-device performance!
— 第 2 段
⬇︎ 下载 PNG 𝕏 分享到 X
E2B: ollama run gemma4:e2b-it-qat E4B: ollama run gemma4:e4b-it-qat 12B: ollama run gemma4:12b-it-qat 26B: ollama run gemma4:26b-a4b-it-qat 31B: ollama run gemma4:31b-it-qat
— 第 1 段
⬇︎ 下载 PNG 𝕏 分享到 X

#Gemma#QAT#Ollama#Hugging Face#模型优化

Open original article

ollama on X: "Gemma 4 Quantization-Aware Training (QAT) weights are now available on Ollama! They reduce memory requirements while maintaining model quality. E2B: ollama run gemma4:e2b-it-qat E4B: ollama run gemma4:e4b-it-qat 12B: ollama run gemma4:12b-it-qat 26B: ollama run" / X

ollama

@ollama

Gemma 4 Quantization-Aware Training (QAT) weights are now available on Ollama! They reduce memory requirements while maintaining model quality. E2B: ollama run gemma4:e2b-it-qat E4B: ollama run gemma4:e4b-it-qat 12B: ollama run gemma4:12b-it-qat 26B: ollama run gemma4:26b-a4b-it-qat 31B: ollama run gemma4:31b-it-qat Try them with ollama launch integrations to use with your favorite tools 👇👇👇

Google Gemma

@googlegemma

Jun 5

We just dropped Gemma 4 Quantization-Aware Training (QAT) checkpoints on Hugging Face! All Gemma 4 model sizes and their drafters are now optimized with QAT to cut memory requirements and maximize on-device performance!

6:32 PM · Jun 5, 2026

107.3K

Views

4

1

41

5

9

159

.

K

1.5K

7

3

2

732

Read 41 replies