T
traeai
Sign in
返回首页
ollama(@ollama)

Gemma 4 Quantization-Aware Training (QAT) weights are now available on Ollama! They reduce memory ...

8.5Score
Gemma 4 Quantization-Aware Training (QAT) weights are now available on Ollama! 

They reduce memory ...

TL;DR · AI Summary

Gemma 4 QAT模型在Ollama和Hugging Face上线,显著降低内存需求并保持模型质量。

Key Takeaways

  • Gemma 4 QAT模型在Ollama和Hugging Face上线,支持多种模型大小。
  • QAT技术可降低内存需求,同时保持模型质量。
  • 用户可通过ollama run命令运行不同版本的Gemma 4 QAT模型。

Outline

Jump quickly between sections.

  1. §Gemma 4 QAT模型发布

    Gemma 4 QAT模型在OllamaHugging Face上线,支持多种模型大小。

  2. QAT技术可降低内存需求,同时保持模型质量。

  3. 用户可通过ollama run命令运行不同版本的Gemma 4 QAT模型。

Mindmap

See how the topics connect at a glance.

查看大纲文本(无障碍 / 无 JS 友好)
  • Gemma 4 QAT模型发布
    • QAT技术优势
      • 降低内存需求
      • 保持模型质量
    • 模型运行方式
      • ollama run命令

Highlights

Key sentences worth saving and sharing.

  • Gemma 4 QAT weights are now available on Ollama! They reduce memory requirements while maintaining model quality.

    第 1 段

    ⬇︎ 下载 PNG𝕏 分享到 X
  • All Gemma 4 model sizes and their drafters are now optimized with QAT to cut memory requirements and maximize on-device performance!

    第 2 段

    ⬇︎ 下载 PNG𝕏 分享到 X
  • E2B: ollama run gemma4:e2b-it-qat E4B: ollama run gemma4:e4b-it-qat 12B: ollama run gemma4:12b-it-qat 26B: ollama run gemma4:26b-a4b-it-qat 31B: ollama run gemma4:31b-it-qat

    第 1 段

    ⬇︎ 下载 PNG𝕏 分享到 X
#Gemma#QAT#Ollama#Hugging Face#模型优化
Open original article

ollama on X: "Gemma 4 Quantization-Aware Training (QAT) weights are now available on Ollama! They reduce memory requirements while maintaining model quality. E2B: ollama run gemma4:e2b-it-qat E4B: ollama run gemma4:e4b-it-qat 12B: ollama run gemma4:12b-it-qat 26B: ollama run" / X

ollama

@ollama

Gemma 4 Quantization-Aware Training (QAT) weights are now available on Ollama! They reduce memory requirements while maintaining model quality. E2B: ollama run gemma4:e2b-it-qat E4B: ollama run gemma4:e4b-it-qat 12B: ollama run gemma4:12b-it-qat 26B: ollama run gemma4:26b-a4b-it-qat 31B: ollama run gemma4:31b-it-qat Try them with ollama launch integrations to use with your favorite tools 👇👇👇

Google Gemma

@googlegemma

Jun 5

We just dropped Gemma 4 Quantization-Aware Training (QAT) checkpoints on Hugging Face! All Gemma 4 model sizes and their drafters are now optimized with QAT to cut memory requirements and maximize on-device performance!

6:32 PM · Jun 5, 2026

107.3K

Views

4

1

41

5

9

159

.

K

1.5K

7

3

2

732

Read 41 replies

AI may generate inaccurate information. Please verify important content.