New @GoogleGemma 4 QAT (Quantization-Aware Training) checkpoints are here, so you can run models locally on consumer GPUs and mobile devices with minimal quality loss.
Google AI Developers(@googleaidevs)159 字 (约 1 分钟)
72
Google releases Gemma 4 QAT checkpoints, enabling local inference on consumer GPUs and mobile devices with Q4_0 GGUF format, keeping memory below 1GB while preserving high inference quality.
入选理由:Gemma 4 QAT 检查点采用 Q4_0 GGUF 格式,兼容所有尺寸模型,提升本地推理性能。
FeaturedTweet#Gemma#QAT#GGUF#mobile inference#quantization中文
