T
traeai
Sign in

概念

NVFP4

NVIDIA 模型优化器生成的量化格式,用于本地快速推理。

相关材料

已收录 3 条与 NVFP4 相关的内容,按评分排序。

Holo3.1: Fast & Local Computer Use Agents

Holo3.1: Fast & Local Computer Use Agents

Hugging Face Blog808 字 (约 4 分钟)
85

Holo3.1 is Hugging Face's new computer-use agent model supporting cross-platform, multi-framework deployment and first releasing quantized weights (FP8/Q4 GGUF/NVFP4) for local inference.

入选理由:Holo3.1 在 AndroidWorld 上 35B-A3B 模型准确率从 67% 提升至 79.3%

FeaturedArticle#Computer Use Agent#Hugging Face#Quantized Model#Mobile Automation英文
Long video generation is a systems problem.

Introducing LongLive-2.0 from NVIDIA Research: an end-t...

NVIDIA Research releases LongLive-2.0 system that adopts end-to-end NVFP4 training and inference architecture to solve long video generation problems, eliminating model deployment gaps through unified training-inference precision alignment while improving speed and memory efficiency.

入选理由:LongLive-2.0采用NVFP4低精度训练推理架构

FeaturedTweet#NVIDIA#Video Generation#Low Precision Computing#AI Systems英文
Nvidia presents LongLive-2.0

An NVFP4 Parallel Infrastructure for Long Video Generation

Nvidia presents LongLive-2.0

AK(@_akhaliq)52 字 (约 1 分钟)
45

Nvidia releases LongLive-2.0, an NVFP4 parallel infrastructure for long video generation, but the tweet only announces the product name without disclosing any technical implementation details.

入选理由:Nvidia发布LongLive-2.0长视频生成基础设施

FeaturedTweet#Nvidia#Video Generation#NVFP4#Parallel Computing#AI Infrastructure英文

跨材料问答 · NVFP4

回答基于:NVFP4 相关 3 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.