T
traeai
Sign in

产品

CUDA

别名:CUDA生态

NVIDIA 的并行计算平台与应用模型,广泛用于AI与高性能计算。

已跟踪 8 条高相关材料

TraeAI 观察

相关材料

已收录 8 条与 CUDA 相关的内容,按评分排序。

#567. 黄仁勋:Agent 时代普通人和企业的新生产力,AI 基础设施竞赛下的计算革命

Jensen Huang announced at GTC Taipei 2026 that the Agentic AI era has arrived, shifting AI from content generation to autonomous task execution. NVIDIA launched infrastructure products like Vera Rubin and Vera CPU, driving a computing paradigm shift where AI becomes a direct generator of profit and GDP.

入选理由:NVIDIA发布Vera Rubin超级计算系统,专为Agent设计,支持解耦、异构和分布式AI工作负载。

FeaturedPodcast#AI Agent#NVIDIA#Vera Rubin#Agentic AI#AI Infrastructure中文
Introducing NVIDIA Nemotron 3 Ultra: An Open 550B Model for Long-Running Agents

NVIDIA today launches Nemotron 3 Ultra, a 550B-parameter open model built on the same architecture as Nemotron 3 Super, optimized for long-running AI agents. It employs LatentMoE to quadruple the number of experts at the same inference cost, introduces multi-token prediction to boost single-user inference speed, and is released under the Linux Foundation’s Open MDW license to enable enterprise deployment.

入选理由:Nemotron 3 Ultra 为 550B 参数模型,基于与 Nemotron 3 Super 相同架构,面向长时运行的智能代理场景。

FeaturedVideo#NVIDIA#Nemotron#AI Agent#LatentMoE#OpenMDW英文
Your Coding Agent Should Do AI System Engineering — Ben Burtenshaw, Hugging Face

Your Coding Agent Should Do AI System Engineering

AI Engineer4747 字 (约 19 分钟)
85

This talk proposes that AI system engineering should be handled by coding agents through three progressive steps addressing hardware optimization, model training, and automated research, emphasizing standardized repositories and Hugging Face Hub's role.

入选理由:编码代理能有效编写优化的CUDA内核,提升推理速度达30%-50%(如AMD hackathon案例)

FeaturedVideo#AI System Engineering#CUDA#Hugging Face#LLM#Multi-Agent Systems英文
DeepSeek V4 Flash 可以在 128GB 的 M3 Max 运行,还是 1M 上下文

DeepSeek V4 Flash 可以在 128GB 的 M3 Max 运行,还是 1M 上下文

掘金本周最热3702 字 (约 15 分钟)
85

DeepSeek V4 Flash 模型通过不对称优化和硬件特性绑定,在 128GB 内存的 M3 Max MacBook Pro 上实现了 1M 上下文的稳定运行。

入选理由:DeepSeek V4 Flash 使用不对称 2-bit 量化,仅对 MoE 专家部分进行量化,保持关键路径全精度。

FeaturedArticle#DeepSeek#MoE#量化#Apple Silicon#CUDA中文
Private, Local AI CUDA Coding Assistance on DGX Spark

Private, Local AI CUDA Coding Assistance on DGX Spark

NVIDIA Developer354 字 (约 2 分钟)
82

Nsight Copilot runs offline on DGX Spark using 128GB VRAM to deploy GPT OSS 12B NIM + CUDA RAG pipeline, delivering privacy-preserving, cloud-cost-free AI coding assistance for CUDA developers.

入选理由:Nsight Copilot 支持在 DGX Spark(128GB 显存)上本地部署 GPT OSS 12B NIM + CUDA RAG 管道,实现完全离线运行。

FeaturedVideo#CUDA#AI Coding Assistant#NVIDIA#Local LLM#DGX Spark英文
CUDA Proves Nvidia Is a Software Company

CUDA Proves Nvidia Is a Software Company

Wired AI757 字 (约 4 分钟)
75

The article analyzes how CUDA proves that NVIDIA is a software company, emphasizing its software strategy in the GPU computing ecosystem.

入选理由:CUDA是NVIDIA构建软件生态的核心工具

FeaturedArticle#CUDA#NVIDIA#Software Ecosystem中文
Mixpanel 创始人 @Suhail 提出了一个对于美国 AI 公司们很实现的问题:

当中国在浮点运算层面的算力实现独立后,他们的开源贡献会逐渐迁移到一套美国 "用不了、也不能用" 的技术栈上。...

Suhail's Concern About U.S. AI Companies and China's Compute Independence

meng shao(@shao__meng)498 字 (约 2 分钟)
65

China's independence in floating-point compute may lead its open-source contributions to shift toward tech stacks unusable by the U.S., posing risks to American AI research and infrastructure.

入选理由:中国算力独立后,开源贡献可能转向美国无法使用的技术栈

FeaturedTweet#AI#Compute#Open Source#China-US Tech#Tech Stack中英混合
老黄当年决定做 CUDA 的时候
大概也没想到后来会有一万个人为这个决定辩经
人类太沉迷寻找证据寻找因果了

Human decisions are more driven by hormones than rational analysis, prompting reflection on the motivations behind technological choices.

入选理由:人类决策主要由激素推动,而非知识或经验。

FeaturedTweet#Decision Psychology#CUDA#Artificial Intelligence中英混合

跨材料问答 · CUDA

回答基于:CUDA 相关 8 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.