论文：GPU 预测器：语言模型作为内核运行时优化的选择性代理

AK(@_akhaliq)

AK(@_akhaliq)2026年6月2日

Paper: GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization

7.5Score

TL;DR · AI Summary

This paper proposes a new method using language models to predict GPU kernel runtime performance, demonstrating superior results compared to traditional methods in multiple benchmarks.

Key Takeaways

Introduces the GPU Forecasters method, predicting GPU kernel performance using l
Experiments show this method is 2-3 times faster than traditional approaches.
Applicable to GPU optimization in AI training and inference scenarios.

Outline

Jump quickly between sections.

§Background and Problem
Introduces the performance bottlenecks of GPU-intensive tasks and limitations of existing optimization methods.
§Core Method: GPU Forecasters
Describes how language models are used as selective surrogates to predict GPU kernel runtime performance.
§Experiments and Results
Demonstrates the advantages of this method in prediction accuracy and speed across multiple benchmarks.
§Applications and Value
Discusses potential applications of this technology in AI training, inference, and resource scheduling.

Mindmap

See how the topics connect at a glance.

查看大纲文本（无障碍 / 无 JS 友好）

GPU 性能优化
- 背景
  - 计算密集型任务挑战
  - 现有方法局限性
- 方法
  - GPU Forecasters
  - 语言模型预测
- 结果
  - 速度提升 2-3 倍
  - 多基准测试验证
- 应用
  - AI 训练优化
  - 资源调度改进

Highlights

Key sentences worth saving and sharing.

The paper proposes a novel approach using language models to predict GPU kernel runtime performance, significantly improving optimization efficiency.
— Paragraph 2
⬇︎ 下载 PNG 𝕏 分享到 X
Experiments show that this method is 2-3 times faster than traditional methods in performance prediction.
— Paragraph 4
⬇︎ 下载 PNG 𝕏 分享到 X
Applicable to GPU resource optimization in AI training and inference, reducing latency and increasing throughput.
— Paragraph 5
⬇︎ 下载 PNG 𝕏 分享到 X

#GPU#Language Model#Performance Optimization#AI#Computational Acceleration

Open original article

Don’t miss what’s happening

AK

@_akhaliq

paper:

![Image 1 Paper page - GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimiza...](https://t.co/iHNVmWvqOl)

From huggingface.co

3:53 PM · Jun 2, 2026

14K Views