Building Generative Image & Video models at Scale - Sander Dieleman (Veo and Nano Banana)

- 探讨了生成模型在扩展性上的关键挑战与解决方案。
- 分享了训练超大规模生成模型的硬件与软件优化策略。
- 提供了对生成式 AI 在实际应用中的未来展望。
Building Generative Image & Video models at Scale - Sander Dieleman (Veo and Nano Banana) - YouTube
Back 
Skip navigation
Search
Search with your voice
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)

[](http://www.youtube.com/watch?v=xOP1PM8fwnk)
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)
Building Generative Image & Video models at Scale - Sander Dieleman (Veo and Nano Banana)
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)
Tap to unmute
2x

Building Generative Image & Video models at Scale - Sander Dieleman (Veo and Nano Banana)
AI Engineer 1,795 views 14 hours ago
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)
Search
Copy link
Info
Shopping

If playback doesn't begin shortly, try restarting your device.
•
You're signed out
Videos you watch may be added to the TV's watch history and influence TV recommendations. To avoid this, cancel and sign in to YouTube on your computer.
Cancel Confirm
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)
Share
[](http://www.youtube.com/watch?v=xOP1PM8fwnk "Share link")- [x] Include playlist
An error occurred while retrieving sharing information. Please try again later.
0:00
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)[](https://www.youtube.com/watch?v=wjk0ulMAkbc "Next (SHIFT+n)")
0:00 / 0:00
Live
•Watch full video
•
•
20:37 How Google DeepMind is researching the next Frontier of AI for Gemini —Raia Hadsell, VP of Research AI Engineer 67K views • 3 days ago Live Playlist ()Mix (50+)29:17 Taste & Craft: A Conversation with Tuomas Artman, CTO Linear & Gergely Orosz, @The Pragmatic Engineer AI Engineer 2.4K views • 19 hours ago Live Playlist ()Mix (50+)1:01:41 This is not the AI we were promised | The Royal Society The Royal Society 157K views • Streamed 2 months ago Live Playlist ()Mix (50+)12:17 Shrimp Forever | Trump: I Would Have Won Vietnam | Jet Fuel Running Low The Late Show with Stephen Colbert 641K views • 6 hours ago Live Playlist ()Mix (50+)1:45:00 Yann LeCun: Special Lecture on AI and World Models Al-Khwarizmi Applied Mathematics Webinar 14K views • 7 days ago Live Playlist ()Mix (50+)18:46 The Future of MCP — David Soria Parra, Anthropic AI Engineer 91K views • 2 days ago Live Playlist ()Mix (50+)1:08:22 Ken Goldberg: Good Old-Fashioned Engineering Can Close the 100,000 Year “Data Gap” in Robotics MIT Robotics 19K views • 2 weeks ago Live Playlist ()Mix (50+)14:11 Trump Extends Iran Ceasefire Again, Reads from Bible & Tucker Carlson Apologizes for Supporting Him Jimmy Kimmel Live 1M views • 6 hours ago Live Playlist ()Mix (50+)26:46 Everything We Got Wrong About Research-Plan-Implement - Dexter Horthy MLOps.community 90K views • 4 weeks ago Live Playlist ()Mix (50+)15:26 Gemma, DeepMind's Family of Open Models —Omar Sanseviero, Google DeepMind AI Engineer 15K views • 1 day ago Live Playlist ()Mix (50+)39:58 Ex-Google Exec: How to Position Yourself Now Before the Next AI Phase (2026–2027) | Mo Gawdat Silicon Valley Girl and Mo Gawdat 573K views • 3 weeks ago Live Playlist ()Mix (50+)46:21 Harness Engineering: How to Build Software When Humans Steer, Agents Execute —Ryan Lopopolo, OpenAI AI Engineer 49K views • 5 days ago Live Playlist ()Mix (50+)
Sign in to confirm you’re not a bot This helps protect our community. Learn more
Building Generative Image & Video models at Scale - Sander Dieleman (Veo and Nano Banana)

AI Engineer
411K subscribers
Subscribe
Subscribed
70
Share
Save
Download
Download
1,795 views 14 hours ago
1,795 views • Apr 21, 2026
https://sander.ai/2025/04/15/latents.... Speaker info:…...more
...more
How this was made
Auto-dubbed
Audio tracks for some languages were automatically generated. Learn more
Transcript
Follow along using the transcript.
Show transcript

Show less
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)
Building Generative Image & Video models at Scale - Sander Dieleman (Veo and Nano Banana)
1,795 views
Apr 21, 2026
70
Share
Save
Download
Download
0 Comments
Sort comments
Sort by
Top Show featured commentsNewest Show recent comments, including potential spam
Add a comment...
Comments
Top Show featured commentsNewest Show recent comments, including potential spam
In this video
Transcript
Description
Building Generative Image & Video models at Scale - Sander Dieleman (Veo and Nano Banana)
70 Likes
1,795 Views
14h Ago
https://sander.ai/2025/04/15/latents.... Speaker info:
- https://sander.ai/
- https://github.com/benanne
- 
- https://x.com/sedielem
Timestamps 0:00 Introduction 2:55 Data Curation 4:02 Representation 9:39 Modeling: Diffusion Mechanism 20:01 Network Architecture 22:25 Training at Scale 23:33 Sampling & Guidance 28:03 Distillation 30:03 Control Signals…...more
...more Show less
How this was made
Auto-dubbed
Audio tracks for some languages were automatically generated. Learn more
Transcript
Follow along using the transcript.
Show transcript

Transcript
NaN / NaN

[How Google DeepMind is researching the next Frontier of AI for Gemini —Raia Hadsell, VP of Research](http://www.youtube.com/watch?v=zZsTVBXcbow&pp=ugUEEgJlbg%3D%3D)
AI Engineer
67K views • 3 days ago
New

[Taste & Craft: A Conversation with Tuomas Artman, CTO Linear & Gergely Orosz, @The Pragmatic Engineer](http://www.youtube.com/watch?v=wjk0ulMAkbc)
AI Engineer
2.4K views • 19 hours ago
New

[This is not the AI we were promised | The Royal Society](http://www.youtube.com/watch?v=CyyL0yDhr7I)
The Royal Society
157K views • Streamed 2 months ago

[Shrimp Forever | Trump: I Would Have Won Vietnam | Jet Fuel Running Low](http://www.youtube.com/watch?v=b4LiQ-qhOcM)
The Late Show with Stephen Colbert
641K views • 6 hours ago
New

[Yann LeCun: Special Lecture on AI and World Models](http://www.youtube.com/watch?v=vJKC31YpA8c)
Al-Khwarizmi Applied Mathematics Webinar
14K views • 7 days ago

[The Future of MCP — David Soria Parra, Anthropic](http://www.youtube.com/watch?v=v3Fr2JR47KA&pp=ugUEEgJlbg%3D%3D)
AI Engineer
91K views • 2 days ago
New

[Ken Goldberg: Good Old-Fashioned Engineering Can Close the 100,000 Year “Data Gap” in Robotics](http://www.youtube.com/watch?v=EtK5es0jVM4&pp=0gcJCcMKAYcqIYzv)
MIT Robotics
19K views • 2 weeks ago

[Trump Extends Iran Ceasefire Again, Reads from Bible & Tucker Carlson Apologizes for Supporting Him](http://www.youtube.com/watch?v=TS6WWmzcCpA&pp=ugUHEgVlbi1VUw%3D%3D)
Jimmy Kimmel Live
1M views • 6 hours ago
New

[Everything We Got Wrong About Research-Plan-Implement - Dexter Horthy](http://www.youtube.com/watch?v=YwZR6tc7qYg&pp=ugUEEgJlbg%3D%3D)
MLOps.community
90K views • 4 weeks ago

[Gemma, DeepMind's Family of Open Models —Omar Sanseviero, Google DeepMind](http://www.youtube.com/watch?v=_gVFUEdhCyI&pp=ugUEEgJlbg%3D%3D)
AI Engineer
15K views • 1 day ago
New

[Ex-Google Exec: How to Position Yourself Now Before the Next AI Phase (2026–2027) | Mo Gawdat](http://www.youtube.com/watch?v=E0Q96IKXx6Q)
Silicon Valley Girl and Mo Gawdat
573K views • 3 weeks ago

[Harness Engineering: How to Build Software When Humans Steer, Agents Execute —Ryan Lopopolo, OpenAI](http://www.youtube.com/watch?v=am_oeAoUhew&pp=ugUEEgJlbg%3D%3D)
AI Engineer
49K views • 5 days ago
New

[The Decline of Trump Presidency and Revisioning of the United States | Explainer](http://www.youtube.com/watch?v=twLoOT3H9SY&pp=ugUHEgVlbi1VUw%3D%3D)
Heather Cox Richardson
17K views • 5 hours ago
New

[The New Application Layer - Malte Ubl, CTO Vercel](http://www.youtube.com/watch?v=XKup1pj-34M&pp=ugUEEgJlbg%3D%3D)
AI Engineer
11K views • 1 day ago
New

[Demis Hassabis: Why AGI is Bigger than the Industrial Revolution & Where Are The Bottlenecks in AI](http://www.youtube.com/watch?v=SSya123u9Yk&pp=ugUHEgVlbi1VUw%3D%3D)
20VC with Harry Stebbings
264K views • 2 weeks ago

[The Friction is Your Judgment — Armin Ronacher & Cristina Poncela Cubeiro, Earendil](http://www.youtube.com/watch?v=_Zcw_sVF6hU&pp=ugUEEgJlbg%3D%3D)
AI Engineer
17K views • 3 days ago
New

[Anthropic's CEO: ‘We Don’t Know if the Models Are Conscious’ | Interesting Times with Ross Douthat](http://www.youtube.com/watch?v=N5JDzS9MQYI)
Interesting Times with Ross Douthat and 2 more
436K views • 2 months ago

[Richard Sutton – Father of RL thinks LLMs are a dead end](http://www.youtube.com/watch?v=21EYKqUsPfg)
Dwarkesh Patel
712K views • 6 months ago

[Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi](http://www.youtube.com/watch?v=mYSRn6PC1mc&pp=ugUEEgJlbg%3D%3D)
AI Engineer
9.9K views • 1 day ago
New

[Practical Deep Learning for Coders: Lesson 1](http://www.youtube.com/watch?v=8SF_h3xF3cE)
Jeremy Howard
541K views • 3 years ago
Show more
[](http://www.youtube.com/watch?v=xOP1PM8fwnk)