返回首页
AI Engineer

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX — Adrien Grondin, Locally AI

9.0Score
Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX — Adrien Grondin, Locally AI
AI 深度提炼
  • 实现了 40 tokens/s 的高性能推理速度。
  • 详细介绍了 MLX 框架的技术优势与实现细节。
  • 为移动端 AI 应用开发提供了新思路。
#LLM#移动端#MLX
打开原文

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX —Adrien Grondin, Locally AI - YouTube

Back ![Image 1](http://www.youtube.com/ "YouTube Home")

Skip navigation

Search

Search with your voice

[](http://www.youtube.com/watch?v=a2muGkT4WD4)

Sign in

![Image 2](http://www.youtube.com/ "YouTube Home")

[](http://www.youtube.com/watch?v=a2muGkT4WD4)

[](http://www.youtube.com/watch?v=a2muGkT4WD4)

[](http://www.youtube.com/watch?v=a2muGkT4WD4)

[](http://www.youtube.com/watch?v=a2muGkT4WD4)

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX —Adrien Grondin, Locally AI

[](http://www.youtube.com/watch?v=a2muGkT4WD4)

Tap to unmute

2x

![Image 3](http://www.youtube.com/watch?v=a2muGkT4WD4)

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX —Adrien Grondin, Locally AI

AI Engineer 4,211 views 1 day ago

[](http://www.youtube.com/watch?v=a2muGkT4WD4)

Search

Copy link

Info

Shopping

Image 4

![Image 5](http://www.youtube.com/watch?v=a2muGkT4WD4)

If playback doesn't begin shortly, try restarting your device.

You're signed out

Videos you watch may be added to the TV's watch history and influence TV recommendations. To avoid this, cancel and sign in to YouTube on your computer.

Cancel Confirm

[](http://www.youtube.com/watch?v=a2muGkT4WD4)

Share

[](http://www.youtube.com/watch?v=a2muGkT4WD4 "Share link")- [x] Include playlist

An error occurred while retrieving sharing information. Please try again later.

Image 6

0:00

[](http://www.youtube.com/watch?v=a2muGkT4WD4)[](https://www.youtube.com/watch?v=CEvIs9y1uog "Next (SHIFT+n)")

0:00 / 0:00

Live

•Watch full video

15:26 Gemma, DeepMind's Family of Open Models —Omar Sanseviero, Google DeepMind AI Engineer 15K • 1d ago Live Playlist ()Mix (50+)46:21 Harness Engineering: How to Build Software When Humans Steer, Agents Execute —Ryan Lopopolo, OpenAI AI Engineer 49K • 5d ago Live Playlist ()Mix (50+)21:50 Master Gemma 4 in 20 Minutes Ali H. Salem 31K • 5d ago Live Playlist ()Mix (50+)18:46 The Future of MCP — David Soria Parra, Anthropic AI Engineer 91K • 2d ago Live Playlist ()Mix (50+)12:30 What 6 months of AI coding did to my dev team Axel Molist 69K • 3d ago Live Playlist ()Mix (50+)1:57:03 Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi AI Engineer 9.9K • 1d ago Live Playlist ()Mix (50+)55:32 Claude just killed ALL Note-Taking Apps. Here is proof.ICOR with Tom | AI Productivity 296K • 4w ago Live Playlist ()Mix (50+)19:13 Full Claude Tutorial: Beginner to Advanced in 19 Minutes Futurepedia 223K • 13d ago Live Playlist ()Mix (50+)21:04 Qwen3.6 on Llama C++ with TurboQuant Samuel Gregory 3.8K • 19h ago Live Playlist ()Mix (50+)12:25 What AI Agent Skills Are and How They Work IBM Technology 54K • 1d ago Live Playlist ()Mix (50+)16:22 Don't Build Agents, Build Skills Instead – Barry Zhang & Mahesh Murag, Anthropic AI Engineer 1.1M • 4mo ago Live Playlist ()Mix (50+)23:34 Trump's OWN Party TURNS on Kash Patel After Devastating Atlantic Report Jack Mercer Show 46K • 9h ago Live Playlist ()Mix (50+)

Sign in to confirm you’re not a bot This helps protect our community. Learn more

Sign in

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX —Adrien Grondin, Locally AI

![Image 7](http://www.youtube.com/@aiDotEngineer)

AI Engineer

AI Engineer

411K subscribers

Subscribe

Subscribed

110

Share

Save

Download

Download

4.2K views 1 day ago

4,211 views • Apr 20, 2026

See more: https://x.com/adrgrondin/status/20405... Speaker info:…...more

...more

How this was made

Auto-dubbed

Audio tracks for some languages were automatically generated. Learn more

Transcript

Follow along using the transcript.

Show transcript

![Image 8 ### AI Engineer 411K subscribers](http://www.youtube.com/@aiDotEngineer)

VideosAbout

Show less

[](http://www.youtube.com/watch?v=a2muGkT4WD4)

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX —Adrien Grondin, Locally AI

4,211 views 4.2K views

Apr 20, 2026

110

Share

Save

Download

Download

3 Comments

Sort comments

Sort by

Top Show featured commentsNewest Show recent comments, including potential spam

Image 9: Default profile photo

Add a comment...

Image 10

[@rayr268](http://www.youtube.com/@rayr268)

13 hours ago

This gives me so many ideas. Already have a good project in mind.

Show less Read more

Like

1

Dislike

Reply

Image 11

[@cedricmanouan2333](http://www.youtube.com/@cedricmanouan2333)

1 day ago

Insightful presentation!!!

Show less Read more

Like

Dislike

Reply

Image 12

[@jam_daniels](http://www.youtube.com/@jam_daniels)

1 day ago

great app!

Show less Read more

Like

Dislike

Reply

Top is selected, so you'll see featured comments

Comments 3

Top Show featured commentsNewest Show recent comments, including potential spam

In this video

Transcript

Description

Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX —Adrien Grondin, Locally AI

110 Likes

4,211 Views

Apr 20 2026

See more: https://x.com/adrgrondin/status/20405... Speaker info:

…...more

...more Show less

How this was made

Auto-dubbed

Audio tracks for some languages were automatically generated. Learn more

Transcript

Follow along using the transcript.

Show transcript

![Image 13 ### AI Engineer 411K subscribers](http://www.youtube.com/@aiDotEngineer)

VideosAbout

Transcript

NaN / NaN

![Image 14 15:26](http://www.youtube.com/watch?v=_gVFUEdhCyI&pp=ugUEEgJlbg%3D%3D)

Image 15

[Gemma, DeepMind's Family of Open Models —Omar Sanseviero, Google DeepMind](http://www.youtube.com/watch?v=_gVFUEdhCyI&pp=ugUEEgJlbg%3D%3D)

AI Engineer

15K 1d ago

New

![Image 16 46:21](http://www.youtube.com/watch?v=am_oeAoUhew&pp=ugUEEgJlbg%3D%3D)

Image 17

[Harness Engineering: How to Build Software When Humans Steer, Agents Execute —Ryan Lopopolo, OpenAI](http://www.youtube.com/watch?v=am_oeAoUhew&pp=ugUEEgJlbg%3D%3D)

AI Engineer

49K 5d ago

New

![Image 18 21:50](http://www.youtube.com/watch?v=yJr_kTCOkFo&pp=ugUEEgJlbg%3D%3D)

Image 19

[Master Gemma 4 in 20 Minutes](http://www.youtube.com/watch?v=yJr_kTCOkFo&pp=ugUEEgJlbg%3D%3D)

Ali H. Salem

31K 5d ago

New

![Image 20 18:46](http://www.youtube.com/watch?v=v3Fr2JR47KA&pp=ugUEEgJlbg%3D%3D)

Image 21

[The Future of MCP — David Soria Parra, Anthropic](http://www.youtube.com/watch?v=v3Fr2JR47KA&pp=ugUEEgJlbg%3D%3D)

AI Engineer

91K 2d ago

New

![Image 22 12:30](http://www.youtube.com/watch?v=h0hdaHPKDdI)

Image 23

[What 6 months of AI coding did to my dev team](http://www.youtube.com/watch?v=h0hdaHPKDdI)

Axel Molist

69K 3d ago

New

![Image 24 1:57:03](http://www.youtube.com/watch?v=mYSRn6PC1mc&pp=ugUEEgJlbg%3D%3D)

Image 25

[Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi](http://www.youtube.com/watch?v=mYSRn6PC1mc&pp=ugUEEgJlbg%3D%3D)

AI Engineer

9.9K 1d ago

New

![Image 26 55:32](http://www.youtube.com/watch?v=geIKyDaXwGg)

Image 27

[Claude just killed ALL Note-Taking Apps. Here is proof.](http://www.youtube.com/watch?v=geIKyDaXwGg)

ICOR with Tom | AI Productivity

296K 4w ago

![Image 28 19:13](http://www.youtube.com/watch?v=WSPChlfxJyA&pp=ugUEEgJlbg%3D%3D)

Image 29

[Full Claude Tutorial: Beginner to Advanced in 19 Minutes](http://www.youtube.com/watch?v=WSPChlfxJyA&pp=ugUEEgJlbg%3D%3D)

Futurepedia

223K 13d ago

![Image 30 21:04](http://www.youtube.com/watch?v=5jkAlqbk66A&pp=ugUHEgVlbi1VU9IHCQnDCgGHKiGM7w%3D%3D)

Image 31

[Qwen3.6 on Llama C++ with TurboQuant](http://www.youtube.com/watch?v=5jkAlqbk66A&pp=ugUHEgVlbi1VU9IHCQnDCgGHKiGM7w%3D%3D)

Samuel Gregory

3.8K 19h ago

New

![Image 32 12:25](http://www.youtube.com/watch?v=Lg-meK5IU8Q)

Image 33

[What AI Agent Skills Are and How They Work](http://www.youtube.com/watch?v=Lg-meK5IU8Q)

IBM Technology

54K 1d ago

New

![Image 34 16:22](http://www.youtube.com/watch?v=CEvIs9y1uog&pp=ugUEEgJlbg%3D%3D)

Image 35

[Don't Build Agents, Build Skills Instead – Barry Zhang & Mahesh Murag, Anthropic](http://www.youtube.com/watch?v=CEvIs9y1uog&pp=ugUEEgJlbg%3D%3D)

AI Engineer

1.1M 4mo ago

![Image 36 23:34](http://www.youtube.com/watch?v=yPafTchzwmg)

Image 37

[Trump's OWN Party TURNS on Kash Patel After Devastating Atlantic Report](http://www.youtube.com/watch?v=yPafTchzwmg)

Jack Mercer Show

46K 9h ago

New

![Image 38 14:38](http://www.youtube.com/watch?v=n7qTHmr0df0&pp=0gcJCcMKAYcqIYzv)

Image 39

[I Ran Google's New AI Locally for Free. Here's How.](http://www.youtube.com/watch?v=n7qTHmr0df0&pp=0gcJCcMKAYcqIYzv)

Jimi Barkway | AI Automation

2.5K 3d ago

New

![Image 40 27:02](http://www.youtube.com/watch?v=Dpu8VTnU4I8)

Image 41

[Airplane (1980): 15 Weird Facts You Didn't Know!](http://www.youtube.com/watch?v=Dpu8VTnU4I8)

Remember When

415K 4d ago

New

![Image 42 32:23](http://www.youtube.com/watch?v=SSya123u9Yk&pp=ugUHEgVlbi1VUw%3D%3D)

Image 43

[Demis Hassabis: Why AGI is Bigger than the Industrial Revolution & Where Are The Bottlenecks in AI](http://www.youtube.com/watch?v=SSya123u9Yk&pp=ugUHEgVlbi1VUw%3D%3D)

20VC with Harry Stebbings

264K 2w ago

![Image 44 1:17:25](http://www.youtube.com/watch?v=eh8bcBIAAFo)

Image 45

[The design process is dead. Here’s what’s replacing it. | Jenny Wen (head of design at Claude)](http://www.youtube.com/watch?v=eh8bcBIAAFo)

Lenny's Podcast

274K 1mo ago

![Image 46 26:46](http://www.youtube.com/watch?v=YwZR6tc7qYg&pp=ugUEEgJlbg%3D%3D)

Image 47

[Everything We Got Wrong About Research-Plan-Implement - Dexter Horthy](http://www.youtube.com/watch?v=YwZR6tc7qYg&pp=ugUEEgJlbg%3D%3D)

MLOps.community

90K 4w ago

![Image 48 12:55](http://www.youtube.com/watch?v=cKFITKsb7M8&pp=ugUEEgJlbg%3D%3D)

Image 49

[Claude Mythos Clone Shocks Anthropic and OpenAI](http://www.youtube.com/watch?v=cKFITKsb7M8&pp=ugUEEgJlbg%3D%3D)

AI Revolution

39K 11h ago

New

![Image 50 15:16](http://www.youtube.com/watch?v=tS_fJJxMjn4&pp=ugUEEgJlbg%3D%3D)

Image 51

[🚗 BYD : The biggest SCAM of the car industry ?](http://www.youtube.com/watch?v=tS_fJJxMjn4&pp=ugUEEgJlbg%3D%3D)

Statrys

4.2M 4d ago

New

![Image 52 19:20](http://www.youtube.com/watch?v=ITT9xWeicWM)

Image 53

[Mythos leaks, SpaceX buys Cursor and OpenAI drops GPT Image 2.0](http://www.youtube.com/watch?v=ITT9xWeicWM)

Wes Roth

10K 4h ago

New

Show more

[](http://www.youtube.com/watch?v=a2muGkT4WD4)