Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
[Journal club] Improved Mean Flows: On the Chal...
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
Semantic Machine Intelligence Lab., Keio Univ.
PRO
December 24, 2025
Technology
190
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
[Journal club] Improved Mean Flows: On the Challenges of Fastforward Generative Models
Semantic Machine Intelligence Lab., Keio Univ.
PRO
December 24, 2025
More Decks by Semantic Machine Intelligence Lab., Keio Univ.
See All by Semantic Machine Intelligence Lab., Keio Univ.
[Journal club ] PHyCLIP: ðð-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning
keio_smilab
PRO
0
36
[Journal club] ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation
keio_smilab
PRO
0
99
[Journal club] ReLaGS: Relational Language Gaussian Splatting
keio_smilab
PRO
0
94
[Journal club] Flow as the Cross-Domain Manipulation Interface
keio_smilab
PRO
0
87
Mobi-ð: Mobilizing Your Robot Learning Policy
keio_smilab
PRO
0
150
A Gentle Introduction to Transformers
keio_smilab
PRO
16
6.7k
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching
keio_smilab
PRO
0
58
[Journal club] VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
keio_smilab
PRO
0
130
[Journal club] MemER: Scaling Up Memory for Robot Control via Experience Retrieval
keio_smilab
PRO
0
140
Other Decks in Technology
See All in Technology
å®è£ ã¯éããªã£ããã¬ãã¥ãŒã¯ã©ãããïŒ â èªèº«ã®ã¬ãã¥ãŒãAIã§åçŸããããµãŒãŽã¡ã³ããšã³ãžãã¢ãªã³ã°ã®ããã / Implementation got faster. So what about reviews? â An invitation to Servant Engineering: Recreating your own code reviews with AI
nrslib
7
4.1k
LLMãã䞻圹ãã«ããªãããã® 3ã€ã®åå
techtekt
PRO
0
120
ãéãäœãããããæ£ããäœãããž â çæAIæä»£ã®éçºãããŒæ¹é©ã® ããŒãããããšå®è¡ â
starfish719
0
8.2k
noUncheckedIndexedAccessã3æéã1äžåã / noUncheckedIndexedAccess, 3 Hours, 10,000 JPY.
kaonavi
1
310
GoãšSIMDãšWasmã®ä»ã
askua
3
510
AIã«ãããŒãäœãããããšããŠæ«æãã話
hamatsutaichi
0
210
Sony_KMP_Journey_KotlinConf2026
sony
2
210
ã³ãŒãã¬ãã¥ãŒãå¶ããããŒã ããœãããŠã§ã¢ããªããªãŒã®ãããŒãå¶ã / Beyond Code Review: Distributing Its Responsibilities Across the SDLC
mtx2s
4
1.2k
éçšãèŠæ®ããAIãšãŒã·ãã§ã³ãèšèšå®è·µ
amacbee
1
3.1k
10åã®çç£æ§ãå®çŸããAIé§å䞊åãšãŒãžã§ã³ãã®ãã¹ãŠ
kumaiu
4
820
Claude CodeÃTerraform IaC ãã³ãã¬ãŒãé§åéçº
itouhi
1
400
Dario AmodiãPolicy on the AI Exponentialããçè§£ãã
nagatsu
0
200
Featured
See All Featured
Code Reviewing Like a Champion
maltzj
528
40k
Impact Scores and Hybrid Strategies: The future of link building
tamaranovitovic
0
300
Fireside Chat
paigeccino
42
3.9k
HTML-Aware ERB: The Path to Reactive Rendering @ RubyCon 2026, Rimini, Italy
marcoroth
1
160
sira's awesome portfolio website redesign presentation
elsirapls
0
270
Stewardship and Sustainability of Urban and Community Forests
pwiseman
0
220
HDC tutorial
michielstock
2
690
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
254
22k
Gemini Prompt Engineering: Practical Techniques for Tangible AI Outcomes
mfonobong
2
420
Design in an AI World
tapps
1
220
SERP Conf. Vienna - Web Accessibility: Optimizing for Inclusivity and SEO
sarafernandez
2
1.5k
Learning to Love Humans: Emotional Interface Design
aarron
275
41k
Transcript
ææµŠåæç 究宀 効尟幞暹 Improved Mean Flows: On the Challenges of Fastforward
Generative Models Zhengyang Geng1,2,3,* Yiyang Lu4,2,â Zongze Wu3 Eli Shechtman3 J. Zico Kolter1 Kaiming He2 1CMU 2MIT 3Adobe 4THU Geng, Z., Lu, Y., Wu, Z., Shechtman, E., Kolter, J. Z., & He, K. (2025). Improved Mean Flows: On the Challenges of Fastforward Generative Models. arXiv preprint arXiv:2512.02012, 2025
æŠèŠ 2 âª èæ¯ïžMean Flows J 1-NFEã§âŸŒå質ãªâœ£æ L æ°åŒã«ç²ãè¿äŒŒãååš âª
ææ¡ïžImproved Mean Flows J Mean Flows ã«ãããæ°åŒçãªåé¡ãæ¹å J æè»ãª Classifier Free Guidance J in-context conditioning ã«ãã軜éå âª çµæ J 1-NFE ã§å€ãã® Multi-NFE ã¢ãã«ãäžåã
èæ¯ïžMean Flows ã®æ°åŒã¯äžæ£ç¢º 3 âª æ¡æ£ã¢ãã«ã Flow Matching ã¯âŸŒæ§èœã ãèšç®ã³ã¹ãã⟌ã âª
ODE ãè§£ãéã«å€ãã® NFE ãå¿ èŠ âª {1, few}-NFEã®ã¢ãã«ãå°é ⪠Mean Flows [Geng+, NeurIPS25] ⪠ç¬éé床ã§ã¯ãªãå¹³åé床ãäºæž¬ J 1-NFE ã§âŸŒå質ãªâœ£æãå¯èœ L GTã®èšç®ãå°é£ âª äžæ£ç¢ºãªè¿äŒŒïŒåŸè¿°ïŒ MoFlow [Fu+, CVPR25] Mean Flows [Geng+, NeurIPS25] Mean Flows [Geng+, NeurIPS25] ð§! ïžæå» ð ã«ããããã€ãºä»ãããŒã¿, ð¡, ðïžæå»
é¢é£ç ç©¶ïžMean Flows ã®æ¹å 4 âŒ¿æ³ ç¹åŸŽ AlphaFlow [Zhang+, 25] Flow
Matching ãã MeanFlow ãžæ®µéçã«ç§»âŸãã ã«ãªãã¥ã©ã åŠç¿âŒ¿æ³ Decoupled MeanFlow [Lee+, 25] äºååŠç¿æžã¿ Flow Matching ã¢ãã«ã fine-tuning ã㊠MeanFlow ã¢ãã«ãžå€æ CMT [Hu+, 25] äºååŠç¿ãšäºåŸåŠç¿ã®éã«âŒè²«æ§æå€±ã✀ããäžéåŠç¿ãå°âŒ äºåŸåŠç¿ã«ããã MeanFlow ã¢ãã«ã®æ§èœãåäž Decoupled Meanflow [Lee+, 25] AlphaFlow [Zhang+, 25]
⪠ã¢ãã«ã¯ä»»æã®æå» ð ããä»»æã®æå» ð¡ ãžã®å¹³åé床ãäºæž¬ 1ã¹ããã✣æãå¯èœïŒð = 0, ð¡
= 1ïŒ âª æå€±é¢æ° åæïŒ1/2ïŒ: Mean FlowsïŒæŠèŠïŒ 5 ð¡, ð â [0, 1] ð¥~ð!"#" ð~ð$%&'% (e.g. ã¬ãŠã¹ååž ) ð¢( ïžãã¥ãŒã©ã«ãããã¯ãŒã¯ sg ã» ïžstop gradient JVP ã» ïžJacobian Vector Product Mean Flows [Geng+, NeurIPS25]
åæïŒ2/2ïŒ: Mean FlowsïŒå°åºã»åé¡ç¹ïŒ 6 ð¢ ð§&, ð, ð¡ + ð¡
â ð ð ðð¡ ð¢ ð§&, ð, ð¡ = ð£ ð§& ð ðð¡ ð¡ â ð ð¢ ð§&, ð, ð¡ = ð ðð¡ - ' & ð£ ð§( ðð ð¢ ð§&, ð, ð¡ = ð£ ð§& â ð¡ â ð ð ðð¡ ð¢ ð§&, ð, ð¡ , where ç©ã®åŸ®å ⹠⹠⹠⎠ð§" ð¥ ð ð¡ â ð ãæããåŸïŒð¡ ã§åŸ®å
åæïŒ2/2ïŒ: Mean FlowsïŒå°åºã»åé¡ç¹ïŒ 7 ð¢ ð§&, ð, ð¡ + ð¡
â ð ð ðð¡ ð¢ ð§&, ð, ð¡ = ð£ ð§& ð ðð¡ ð¡ â ð ð¢ ð§&, ð, ð¡ = ð ðð¡ - ' & ð£ ð§( ðð ð¢ ð§&, ð, ð¡ = ð£ ð§& â ð¡ â ð ð ðð¡ ð¢ ð§&, ð, ð¡ , where ç©ã®åŸ®å = ð â ð¥, 0, 1 / ð# , ð$ , ð" ð¢ = ð â ð¥ / ð# ð¢ + 0 / ð$ ð¢ + 1 / ð" ð¢ J JVPã✀ããŠèšç®å¯èœ ð ðð¡ ð¢ ð§" , ð, ð¡ = ðð§" ðð¡ ð# ð¢ + ðð ðð¡ ð$ ð¢ + ðð¡ ðð¡ ð" ð¢ ⹠⹠⹠⎠ð§" ð¥ ð ð¡ â ð ãæããåŸïŒð¡ ã§åŸ®å
åæïŒ2/2ïŒ: Mean FlowsïŒå°åºã»åé¡ç¹ïŒ 8 ð¢ ð§&, ð, ð¡ + ð¡
â ð ð ðð¡ ð¢ ð§&, ð, ð¡ = ð£ ð§& ð ðð¡ ð¡ â ð ð¢ ð§&, ð, ð¡ = ð ðð¡ - ' & ð£ ð§( ðð ð¢ ð§&, ð, ð¡ = ð£ ð§& â ð¡ â ð ð ðð¡ ð¢ ð§&, ð, ð¡ , where ç©ã®åŸ®å = ð â ð¥, 0, 1 / ð# , ð$ , ð" ð¢ = ð â ð¥ / ð# ð¢ + 0 / ð$ ð¢ + 1 / ð" ð¢ J JVPã✀ããŠèšç®å¯èœ ð ðð¡ ð¢ ð§" , ð, ð¡ = ðð§" ðð¡ ð# ð¢ + ðð ðð¡ ð$ ð¢ + ðð¡ ðð¡ ð" ð¢ ⹠⹠⹠⎠L åé¡â ð§" ð¥ ð ð¡ â ð ãæããåŸïŒð¡ ã§åŸ®å ð ðð ð â ð L åé¡â¡ L Marginal Velocity ã Conditional Velocity ã§è¿äŒŒ
ææ¡âŒ¿æ³ (1/4) : ð-ð¥ðšð¬ð¬ (å¹³åé床ã§ã¯ãªãç¬éé床ãååž°) 9 ⪠ð-ð¥ðšð¬ð¬ ⪠ð-ð¥ðšð¬ð¬
â , where L GTã®èšç®ãå°é£ J ç¬éé床ãååž° ð£ ð§& = ð¢ ð§& , ð, ð¡ + ð¡ â ð ð ðð¡ ð¢ ð§& , ð, ð¡ ð¢ ð§&, ð, ð¡ = ð£ ð§& â ð¡ â ð ð ðð¡ ð¢ ð§&, ð, ð¡ ⎠ð¢454
ææ¡âŒ¿æ³ (2/4) : JVPãžã®äžé©åãªâŒâŒãæ¹å 10 ⪠Mean Flows (MF) âª
Improved Mean Flows (iMF) = ð â ð¥, 0, 1 / ð# , ð$ , ð" ð¢ = ð â ð¥ / ð# ð¢ + 0 / ð$ ð¢ + 1 / ð" ð¢ åæ²: Marginal Velocity ã Conditional Velocity ã§è¿äŒŒ ð ðð¡ ð¢ ð§" , ð, ð¡ = ðð§" ðð¡ ð# ð¢ + ðð ðð¡ ð$ ð¢ + ðð¡ ðð¡ ð" ð¢ L äžé©åãªè¿äŒŒ (ð¢6 ; ð£6 ) , where ð£6 (ð§4 , ð¡) = ð¢6 (ð§4 , ð¡, ð¡) L MFã®æå€±ã¯å¢å
ææ¡âŒ¿æ³ (3/4) : æšè«æã«ãã©ã¡ãŒã¿ã決å®å¯èœãªCFG 11 ⪠åæ: Classifier Free Guidance
(CFG) J æšè«æã«æ¡ä»¶âŒâŒ{æ, ç¡}ã®ã¢ãã«ã®éã¿ã¥ãåã✀ããŠæ§èœåäž âª Mean Flowsã«ãããCFG ⪠Flexible Guidance (ð ãæ¡ä»¶ãšããŠâŒâŒ) J æšè«æã« ð ãæ±ºå®å¯èœãªæè»ãªèšèš & ð£# ð§$ , ð¡ ð) = 1 + ð ð£% ð§$ , ð¡| ð â ð ð£% ð§$ , ð¡ ð) & ð£$ = ð ð â ð¥ â 1 â ð ð¢ # &'( ð§, ð¡, ð¡ where ð¢)() &'( = & ð£$ â ð¡ â ð JVP(ð¢ # &'(; & ð£$ ) , â sg ð¢!"! #$" , ðïžæ¡ä»¶ ðïžç¡æ¡ä»¶ ðïžã¬ã€ãã³ã¹ã¹ã±ãŒã« L èšç·Žæã« ð ãèšå® < | ð, Ï ð§$ | ð, Ï
ææ¡âŒ¿æ³ (4/4) : In-context Conditioning ã«ãã軜éå 12 ⪠åæïžDiT ã¢ãŒããã¯ãã£
⪠æ¡ä»¶ã¯ AdaLN-zero ã§åŠç âª å šãŠã®æ¡ä»¶ãåã«âŸããŠãã L è€æ°æ¡ä»¶ãé©åã«æ±ããªã L ãã©ã¡ãŒã¿æ°ãå€ã n ä»ã®æ¡ä»¶ä»ãã¯æ§èœãäžâŒå ⪠Improved In-context Conditioning ⪠æ¡ä»¶ããã€ãºã« concat ã㊠DiT ã«âŒâŒ J æ¡ä»¶ã®ããŒã¯ã³æ°ãè€æ°åã«ããããšã§å®âœ€å¯èœã« (class token: 8, ãã®ä»: 4) J AdaLN-zero ãåãé€ããŠ33%軜éå (e.g. 133M â 89M) DiT [Peebles+, ICCV23]
ææ¡âŒ¿æ³ (4/4) : In-context Conditioning ã«ãã軜éå 13 ⪠åæïžDiT ã¢ãŒããã¯ãã£
⪠æ¡ä»¶ã¯ AdaLN-zero ã§åŠç âª å šãŠã®æ¡ä»¶ãåã«âŸããŠãã L è€æ°æ¡ä»¶ãé©åã«æ±ããªã L ãã©ã¡ãŒã¿æ°ãå€ã n ä»ã®æ¡ä»¶ä»ãã¯æ§èœãäžâŒå ⪠Improved In-context Conditioning ⪠æ¡ä»¶ããã€ãºã« concat ã㊠Transformer ã«âŒâŒ J æ¡ä»¶ã®ããŒã¯ã³æ°ãè€æ°åã«ããããšã§å®âœ€å¯èœã« (class token: 8, ãã®ä»: 4) J AdaLN-zero ãåãé€ããŠè»œéå (e.g. 133M â 89M) DiT [Peebles+, ICCV23] AdaLN-zero Cross-Attn In-context Conditioning
ææ¡âŒ¿æ³ (4/4) : In-context Conditioning ã«ãã軜éå 14 ⪠åæïžDiT ã¢ãŒããã¯ãã£
⪠æ¡ä»¶ã¯ AdaLN-zero ã§åŠç âª å šãŠã®æ¡ä»¶ãåã«âŸããŠãã L è€æ°æ¡ä»¶ãé©åã«æ±ããªã L ãã©ã¡ãŒã¿æ°ãå€ã n ä»ã®æ¡ä»¶ä»ãã¯æ§èœãäžâŒå ⪠Improved In-context Conditioning ⪠æ¡ä»¶ããã€ãºã« concat ã㊠DiT ã«âŒâŒ J æ¡ä»¶ã®ããŒã¯ã³æ°ãè€æ°åã«ããããšã§å®âœ€å¯èœã« (class token: 8, ãã®ä»: 4) J AdaLN-zero ãåãé€ããŠ33%軜éå (e.g. 133M â 89M) DiT [Peebles+, ICCV23]
å®éšèšå® 15 ⪠ã¿ã¹ã¯ïžã¯ã©ã¹ã©ãã«ã«åºã¥ãç»å✣æ ⪠ããŒã¿ã»ããïžImageNetïŒ256x256ïŒ âª âœ£æâœ æ³ âª æœåšç©ºéã§âœ£æ
(4x32x32) ⪠äºååŠç¿æžã¿ã®VAEã䜿✀ ⪠{1,2}-NFE ✣æ ⪠è©äŸ¡ææšïžFréchet Inception Distance (FID) ⪠ããŒããŠã§ã¢æ§æïžèšèŒãªã ⪠èšç·ŽæéïžèšèŒãªã
å®éççµæïž1-NFE ã§å€ãã® Multi-NFE ã¢ãã«ãäžåã 16 ⪠Multi-NFE ã¢ãã«ãšâœèŒããŠ500~2000åã®1ã®æšè«åæ°ã ãæ§èœã¯åç J scratch
ããèšç·Žå¯èœïŒ=èžçãªãïŒãªã®ã§æåž«ã¢ãã«ãäžèŠ âŸŒå質ã ã èšç®ã³ã¹ãã⟌ã L
Ablation studies 17 J CFG (Flexible Guidance) ã✀ããããšã§æ§èœåäž J in-context
conditioning ã«ãã軜éåãã€æ§èœåäž â» aux. head ⊠ð£% ✀㮠head ã远å
宿§ççµæïžâŸŒå質ãªç»åã✣æ 18
ãŸãšã 19 âª èæ¯ïžMean Flows J 1-NFEã§âŸŒå質ãªâœ£æ L æ°åŒã«ç²ãè¿äŒŒãååš âª
ææ¡ïžImproved Mean Flows J Mean Flows ã«ãããæ°åŒçãªåé¡ãæ¹å J æè»ãª Classifier Free Guidance J in-context conditioning ã«ãã軜éå âª çµæ J 1-NFE ã§å€ãã® Multi-NFE ã¢ãã«ãäžåã