Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Lux AI 34th Place Solution
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Kyohei Uto
December 28, 2021
Technology
430
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Lux AI 34th Place Solution
My 34th place solution in Lux AI competition @kaggle
https://www.kaggle.com/c/lux-ai-2021
Kyohei Uto
December 28, 2021
More Decks by Kyohei Uto
See All by Kyohei Uto
Kaggle過去コンペ上位解法をAIエージェントでレポートする
kuto5046
5
3.6k
Kaggle - Lux AI season3 9th place solution
kuto5046
0
210
kaggle Eedi solution
kuto5046
0
390
Kaggle Eediコンペ振り返り
kuto5046
7
2k
CMI 13th place solution
kuto5046
0
290
Kaggle H&Mコンペ振り返り
kuto5046
0
3.3k
Kaggleシミュレーションコンペで強化学習に取り組むときのTips
kuto5046
22
12k
タクシー予約を支えるMLモデルの継続的改善
kuto5046
1
3.9k
H&M 23th place solution
kuto5046
0
520
Other Decks in Technology
See All in Technology
非エンジニアがClaudeと挑んだ「1ヶ月間プロダクト30本ノック」
askokc
0
540
人材育成分科会.pdf
_awache
4
250
Bedrock AgentCore RuntimeでAuth0 Changelog調査AIをアップグレードした話
t5u8a5a
1
160
RSA暗号を手計算したくなること、ありますよね?? (20260615_orestudy6_rsa)
thousanda
0
430
FinOps × AIエージェントで実現する コストインシデントの自動調査
oasis1994liveforever
0
140
日本 Fintech 未来予測レポート 2027〜2028年(オリジナル版)
8maki
0
2.2k
機械学習を「社会実装」するということ 2026年夏版 / Social Implementation of Machine Learning June 2026 Version
moepy_stats
5
2.4k
Oracle AI Database@Azure:サービス概要のご紹介
oracle4engineer
PRO
6
2k
Android の公式 Skill / Android skills
yanzm
0
150
手塩にかけりゃいいってもんじゃない
ming_ayami
0
580
AIはどのように 組織のアジリティを変えるのか?
junki
3
840
【NRUG vol.18】なぜ多くのオブザーバビリティ導入は失敗するのか
nrug_member
0
130
Featured
See All Featured
The World Runs on Bad Software
bkeepers
PRO
72
12k
We Have a Design System, Now What?
morganepeng
55
8.2k
A designer walks into a library…
pauljervisheath
211
24k
Taking LLMs out of the black box: A practical guide to human-in-the-loop distillation
inesmontani
PRO
3
2.3k
WENDY [Excerpt]
tessaabrams
11
38k
Building Adaptive Systems
keathley
44
3.1k
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.9k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
31
3.2k
Abbi's Birthday
coloredviolet
2
8.1k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
A Tale of Four Properties
chriscoyier
163
24k
Ruling the World: When Life Gets Gamed
codingconduct
0
250
Transcript
Lux AI Challenge Copyright 2021 @kuto_bopro Meta Kaggle Collection of
episodes ・Team: Toad Brigade ・LB score > 1900 ・only win game ・about 1000 episodes(3 submissions) Unet Imitation Learning approach inspired by nosound(@zharch) obs horizontal flip vertical flip random roll(-5~5) TTA obs obs Global features (8ch,4,4) Observation map (17ch, 32, 32) Policy map (3ch,32,32) ・Units counts (×2) ・Citytiles counts (×2) ・Research points (×2) ・turn / cycle Data Sampling ・Random sampling up to 4 units actions in each turn ・Downsampling center actions Extract units policy from each units position Image reference: https://www.lux-ai.org/ ・Units position/cooldown/resource (×2) ・Citytiles position/cooldown/fuel-lightupkeep ratio (×2) ・Wood/Coal/Uranium positions ・Road level ・Effective map area Create 8 pattern policy maps and apply mean UNet model Decide citytile actions by simple rule Create 4 batch by rotation input (4 batches) Policy maps (4batch, 3ch, 32, 32) Final policy map (6ch, 32, 32) Hierarchize move actions (shared by nosound) output 3ch policy map (4 batches) 90° 180° 270° 90° 180° 270° 0ch: Center Action → batch mean 1ch: Move Action 1st batch: north 2nd batch: west 3rd batch: south 4th batch: east 2ch: Build City Action → batch mean kuto(@kuto0633) Final policy map (6ch,32,32) Observation maps(4batch, 17ch, 32,32) 0ch: Move Center 1ch: Move North 2ch: Move West 3ch: Move South 4ch: Move East 5ch: Build City Calculate 4 move actions as one direction State Value (for RL and MCTS but not work) 16 64 64 128 128 256 256 256 256 8 256 256 +8 256 256 128 + 256 128 128 64 + 128 64 64 3 32×32 32×32 32×32 16×16 16×16 16×16 8×8 8×8 8×8 4×4 4×4 4×4 32×32 32×32 32×32 16×16 16×16 8×8 8×8 FC BN ReLU FC 264→64 64→1 Conv2d BatchNorm2d, ReLU MaxPooling2d Upsample Concatenate Private LB: 34th (score 1570)