Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
FlexiBO: A Decoupled Cost-Aware Multi-Objective...
Search
Pooyan Jamshidi
February 29, 2024
Science
190
0
Share
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
AAAI 2024
Pooyan Jamshidi
February 29, 2024
More Decks by Pooyan Jamshidi
See All by Pooyan Jamshidi
Reconciling Accuracy, Cost, and Latency of Inference Serving Systems
pjamshidi
0
220
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
pjamshidi
0
240
Learning from Valerie Issarny: Insights Gained from Program Co-Chairing SEAMS’23
pjamshidi
0
450
Artificial Intelligence and Systems Laboratory (AISys): A Research Overview
pjamshidi
0
820
Experiential Learning by Building Real-World AI Systems
pjamshidi
0
250
Understanding and Explaining the Root Causes of Performance Faults with Causal AI: A Path towards Building Dependable Computer Systems
pjamshidi
0
210
On Debugging the Performance of Configurable Software Systems: Developer Needs and Tailored Tool Support
pjamshidi
0
310
Unicorn: Reasoning about Configurable System Performance through the Lens of Causality
pjamshidi
0
500
Causal AI for Systems
pjamshidi
0
360
Other Decks in Science
See All in Science
Amusing Abliteration
ianozsvald
1
160
AI(人工知能)の過去・現在・未来 —AIは人間を超えるのか—
tagtag
PRO
0
180
会社でMLモデルを作るとは @電気通信大学 データアントレプレナーフェロープログラム
yuto16
1
660
Text-to-SQLの既存の評価指標を問い直す
gotalab555
1
210
20260220 OpenIDファウンデーション・ジャパン ご紹介 / 20260220 OpenID Foundation Japan Intro
oidfj
0
320
機械学習 - pandas入門
trycycle
PRO
0
580
Bear-safety-running
akirun_run
0
130
データベース01: データベースを使わない世界
trycycle
PRO
1
1.2k
Kaggle: NeurIPS - Open Polymer Prediction 2025 コンペ 反省会
calpis10000
0
550
[NLP2026 参加報告会] AI for Science まとめ / NLP2026
lychee1223
0
1.8k
【論文紹介】Is CLIP ideal? No. Can we fix it?Yes! 第65回 コンピュータビジョン勉強会@関東
shun6211
5
2.4k
主成分分析に基づく教師なし特徴抽出法を用いたコラーゲン-グリコサミノグリカンメッシュの遺伝子発現への影響
tagtag
PRO
0
250
Featured
See All Featured
Groundhog Day: Seeking Process in Gaming for Health
codingconduct
0
170
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
122
21k
Ten Tips & Tricks for a 🌱 transition
stuffmc
0
110
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
35
2.4k
Docker and Python
trallard
47
3.8k
Ruling the World: When Life Gets Gamed
codingconduct
0
220
Leadership Guide Workshop - DevTernity 2021
reverentgeek
1
280
The Illustrated Guide to Node.js - THAT Conference 2024
reverentgeek
1
340
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.8k
Winning Ecommerce Organic Search in an AI Era - #searchnstuff2025
aleyda
1
2k
How to optimise 3,500 product descriptions for ecommerce in one day using ChatGPT
katarinadahlin
PRO
1
3.6k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
9
1.3k
Transcript
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
Shahriar Iqbal, Jianhai Su, Lars Kotthoff, Pooyan Jamshidi
[email protected]
AAAI, 24 February 2024 1
One Size Does Not Fit All 1 1.5 2 2.5
3 3.5 ·104 15 20 25 30 35 40 Energy Consumption (mJ) Prediction Error (%) Xception ← Energy consumption varies 4 × → ← Prediction Error varies 3 × → 2
Heterogeneous Parameters Num of Filters, Filter Size, Learning Rate, Num
of Epochs DN N Design Compiler Hardware Deployment Num of Active CPUs, CPU/ GPU/ EMC Frequency Cloud, IoT, Edge Num of Threads, GPU Threads, Memory Growth 3
Cost-Unaware Methods Waste Resources Coupled Unaware Pareto Optimal Prediction Error
(%) Log Wall Clock Time Energy Consumption (mJ) 3000 6000 9000 12000 15 25 35 45 3.65 3.50 3.35 Decoupled Aware Pareto Optimal Prediction Error (%) Log Wall Clock Time Energy Consumption (mJ) 3000 6000 9000 12000 15 25 35 45 3.65 3.50 3.35 4
Proposed Method ▷ weight expected benefit of evaluation by cost
▷ choose which objective(s) to evaluate ▷ more efficient use of resources – lower cost, more evaluations 5
Results – Computer Vision 0 50 100 150 200 Cumulative
Log WallClock Time 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error Xception PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 10000 15000 20000 25000 Energy Consumption (mJ) 15 20 25 30 35 40 Prediction Error (%) Xception PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 6
Results – NLP 0 50 100 150 200 Cumulative Log
WallClock Time 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error BERT-SQuAD PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 70000 80000 90000 Energy Consumption (mJ) 20 25 30 35 Prediction Error (%) BERT-SQuAD PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 7
Results – Speech Recognition 0 50 100 150 200 250
300 Cumulative Log WallClock Time 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 Energy Consumption (mJ) 17.5 20.0 22.5 25.0 27.5 30.0 32.5 35.0 Prediction Error (%) DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 8
Results – Evaluations 0 20 40 60 80 100 120
140 160 180 200 PAL 0 20 40 60 80 100 120 140 160 180 200 PESMO-DEC 2 4 6 8 0 20 40 60 80 100 120 140 160 180 200 Iteration CA-MOBO 0 20 40 60 80 100 120 140 160 180 200 Iteration FlexiBO 2 4 6 8 9
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
▷ cost-aware acquisition function decreases cost and improves results ▷ code available at https://github.com/softsys4ai/FlexiBO 0 50 100 150 200 250 300 Cumulative Log WallClock Time 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 Energy Consumption (mJ) 17.5 20.0 22.5 25.0 27.5 30.0 32.5 35.0 Prediction Error (%) DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 10