Lock in $30 Savings on PRO—Offer Ends Soon! ⏳
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
FlexiBO: A Decoupled Cost-Aware Multi-Objective...
Search
Pooyan Jamshidi
February 29, 2024
Science
0
160
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
AAAI 2024
Pooyan Jamshidi
February 29, 2024
Tweet
Share
More Decks by Pooyan Jamshidi
See All by Pooyan Jamshidi
Reconciling Accuracy, Cost, and Latency of Inference Serving Systems
pjamshidi
0
200
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
pjamshidi
0
210
Learning from Valerie Issarny: Insights Gained from Program Co-Chairing SEAMS’23
pjamshidi
0
380
Artificial Intelligence and Systems Laboratory (AISys): A Research Overview
pjamshidi
0
750
Experiential Learning by Building Real-World AI Systems
pjamshidi
0
230
Understanding and Explaining the Root Causes of Performance Faults with Causal AI: A Path towards Building Dependable Computer Systems
pjamshidi
0
190
On Debugging the Performance of Configurable Software Systems: Developer Needs and Tailored Tool Support
pjamshidi
0
290
Unicorn: Reasoning about Configurable System Performance through the Lens of Causality
pjamshidi
0
470
Causal AI for Systems
pjamshidi
0
330
Other Decks in Science
See All in Science
Lean4による汎化誤差評価の形式化
milano0017
1
390
蔵本モデルが解き明かす同期と相転移の秘密 〜拍手のリズムはなぜ揃うのか?〜
syotasasaki593876
1
160
データマイニング - グラフデータと経路
trycycle
PRO
1
260
DMMにおけるABテスト検証設計の工夫
xc6da
1
1.4k
良書紹介04_生命科学の実験デザイン
bunnchinn3
0
100
Accelerated Computing for Climate forecast
inureyes
PRO
0
140
NDCG is NOT All I Need
statditto
2
2.6k
機械学習 - K-means & 階層的クラスタリング
trycycle
PRO
0
1.2k
主成分分析に基づく教師なし特徴抽出法を用いたコラーゲン-グリコサミノグリカンメッシュの遺伝子発現への影響
tagtag
0
140
風の力で振れ幅が大きくなる振り子!? 〜タコマナローズ橋はなぜ落ちたのか〜
syotasasaki593876
1
170
データマイニング - ノードの中心性
trycycle
PRO
0
320
機械学習 - SVM
trycycle
PRO
1
940
Featured
See All Featured
How Software Deployment tools have changed in the past 20 years
geshan
0
29k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
37
2.7k
Agile Actions for Facilitating Distributed Teams - ADO2019
mkilby
0
86
Navigating the Design Leadership Dip - Product Design Week Design Leaders+ Conference 2024
apolaine
0
110
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
31
3k
A better future with KSS
kneath
240
18k
Exploring the relationship between traditional SERPs and Gen AI search
raygrieselhuber
PRO
2
3.4k
Reality Check: Gamification 10 Years Later
codingconduct
0
1.9k
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
0
290
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
22
Dominate Local Search Results - an insider guide to GBP, reviews, and Local SEO
greggifford
PRO
0
11
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
359
30k
Transcript
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
Shahriar Iqbal, Jianhai Su, Lars Kotthoff, Pooyan Jamshidi
[email protected]
AAAI, 24 February 2024 1
One Size Does Not Fit All 1 1.5 2 2.5
3 3.5 ·104 15 20 25 30 35 40 Energy Consumption (mJ) Prediction Error (%) Xception ← Energy consumption varies 4 × → ← Prediction Error varies 3 × → 2
Heterogeneous Parameters Num of Filters, Filter Size, Learning Rate, Num
of Epochs DN N Design Compiler Hardware Deployment Num of Active CPUs, CPU/ GPU/ EMC Frequency Cloud, IoT, Edge Num of Threads, GPU Threads, Memory Growth 3
Cost-Unaware Methods Waste Resources Coupled Unaware Pareto Optimal Prediction Error
(%) Log Wall Clock Time Energy Consumption (mJ) 3000 6000 9000 12000 15 25 35 45 3.65 3.50 3.35 Decoupled Aware Pareto Optimal Prediction Error (%) Log Wall Clock Time Energy Consumption (mJ) 3000 6000 9000 12000 15 25 35 45 3.65 3.50 3.35 4
Proposed Method ▷ weight expected benefit of evaluation by cost
▷ choose which objective(s) to evaluate ▷ more efficient use of resources – lower cost, more evaluations 5
Results – Computer Vision 0 50 100 150 200 Cumulative
Log WallClock Time 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error Xception PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 10000 15000 20000 25000 Energy Consumption (mJ) 15 20 25 30 35 40 Prediction Error (%) Xception PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 6
Results – NLP 0 50 100 150 200 Cumulative Log
WallClock Time 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error BERT-SQuAD PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 70000 80000 90000 Energy Consumption (mJ) 20 25 30 35 Prediction Error (%) BERT-SQuAD PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 7
Results – Speech Recognition 0 50 100 150 200 250
300 Cumulative Log WallClock Time 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 Energy Consumption (mJ) 17.5 20.0 22.5 25.0 27.5 30.0 32.5 35.0 Prediction Error (%) DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 8
Results – Evaluations 0 20 40 60 80 100 120
140 160 180 200 PAL 0 20 40 60 80 100 120 140 160 180 200 PESMO-DEC 2 4 6 8 0 20 40 60 80 100 120 140 160 180 200 Iteration CA-MOBO 0 20 40 60 80 100 120 140 160 180 200 Iteration FlexiBO 2 4 6 8 9
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
▷ cost-aware acquisition function decreases cost and improves results ▷ code available at https://github.com/softsys4ai/FlexiBO 0 50 100 150 200 250 300 Cumulative Log WallClock Time 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 Energy Consumption (mJ) 17.5 20.0 22.5 25.0 27.5 30.0 32.5 35.0 Prediction Error (%) DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 10