Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
HLT CPU Consumption
Search
Sasha Mazurov
February 06, 2012
Science
0
350
HLT CPU Consumption
Sasha Mazurov
February 06, 2012
Tweet
Share
More Decks by Sasha Mazurov
See All by Sasha Mazurov
L1Calo Offline Software Status
mazurov
0
77
Performance and Regression tests for Simulation
mazurov
0
92
About v2
mazurov
0
70
L1Calo Offline Software Status
mazurov
0
100
L1Calo Offline Software Status
mazurov
0
100
LHCbPR V2
mazurov
0
150
Paper approval
mazurov
0
78
Conventions' Publications
mazurov
0
64
Ph.D final exam
mazurov
0
130
Other Decks in Science
See All in Science
Algorithmic Aspects of Quiver Representations
tasusu
0
190
AIに仕事を奪われる 最初の医師たちへ
ikora128
0
1k
検索と推論タスクに関する論文の紹介
ynakano
1
140
NDCG is NOT All I Need
statditto
2
2.8k
(メタ)科学コミュニケーターからみたAI for Scienceの同床異夢
rmaruy
0
160
データマイニング - コミュニティ発見
trycycle
PRO
0
200
安心・効率的な医療現場の実現へ ~オンプレAI & ノーコードワークフローで進める業務改革~
siyoo
0
450
academist Prize 4期生 研究トーク延長戦!「美は世界を救う」っていうけど、どうやって?
jimpe_hitsuwari
0
470
KH Coderチュートリアル(スライド版)
koichih
1
58k
蔵本モデルが解き明かす同期と相転移の秘密 〜拍手のリズムはなぜ揃うのか?〜
syotasasaki593876
1
210
主成分分析に基づく教師なし特徴抽出法を用いたコラーゲン-グリコサミノグリカンメッシュの遺伝子発現への影響
tagtag
PRO
0
180
生成検索エンジン最適化に関する研究の紹介
ynakano
2
2k
Featured
See All Featured
Rails Girls Zürich Keynote
gr2m
96
14k
Jamie Indigo - Trashchat’s Guide to Black Boxes: Technical SEO Tactics for LLMs
techseoconnect
PRO
0
57
Sam Torres - BigQuery for SEOs
techseoconnect
PRO
0
180
How Fast Is Fast Enough? [PerfNow 2025]
tammyeverts
3
450
4 Signs Your Business is Dying
shpigford
187
22k
Crafting Experiences
bethany
1
48
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
State of Search Keynote: SEO is Dead Long Live SEO
ryanjones
0
110
Building Flexible Design Systems
yeseniaperezcruz
330
40k
Speed Design
sergeychernyshev
33
1.5k
What the history of the web can teach us about the future of AI
inesmontani
PRO
1
430
The Illustrated Children's Guide to Kubernetes
chrisshort
51
51k
Transcript
HLT CPU Consumption Sasha Mazurov 6 Febrary 2012
Tool Gaudi Auditor & Intel® VTune™ Amplifier XE 2011 Can
be run on any lxplus node
Benefits ➔ Can focus on a specific sequence/algorithm(s). ➔ Skip
initialization & finalization phase. ➔ Report CPU consumption per algorithm / function / class / module. ➔ Perfect GUI & reports.
http://amazurov.ru/cern/intelprofiler/ - installation - documentation - screencasts $> intelprofiler -o
/where/to/store/profiler/output myJob.py
None
Profiler vs. HLT1 Lines (Offline )
https://github.com/mazurov/HltProfiling profiler = IntelProfilerAuditor() profiler.StartFromEventN = 5000 profiler.StopAtEventN = 15000
profiler.IncludeAlgorithms = ["Hlt1TrackAllL0", "Hlt1DiMuonHighMass", "Hlt1DiMuonLowMass"] Jop Options Moore v12r10
Hotspots
Top Hotspots
CPU/Per Function
CPU / Per Module
CPU/Per Algorithm
http://amazurov.ru/cern/hltprofilingresults/
CPU / Per Function In Algorithm
CPU / Per Source Code (debug mode)
TCMalloc vs. “new” Operator
Before: After: CPU: 238 s CPU: 222 s
Results ➔ tc_new is twice faster than “new” operator. ➔
5% total improvement for Hlt1 job.
GCC 4.3 vs. GCC 4.6
GCC 4.3 GCC 4.6 -O2 flag ~ 3.6% worth
Two profiles comparison
Result (preliminary) ➔ It's not evident, that GCC 4.6 optimize
better than GCC 4.3 (for HLT1 jobs).
Future plans ➔ Profile code compiled with GCC 4.6 and
-O3 flag. ➔ Profile code compiled with GCC 4.6's profile driven optimization. ➔ Create a web interface to display collected profiler results.
http://amazurov.ru/cern/hltprofilingpresentation