Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
HLT CPU Consumption
Search
Sasha Mazurov
February 06, 2012
Science
350
0
Share
HLT CPU Consumption
Sasha Mazurov
February 06, 2012
More Decks by Sasha Mazurov
See All by Sasha Mazurov
L1Calo Offline Software Status
mazurov
0
79
Performance and Regression tests for Simulation
mazurov
0
97
About v2
mazurov
0
72
L1Calo Offline Software Status
mazurov
0
110
L1Calo Offline Software Status
mazurov
0
110
LHCbPR V2
mazurov
0
150
Paper approval
mazurov
0
82
Conventions' Publications
mazurov
0
71
Ph.D final exam
mazurov
0
130
Other Decks in Science
See All in Science
Non-Gaussian, nonlinear causal discovery with hidden variables and application
sshimizu2006
0
100
次代のデータサイエンティストへ~スキルチェックリスト、タスクリスト更新~
datascientistsociety
PRO
3
33k
因果推論と機械学習
sshimizu2006
1
1k
ド文系だった私が、 KaggleのNCAAコンペでソロ金取れるまで
wakamatsu_takumu
2
2.2k
Testing the Longevity Bottleneck Hypothesis
chinson03
0
230
会社でMLモデルを作るとは @電気通信大学 データアントレプレナーフェロープログラム
yuto16
1
610
AIによる科学の加速: 各領域での革新と共創の未来
masayamoriofficial
0
480
PPIのみを用いたAIによる薬剤–遺伝子–疾患 相互作用の同定
tagtag
PRO
0
200
生成AIの現状と展望
tagtag
PRO
0
110
[Paper Introduction] From Bytes to Ideas:Language Modeling with Autoregressive U-Nets
haruumiomoto
0
220
白金鉱業Vol.21【初学者向け発表枠】身近な例から学ぶ数理最適化の基礎 / Learning the Basics of Mathematical Optimization Through Everyday Examples
brainpadpr
1
680
HDC tutorial
michielstock
1
590
Featured
See All Featured
職位にかかわらず全員がリーダーシップを発揮するチーム作り / Building a team where everyone can demonstrate leadership regardless of position
madoxten
62
53k
A Modern Web Designer's Workflow
chriscoyier
698
190k
Chasing Engaging Ingredients in Design
codingconduct
0
160
Skip the Path - Find Your Career Trail
mkilby
1
93
How to Align SEO within the Product Triangle To Get Buy-In & Support - #RIMC
aleyda
1
1.5k
Large-scale JavaScript Application Architecture
addyosmani
515
110k
WCS-LA-2024
lcolladotor
0
500
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.8k
What Being in a Rock Band Can Teach Us About Real World SEO
427marketing
0
200
Game over? The fight for quality and originality in the time of robots
wayneb77
1
150
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
Claude Code どこまでも/ Claude Code Everywhere
nwiizo
64
54k
Transcript
HLT CPU Consumption Sasha Mazurov 6 Febrary 2012
Tool Gaudi Auditor & Intel® VTune™ Amplifier XE 2011 Can
be run on any lxplus node
Benefits ➔ Can focus on a specific sequence/algorithm(s). ➔ Skip
initialization & finalization phase. ➔ Report CPU consumption per algorithm / function / class / module. ➔ Perfect GUI & reports.
http://amazurov.ru/cern/intelprofiler/ - installation - documentation - screencasts $> intelprofiler -o
/where/to/store/profiler/output myJob.py
None
Profiler vs. HLT1 Lines (Offline )
https://github.com/mazurov/HltProfiling profiler = IntelProfilerAuditor() profiler.StartFromEventN = 5000 profiler.StopAtEventN = 15000
profiler.IncludeAlgorithms = ["Hlt1TrackAllL0", "Hlt1DiMuonHighMass", "Hlt1DiMuonLowMass"] Jop Options Moore v12r10
Hotspots
Top Hotspots
CPU/Per Function
CPU / Per Module
CPU/Per Algorithm
http://amazurov.ru/cern/hltprofilingresults/
CPU / Per Function In Algorithm
CPU / Per Source Code (debug mode)
TCMalloc vs. “new” Operator
Before: After: CPU: 238 s CPU: 222 s
Results ➔ tc_new is twice faster than “new” operator. ➔
5% total improvement for Hlt1 job.
GCC 4.3 vs. GCC 4.6
GCC 4.3 GCC 4.6 -O2 flag ~ 3.6% worth
Two profiles comparison
Result (preliminary) ➔ It's not evident, that GCC 4.6 optimize
better than GCC 4.3 (for HLT1 jobs).
Future plans ➔ Profile code compiled with GCC 4.6 and
-O3 flag. ➔ Profile code compiled with GCC 4.6's profile driven optimization. ➔ Create a web interface to display collected profiler results.
http://amazurov.ru/cern/hltprofilingpresentation