Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
HLT CPU Consumption
Search
Sasha Mazurov
February 06, 2012
Science
0
310
HLT CPU Consumption
Sasha Mazurov
February 06, 2012
Tweet
Share
More Decks by Sasha Mazurov
See All by Sasha Mazurov
L1Calo Offline Software Status
mazurov
0
56
Performance and Regression tests for Simulation
mazurov
0
51
About v2
mazurov
0
53
L1Calo Offline Software Status
mazurov
0
77
L1Calo Offline Software Status
mazurov
0
81
LHCbPR V2
mazurov
0
110
Paper approval
mazurov
0
45
Conventions' Publications
mazurov
0
46
Ph.D final exam
mazurov
0
81
Other Decks in Science
See All in Science
大規模画像テキストデータのフィルタリング手法の紹介
lyakaap
5
1.1k
「国と音楽」 ~spotifyrを用いて~ #muana
bob3bob3
2
330
同じデータでもP値が変わる話/key_considerations_in_NHST
florets1
1
1.1k
Ph.D. defense "Convex Manifold Approximation for Tensors"
gkazunii
0
180
MIKAMI Koichi
genomethica
0
180
History towards Universal Neural Network Potential for Material Discovery
matlantis
0
150
Machine Learning for Materials (Lecture 4)
aronwalsh
0
670
【論文紹介】DocTr_ Document Transformer for Structured Information Extraction in Documents / iccv2023-doctr
yuya4
3
580
論文輪読会 第16回 "NeRF:Representing Scenes as Neural"
academix
0
460
「みんなの自然災害伝承碑」ワークショップ 2023|日本地図学会
fullfull
0
210
研究・教育・産学連携の循環の実践
sshimizu2006
0
220
構造活性フォーラム2023-山﨑担当分
yamasakih
0
310
Featured
See All Featured
Building a Modern Day E-commerce SEO Strategy
aleyda
17
6.4k
Typedesign – Prime Four
hannesfritz
36
2.1k
How to train your dragon (web standard)
notwaldorf
73
5.2k
It's Worth the Effort
3n
180
27k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
14
1.5k
How To Stay Up To Date on Web Technology
chriscoyier
782
250k
jQuery: Nuts, Bolts and Bling
dougneiner
59
7.1k
Clear Off the Table
cherdarchuk
84
310k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
19
1.7k
GraphQLとの向き合い方2022年版
quramy
32
12k
Agile that works and the tools we love
rasmusluckow
325
20k
Optimizing for Happiness
mojombo
370
69k
Transcript
HLT CPU Consumption Sasha Mazurov 6 Febrary 2012
Tool Gaudi Auditor & Intel® VTune™ Amplifier XE 2011 Can
be run on any lxplus node
Benefits ➔ Can focus on a specific sequence/algorithm(s). ➔ Skip
initialization & finalization phase. ➔ Report CPU consumption per algorithm / function / class / module. ➔ Perfect GUI & reports.
http://amazurov.ru/cern/intelprofiler/ - installation - documentation - screencasts $> intelprofiler -o
/where/to/store/profiler/output myJob.py
None
Profiler vs. HLT1 Lines (Offline )
https://github.com/mazurov/HltProfiling profiler = IntelProfilerAuditor() profiler.StartFromEventN = 5000 profiler.StopAtEventN = 15000
profiler.IncludeAlgorithms = ["Hlt1TrackAllL0", "Hlt1DiMuonHighMass", "Hlt1DiMuonLowMass"] Jop Options Moore v12r10
Hotspots
Top Hotspots
CPU/Per Function
CPU / Per Module
CPU/Per Algorithm
http://amazurov.ru/cern/hltprofilingresults/
CPU / Per Function In Algorithm
CPU / Per Source Code (debug mode)
TCMalloc vs. “new” Operator
Before: After: CPU: 238 s CPU: 222 s
Results ➔ tc_new is twice faster than “new” operator. ➔
5% total improvement for Hlt1 job.
GCC 4.3 vs. GCC 4.6
GCC 4.3 GCC 4.6 -O2 flag ~ 3.6% worth
Two profiles comparison
Result (preliminary) ➔ It's not evident, that GCC 4.6 optimize
better than GCC 4.3 (for HLT1 jobs).
Future plans ➔ Profile code compiled with GCC 4.6 and
-O3 flag. ➔ Profile code compiled with GCC 4.6's profile driven optimization. ➔ Create a web interface to display collected profiler results.
http://amazurov.ru/cern/hltprofilingpresentation