Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
HLT CPU Consumption
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Sasha Mazurov
February 06, 2012
Science
350
0
Share
HLT CPU Consumption
Sasha Mazurov
February 06, 2012
More Decks by Sasha Mazurov
See All by Sasha Mazurov
L1Calo Offline Software Status
mazurov
0
79
Performance and Regression tests for Simulation
mazurov
0
98
About v2
mazurov
0
76
L1Calo Offline Software Status
mazurov
0
110
L1Calo Offline Software Status
mazurov
0
110
LHCbPR V2
mazurov
0
150
Paper approval
mazurov
0
87
Conventions' Publications
mazurov
0
75
Ph.D final exam
mazurov
0
140
Other Decks in Science
See All in Science
Cross-Media Technologies, Information Science and Human-Information Interaction
signer
PRO
3
32k
東北地方における過去20年間の降水量の変化
naokimuroki
1
170
Amusing Abliteration
ianozsvald
1
160
(2025) Balade en cyclotomie
mansuy
0
550
SpatialRDDパッケージによる空間回帰不連続デザイン
saltcooky12
0
210
20260220 OpenIDファウンデーション・ジャパン ご紹介 / 20260220 OpenID Foundation Japan Intro
oidfj
0
320
タンパク質間相互作⽤を利⽤した⼈⼯知能による新しい薬剤遺伝⼦-疾患相互作⽤の同定
tagtag
PRO
0
200
データベース01: データベースを使わない世界
trycycle
PRO
1
1.2k
MATSUO Makiko
genomethica
0
140
Bリーグのショットデータを活用した得点期待値モデルの構築 / Construction of expected points model using shot data of B.LEAGUE
konakalab
0
120
ド文系だった私が、 KaggleのNCAAコンペでソロ金取れるまで
wakamatsu_takumu
2
2.3k
Understanding CVP Waveforms: Interpretation and Clinical Implications in Anesthesiology
taka88
0
500
Featured
See All Featured
Principles of Awesome APIs and How to Build Them.
keavy
128
17k
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
2
800
Automating Front-end Workflow
addyosmani
1370
200k
JAMstack: Web Apps at Ludicrous Speed - All Things Open 2022
reverentgeek
1
440
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
120
30 Presentation Tips
portentint
PRO
1
290
Leveraging Curiosity to Care for An Aging Population
cassininazir
1
230
Bioeconomy Workshop: Dr. Julius Ecuru, Opportunities for a Bioeconomy in West Africa
akademiya2063
PRO
1
110
Product Roadmaps are Hard
iamctodd
PRO
55
12k
The B2B funnel & how to create a winning content strategy
katarinadahlin
PRO
1
350
A brief & incomplete history of UX Design for the World Wide Web: 1989–2019
jct
1
370
Large-scale JavaScript Application Architecture
addyosmani
515
110k
Transcript
HLT CPU Consumption Sasha Mazurov 6 Febrary 2012
Tool Gaudi Auditor & Intel® VTune™ Amplifier XE 2011 Can
be run on any lxplus node
Benefits ➔ Can focus on a specific sequence/algorithm(s). ➔ Skip
initialization & finalization phase. ➔ Report CPU consumption per algorithm / function / class / module. ➔ Perfect GUI & reports.
http://amazurov.ru/cern/intelprofiler/ - installation - documentation - screencasts $> intelprofiler -o
/where/to/store/profiler/output myJob.py
None
Profiler vs. HLT1 Lines (Offline )
https://github.com/mazurov/HltProfiling profiler = IntelProfilerAuditor() profiler.StartFromEventN = 5000 profiler.StopAtEventN = 15000
profiler.IncludeAlgorithms = ["Hlt1TrackAllL0", "Hlt1DiMuonHighMass", "Hlt1DiMuonLowMass"] Jop Options Moore v12r10
Hotspots
Top Hotspots
CPU/Per Function
CPU / Per Module
CPU/Per Algorithm
http://amazurov.ru/cern/hltprofilingresults/
CPU / Per Function In Algorithm
CPU / Per Source Code (debug mode)
TCMalloc vs. “new” Operator
Before: After: CPU: 238 s CPU: 222 s
Results ➔ tc_new is twice faster than “new” operator. ➔
5% total improvement for Hlt1 job.
GCC 4.3 vs. GCC 4.6
GCC 4.3 GCC 4.6 -O2 flag ~ 3.6% worth
Two profiles comparison
Result (preliminary) ➔ It's not evident, that GCC 4.6 optimize
better than GCC 4.3 (for HLT1 jobs).
Future plans ➔ Profile code compiled with GCC 4.6 and
-O3 flag. ➔ Profile code compiled with GCC 4.6's profile driven optimization. ➔ Create a web interface to display collected profiler results.
http://amazurov.ru/cern/hltprofilingpresentation