Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Beyond top: Command-Line Monitoring on the JVM ...
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
Colin Jones
October 29, 2015
Programming
1
690
Beyond top: Command-Line Monitoring on the JVM (JavaOne 2015)
A session from JavaOne 2015
Colin Jones
October 29, 2015
Tweet
Share
More Decks by Colin Jones
See All by Colin Jones
A Bug's Life: What if `select` is Broken After All?
trptcolin
0
170
Underestimated costs of microservice architectures
trptcolin
3
1.6k
FP vs. OOP: Beyond the Bikeshed
trptcolin
0
450
Diving into the Details with DTrace! (RubyConf 2016 edition)
trptcolin
2
520
Diving into the Details with DTrace
trptcolin
3
530
Adopting FP: the good, the familiar, and the unknown
trptcolin
0
270
Finding out what's *really* going on, with DTrace!
trptcolin
1
390
Beyond top: Command-Line Monitoring on the JVM (ClojureRemote)
trptcolin
0
160
ZooKeeper: Wait-free coordination for Internet-scale systems
trptcolin
2
210
Other Decks in Programming
See All in Programming
SourceGeneratorのマーカー属性問題について
htkym
0
180
ポーリング処理廃止によるイベント駆動アーキテクチャへの移行
seitarof
3
1k
AI Assistants for Your Angular Solutions
manfredsteyer
PRO
0
130
API Platformを活用したPHPによる本格的なWeb API開発 / api-platform-book-intro
ttskch
1
130
The Ralph Wiggum Loop: First Principles of Autonomous Development
sembayui
0
3.7k
15年目のiOSアプリを1から作り直す技術
teakun
1
620
new(1.26) ← これすき / kamakura.go #8
utgwkk
0
2.2k
Ruby x Terminal
a_matsuda
7
590
社内規程RAGの精度を73.3% → 100%に改善した話
oharu121
13
7.9k
Rで始めるML・LLM活用入門
wakamatsu_takumu
0
170
最初からAWS CDKで技術検証してもいいんじゃない?
akihisaikeda
4
130
メタプログラミングで実現する「コードを仕様にする」仕組み/nikkei-tech-talk43
nikkei_engineer_recruiting
0
170
Featured
See All Featured
Building a Modern Day E-commerce SEO Strategy
aleyda
45
8.8k
How to optimise 3,500 product descriptions for ecommerce in one day using ChatGPT
katarinadahlin
PRO
1
3.5k
The Cost Of JavaScript in 2023
addyosmani
55
9.8k
Leading Effective Engineering Teams in the AI Era
addyosmani
9
1.7k
Thoughts on Productivity
jonyablonski
75
5.1k
Hiding What from Whom? A Critical Review of the History of Programming languages for Music
tomoyanonymous
2
540
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
133
19k
The agentic SEO stack - context over prompts
schlessera
0
690
The SEO Collaboration Effect
kristinabergwall1
0
390
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
67
37k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
49
9.9k
Crafting Experiences
bethany
1
82
Transcript
Beyond top(1) Command-Line Monitoring on the JVM Colin Jones @trptcolin
8th Light
None
What to expect
command-line tooling
on the JVM
introspection & serviceability
--all-flags=false
war stories
real-life usage (well, re-enacted anyway)
A long time ago in a software shop far, far
away…
Things are going pretty well
What does this thing look like? app-architecture Postgres Web /
API Application Server Load Balancer Periodic Job Application Server 3rd-party Service A 3rd-party Service B Monitored email account End users: native mobile app Admin users: desktop browsers
But strange things are afoot
the server sometimes gets really slow
the team has to manually restart the application server
incident response time is ~5 minutes
Yes, strange things are afoot
Pain, frustration, anger
Just the facts
sometimes, things get slow
all requests seem to be affected
the JVM stays up
restart the JVM and everything is fine
What could it be?
Demo
More facts, please!
constant full GCs
what’s in the heap
what application code was running
The right tools for the job
vmstat system-level: CPU, memory, disk, context switching
top per-process: CPU & memory
jps what’s our PID?
jstack status of all threads (right now-ish!)
jcmd what can’t it do?! jcmd [PID] help (sorry, JVM
6 users: see jinfo/jmap/jstack)
jstat GC classloader compiler
Mystery solved!
Now “just” fix it
idea 1: eliminate the leak
idea 2: eliminate the cache altogether?
idea 3: delete the feature
idea 4: full-text search engine
So we’re good now… until the next incident
Lessons
“it’s slow” could mean lots of things
“high CPU” could mean lots of things
collecting data is crucial in a crisis
reproducing the issue helps me sleep at night
Other “right tools for the job”
Heap analyzers
Profilers
Constant monitoring & alerting
Dynamic tracing
Learning more
Books
operators are standing by! man jstat man jstack jcmd [PID]
help [COMMAND] etc.
Thank you! Colin Jones @trptcolin 8th Light