Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Tuning Linux for Databases
Search
Alexey Lesovsky
October 15, 2016
Education
0
53
Tuning Linux for Databases
Slides from my talk at HDConf 2016 Minsk, Belarus
Alexey Lesovsky
October 15, 2016
Tweet
Share
More Decks by Alexey Lesovsky
See All by Alexey Lesovsky
PostgreSQL Scaling Usecases
lesovsky
0
110
Patroni failure stories, or How to crash yout PostgreSQL cluster
lesovsky
0
430
PostgreSQL High Availability in 2019
lesovsky
0
400
Top developer's mistakes when working with PostgreSQL
lesovsky
1
150
Let's Pull the Plug On the Autovacuum (EN)
lesovsky
0
1.2k
Troubleshooting PostgreSQL for Developers (RU)
lesovsky
2
360
Troubleshooting PostgreSQL Streaming Replication
lesovsky
0
1k
Call of Postgres: Advanced Operations. Part I.
lesovsky
0
66
Call of Postgres: Advanced Operations. Part II.
lesovsky
0
85
Other Decks in Education
See All in Education
✅ レポート採点基準 / How Your Reports Are Assessed
yasslab
PRO
0
140
1008
cbtlibrary
0
100
Adobe Express
matleenalaakso
1
8k
あなたの言葉に力を与える、演繹的なアプローチ
logica0419
1
230
ロータリー国際大会について~国際大会に参加しよう~:古賀 真由美 会員(2720 Japan O.K. ロータリーEクラブ・(有)誠邦産業 取締役)
2720japanoke
1
700
高校におけるプログラミング教育を考える
naokikato
PRO
0
180
Evaluation Methods - Lecture 6 - Human-Computer Interaction (1023841ANR)
signer
PRO
0
1.2k
Técnicas y Tecnología para la Investigación Neurocientífica en el Neuromanagement
jvpcubias
0
180
バケットポリシーの記述を誤りマネコンからS3バケットを操作できなくなりそうになった話
amarelo_n24
1
130
20250830_本社にみんなの公園を作ってみた
yoneyan
0
150
Entrepreneurship minor course at HSE 2025
karlov
0
130
ハッカソンを活用したモノづくり教育について
yusk1450
PRO
2
120
Featured
See All Featured
Why Our Code Smells
bkeepers
PRO
340
57k
Site-Speed That Sticks
csswizardry
13
970
Why You Should Never Use an ORM
jnunemaker
PRO
60
9.6k
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
10
670
GraphQLの誤解/rethinking-graphql
sonatard
73
11k
Embracing the Ebb and Flow
colly
88
4.9k
Building a Modern Day E-commerce SEO Strategy
aleyda
45
8.1k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
658
61k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
34
2.3k
Building Applications with DynamoDB
mza
96
6.8k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
1.8k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
359
30k
Transcript
None
About PostgreSQL DBA. Linux system administrator. PostgreSQL-Consulting.com: • 24/7 support.
• Audit, performance optimizations. • Consulting and Training. • Monitoring and Emergency. • Capacity planning. Slides: https://goo.gl/awmZ2H
Agenda RDBMS on Linux, why? Databases and Resources. OS subsystems.
CPU, Process scheduling, Power saving policies. Memory, VM, NUMA, Huge pages. Storage, File Systems, Input/Output. Other misc.
Why Linux? Linux is a good choice: • Active development
& Community support. • A lot of features & Fast implementation. • Stable & Mature & Durable.
Databases & Resources Concurrency Query speed Sort, group, hash,... OS
page cache DB buffer pool Local process cache DB data files Transaction Log Cold start CPU Memory Storage
Databases & Resources CPU Scheduling NUMA Power Saving Virtual Memory
NUMA Huge Pages File Systems Storage I/O CPU Memory Storage
Resources CPU scheduler. Virtual memory and NUMA. Huge pages. File
systems. Storage IO. Power saving policy. Others.
CPU scheduling CPU scheduler responsible for proper processes planning: Sysctl:
• kernel.sched_migration_cost_ns = 5000000 (default: 500000). • kernel.sched_autogroup_enabled = 0 (default: 1). http://www.postgresql.org/message-id/
[email protected]
http://kernelnewbies.org/Linux_2_6_38#head-59575a6aeafa38490226a560ee02de89829a5b20
CPU scheduling CPU scheduler responsible for proper processes planning: Sysctl:
• kernel.sched_migration_cost_ns = 5000000 (default: 500000). • kernel.sched_autogroup_enabled = 0 (default: 1). http://www.postgresql.org/message-id/
[email protected]
http://kernelnewbies.org/Linux_2_6_38#head-59575a6aeafa38490226a560ee02de89829a5b20 Be aware on Ubuntu: 12.04 #1055222 and 14.04 #1422016. Use noautogroup kernel param instead of sysctl.conf.
Virtual Memory What is it? Allocator, Caching, Dirty pages and
Writeback.
Virtual Memory
Virtual Memory Sysctl: vm.dirty_background_ratio & vm.dirty_ratio = disable it. vm.dirty_background_bytes
& vm.dirty_bytes = depends on ... RAID cache size, 64MB/128MB otherwise
Virtual Memory Out-of-memory & OOM-Killer Sysctl: vm.swappiness = 1 (default:
60)
NUMA S — Socket C — CPU core M —
Memory bank
NUMA BIOS: enable memory node interleaving. Kernel boot: numa=off. numactl
utility. Sysctl: • vm.zone_reclaim_mode = 0 (default: 0). • kernel.numa_balancing = 0 (default: 0).
Huge Pages Huge pages vs. Transparent huge pages. Huge pages
are supported by many RDBMS. Always disable transparent huge pages.
Huge Pages Huge pages vs. Transparent huge pages. Huge pages
are supported by many RDBMS. Always disable transparent huge pages. /etc/rc.local: • echo never > /sys/kernel/mm/transparent_hugepage/enabled • echo never > /sys/kernel/mm/transparent_hugepage/defrag
Filesystems Ext3 vs Ext4 vs XFS: what is better? Filesystem
Barriers.
Filesystems Ext3 vs Ext4 vs XFS: what is better? Filesystem
Barriers. Disable Write Cache: • hdparm -W0 /dev/device • MegaCli64 -LDSetProp -DisDskCache -Lall -aALL
Filesystems Ext3 vs Ext4 vs XFS: what is better? Filesystem
Barriers. Disable Write Cache: • hdparm -W0 /dev/device • MegaCli64 -LDSetProp -DisDskCache -Lall -aALL Hardware RAID + BBU = barrier=0 (disable). Software RAID = barrier=1 (enable).
Filesystems Ext3 vs Ext4 vs XFS: what is better? Filesystem
Barriers. Disable Write Cache: • hdparm -W0 /dev/device • MegaCli64 -LDSetProp -DisDskCache -Lall -aALL Hardware RAID + BBU = barrier=0 (disable). Software RAID = barrier=1 (enable). Enterprise SSD with Power Loss Protection = barrier=0 (disable).
Storage IO SATA/SAS vs SSD. IO elevators.
Storage IO SATA/SAS vs SSD. IO elevators: • noop: SSD,
PCIe SSD, hi-end storages. • deadline: RAID, SATA/SAS. • cfq: good default. • none (multi-queue block IO): SSD, PCIe SSD.
Storage IO SATA/SAS vs SSD. IO elevators: • noop: SSD,
PCIe SSD, hi-end storages. • deadline: RAID, SATA/SAS. • cfq: good default. • none (multi-queue block IO): SSD, PCIe SSD. # echo 'elevator_name' > /sys/block/<device>/queue/scheduler kernel boot: elevator=<name> /sys/block/*/queue/: rotational, rq_affinity, read_ahead_kb
Power Saving Policy Drivers: acpi_cpufreq vs. intel_pstate. scaling_governor.
Power Saving Policy Drivers: acpi_cpufreq vs. intel_pstate. scaling_governor: • /sys/devices/system/cpu/cpuX/cpufreq/scaling_available_governors
• /sys/devices/system/cpu/cpuX/cpufreq/scaling_governor
Power Saving Policy Drivers: acpi_cpufreq vs. intel_pstate. scaling_governor: • /sys/devices/system/cpu/cpuX/cpufreq/scaling_available_governors
• /sys/devices/system/cpu/cpuX/cpufreq/scaling_governor acpi_cpufreq + performance. intel_pstate + powersave.
Misc: Clocksources What is clocksource? acpi_pm vs. hpet vs. tsc.
/sys/devices/system/clocksource/clocksource0/available_clocksource. /sys/devices/system/clocksource/clocksource0/current_clocksource.
Summary Linux is a good choice for RDBMS: Modern, Universal,
Flexible, Stable. Adapt Linux for your workloads. Test → Change → Test → Commit/Rollback.
Questions? Alexey Lesovsky
[email protected]
PostgreSQL-Consulting.com: Data maintenance at its best
https://postgresql-consulting.com