Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Tuning Linux for Databases
Search
Alexey Lesovsky
October 15, 2016
Education
0
58
Tuning Linux for Databases
Slides from my talk at HDConf 2016 Minsk, Belarus
Alexey Lesovsky
October 15, 2016
Tweet
Share
More Decks by Alexey Lesovsky
See All by Alexey Lesovsky
PostgreSQL Scaling Usecases
lesovsky
0
110
Patroni failure stories, or How to crash yout PostgreSQL cluster
lesovsky
0
440
PostgreSQL High Availability in 2019
lesovsky
0
450
Top developer's mistakes when working with PostgreSQL
lesovsky
1
160
Let's Pull the Plug On the Autovacuum (EN)
lesovsky
0
1.2k
Troubleshooting PostgreSQL for Developers (RU)
lesovsky
2
360
Troubleshooting PostgreSQL Streaming Replication
lesovsky
0
1k
Call of Postgres: Advanced Operations. Part I.
lesovsky
0
70
Call of Postgres: Advanced Operations. Part II.
lesovsky
0
95
Other Decks in Education
See All in Education
卒論の書き方 / Happy Writing
kaityo256
PRO
54
28k
TeXで変える教育現場
doratex
1
12k
心理学を学び活用することで偉大なスクラムマスターを目指す − 大学とコミュニティを組み合わせた学びの循環 / Becoming a great Scrum Master by learning and using psychology
psj59129
1
1.7k
栃木にいても「だいじ」だっぺ〜! 栃木&全国アジャイルコミュニティへの参加・運営の魅力
sasakendayo
1
140
国際卓越研究大学計画|Science Tokyo(東京科学大学)
sciencetokyo
PRO
0
47k
【ZEPホスト用メタバース校舎操作ガイド】
ainischool
0
170
HCI Research Methods - Lecture 7 - Human-Computer Interaction (1023841ANR)
signer
PRO
0
1.3k
✅ レポート採点基準 / How Your Reports Are Assessed
yasslab
PRO
0
280
都市の形成要因と 「都市の余白」のあり方
sakamon
0
150
KBS新事業創造体験2025_科目説明会
yasuchikawakayama
0
160
【旧:ZEPメタバース校舎操作ガイド】
ainischool
0
790
東大1年生にJulia教えてみた
matsui_528
7
12k
Featured
See All Featured
Max Prin - Stacking Signals: How International SEO Comes Together (And Falls Apart)
techseoconnect
PRO
0
84
HU Berlin: Industrial-Strength Natural Language Processing with spaCy and Prodigy
inesmontani
PRO
0
200
Measuring & Analyzing Core Web Vitals
bluesmoon
9
750
Jess Joyce - The Pitfalls of Following Frameworks
techseoconnect
PRO
1
64
Utilizing Notion as your number one productivity tool
mfonobong
3
220
Jamie Indigo - Trashchat’s Guide to Black Boxes: Technical SEO Tactics for LLMs
techseoconnect
PRO
0
57
Speed Design
sergeychernyshev
33
1.5k
Bioeconomy Workshop: Dr. Julius Ecuru, Opportunities for a Bioeconomy in West Africa
akademiya2063
PRO
1
54
技術選定の審美眼(2025年版) / Understanding the Spiral of Technologies 2025 edition
twada
PRO
117
110k
Getting science done with accelerated Python computing platforms
jacobtomlinson
2
110
ラッコキーワード サービス紹介資料
rakko
1
2.2M
We Have a Design System, Now What?
morganepeng
54
8k
Transcript
None
About PostgreSQL DBA. Linux system administrator. PostgreSQL-Consulting.com: • 24/7 support.
• Audit, performance optimizations. • Consulting and Training. • Monitoring and Emergency. • Capacity planning. Slides: https://goo.gl/awmZ2H
Agenda RDBMS on Linux, why? Databases and Resources. OS subsystems.
CPU, Process scheduling, Power saving policies. Memory, VM, NUMA, Huge pages. Storage, File Systems, Input/Output. Other misc.
Why Linux? Linux is a good choice: • Active development
& Community support. • A lot of features & Fast implementation. • Stable & Mature & Durable.
Databases & Resources Concurrency Query speed Sort, group, hash,... OS
page cache DB buffer pool Local process cache DB data files Transaction Log Cold start CPU Memory Storage
Databases & Resources CPU Scheduling NUMA Power Saving Virtual Memory
NUMA Huge Pages File Systems Storage I/O CPU Memory Storage
Resources CPU scheduler. Virtual memory and NUMA. Huge pages. File
systems. Storage IO. Power saving policy. Others.
CPU scheduling CPU scheduler responsible for proper processes planning: Sysctl:
• kernel.sched_migration_cost_ns = 5000000 (default: 500000). • kernel.sched_autogroup_enabled = 0 (default: 1). http://www.postgresql.org/message-id/
[email protected]
http://kernelnewbies.org/Linux_2_6_38#head-59575a6aeafa38490226a560ee02de89829a5b20
CPU scheduling CPU scheduler responsible for proper processes planning: Sysctl:
• kernel.sched_migration_cost_ns = 5000000 (default: 500000). • kernel.sched_autogroup_enabled = 0 (default: 1). http://www.postgresql.org/message-id/
[email protected]
http://kernelnewbies.org/Linux_2_6_38#head-59575a6aeafa38490226a560ee02de89829a5b20 Be aware on Ubuntu: 12.04 #1055222 and 14.04 #1422016. Use noautogroup kernel param instead of sysctl.conf.
Virtual Memory What is it? Allocator, Caching, Dirty pages and
Writeback.
Virtual Memory
Virtual Memory Sysctl: vm.dirty_background_ratio & vm.dirty_ratio = disable it. vm.dirty_background_bytes
& vm.dirty_bytes = depends on ... RAID cache size, 64MB/128MB otherwise
Virtual Memory Out-of-memory & OOM-Killer Sysctl: vm.swappiness = 1 (default:
60)
NUMA S — Socket C — CPU core M —
Memory bank
NUMA BIOS: enable memory node interleaving. Kernel boot: numa=off. numactl
utility. Sysctl: • vm.zone_reclaim_mode = 0 (default: 0). • kernel.numa_balancing = 0 (default: 0).
Huge Pages Huge pages vs. Transparent huge pages. Huge pages
are supported by many RDBMS. Always disable transparent huge pages.
Huge Pages Huge pages vs. Transparent huge pages. Huge pages
are supported by many RDBMS. Always disable transparent huge pages. /etc/rc.local: • echo never > /sys/kernel/mm/transparent_hugepage/enabled • echo never > /sys/kernel/mm/transparent_hugepage/defrag
Filesystems Ext3 vs Ext4 vs XFS: what is better? Filesystem
Barriers.
Filesystems Ext3 vs Ext4 vs XFS: what is better? Filesystem
Barriers. Disable Write Cache: • hdparm -W0 /dev/device • MegaCli64 -LDSetProp -DisDskCache -Lall -aALL
Filesystems Ext3 vs Ext4 vs XFS: what is better? Filesystem
Barriers. Disable Write Cache: • hdparm -W0 /dev/device • MegaCli64 -LDSetProp -DisDskCache -Lall -aALL Hardware RAID + BBU = barrier=0 (disable). Software RAID = barrier=1 (enable).
Filesystems Ext3 vs Ext4 vs XFS: what is better? Filesystem
Barriers. Disable Write Cache: • hdparm -W0 /dev/device • MegaCli64 -LDSetProp -DisDskCache -Lall -aALL Hardware RAID + BBU = barrier=0 (disable). Software RAID = barrier=1 (enable). Enterprise SSD with Power Loss Protection = barrier=0 (disable).
Storage IO SATA/SAS vs SSD. IO elevators.
Storage IO SATA/SAS vs SSD. IO elevators: • noop: SSD,
PCIe SSD, hi-end storages. • deadline: RAID, SATA/SAS. • cfq: good default. • none (multi-queue block IO): SSD, PCIe SSD.
Storage IO SATA/SAS vs SSD. IO elevators: • noop: SSD,
PCIe SSD, hi-end storages. • deadline: RAID, SATA/SAS. • cfq: good default. • none (multi-queue block IO): SSD, PCIe SSD. # echo 'elevator_name' > /sys/block/<device>/queue/scheduler kernel boot: elevator=<name> /sys/block/*/queue/: rotational, rq_affinity, read_ahead_kb
Power Saving Policy Drivers: acpi_cpufreq vs. intel_pstate. scaling_governor.
Power Saving Policy Drivers: acpi_cpufreq vs. intel_pstate. scaling_governor: • /sys/devices/system/cpu/cpuX/cpufreq/scaling_available_governors
• /sys/devices/system/cpu/cpuX/cpufreq/scaling_governor
Power Saving Policy Drivers: acpi_cpufreq vs. intel_pstate. scaling_governor: • /sys/devices/system/cpu/cpuX/cpufreq/scaling_available_governors
• /sys/devices/system/cpu/cpuX/cpufreq/scaling_governor acpi_cpufreq + performance. intel_pstate + powersave.
Misc: Clocksources What is clocksource? acpi_pm vs. hpet vs. tsc.
/sys/devices/system/clocksource/clocksource0/available_clocksource. /sys/devices/system/clocksource/clocksource0/current_clocksource.
Summary Linux is a good choice for RDBMS: Modern, Universal,
Flexible, Stable. Adapt Linux for your workloads. Test → Change → Test → Commit/Rollback.
Questions? Alexey Lesovsky
[email protected]
PostgreSQL-Consulting.com: Data maintenance at its best
https://postgresql-consulting.com