The Wonders of NUMA

The Wonders of NUMA (Or Why Your High-Performance Application Doesn't
Perform) Stephen Finucane OpenStack Software Developer 23rd May 2018

What is NUMA?

INSERT DESIGNATOR, IF NEEDED 3 What is NUMA? Non-Uniform Memory
Architecture

INSERT DESIGNATOR, IF NEEDED 4 What is NUMA? UMA (Uniform
Memory Access) Historically, all memory on x86 systems is equally accessible by all CPUs. Known as Uniform Memory Access (UMA), access times are the same no matter which CPU performs the operation. NUMA (Non-Uniform Memory Access) This behavior is no longer the case with recent x86 processors. In Non-Uniform Memory Access (NUMA), system memory is divided into zones (called nodes), which are allocated to particular CPUs or sockets. Access to memory that is local to a CPU is faster than memory connected to remote CPUs on that system.

INSERT DESIGNATOR, IF NEEDED 5 What is NUMA? node A
node B Local Access Remote Access Memory Channel Interconnect Memory Channel

node B node C node D

INSERT DESIGNATOR, IF NEEDED 8

NUMA in OpenStack

INSERT DESIGNATOR, IF NEEDED 10 NUMA in OpenStack • NUMA
Guest Topologies • Guest vCPU Placement • PCI NUMA Affinity • vGPU, Neutron Network NUMA Affinity

INSERT DESIGNATOR, IF NEEDED $ openstack flavor create --vcpus 6
--ram 6144 --disk 20 test.numa 12 NUMA Guest Topologies

--ram 6144 --disk 20 test.numa $ openstack flavor set test.numa \ --property hw:numa_nodes=2 13 NUMA Guest Topologies

--ram 6144 --disk 20 test.numa $ openstack flavor set test.numa \ --property hw:numa_nodes=2 \ --property hw:numa_cpus.0=0-3 \ --property hw:numa_cpus.1=4,5 \ --property hw:numa_mem.0=4096 \ --property hw:numa_mem.1=2048 14 NUMA Guest Topologies

--ram 6144 --disk 20 test.numa $ openstack flavor set test.numa \ --property hw:numa_nodes=2 \ --property hw:numa_cpus.0=0-3 \ --property hw:numa_cpus.1=4,5 \ # guest vCPUs - not host CPUs --property hw:numa_mem.0=4096 \ --property hw:numa_mem.1=2048 15 NUMA Guest Topologies

--ram 4096 --disk 20 test.pinned 18 Guest vCPU Placement

--ram 4096 --disk 20 test.pinned $ openstack flavor set test.pinned \ --property hw:cpu_policy=dedicated 19 Guest vCPU Placement

INSERT DESIGNATOR, IF NEEDED 20 Guest vCPU Placement node #0
core #0 core #1 core #3 core #2 node #1 core #0 core #1 core #3 core #2

--ram 4096 --disk 20 test.pinned $ openstack flavor set test.pinned \ --property hw:cpu_policy=dedicated --property hw:numa_nodes=2 28 Guest vCPU Placement

INSERT DESIGNATOR, IF NEEDED 34 PCI NUMA Affinity node #0
core #1 core #2 core #5 core #4 core #0 core #3 node #1 core #1 core #2 core #5 core #4 core #0 core #3

INSERT DESIGNATOR, IF NEEDED [pci] alias = '{ "name": "QuickAssist",
"product_id": "0443", "vendor_id": "8086", "device_type": "type-PCI" }' 35 PCI NUMA Affinity

--ram 4096 --disk 20 test.pci $ openstack flavor set test.pci \ --property pci_passthrough:alias=QuickAssist:1 36 PCI NUMA Affinity

"product_id": "0443", "vendor_id": "8086", "device_type": "type-PCI" }' 40 PCI NUMA Affinity

"product_id": "0443", "vendor_id": "8086", "device_type": "type-PCI", "numa_policy": "preferred" # or 'legacy' or 'required' }' 41 PCI NUMA Affinity

Guest Topologies • Guest vCPU Placement • PCI NUMA Affinity • vGPU, Neutron Network NUMA Affinity *coming soon*

Common Questions

INSERT DESIGNATOR, IF NEEDED 47 Common Questions • Can I
choose what host NUMA nodes my guest runs on?

choose what host NUMA nodes my guest runs on? ◦ We don’t support this by design

choose what host NUMA nodes my guest runs on? ◦ We don’t support this by design • Why would I want a multi-node guest?

choose what host NUMA nodes my guest runs on? ◦ We don’t support this by design • Why would I want a multi-node guest? ◦ By necessity ▪ Large core counts ▪ Multiple PCI devices with different NUMA affinities ◦ Application requirements

choose what host NUMA nodes my guest runs on? ◦ We don’t support this by design • Why would I want a multi-node guest? ◦ By necessity ▪ Large core counts ▪ Multiple PCI devices with different NUMA affinities ◦ Application requirements • Can a guest’s NUMA nodes share the same host node?

choose what host NUMA nodes my guest runs on? ◦ We don’t support this by design • Why would I want a multi-node guest? ◦ By necessity ▪ Large core counts ▪ Multiple PCI devices with different NUMA affinities ◦ Application requirements • Can a guest’s NUMA nodes share the same host node? ◦ Not at the moment

INSERT DESIGNATOR, IF NEEDED 53 Common Misconceptions

INSERT DESIGNATOR, IF NEEDED 54 Common Misconceptions • Host NUMA
node selection ◦ You can’t dictate what node is used - nova must decide

node selection ◦ You can’t dictate what node is used - nova must decide • Host sockets != NUMA nodes ◦ Cluster-on-Die is a thing

node selection ◦ You can’t dictate what node is used - nova must decide • Host sockets != NUMA nodes ◦ Cluster-on-Die is a thing • Guest sockets != NUMA nodes ◦ You can specify hw:numa_nodes and hw:cpu_sockets

node selection ◦ You can’t dictate what node is used - nova must decide • Host sockets != NUMA nodes ◦ Cluster-on-Die is a thing • Guest sockets != NUMA nodes ◦ You can specify hw:numa_nodes and hw:cpu_sockets • CPU pinning isn’t a requirement ◦ It’s just common in these scenarios

Questions?

THANK YOU plus.google.com/+RedHat linkedin.com/company/red-hat youtube.com/user/RedHatVideos facebook.com/redhatinc twitter.com/RedHatNews

INSERT DESIGNATOR, IF NEEDED 60 Resources You might want to
know about these... • RHEL NUMA Tuning Guide • Attaching physical PCI devices to guests • Nova Flavors Guide

The Wonders of NUMA

The Wonders of NUMA

More Decks by Stephen Finucane

Other Decks in Technology

Featured

Transcript