yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #46968
[Bug 1550269] [NEW] 'hw:cpu_thread_policy=require' does not function correctly if NUMATopologyFilter is disabled
Public bug reported:
The 'require' policy is supposed to restrict instances to hosts that
support hardware threads, e.g. HyperThreading. However, this filtering
is done as part of the NUMATopologyFilter. If this filter is disabled,
the host boots just fine. This should not be the case and needs to be
fixed. Findings below.
---
Testing was conducted on a single-node, Fedora 23-based (4.3.5-300.fc23.x86_64)
OpenStack instance (built with devstack). The system is a dual-socket, ten core,
HT-enabled system (2 sockets * 10 cores * 2 threads = 40 "pCPUs".
0-9,20-29 = node0, 10-19,30-39 = node1).
Commit '8bafc9' of Nova was used.
# Steps
## Create flavors
$ openstack flavor create pinned.require \
--id 102 --ram 2048 --disk 0 --vcpus 4
$ openstack flavor set pinned.require \
--property "hw:cpu_policy=dedicated" \
--property "hw:cpu_thread_policy=require"
## Validate a HT-enabled node
The 'require' case is a stricter version of the `prefer` case, in that it
should fail if we have HyperThreading disabled, do not have enough free
sibling sets, or have no HyperThreading support at all. However, since we're
not hitting any of these conditions on this host, things should function just
like they do for the `prefer` case. Therefore, the guest should see a two
sockets with one core per socket and two threads per core.
$ openstack server create --flavor=pinned.require \
--image=cirros-0.3.4-x86_64-uec --wait test1
$ sudo virsh list
Id Name State
----------------------------------------------------
2 instance-00000002 running
$ sudo virsh dumpxml 2
<domain type='kvm' id='2'>
<name>instance-00000002</name>
...
<vcpu placement='static'>4</vcpu>
<cputune>
<shares>4096</shares>
<vcpupin vcpu='0' cpuset='1'/>
<vcpupin vcpu='1' cpuset='21'/>
<vcpupin vcpu='2' cpuset='0'/>
<vcpupin vcpu='3' cpuset='20'/>
<emulatorpin cpuset='0-1,20-21'/>
</cputune>
<numatune>
<memory mode='strict' nodeset='0'/>
<memnode cellid='0' mode='strict' nodeset='0'/>
</numatune>
...
<cpu>
<topology sockets='2' cores='1' threads='2'/>
<numa>
<cell id='0' cpus='0-3' memory='2097152' unit='KiB'/>
</numa>
</cpu>
...
</domain>
$ openstack server delete test1
No issues here.
## Validate a HT-disabled node
This policy "requires" HyperThreading or similar on the host, so it shouldn't
work here.
$ openstack server create --flavor=pinned.require \
--image=cirros-0.3.4-x86_64-uec --wait test1
$ sudo virsh list
Id Name State
----------------------------------------------------
2 instance-00000002 running
$ sudo virsh dumpxml 2
<domain type='kvm' id='2'>
<name>instance-00000002</name>
...
<vcpu placement='static'>4</vcpu>
<cputune>
<shares>4096</shares>
<vcpupin vcpu='0' cpuset='0'/>
<vcpupin vcpu='1' cpuset='1'/>
<vcpupin vcpu='2' cpuset='2'/>
<vcpupin vcpu='3' cpuset='3'/>
<emulatorpin cpuset='0-3'/>
</cputune>
<numatune>
<memory mode='strict' nodeset='0'/>
<memnode cellid='0' mode='strict' nodeset='0'/>
</numatune>
...
<cpu>
<topology sockets='2' cores='1' threads='2'/>
<numa>
<cell id='0' cpus='0-3' memory='2097152' unit='KiB'/>
</numa>
</cpu>
...
</domain>
$ openstack server delete test1
This is a problem, but we do not currently have the filter activated:
$ cat /etc/nova/nova.conf | grep scheduler_default_filters
scheduler_default_filters = RetryFilter,AvailabilityZoneFilter,RamFilter,\
DiskFilter,ComputeFilter,ComputeCapabilitiesFilter,ImagePropertiesFilter,\
ServerGroupAntiAffinityFilter,ServerGroupAffinityFilter,SameHostFilter,\
DifferentHostFilter
Let's activate this:
$ cat /etc/nova/nova.conf | grep scheduler_default_filters
scheduler_default_filters = RetryFilter,AvailabilityZoneFilter,RamFilter,\
DiskFilter,ComputeFilter,ComputeCapabilitiesFilter,ImagePropertiesFilter,\
ServerGroupAntiAffinityFilter,ServerGroupAffinityFilter,SameHostFilter,\
DifferentHostFilter,NUMATopologyFilter
And try again:
$ openstack server create --flavor=pinned.require \
--image=cirros-0.3.4-x86_64-uec --wait test1
Error creating server: test1
Error creating server
That's more like it, but it shouldn't be necessary.
** Affects: nova
Importance: Undecided
Status: New
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1550269
Title:
'hw:cpu_thread_policy=require' does not function correctly if
NUMATopologyFilter is disabled
Status in OpenStack Compute (nova):
New
Bug description:
The 'require' policy is supposed to restrict instances to hosts that
support hardware threads, e.g. HyperThreading. However, this filtering
is done as part of the NUMATopologyFilter. If this filter is disabled,
the host boots just fine. This should not be the case and needs to be
fixed. Findings below.
---
Testing was conducted on a single-node, Fedora 23-based (4.3.5-300.fc23.x86_64)
OpenStack instance (built with devstack). The system is a dual-socket, ten core,
HT-enabled system (2 sockets * 10 cores * 2 threads = 40 "pCPUs".
0-9,20-29 = node0, 10-19,30-39 = node1).
Commit '8bafc9' of Nova was used.
# Steps
## Create flavors
$ openstack flavor create pinned.require \
--id 102 --ram 2048 --disk 0 --vcpus 4
$ openstack flavor set pinned.require \
--property "hw:cpu_policy=dedicated" \
--property "hw:cpu_thread_policy=require"
## Validate a HT-enabled node
The 'require' case is a stricter version of the `prefer` case, in that it
should fail if we have HyperThreading disabled, do not have enough free
sibling sets, or have no HyperThreading support at all. However, since we're
not hitting any of these conditions on this host, things should function just
like they do for the `prefer` case. Therefore, the guest should see a two
sockets with one core per socket and two threads per core.
$ openstack server create --flavor=pinned.require \
--image=cirros-0.3.4-x86_64-uec --wait test1
$ sudo virsh list
Id Name State
----------------------------------------------------
2 instance-00000002 running
$ sudo virsh dumpxml 2
<domain type='kvm' id='2'>
<name>instance-00000002</name>
...
<vcpu placement='static'>4</vcpu>
<cputune>
<shares>4096</shares>
<vcpupin vcpu='0' cpuset='1'/>
<vcpupin vcpu='1' cpuset='21'/>
<vcpupin vcpu='2' cpuset='0'/>
<vcpupin vcpu='3' cpuset='20'/>
<emulatorpin cpuset='0-1,20-21'/>
</cputune>
<numatune>
<memory mode='strict' nodeset='0'/>
<memnode cellid='0' mode='strict' nodeset='0'/>
</numatune>
...
<cpu>
<topology sockets='2' cores='1' threads='2'/>
<numa>
<cell id='0' cpus='0-3' memory='2097152' unit='KiB'/>
</numa>
</cpu>
...
</domain>
$ openstack server delete test1
No issues here.
## Validate a HT-disabled node
This policy "requires" HyperThreading or similar on the host, so it shouldn't
work here.
$ openstack server create --flavor=pinned.require \
--image=cirros-0.3.4-x86_64-uec --wait test1
$ sudo virsh list
Id Name State
----------------------------------------------------
2 instance-00000002 running
$ sudo virsh dumpxml 2
<domain type='kvm' id='2'>
<name>instance-00000002</name>
...
<vcpu placement='static'>4</vcpu>
<cputune>
<shares>4096</shares>
<vcpupin vcpu='0' cpuset='0'/>
<vcpupin vcpu='1' cpuset='1'/>
<vcpupin vcpu='2' cpuset='2'/>
<vcpupin vcpu='3' cpuset='3'/>
<emulatorpin cpuset='0-3'/>
</cputune>
<numatune>
<memory mode='strict' nodeset='0'/>
<memnode cellid='0' mode='strict' nodeset='0'/>
</numatune>
...
<cpu>
<topology sockets='2' cores='1' threads='2'/>
<numa>
<cell id='0' cpus='0-3' memory='2097152' unit='KiB'/>
</numa>
</cpu>
...
</domain>
$ openstack server delete test1
This is a problem, but we do not currently have the filter activated:
$ cat /etc/nova/nova.conf | grep scheduler_default_filters
scheduler_default_filters = RetryFilter,AvailabilityZoneFilter,RamFilter,\
DiskFilter,ComputeFilter,ComputeCapabilitiesFilter,ImagePropertiesFilter,\
ServerGroupAntiAffinityFilter,ServerGroupAffinityFilter,SameHostFilter,\
DifferentHostFilter
Let's activate this:
$ cat /etc/nova/nova.conf | grep scheduler_default_filters
scheduler_default_filters = RetryFilter,AvailabilityZoneFilter,RamFilter,\
DiskFilter,ComputeFilter,ComputeCapabilitiesFilter,ImagePropertiesFilter,\
ServerGroupAntiAffinityFilter,ServerGroupAffinityFilter,SameHostFilter,\
DifferentHostFilter,NUMATopologyFilter
And try again:
$ openstack server create --flavor=pinned.require \
--image=cirros-0.3.4-x86_64-uec --wait test1
Error creating server: test1
Error creating server
That's more like it, but it shouldn't be necessary.
To manage notifications about this bug go to:
https://bugs.launchpad.net/nova/+bug/1550269/+subscriptions
Follow ups