OpenStack - Ironic Race Condition during Deployment
With RDO Manager/RH-OSP, I encountered a problem with Ironic & Nova during a deployment of ~30 physical servers.
By default, Nova is able to launch 10 instances builds to run concurrently but actually, Ironic can’t deal with and it cause a race condition…
The problem is Ironic try to attach a wrong profile/instance to a node.
Error example: Compute profile on a Controller server.
The workaround consists to:
- Decrease the number of max concurrent builds from
10
to 2
- Set the number of nodes you want to deploy on
scheduler_max_attempts
parameter
- Reduce size of the RPC thread pool from
64
to 4
/etc/nova/nova.conf
/etc/ironic/ironic.conf
Don’t forget to restart Ironic & Nova services.
This is the minimum values, you can increase them to find a better adjustment.
More information here