Configuring Resource Relocation Modes
You can configure how the cluster will deal with situations when a Hardware Node fails. Three modes are available:
- DRS (default). In this mode, virtual machines and Containers which were running on a failed Hardware Node are relocated to healthy Nodes based on available RAM and license capacity. This mode can be used for Nodes on which the
pdrs service is running.Note: If CPU pools are used, virtual machines and Containers can only be relocated to other Nodes in the same CPU pool. For details, see Managing CPU Pools.
The DRS mode works as follows. The master DRS continuously collects the following data from each healthy Node in the cluster via SNMP:
- total Node RAM,
- total RAM used by virtual machines,
- total RAM used by Containers,
- maximum running virtual machines allowed,
- maximum running Containers allowed,
- maximum running virtual machines and Containers allowed.
If a Node fails, the shaman service sends a list of virtual machines and Containers which were running on that Node to the master DRS that sorts it by most required RAM. Using the collected data on Node RAM and licenses, the master DRS then attempts to find a Node with the most available RAM and a suitable license for the virtual environment on top of the list (requiring the most RAM). If such a Node exists, the master DRS marks the virtual environment for relocation to that Node. Otherwise, it marks the virtual environment as 'broken'. Then the master DRS processes the next virtual environment down the list, adjusting the collected Node data by the requirements of the previous virtual environment. Having processed all virtual environments on the list, the master DRS sends the list to the shaman service for actual relocation.
- Spare. In this mode, virtual machines and Containers from a failed Hardware Node are relocated to a target backup Node—an empty Hardware Node with enough resources and a license to host all virtual environments from any given Node in the cluster. This Node is required for High Availability to work in this mode. To switch to this mode, use this command:
# shaman set-config RESOURCE_RELOCATION_MODE=spare
- Round-robin (default fallback). In this mode, virtual machines, Containers, and iSCSI targets from a failed Hardware Node are relocated to healthy Nodes in the round-robin manner. To switch to this mode, use this command:
# shaman set-config RESOURCE_RELOCATION_MODE=round-robin
Additionally, you can set a fallback relocation mode in case the chosen relocation mode fails. For example:
# shaman set-config RESOURCE_RELOCATION_MODE=drs,spare
|