Insufficient resources to satisfy HA failover level on cluster

I had this question yesterday where the error “Insufficient resources to satisfy HA failover level on cluster” comes from. And although it is hopefully clear to all of my regular readers this is caused by something that is called vSphere HA Admission Control, I figured I would reemphasize it and make sure people can easily find it when they do a search on my website.

When vSphere HA Admission Control is enabled vCenter Server validates if enough resources are available to guarantee all virtual machines can be restarted. If this is not the case the error around the HA failover level will appear. So what could cause this to happen and how do you solve it?

Are all hosts in your cluster still available (any hosts down )?
- If a host is down it could be insufficient resource are available to guarantee restarts
Check which admission control policy has been selected
- Depending on which policy has been selected a single large reservation could skew the admission control algorithm (primarily “host failures” policy is impacted by this)
Admission Control was recently enabled
- Could be that the cluster was overcommitted, or various reservations are used, causing the policy to be violated directly when enabled

In most cases when this error pops up it is caused by a large reservation on memory or CPU and that should always be the first thing to check. There are probably a million scripts out there to check this, but I prefer to use either the CloudPhysics appliance (cloud based flexible solution with new reports weekly), or RVTools which is a nice Windows based utility that produces quick reports. If you are interested in more in-depth info on admission control I suggest reading this section of my vSphere HA deepdive page.

Comments

ben @ geekswing says

4 June, 2013 at 23:08

The host failures policy definitely impacted us. We had the default of “1” set and couldn’t turn any VMs on even though we were barely using 20% of our resources. Turns out the calcluations for the host failures is pretty conservative. Good idea to go through all your VMs to check reservations or change to the percentage policy. Took me awhile to get through it

Nice post.
ben @ geekswing says

4 June, 2013 at 23:40

Just went through a bit briefly on your deep dives. HOLY MOLY! Will have to check those out when I have more time!
- Duncan Epping says
  
  5 June, 2013 at 09:01
  
  It is called deepdive for a reason right 🙂

Related

Reader Interactions

Comments