[op5-users] check_esx3.pl high CPU load.

Mattias Bergsten mattias at westbahr.com
Fri Feb 19 18:06:39 CET 2010


On 2010-02-19 14:57, Otto Wincrantz wrote:
> The script takes about 5-7 seconds to execute in my environment, with 15
> virtual hosts, 60 VMs and a 5 minute check schedule I am looking at
> close to a constant 100% CPU load on my monitoring server.

Make sure you're using the latest version of the plugin, there have been 
many performance improvements since it was written. It can be found in 
git at op5.org.

Unfortunately the VMware Perl API is a little strange, and there's not 
always much choice in requesting data. Sometimes all you can do is say 
"give me all the data" and then filter it clientside, which isn't all 
that efficient.

Having said that, there are definitely optimizations to be done to the 
script, and it is something that is on the roadmap (Peter, do you have 
more info on this? We've talked about it but I can't remember what 
timeframe we said except "post 5.0 release"..) but it might take a while 
more until someone at op5 can look at it. If anyone is conversant with 
Perl optimization and would like to take a look and submit some patches, 
by all means feel free - it would be appreciated by many.

Until then, I suggest running as few checks as possible, i.e., if you 
have a VirtualCenter, run _one_ VMFS check against the VirtualCenter to 
get the data for all hosts and all filesystems. I realize it's not a 
perfect solution but at least it's a workable one. :)

(hälsa Steve!)

/mattias, who does not work at op5 but helped create check_esx3



More information about the op5-users mailing list