[one-users] Host hook failure

Carlos Jiménez cjimenez at eneotecnologia.com
Mon Dec 10 02:54:57 PST 2012


Hi all,

I have one computer running OpenNebula and two hosts with KVM.
I have configured "HOST_HOOK" and "VM_HOOK" on /etc/one/oned.conf:
/
//HOST_MONITORING_INTERVAL         = 600//
//VM_POLLING_INTERVAL            = 600//
//HOST_HOOK = [//
//    name      = "error",//
//    on        = "ERROR",//
//    command   = "ft/host_error.rb",//
//    arguments = "$ID -r",//
//    remote    = "no" ]//
//VM_HOOK = [//
//   name      = "on_failure_resubmit",//
//   on        = "FAILED",//
//   command   = "/usr/bin/env onevm resubmit",//
//   arguments = "$ID" ]/

I've powered off one of the hosts, with one VM running. The VM polling 
and Host monitoring intervals are set to default. After 10-15 minutes, 
the host is detected as failed (ERROR state) but the VM is not 
resubmitted yet. According to the logs, it seems to be a problem in the 
execution of the "recovery" script.
The failed host ID is 5 and the affected VM ID is 36.

This is the relevant part of the log:
/
//Mon Dec 10 11:47:34 2012 [InM][I]: Monitoring host host2 (5)//
//Mon Dec 10 11:47:34 2012 [InM][I]: Monitoring host host1 (6)//
//Mon Dec 10 11:47:34 2012 [ReM][D]: Req:4576 UID:0 AclInfo invoked//
//Mon Dec 10 11:47:34 2012 [ReM][D]: Req:4576 UID:0 AclInfo result 
SUCCESS, "<ACL_POOL><ACL><ID>0..."//
//Mon Dec 10 11:47:37 2012 [InM][I]: Command execution fail: 'if [ -x 
"/var/tmp/one/im/run_probes" ]; then /var/tmp/one/im/run_probes kvm 5 
host2; else                              exit 42; fi'//
//Mon Dec 10 11:47:37 2012 [InM][I]: ssh: connect to host host2 port 22: 
No route to host//
//Mon Dec 10 11:47:37 2012 [InM][I]: ExitCode: 255//
//Mon Dec 10 11:47:37 2012 [InM][E]: Error monitoring host 5 : MONITOR 
FAILURE 5 -//
//
//Mon Dec 10 11:47:37 2012 [HKM][D]: Message received: LOG I 5 Command 
execution fail: /var/lib/one/remotes//hooks/ft/host_error.rb 5 -r//
//
//Mon Dec 10 11:47:37 2012 [HKM][D]: Message received: LOG I 5 ExitCode: 
255//
//
//Mon Dec 10 11:47:37 2012 [HKM][D]: Message received: EXECUTE FAILURE 5 
error: -//
//
//Mon Dec 10 11:47:37 2012 [InM][I]: ExitCode: 0//
//Mon Dec 10 11:47:37 2012 [InM][D]: Host 6 successfully monitored./


Could you give advice on how to solve this issue?
Let me know if you need more information.


Thanks in advance.

Carlos.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20121210/f7694116/attachment-0001.htm>


More information about the Users mailing list