[one-users] Host hook failure
Carlos Jiménez
cjimenez at eneotecnologia.com
Mon Dec 10 04:18:02 PST 2012
Hi all,
Problem solved. It was a bug previously reported and solved by Tino
Vázquez. After uncommenting several lines in host_error.rb all works fine.
Kind Regards,
Carlos.
On 12/10/2012 11:54 AM, Carlos Jiménez wrote:
> Hi all,
>
> I have one computer running OpenNebula and two hosts with KVM.
> I have configured "HOST_HOOK" and "VM_HOOK" on /etc/one/oned.conf:
> /
> //HOST_MONITORING_INTERVAL = 600//
> //VM_POLLING_INTERVAL = 600//
> //HOST_HOOK = [//
> // name = "error",//
> // on = "ERROR",//
> // command = "ft/host_error.rb",//
> // arguments = "$ID -r",//
> // remote = "no" ]//
> //VM_HOOK = [//
> // name = "on_failure_resubmit",//
> // on = "FAILED",//
> // command = "/usr/bin/env onevm resubmit",//
> // arguments = "$ID" ]/
>
> I've powered off one of the hosts, with one VM running. The VM polling
> and Host monitoring intervals are set to default. After 10-15 minutes,
> the host is detected as failed (ERROR state) but the VM is not
> resubmitted yet. According to the logs, it seems to be a problem in
> the execution of the "recovery" script.
> The failed host ID is 5 and the affected VM ID is 36.
>
> This is the relevant part of the log:
> /
> //Mon Dec 10 11:47:34 2012 [InM][I]: Monitoring host host2 (5)//
> //Mon Dec 10 11:47:34 2012 [InM][I]: Monitoring host host1 (6)//
> //Mon Dec 10 11:47:34 2012 [ReM][D]: Req:4576 UID:0 AclInfo invoked//
> //Mon Dec 10 11:47:34 2012 [ReM][D]: Req:4576 UID:0 AclInfo result
> SUCCESS, "<ACL_POOL><ACL><ID>0..."//
> //Mon Dec 10 11:47:37 2012 [InM][I]: Command execution fail: 'if [ -x
> "/var/tmp/one/im/run_probes" ]; then /var/tmp/one/im/run_probes kvm 5
> host2; else exit 42; fi'//
> //Mon Dec 10 11:47:37 2012 [InM][I]: ssh: connect to host host2 port
> 22: No route to host//
> //Mon Dec 10 11:47:37 2012 [InM][I]: ExitCode: 255//
> //Mon Dec 10 11:47:37 2012 [InM][E]: Error monitoring host 5 : MONITOR
> FAILURE 5 -//
> //
> //Mon Dec 10 11:47:37 2012 [HKM][D]: Message received: LOG I 5 Command
> execution fail: /var/lib/one/remotes//hooks/ft/host_error.rb 5 -r//
> //
> //Mon Dec 10 11:47:37 2012 [HKM][D]: Message received: LOG I 5
> ExitCode: 255//
> //
> //Mon Dec 10 11:47:37 2012 [HKM][D]: Message received: EXECUTE FAILURE
> 5 error: -//
> //
> //Mon Dec 10 11:47:37 2012 [InM][I]: ExitCode: 0//
> //Mon Dec 10 11:47:37 2012 [InM][D]: Host 6 successfully monitored./
>
>
> Could you give advice on how to solve this issue?
> Let me know if you need more information.
>
>
> Thanks in advance.
>
> Carlos.
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20121210/85dcaa17/attachment-0001.htm>
More information about the Users
mailing list