[one-users] error monitoring host - even though logs indicate success

Ruben S. Montero rsmontero at opennebula.org
Thu Aug 2 13:17:21 PDT 2012


Hi,

No the host is not treated as in an error state. The ERROR message
(includind its TIMESTAMP) is left in the host template for information
purposes.

Once you have verified the issue you can freely remove the message doing

onehost update 1

Just remove the ERROR attribute in the editor.

Cheers

Ruben

On Thu, Aug 2, 2012 at 8:34 PM, Shantanu Pavgi <pavgi at uab.edu> wrote:

>
> I am getting following monitoring error on one of the host as show below:
>
> {{{
>
> $ onehost show 1
> HOST 1 INFORMATION
> ID                    : 1
> NAME                  : kvm-04
> STATE                 : MONITORED
> IM_MAD                : im_kvm
> VM_MAD                : vmm_kvm
> VN_MAD                : dummy
> TM_MAD                : tm_shared
> LAST MONITORING TIME  : 1343923509
>
> HOST SHARES
> MAX MEM               : 49409948
> USED MEM (REAL)       : 9608032
> USED MEM (ALLOCATED)  : 16777216
> MAX CPU               : 2400
> USED CPU (REAL)       : 4
> USED CPU (ALLOCATED)  : 600
> MAX DISK              : 0
> USED DISK (REAL)      : 0
> USED DISK (ALLOCATED) : 0
> RUNNING VMS           : 7
>
> MONITORING INFORMATION
> ARCH=x86_64
> CPUSPEED=2660
> ERROR=[
>   MESSAGE="Error monitoring host 1 : MONITOR FAILURE 1 -
> ",
>   TIMESTAMP="Tue Jul 31 12:30:31 2012" ]
> FREECPU=2395.2
> FREEMEMORY=39801916
> HOSTNAME=kvm-04.uabgrid.uab.edu
> HYPERVISOR=kvm
> MODELNAME="Intel(R) Xeon(R) CPU X5650 @ 2.67GHz"
> NETRX=0
> NETTX=0
> TOTALCPU=2400
> TOTALMEMORY=49409948
> USEDCPU=4.80000000000018
> USEDMEMORY=9608032
>
> }}}
>
> OpenNebula had problems in monitoring this host on Jul 31'st. However, now
> it's able to monitor this host successfully as indicated in oned.log file
> below:
>
> {{{
>
> Thu Aug  2 11:04:39 2012 [InM][D]: Host 0 successfully monitored.
> Thu Aug  2 11:05:05 2012 [ReM][D]: HostPoolInfo method invoked
> Thu Aug  2 11:05:05 2012 [InM][I]: Monitoring host kvm-04 (1)
> Thu Aug  2 11:05:05 2012 [ReM][D]: VirtualMachinePoolInfo method invoked
> Thu Aug  2 11:05:05 2012 [ReM][D]: AclInfo method invoked
> Thu Aug  2 11:05:05 2012 [ReM][D]: HostInfo method invoked
> Thu Aug  2 11:05:09 2012 [InM][I]: ExitCode: 0
> Thu Aug  2 11:05:09 2012 [InM][D]: Host 1 successfully monitored.
> Thu Aug  2 11:05:35 2012 [ReM][D]: HostPoolInfo method invoked
> Thu Aug  2 11:05:35 2012 [ReM][D]: HostInfo method invoked
> Thu Aug  2 11:05:35 2012 [ReM][D]: VirtualMachinePoolInfo method invoked
> Thu Aug  2 11:05:35 2012 [ReM][D]: AclInfo method invoked
> Thu Aug  2 11:05:50 2012 [InM][I]: Monitoring host kvm-03 (0)
> Thu Aug  2 11:05:50 2012 [InM][I]: Monitoring host kvm-02 (3)
> Thu Aug  2 11:05:54 2012 [InM][I]: ExitCode: 0
> Thu Aug  2 11:05:54 2012 [InM][D]: Host 3 successfully monitored.
> Thu Aug  2 11:05:54 2012 [InM][I]: ExitCode: 0
> Thu Aug  2 11:05:54 2012 [InM][D]: Host 0 successfully monitored.
>
> }}}
>
>
> Although 'onehost show' command displays monitoring information, it also
> displays 'ERROR  = Error monitoring host 1'. So is it an error message or
> informative message about last monitoring error? Is the host still being
> treated in error state? Any help?
>
>
> --
> Thanks,
> Shantanu
>
>
> _______________________________________________
> Users mailing list
> Users at lists.opennebula.org
> http://lists.opennebula.org/listinfo.cgi/users-opennebula.org
>



-- 
Ruben S. Montero, PhD
Project co-Lead and Chief Architect
OpenNebula - The Open Source Solution for Data Center Virtualization
www.OpenNebula.org | rsmontero at opennebula.org | @OpenNebula
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20120802/f35a175d/attachment-0003.htm>


More information about the Users mailing list