[one-users] error monitoring host - even though logs indicate success
Shantanu Pavgi
pavgi at uab.edu
Thu Aug 2 11:34:28 PDT 2012
I am getting following monitoring error on one of the host as show below:
{{{
$ onehost show 1
HOST 1 INFORMATION
ID : 1
NAME : kvm-04
STATE : MONITORED
IM_MAD : im_kvm
VM_MAD : vmm_kvm
VN_MAD : dummy
TM_MAD : tm_shared
LAST MONITORING TIME : 1343923509
HOST SHARES
MAX MEM : 49409948
USED MEM (REAL) : 9608032
USED MEM (ALLOCATED) : 16777216
MAX CPU : 2400
USED CPU (REAL) : 4
USED CPU (ALLOCATED) : 600
MAX DISK : 0
USED DISK (REAL) : 0
USED DISK (ALLOCATED) : 0
RUNNING VMS : 7
MONITORING INFORMATION
ARCH=x86_64
CPUSPEED=2660
ERROR=[
MESSAGE="Error monitoring host 1 : MONITOR FAILURE 1 -
",
TIMESTAMP="Tue Jul 31 12:30:31 2012" ]
FREECPU=2395.2
FREEMEMORY=39801916
HOSTNAME=kvm-04.uabgrid.uab.edu
HYPERVISOR=kvm
MODELNAME="Intel(R) Xeon(R) CPU X5650 @ 2.67GHz"
NETRX=0
NETTX=0
TOTALCPU=2400
TOTALMEMORY=49409948
USEDCPU=4.80000000000018
USEDMEMORY=9608032
}}}
OpenNebula had problems in monitoring this host on Jul 31'st. However, now it's able to monitor this host successfully as indicated in oned.log file below:
{{{
Thu Aug 2 11:04:39 2012 [InM][D]: Host 0 successfully monitored.
Thu Aug 2 11:05:05 2012 [ReM][D]: HostPoolInfo method invoked
Thu Aug 2 11:05:05 2012 [InM][I]: Monitoring host kvm-04 (1)
Thu Aug 2 11:05:05 2012 [ReM][D]: VirtualMachinePoolInfo method invoked
Thu Aug 2 11:05:05 2012 [ReM][D]: AclInfo method invoked
Thu Aug 2 11:05:05 2012 [ReM][D]: HostInfo method invoked
Thu Aug 2 11:05:09 2012 [InM][I]: ExitCode: 0
Thu Aug 2 11:05:09 2012 [InM][D]: Host 1 successfully monitored.
Thu Aug 2 11:05:35 2012 [ReM][D]: HostPoolInfo method invoked
Thu Aug 2 11:05:35 2012 [ReM][D]: HostInfo method invoked
Thu Aug 2 11:05:35 2012 [ReM][D]: VirtualMachinePoolInfo method invoked
Thu Aug 2 11:05:35 2012 [ReM][D]: AclInfo method invoked
Thu Aug 2 11:05:50 2012 [InM][I]: Monitoring host kvm-03 (0)
Thu Aug 2 11:05:50 2012 [InM][I]: Monitoring host kvm-02 (3)
Thu Aug 2 11:05:54 2012 [InM][I]: ExitCode: 0
Thu Aug 2 11:05:54 2012 [InM][D]: Host 3 successfully monitored.
Thu Aug 2 11:05:54 2012 [InM][I]: ExitCode: 0
Thu Aug 2 11:05:54 2012 [InM][D]: Host 0 successfully monitored.
}}}
Although 'onehost show' command displays monitoring information, it also displays 'ERROR = Error monitoring host 1'. So is it an error message or informative message about last monitoring error? Is the host still being treated in error state? Any help?
--
Thanks,
Shantanu
More information about the Users
mailing list