[one-users] error monitoring host - even though logs indicate success

Shantanu Pavgi pavgi at uab.edu
Thu Aug 2 11:34:28 PDT 2012


I am getting following monitoring error on one of the host as show below: 

{{{

$ onehost show 1
HOST 1 INFORMATION                                                              
ID                    : 1                   
NAME                  : kvm-04            
STATE                 : MONITORED           
IM_MAD                : im_kvm              
VM_MAD                : vmm_kvm             
VN_MAD                : dummy               
TM_MAD                : tm_shared           
LAST MONITORING TIME  : 1343923509          

HOST SHARES                                                                     
MAX MEM               : 49409948            
USED MEM (REAL)       : 9608032             
USED MEM (ALLOCATED)  : 16777216            
MAX CPU               : 2400                
USED CPU (REAL)       : 4                   
USED CPU (ALLOCATED)  : 600                 
MAX DISK              : 0                   
USED DISK (REAL)      : 0                   
USED DISK (ALLOCATED) : 0                   
RUNNING VMS           : 7                   

MONITORING INFORMATION                                                          
ARCH=x86_64
CPUSPEED=2660
ERROR=[
  MESSAGE="Error monitoring host 1 : MONITOR FAILURE 1 -
",
  TIMESTAMP="Tue Jul 31 12:30:31 2012" ]
FREECPU=2395.2
FREEMEMORY=39801916
HOSTNAME=kvm-04.uabgrid.uab.edu
HYPERVISOR=kvm
MODELNAME="Intel(R) Xeon(R) CPU X5650 @ 2.67GHz"
NETRX=0
NETTX=0
TOTALCPU=2400
TOTALMEMORY=49409948
USEDCPU=4.80000000000018
USEDMEMORY=9608032

}}}

OpenNebula had problems in monitoring this host on Jul 31'st. However, now it's able to monitor this host successfully as indicated in oned.log file below: 

{{{

Thu Aug  2 11:04:39 2012 [InM][D]: Host 0 successfully monitored.
Thu Aug  2 11:05:05 2012 [ReM][D]: HostPoolInfo method invoked
Thu Aug  2 11:05:05 2012 [InM][I]: Monitoring host kvm-04 (1)
Thu Aug  2 11:05:05 2012 [ReM][D]: VirtualMachinePoolInfo method invoked
Thu Aug  2 11:05:05 2012 [ReM][D]: AclInfo method invoked
Thu Aug  2 11:05:05 2012 [ReM][D]: HostInfo method invoked
Thu Aug  2 11:05:09 2012 [InM][I]: ExitCode: 0
Thu Aug  2 11:05:09 2012 [InM][D]: Host 1 successfully monitored.
Thu Aug  2 11:05:35 2012 [ReM][D]: HostPoolInfo method invoked
Thu Aug  2 11:05:35 2012 [ReM][D]: HostInfo method invoked
Thu Aug  2 11:05:35 2012 [ReM][D]: VirtualMachinePoolInfo method invoked
Thu Aug  2 11:05:35 2012 [ReM][D]: AclInfo method invoked
Thu Aug  2 11:05:50 2012 [InM][I]: Monitoring host kvm-03 (0)
Thu Aug  2 11:05:50 2012 [InM][I]: Monitoring host kvm-02 (3)
Thu Aug  2 11:05:54 2012 [InM][I]: ExitCode: 0
Thu Aug  2 11:05:54 2012 [InM][D]: Host 3 successfully monitored.
Thu Aug  2 11:05:54 2012 [InM][I]: ExitCode: 0
Thu Aug  2 11:05:54 2012 [InM][D]: Host 0 successfully monitored.

}}}


Although 'onehost show' command displays monitoring information, it also displays 'ERROR  = Error monitoring host 1'. So is it an error message or informative message about last monitoring error? Is the host still being treated in error state? Any help? 


--
Thanks,
Shantanu





More information about the Users mailing list