[one-users] Error monitoring VM, Error executing sudo /usr/sbin/xentop -bi2
María Pérez
maria at caton.es
Wed Jun 22 00:50:51 PDT 2011
Hi Carlos,
I have checked the directory /var/tmp/one/vmm/xen in the nodes and
/$ONE_LOCATION/var/remotes/vmm/xen in the front-end, the content of both
is the same. This means that the update has been made correctly in the
nodes.
I have looked in log file the error before the update and it is
different. The messages are the following, in case they can help.
*BEFORE*
Error executing sudo /usr/sbin/xentop -bi2
/var/tmp/one/vmm/xen/poll:186:in `[]': can't convert String into Integer
(TypeError)
from /var/tmp/one/vmm/xen/poll:186:in `get_vm_info'
from /var/tmp/one/vmm/xen/poll:314:in `print_one_vm_info'
from /var/tmp/one/vmm/xen/poll:348
ExitCode: 1
Error monitoring VM, Error executing sudo /usr/sbin/xentop -bi2
*AFTER*
Error executing sudo /usr/sbin/xentop -bi2
/var/tmp/one/vmm/xen/poll:186:in `get_vm_info': undefined method `[]'
for nil:NilClass (NoMethodError)
from /var/tmp/one/vmm/xen/poll:314:in `print_one_vm_info'
from /var/tmp/one/vmm/xen/poll:348
ExitCode: 1
Error monitoring VM, Error executing sudo /usr/sbin/xentop -bi2
You can say to me if I can test some to help in this.
Thank you very much in advance.
Regards,
María
El 21/06/11 13:54, Carlos Martín Sánchez escribió:
> Hi,
>
> Maybe the problem is that OpenNebula copies updated files when the
> monitorization takes place.
> Could you please check that /var/tmp/one/vmm/xen/poll in your remote
> host has the updated code?
>
> Anyway, we are going to test this issue in our local infrastructure,
> hopefully we'll come up with a fix soon.
>
> Regards.
> --
> Carlos Martín, MSc
> Project Major Contributor
> OpenNebula - The Open Source Toolkit for Cloud Computing
> www.OpenNebula.org <http://www.opennebula.org/> |
> cmartin at opennebula.org <mailto:cmartin at opennebula.org>
>
>
> 2011/6/20 María Pérez <maria at caton.es <mailto:maria at caton.es>>
>
>
> Hi Carlos,
>
> Before anything else, sorry for the delay in my answer.I have been
> very busy this past month and I haven't been able to test the
> patch that you prepared to solve the monitoring problem.
>
> Finally last week I tested the patch but the monitoring problems
> continue although the machines no longer happen to unknown state.
> I have executed from line of commands but the messages are the
> same that they appear registered in logs.
>
> [oneadmin at dn07 ~]$ /var/tmp/one/vmm/xen/poll one-28
>
> Error executing sudo /usr/sbin/xentop -bi2
> /var/tmp/one/vmm/xen/poll:186:in `get_vm_info': undefined method
> `[]' for nil:NilClass (NoMethodError)
> from /var/tmp/one/vmm/xen/poll:314:in `print_one_vm_info'
> from /var/tmp/one/vmm/xen/poll:348
>
>
> Can you think of any other thing that I could test?
>
> Thank you very much in advance.
> BR,
>
> María
>
>
>
> El 20/05/11 15:52, Carlos Martín Sánchez escribió:
>> Hi again,
>>
>> The script was discarding the first lines, because other versions
>> include some information about the system before the list of domains.
>> Please try the patch attached in this new ticket:
>> http://dev.opennebula.org/issues/656
>>
>> Thank you,
>> Carlos.
>>
>> --
>> Carlos Martín, MSc
>> Project Major Contributor
>> OpenNebula - The Open Source Toolkit for Cloud Computing
>> www.OpenNebula.org <http://www.opennebula.org/> |
>> cmartin at opennebula.org <mailto:cmartin at opennebula.org>
>>
>>
>> 2011/5/20 María Pérez <maria at caton.es <mailto:maria at caton.es>>
>>
>>
>> Hi Carlos,
>>
>> The output of 'sudo /usr/sbin/xentop -bi2' is:
>>
>> [oneadmin at dn08 ~]$ sudo /usr/sbin/xentop -bi2
>> NAME STATE CPU(sec) CPU(%) MEM(k) MEM(%)
>> MAXMEM(k) MAXMEM(%) VCPUS NETS NETTX(k) NETRX(k) VBDS
>> VBD_OO VBD_RD VBD_WR SSID
>> Domain-0 -----r 51 0.0 15532060 92.6 no
>> limit n/a 8 5 0 0 0
>> 0 0 0 0
>> one-44 --b--- 1 0.0 1048396 6.2
>> 1048576 6.3 1 1 1 15680 2
>> 0 68 253 0
>> NAME STATE CPU(sec) CPU(%) MEM(k) MEM(%)
>> MAXMEM(k) MAXMEM(%) VCPUS NETS NETTX(k) NETRX(k) VBDS
>> VBD_OO VBD_RD VBD_WR SSID
>> Domain-0 -----r 51 0.5 15532276 92.6 no
>> limit n/a 8 5 0 0 0
>> 0 0 0 0
>> one-44 --b--- 1 0.1 1048404 6.2
>> 1048576 6.3 1 1 1 15705 2
>> 0 68 253 0
>>
>>
>> This error causes that the VM is not monitored correctly,
>> also the machines happen to unknown state. Do you think that
>> this error will have something to do with the unknown state
>> of the VM?
>>
>>
>> Thank you very much.
>> Regards,
>> María
>>
>> El 20/05/11 14:34, Carlos Martín Sánchez escribió:
>>> Hi María,
>>>
>>> It looks like the script is not getting the expected output
>>> format and fails to parse it.
>>> Could you please copy the output of 'sudo /usr/sbin/xentop
>>> -bi2', logged as oneadmin in the Host?
>>>
>>> Regards,
>>> Carlos.
>>> --
>>> Carlos Martín, MSc
>>> Project Major Contributor
>>> OpenNebula - The Open Source Toolkit for Cloud Computing
>>> www.OpenNebula.org <http://www.opennebula.org/> |
>>> cmartin at opennebula.org <mailto:cmartin at opennebula.org>
>>>
>>>
>>> On Wed, May 18, 2011 at 3:07 PM, María Pérez <maria at caton.es
>>> <mailto:maria at caton.es>> wrote:
>>>
>>> Hi again,
>>>
>>> I have another problem with the monitoring of the
>>> machines and the execution of /var/tmp/one/vmm/xen/poll.
>>>
>>> The log file contains the following lines with the error:
>>>
>>>
>>> Command execution fail: 'if [ -x
>>> "/var/tmp/one/vmm/xen/poll" ]; then
>>> /var/tmp/one/vmm/xen/poll one-28;
>>> else exit 42; fi'
>>> Wed May 18 14:48:56 2011 [VMM][I]: STDERR follows.
>>> Wed May 18 14:48:56 2011 [VMM][I]: Error executing sudo
>>> /usr/sbin/xentop -bi2
>>> Wed May 18 14:48:56 2011 [VMM][I]:
>>> /var/tmp/one/vmm/xen/poll:186:in `[]': can't convert
>>> String into Integer (TypeError)
>>> Wed May 18 14:48:56 2011 [VMM][I]: from
>>> /var/tmp/one/vmm/xen/poll:186:in `get_vm_info'
>>> Wed May 18 14:48:56 2011 [VMM][I]: from
>>> /var/tmp/one/vmm/xen/poll:310:in `print_one_vm_info'
>>> Wed May 18 14:48:56 2011 [VMM][I]: from
>>> /var/tmp/one/vmm/xen/poll:344
>>> Wed May 18 14:48:56 2011 [VMM][I]: ExitCode: 1
>>> Wed May 18 14:48:56 2011 [VMM][E]: Error monitoring VM,
>>> Error executing sudo /usr/sbin/xentop -bi2
>>>
>>>
>>> I have tried to execute the commando from the own node
>>> to discard problems of ssh, but the error message is the
>>> same.
>>>
>>> I have found the following information
>>>
>>> http://dev.opennebula.org/issues/567
>>>
>>> but this patches no solved this problem. I have
>>> installed opennebula 2,2 and the nodes are xen 3.4.3
>>>
>>> Any idea? Thank you very much by your attention.
>>>
>>> Regards,
>>>
>>> María
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> Users mailing list
>>> Users at lists.opennebula.org
>>> <mailto:Users at lists.opennebula.org>
>>> http://lists.opennebula.org/listinfo.cgi/users-opennebula.org
>>>
>>>
>>
>>
>>
>>
>
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20110622/95d0ad62/attachment-0003.htm>
More information about the Users
mailing list