[one-users] Opennebula node (CentOS, Xen host) saying Error executing collectd-client.rb when creating a host.

Chris Gerard Chris at hubu.eu
Tue Jul 15 04:50:46 PDT 2014


Hello, I am facing a problem while creating an host on opennebula.
I followed the official documentation and achieved it in the past, on 
the very same machine 
(http://docs.opennebula.org/4.6/design_and_installation/quick_starts/qs_centos_xen.html#qs-centos-xen) 
without having this problem.

After following this doc, I created ssh keys on the frontend and copied 
them on the node (yes, I can connect from the frontend to the node via 
ssh using oneadmin's login without having a password prompt), then I 
edited Xen's configuration in order to enable NAT (my server provider 
has MAC filtering routers).

Now, more about this error, this is what /var/log/one/oned.log says on 
the frontend's server:

    [oneadmin at GeorgeOrwell .ssh]$ tail -f /var/log/one/oned.log
    Tue Jul 15 13:34:16 2014 [InM][D]: Monitoring datastore files (2)
    Tue Jul 15 13:34:16 2014 [ImM][D]: Datastore default (1)
    successfully monitored.
    Tue Jul 15 13:34:16 2014 [ImM][D]: Datastore files (2) successfully
    monitored.
    Tue Jul 15 13:34:28 2014 [ReM][D]: Req:6544 UID:0 HostAllocate
    invoked, "MarieCurie.hubu.eu", "xen", "xen", "dummy", -1
    Tue Jul 15 13:34:28 2014 [ReM][D]: Req:6544 UID:0 HostAllocate
    result SUCCESS, 6
    Tue Jul 15 13:34:31 2014 [InM][D]: Monitoring host
    MarieCurie.hubu.eu (6)
    Tue Jul 15 13:34:33 2014 [ReM][D]: Req:1888 UID:0
    VirtualMachinePoolInfo invoked, -2, -1, -1, -1
    Tue Jul 15 13:34:33 2014 [ReM][D]: Req:1888 UID:0
    VirtualMachinePoolInfo result SUCCESS, "<VM_POOL></VM_POOL>"
    Tue Jul 15 13:34:33 2014 [ReM][D]: Req:1888 UID:0
    VirtualMachinePoolInfo invoked, -2, -1, -1, -1
    Tue Jul 15 13:34:33 2014 [ReM][D]: Req:1888 UID:0
    VirtualMachinePoolInfo result SUCCESS, "<VM_POOL></VM_POOL>"
    Tue Jul 15 13:34:41 2014 [AuM][D]: Message received: AUTHENTICATE
    SUCCESS 14 -

    Tue Jul 15 13:34:41 2014 [ReM][D]: Req:3744 UID:0 HostPoolInfo invoked
    Tue Jul 15 13:34:41 2014 [ReM][D]: Req:3744 UID:0 HostPoolInfo
    result SUCCESS, "<HOST_POOL><HOST><ID..."
    Tue Jul 15 13:34:50 2014 [InM][I]: Command execution fail: 'if [ -x
    "/var/tmp/one/im/run_probes" ]; then /var/tmp/one/im/run_probes xen4
    /var/lib/one//datastores 4124 20 6 MarieCurie.hubu.eu;
    else                              exit 42; fi'
    Tue Jul 15 13:34:50 2014 [InM][I]: cat:
    /tmp/one-collectd-client.pid: No such file or directory
    Tue Jul 15 13:34:50 2014 [InM][I]: /var/tmp/one/im/run_probes: line
    34:  4369 Aborted                 ./$i $ARGUMENTS
    Tue Jul 15 13:34:50 2014 [InM][E]: Error executing collectd-client.rb
    Tue Jul 15 13:34:50 2014 [InM][I]: ExitCode: 134


After facing this, first yesterday (then I formatted today, just in case 
and faced the same problem on a fresh install), I asked on IRC and 
jfontan asked me to try this:

    bash -x /var/tmp/one/im/run_probes xen4 /var/lib/one//datastores 4124 20 4 MarieCurie.hubu.eu

Which gave this output:

    [oneadmin at MarieCurie /]$ bash -x /var/tmp/one/im/run_probes xen4 /var/lib/one//datastores 4124 20 4 MarieCurie.hubu.eu
    ++ dirname /var/tmp/one/im/run_probes
    + source /var/tmp/one/im/../scripts_common.sh
    ++ export LANG=C
    ++ LANG=C
    ++ export PATH=/bin:/sbin:/usr/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin
    ++ PATH=/bin:/sbin:/usr/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin
    ++ AWK=awk
    ++ BASH=bash
    ++ CUT=cut
    ++ DATE=date
    ++ DD=dd
    ++ DF=df
    ++ DU=du
    ++ GREP=grep
    ++ ISCSIADM=iscsiadm
    ++ LVCREATE=lvcreate
    ++ LVREMOVE=lvremove
    ++ LVRENAME=lvrename
    ++ LVS=lvs
    ++ LN=ln
    ++ MD5SUM=md5sum
    ++ MKFS=mkfs
    ++ MKISOFS=genisoimage
    ++ MKSWAP=mkswap
    ++ QEMU_IMG=qemu-img
    ++ RADOS=rados
    ++ RBD=rbd
    ++ READLINK=readlink
    ++ RM=rm
    ++ SCP=scp
    ++ SED=sed
    ++ SSH=ssh
    ++ SUDO=sudo
    ++ SYNC=sync
    ++ TAR=tar
    ++ TGTADM=tgtadm
    ++ TGTADMIN=tgt-admin
    ++ TGTSETUPLUN=tgt-setup-lun-one
    ++ TR=tr
    ++ VGDISPLAY=vgdisplay
    ++ VMKFSTOOLS=vmkfstools
    ++ WGET=wget
    +++ uname -s
    ++ '[' xLinux = xLinux ']'
    ++ SED='sed -r'
    +++ basename /var/tmp/one/im/run_probes
    ++ SCRIPT_NAME=run_probes
    + export LANG=C
    + LANG=C
    + HYPERVISOR_DIR=xen4.d
    + ARGUMENTS='xen4 /var/lib/one//datastores 4124 20 4 MarieCurie.hubu.eu'
    ++ dirname /var/tmp/one/im/run_probes
    + SCRIPTS_DIR=/var/tmp/one/im
    + cd /var/tmp/one/im
    ++ '[' -d xen4.d ']'
    ++ run_dir xen4.d
    ++ cd xen4.d
    +++ ls collectd-client.rb collectd-client_control.sh
    ++ for i in '`ls *`'
    ++ '[' -x collectd-client.rb ']'
    ++ ./collectd-client.rb xen4 /var/lib/one//datastores 4124 20 4 MarieCurie.hubu.eu
    /var/tmp/one/im/run_probes: line 34: 12948 Aborted                 ./$i $ARGUMENTS
    ++ EXIT_CODE=134
    ++ '[' x134 '!=' x0 ']'
    ++ error_message 'Error executing collectd-client.rb'
    ++ echo 'ERROR MESSAGE --8<------'
    ERROR MESSAGE --8<------
    ++ echo 'Error executing collectd-client.rb'
    Error executing collectd-client.rb
    ++ echo 'ERROR MESSAGE ------>8--'
    ERROR MESSAGE ------>8--
    ++ exit 134
    + data=
    + EXIT_CODE=134
    + echo ''

    + exit 134

I must admit that I'm quite lost here (and frustrated, since it worked 
on this very same machine last week but I decided to format since I 
played around a bit with it...) and have no idea about what could cause it.

Any help would be appreciated, thanks in advance and have a nice day.



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20140715/571894f8/attachment.htm>


More information about the Users mailing list