[one-users] Opennebula node (CentOS, Xen host) saying Error executing collectd-client.rb when creating a host.
Chris Gerard
Chris at hubu.eu
Tue Jul 15 04:50:46 PDT 2014
Hello, I am facing a problem while creating an host on opennebula.
I followed the official documentation and achieved it in the past, on
the very same machine
(http://docs.opennebula.org/4.6/design_and_installation/quick_starts/qs_centos_xen.html#qs-centos-xen)
without having this problem.
After following this doc, I created ssh keys on the frontend and copied
them on the node (yes, I can connect from the frontend to the node via
ssh using oneadmin's login without having a password prompt), then I
edited Xen's configuration in order to enable NAT (my server provider
has MAC filtering routers).
Now, more about this error, this is what /var/log/one/oned.log says on
the frontend's server:
[oneadmin at GeorgeOrwell .ssh]$ tail -f /var/log/one/oned.log
Tue Jul 15 13:34:16 2014 [InM][D]: Monitoring datastore files (2)
Tue Jul 15 13:34:16 2014 [ImM][D]: Datastore default (1)
successfully monitored.
Tue Jul 15 13:34:16 2014 [ImM][D]: Datastore files (2) successfully
monitored.
Tue Jul 15 13:34:28 2014 [ReM][D]: Req:6544 UID:0 HostAllocate
invoked, "MarieCurie.hubu.eu", "xen", "xen", "dummy", -1
Tue Jul 15 13:34:28 2014 [ReM][D]: Req:6544 UID:0 HostAllocate
result SUCCESS, 6
Tue Jul 15 13:34:31 2014 [InM][D]: Monitoring host
MarieCurie.hubu.eu (6)
Tue Jul 15 13:34:33 2014 [ReM][D]: Req:1888 UID:0
VirtualMachinePoolInfo invoked, -2, -1, -1, -1
Tue Jul 15 13:34:33 2014 [ReM][D]: Req:1888 UID:0
VirtualMachinePoolInfo result SUCCESS, "<VM_POOL></VM_POOL>"
Tue Jul 15 13:34:33 2014 [ReM][D]: Req:1888 UID:0
VirtualMachinePoolInfo invoked, -2, -1, -1, -1
Tue Jul 15 13:34:33 2014 [ReM][D]: Req:1888 UID:0
VirtualMachinePoolInfo result SUCCESS, "<VM_POOL></VM_POOL>"
Tue Jul 15 13:34:41 2014 [AuM][D]: Message received: AUTHENTICATE
SUCCESS 14 -
Tue Jul 15 13:34:41 2014 [ReM][D]: Req:3744 UID:0 HostPoolInfo invoked
Tue Jul 15 13:34:41 2014 [ReM][D]: Req:3744 UID:0 HostPoolInfo
result SUCCESS, "<HOST_POOL><HOST><ID..."
Tue Jul 15 13:34:50 2014 [InM][I]: Command execution fail: 'if [ -x
"/var/tmp/one/im/run_probes" ]; then /var/tmp/one/im/run_probes xen4
/var/lib/one//datastores 4124 20 6 MarieCurie.hubu.eu;
else exit 42; fi'
Tue Jul 15 13:34:50 2014 [InM][I]: cat:
/tmp/one-collectd-client.pid: No such file or directory
Tue Jul 15 13:34:50 2014 [InM][I]: /var/tmp/one/im/run_probes: line
34: 4369 Aborted ./$i $ARGUMENTS
Tue Jul 15 13:34:50 2014 [InM][E]: Error executing collectd-client.rb
Tue Jul 15 13:34:50 2014 [InM][I]: ExitCode: 134
After facing this, first yesterday (then I formatted today, just in case
and faced the same problem on a fresh install), I asked on IRC and
jfontan asked me to try this:
bash -x /var/tmp/one/im/run_probes xen4 /var/lib/one//datastores 4124 20 4 MarieCurie.hubu.eu
Which gave this output:
[oneadmin at MarieCurie /]$ bash -x /var/tmp/one/im/run_probes xen4 /var/lib/one//datastores 4124 20 4 MarieCurie.hubu.eu
++ dirname /var/tmp/one/im/run_probes
+ source /var/tmp/one/im/../scripts_common.sh
++ export LANG=C
++ LANG=C
++ export PATH=/bin:/sbin:/usr/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin
++ PATH=/bin:/sbin:/usr/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin
++ AWK=awk
++ BASH=bash
++ CUT=cut
++ DATE=date
++ DD=dd
++ DF=df
++ DU=du
++ GREP=grep
++ ISCSIADM=iscsiadm
++ LVCREATE=lvcreate
++ LVREMOVE=lvremove
++ LVRENAME=lvrename
++ LVS=lvs
++ LN=ln
++ MD5SUM=md5sum
++ MKFS=mkfs
++ MKISOFS=genisoimage
++ MKSWAP=mkswap
++ QEMU_IMG=qemu-img
++ RADOS=rados
++ RBD=rbd
++ READLINK=readlink
++ RM=rm
++ SCP=scp
++ SED=sed
++ SSH=ssh
++ SUDO=sudo
++ SYNC=sync
++ TAR=tar
++ TGTADM=tgtadm
++ TGTADMIN=tgt-admin
++ TGTSETUPLUN=tgt-setup-lun-one
++ TR=tr
++ VGDISPLAY=vgdisplay
++ VMKFSTOOLS=vmkfstools
++ WGET=wget
+++ uname -s
++ '[' xLinux = xLinux ']'
++ SED='sed -r'
+++ basename /var/tmp/one/im/run_probes
++ SCRIPT_NAME=run_probes
+ export LANG=C
+ LANG=C
+ HYPERVISOR_DIR=xen4.d
+ ARGUMENTS='xen4 /var/lib/one//datastores 4124 20 4 MarieCurie.hubu.eu'
++ dirname /var/tmp/one/im/run_probes
+ SCRIPTS_DIR=/var/tmp/one/im
+ cd /var/tmp/one/im
++ '[' -d xen4.d ']'
++ run_dir xen4.d
++ cd xen4.d
+++ ls collectd-client.rb collectd-client_control.sh
++ for i in '`ls *`'
++ '[' -x collectd-client.rb ']'
++ ./collectd-client.rb xen4 /var/lib/one//datastores 4124 20 4 MarieCurie.hubu.eu
/var/tmp/one/im/run_probes: line 34: 12948 Aborted ./$i $ARGUMENTS
++ EXIT_CODE=134
++ '[' x134 '!=' x0 ']'
++ error_message 'Error executing collectd-client.rb'
++ echo 'ERROR MESSAGE --8<------'
ERROR MESSAGE --8<------
++ echo 'Error executing collectd-client.rb'
Error executing collectd-client.rb
++ echo 'ERROR MESSAGE ------>8--'
ERROR MESSAGE ------>8--
++ exit 134
+ data=
+ EXIT_CODE=134
+ echo ''
+ exit 134
I must admit that I'm quite lost here (and frustrated, since it worked
on this very same machine last week but I decided to format since I
played around a bit with it...) and have no idea about what could cause it.
Any help would be appreciated, thanks in advance and have a nice day.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20140715/571894f8/attachment.htm>
More information about the Users
mailing list