[one-users] Breakage when upgrading from 4.38 to 4.4 ... [Resolved]
Gareth Bult
gareth at linux.co.uk
Thu Dec 5 09:29:40 PST 2013
Ok, not sure what the difference is, but this is a fix;
kvm-probes.d/monitor_ds.sh
--- monitor_ds.sh.orig 2013-12-05 17:06:35.251763742 +0000
+++ monitor_ds.sh 2013-12-05 16:55:11.699743037 +0000
@@ -31,6 +31,10 @@
TOTAL_MB=$(df -B1M -P $dir 2>/dev/null | tail -n 1 | awk '{print $2}')
FREE_MB=$(df -B1M -P $dir 2>/dev/null | tail -n 1 | awk '{print $4}')
+ USED_MB=${USED_MB:-"0"}
+ TOTAL_MB=${TOTAL_MB:-"0"}
+ FREE_MB=${FREE_MB:-"0"}
+
if [ -n "$LVM_SIZE_CMD" ]; then
LVM_SIZE=$($LVM_SIZE_CMD ${LVM_VG_PREFIX}${ds} 2>/dev/null)
LVM_STATUS=$?
The host monitor seems to be calling a generic monitor which is also grabbing information about datastores.
In an environment (such as VDC) where the data exists remotely, the standard monitor (which uses 'du') isn't
going to provide any useful results, indeed in this instance it errors and the script returns blanks, which breaks
the host monitoring. Returning '0' on failure fixes the issue ...
Any chance this could inserted in the next release please?
--
Gareth Bult
“The odds of hitting your target go up dramatically when you aim at it.”
----- Original Message -----
From: "Gareth Bult" <gareth at linux.co.uk>
To: users at lists.opennebula.org
Sent: Thursday, 5 December, 2013 3:47:12 PM
Subject: [one-users] Breakage when upgrading from 4.38 to 4.4 ...
Hi, I've run the upgrade just to see if a number of issues I was having were fixed, and I seem to have
ended up with a fairly terminal issue I can't spot.
Host monitoring is now failing for hosts that previously seemed to be working. The reason seems to be
that the host is somehow looking at the datastores, and this in turn is failing.
The datastore monitoring is correct.
The error in oned.log is as follows;
Thu Dec 5 15:24:47 2013 [ONE][E]: Error parsing host information: syntax error, unexpected EQUAL, expecting COMMA or CBRACKET at line 30, columns 494:497. Monitoring information:
ARCH=x86_64
MODELNAME="AMD Phenom(tm) II X6 1100T Processor"
HYPERVISOR=kvm
TOTALCPU=600
CPUSPEED=3300
TOTALMEMORY=16175216
USEDMEMORY=399320
FREEMEMORY=15775896
FREECPU=600.0
USEDCPU=0.0
NETRX=33791417902
NETTX=25290741468
DS_LOCATION_USED_MB=2637
DS_LOCATION_TOTAL_MB=40190
DS_LOCATION_FREE_MB=28202
DS = [
ID = 0,
USED_MB = 29,
TOTAL_MB = 40190,
FREE_MB = 28202
]
DS = [
ID = 1,
USED_MB = 2609,
TOTAL_MB = 40190,
FREE_MB = 28202
]
DS = [
ID = 107,
USED_MB = ,
TOTAL_MB = ,
FREE_MB =
]
DS = [
ID = 109,
USED_MB = ,
TOTAL_MB = ,
FREE_MB =
]
DS = [
ID = 114,
USED_MB = 1,
TOTAL_MB = 40190,
FREE_MB = 28202
]
DS = [
ID = 2,
USED_MB = 1,
TOTAL_MB = 40190,
FREE_MB = 28202
]
HOSTNAME=node1
VM_POLL=YES
VERSION="4.4.0"
So it would appear that it's calling datastore/monitor, and it's failing .. however the script does work and is reporting
the correct size in the datastore monitor. I've added syslog to the monitor script, it is being called and it is returning
the correct result ...
If I call the script on the command line with a pasted in base64 and host it, this too gives the correct results;
oneadmin at nebula:~/remotes$ ssh node1 /var/lib/one/remotes/datastore/vdc/monitor "PERTX0RSSVZFUl9BQ1RJT05fREFUQT48REFUQVNUT1JFPjxJRD4xMDk8L0lEPjxVSUQ+MDwvVUlEPjxHSUQ+MTwvR0lEPjxVTkFNRT5vbmVhZG1pbjwvVU5BTUU+PEdOQU1FPnVzZXJzPC9HTkFNRT48TkFNRT5kYXRhMjwvTkFNRT48UEVSTUlTU0lPTlM+PE9XTkVSX1U+MTwvT1dORVJfVT48T1dORVJfTT4xPC9PV05FUl9NPjxPV05FUl9BPjA8L09XTkVSX0E+PEdST1VQX1U+MTwvR1JPVVBfVT48R1JPVVBfTT4wPC9HUk9VUF9NPjxHUk9VUF9BPjA8L0dST1VQX0E+PE9USEVSX1U+MTwvT1RIRVJfVT48T1RIRVJfTT4wPC9PVEhFUl9NPjxPVEhFUl9BPjA8L09USEVSX0E+PC9QRVJNSVNTSU9OUz48RFNfTUFEPnZkYzwvRFNfTUFEPjxUTV9NQUQ+dmRjPC9UTV9NQUQ+PEJBU0VfUEFUSD4vdmFyL2xpYi9vbmUvL2RhdGFzdG9yZXMvMTA5PC9CQVNFX1BBVEg+PFRZUEU+MDwvVFlQRT48RElTS19UWVBFPjA8L0RJU0tfVFlQRT48Q0xVU1RFUl9JRD4tMTwvQ0xVU1RFUl9JRD48Q0xVU1RFUj48L0NMVVNURVI+PFRPVEFMX01CPjE1MDAxMDE8L1RPVEFMX01CPjxGUkVFX01CPjk2MzIzMDwvRlJFRV9NQj48VVNFRF9NQj41MzY4NzE8L1VTRURfTUI+PElNQUdFUz48SUQ+MzE8L0lEPjxJRD4zMjwvSUQ+PElEPjYyPC9JRD48SUQ+ODQ8L0lEPjwvSU1BR0VTPjxURU1QTEFURT48Q0xPTkVfVEFSR0VUPjwhW0NEQVRBW1NZU1RFTV1dPjwvQ0xPTkVfVEFSR0VUPjxESVNLX1RZUEU+PCFbQ0RBVEFbRklMRV1dPjwvRElTS19UWVBFPjxEU19NQUQ+PCFbQ0RBVEFbdmRjXV0+PC9EU19NQUQ+PExOX1RBUkdFVD48IVtDREFUQVtTWVNURU1dXT48L0xOX1RBUkdFVD48TU9VTlRQT0lOVD48IVtDREFUQVsvdm9scy92bXNdXT48L01PVU5UUE9JTlQ+PFNBRkVfRElSUz48IVtDREFUQVsvdmFyL2xpYi9vbmUvaW1hZ2VzXV0+PC9TQUZFX0RJUlM+PFRNX01BRD48IVtDREFUQVt2ZGNdXT48L1RNX01BRD48VFlQRT48IVtDREFUQVtJTUFHRV9EU11dPjwvVFlQRT48VkdfTkFNRT48IVtDREFUQVt2b2xzXV0+PC9WR19OQU1FPjwvVEVNUExBVEU+PC9EQVRBU1RPUkU+PC9EU19EUklWRVJfQUNUSU9OX0RBVEE+" 109
TOTAL_MB=1500101.21
FREE_MB=963230.30
USED_MB=536871
Short of trying to dig through oned's source code I'm stuck - can anyone help?
My datastore driver type is "vdc", even if I insert a static script (I've also tried the monitor driver from "fs")
that just prints out "TOTAL_MB=0" etc .. I still get the same issue ...
???
(and it was working fine .. before the upgrade??)
--
Gareth Bult
“The odds of hitting your target go up dramatically when you aim at it.”
_______________________________________________
Users mailing list
Users at lists.opennebula.org
http://lists.opennebula.org/listinfo.cgi/users-opennebula.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20131205/070cd498/attachment-0002.htm>
More information about the Users
mailing list