[one-users] Help with VM launch

Robert Schweikert rschweikert at novell.com
Thu Jun 9 08:44:49 PDT 2011


Hi,

Still working on packaging things properly for openSUSE and trying to 
get OpenNebula running. I've managed to get most things up and running 
but am now stuck with a failure when I try to launch a VM.

I have one host registered:
# onhost list
   ID NAME              CLUSTER  RVM   TCPU   FCPU   ACPU    TMEM 
FMEM STAT
    0 192.168.1.5       default    0      0      0    100      0K 
0K  err

Not sure what to make of the "err" state/status

After I run

# onevm create testVM.vmd

The VM is created and shows as pending (sits there for quite a long 
time, thus I suspect there is something going on that I am not aware 
of). When I force the deployment

# onevm deploy 0 0

I end up with the fail state

# onevm list
    ID     USER     NAME STAT CPU     MEM        HOSTNAME        TIME
     0 oneadmin suse-tes fail   0      0K     192.168.1.5 00 00:22:47

The log contains the following information:

Thu Jun  9 10:58:23 2011 [ReM][D]: VirtualMachineDeploy invoked
Thu Jun  9 10:58:23 2011 [ReM][E]: [VirtualMachineDeploy] Error getting 
HOST [0].
....
Thu Jun  9 11:15:37 2011 [InM][I]: Command execution fail: scp -r 
/var/lib/one/remotes/. 192.168.1.5:/var/tmp/one
Thu Jun  9 11:15:37 2011 [InM][I]: STDERR follows.
Thu Jun  9 11:15:37 2011 [InM][I]: Warning: Permanently added 
'192.168.1.5' (RSA) to the list of known hosts.
Thu Jun  9 11:15:37 2011 [InM][I]: Connection closed by 192.168.1.5
Thu Jun  9 11:15:37 2011 [InM][I]: lost connection
Thu Jun  9 11:15:37 2011 [InM][I]: ExitCode: 1
Thu Jun  9 11:15:47 2011 [InM][I]: Command execution fail: 'if [ -x 
"/var/tmp/one/im/run_probes" ]; then /var/tmp/one/im/run_probes kvm 
192.168.1.5; else                              exit 42; fi'
Thu Jun  9 11:15:47 2011 [InM][I]: STDERR follows.
Thu Jun  9 11:15:47 2011 [InM][I]: Connection closed by 192.168.1.5
Thu Jun  9 11:15:47 2011 [InM][I]: ExitCode: 255
Thu Jun  9 11:15:47 2011 [InM][E]: Error monitoring host 0 : MONITOR 
FAILURE 0 Could not monitor host 192.168.1.5.
Thu Jun  9 11:16:20 2011 [DiM][D]: Deploying VM 0
Thu Jun  9 11:16:23 2011 [TM][D]: Message received: LOG - 0 tm_clone.sh: 
SUSENebula-Head:/var/lib/one//images/44833e18c42179d7012a9229ade7aad049399aef 
192.168.1.5:/var/lib/one//0/images/disk.0

Thu Jun  9 11:16:23 2011 [TM][D]: Message received: LOG - 0 tm_clone.sh: 
DST: /var/lib/one//0/images/disk.0

Thu Jun  9 11:16:23 2011 [TM][D]: Message received: LOG - 0 tm_clone.sh: 
Creating directory /var/lib/one//0/images

Thu Jun  9 11:16:23 2011 [TM][D]: Message received: LOG - 0 tm_clone.sh: 
Executed "mkdir -p /var/lib/one//0/images".

Thu Jun  9 11:16:23 2011 [TM][D]: Message received: LOG - 0 tm_clone.sh: 
Executed "chmod a+w /var/lib/one//0/images".

Thu Jun  9 11:16:23 2011 [TM][D]: Message received: LOG - 0 tm_clone.sh: 
Cloning /var/lib/one//images/44833e18c42179d7012a9229ade7aad049399aef

Thu Jun  9 11:16:23 2011 [TM][D]: Message received: LOG - 0 tm_clone.sh: 
Executed "cp -r 
/var/lib/one//images/44833e18c42179d7012a9229ade7aad049399aef 
/var/lib/one//0/images/disk.0".

Thu Jun  9 11:16:23 2011 [TM][D]: Message received: LOG - 0 tm_clone.sh: 
Executed "chmod a+rw /var/lib/one//0/images/disk.0".

Thu Jun  9 11:16:23 2011 [TM][D]: Message received: TRANSFER SUCCESS 0 -

Thu Jun  9 11:16:26 2011 [ReM][D]: HostPoolInfo method invoked
Thu Jun  9 11:16:26 2011 [ReM][D]: VirtualMachinePoolInfo method invoked
Thu Jun  9 11:16:28 2011 [ReM][D]: VirtualMachinePoolInfo method invoked
Thu Jun  9 11:16:38 2011 [VMM][D]: Message received: LOG - 0 Command 
execution fail: 'if [ -x "/var/tmp/one/vmm/kvm/deploy" ]; then 
/var/tmp/one/vmm/kvm/deploy /var/lib/one//0/images/deployment.0; else 
                            exit 42; fi'

Thu Jun  9 11:16:38 2011 [VMM][D]: Message received: LOG - 0 STDERR follows.

Thu Jun  9 11:16:38 2011 [VMM][D]: Message received: LOG - 0 Connection 
closed by 192.168.1.5

Thu Jun  9 11:16:38 2011 [VMM][D]: Message received: LOG - 0 ExitCode: 255

Thu Jun  9 11:16:38 2011 [VMM][D]: Message received: DEPLOY FAILURE 0 
Connection closed by 192.168.1.5

Thu Jun  9 11:16:38 2011 [VMM][D]: Message received: ExitCode: 255

Thu Jun  9 11:16:39 2011 [TM][D]: Message received: LOG - 0 
tm_delete.sh: Deleting /var/lib/one//0/images

Thu Jun  9 11:16:39 2011 [TM][D]: Message received: LOG - 0 
tm_delete.sh: Executed "rm -rf /var/lib/one//0/images".

Thu Jun  9 11:16:39 2011 [TM][D]: Message received: TRANSFER SUCCESS 0 -

This doesn't tell me a hole lot, but I am hoping this has some meaning 
to someone on the list.

The only thing I can suspect is that the oneadmin user is not allowed 
for interactive login, which is in accordance to the documentation given 
here:

http://opennebula.org/documentation:rel2.2:plan

This page does not state that the oneadmin user needs a password for 
interactive login, and the user is distinctly created without a password:

# useradd -d /srv/cloud/one -g cloud -m oneadmin

The /var/lib/one home directory is NFS mounted on the node and the ssh 
keys have the proper permissions. Based on the ssh warning from the log 
(see above) about adding the host to known_hosts, I would say ssh 
connection is working as expected.

What am I missing?

Help is appreciated.

Thanks,
Robert

-- 
Robert Schweikert                           MAY THE SOURCE BE WITH YOU
SUSE-IBM Software Integration Center                   LINUX
Tech Lead
rschweikert at novell.com
rschweikert at ca.ibm.com
781-464-8147



More information about the Users mailing list