[one-users] wrong restart -> delete disk image!

samuel samu60 at gmail.com
Tue Sep 6 08:28:55 PDT 2011


Hi folks,

Recently there was a network problem and one instance became unreachable. We
tried to restart it with stop and resume actions but there's been a problem
and the disk has been deleted. The main concern is why, after trying to
restart and an error happened, the directory where the disk image resides
has been deleted? There was no sensible data on it but I just don't get why
there has been a rm -rf of the directory.

Details:

The configuration is KVM with shared storage using open nebula 2.2.

output of virsh version
    Compilado contra la biblioteca: libvir 0.8.8
    Utilizando la biblioteca: libvir 0.8.8
    Utilizando API: QEMU 0.8.8
    Ejecutando hypervisor: QEMU 0.14.0

related logs:

Tue Sep  6 12:37:49 2011 [VMM][D]: Message received: SAVE SUCCESS 22 Domain
one-22 saved to /srv/cloud/one/var//22/images/checkpoint
Tue Sep  6 12:37:49 2011 [VMM][D]: Message received:
Tue Sep  6 12:37:49 2011 [TM][D]: Message received: LOG - 22 tm_mv.sh: Will
not move, is not saving image
Tue Sep  6 12:37:49 2011 [TM][D]: Message received: TRANSFER SUCCESS 22 -

Tue Sep  6 12:38:12 2011 [DiM][D]: Restarting VM 22
Tue Sep  6 12:38:12 2011 [DiM][E]: Could not restart VM 22, wrong state.
Tue Sep  6 12:38:12 2011 [ReM][E]: Wrong state to perform action

Tue Sep  6 12:38:18 2011 [ReM][D]: VirtualMachineAction invoked
Tue Sep  6 12:38:18 2011 [DiM][D]: Resuming VM 22
Tue Sep  6 12:38:47 2011 [DiM][D]: Deploying VM 22

Tue Sep  6 12:38:47 2011 [ReM][D]: VirtualMachineInfo method invoked
Tue Sep  6 12:38:47 2011 [TM][D]: Message received: LOG - 22 tm_mv.sh: Will
not move, is not saving image

Tue Sep  6 12:38:47 2011 [TM][D]: Message received: TRANSFER SUCCESS 22 -

Tue Sep  6 12:38:48 2011 [ReM][D]: VirtualMachineInfo method invoked
Tue Sep  6 12:38:49 2011 [VMM][D]: Message received: LOG - 22 Command
execution fail: 'if [ -x "/var/tmp/one/vmm/kvm/restore" ]; then
/var/tmp/one/vmm/kvm/restore /srv/cloud/one/var//22/images/checkpoint;
else                              exit 42; fi'
Tue Sep  6 12:38:49 2011 [VMM][D]: Message received: LOG - 22 STDERR
follows.
Tue Sep  6 12:38:49 2011 [VMM][D]: Message received: LOG - 22 error: Failed
to restore domain from /srv/cloud/one/var//22/images/checkpoint
Tue Sep  6 12:38:49 2011 [VMM][D]: Message received: LOG - 22 error: cannot
close file: Bad file descriptor
Tue Sep  6 12:38:49 2011 [VMM][D]: Message received: LOG - 22 ExitCode: 1
Tue Sep  6 12:38:49 2011 [VMM][D]: Message received: RESTORE FAILURE 22
error: Failed to restore domain from
/srv/cloud/one/var//22/images/checkpoint
Tue Sep  6 12:38:49 2011 [VMM][D]: Message received: error: cannot close
file: Bad file descriptor
Tue Sep  6 12:38:49 2011 [VMM][D]: Message received: ExitCode: 1

Tue Sep  6 12:38:50 2011 [TM][D]: Message received: LOG - 22 tm_delete.sh:
Deleting /srv/cloud/one/var//22/images
Tue Sep  6 12:38:50 2011 [TM][D]: Message received: LOG - 22 tm_delete.sh:
Executed "rm -rf /srv/cloud/one/var//22/images".
Tue Sep  6 12:38:50 2011 [TM][D]: Message received: TRANSFER SUCCESS 22 -


Thank you in advance for any hint!
Samuel.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20110906/440eb46d/attachment-0001.htm>


More information about the Users mailing list