[one-users] Opennebula migration error

Vlad Mirel vlad.mirel at outlook.com
Tue Jan 14 03:00:40 PST 2014


Hello again!

Trying to suspend + resume a VM had the same result.

However, I managed to fix this meanwhile.

It seemed that my system datastore was shared. After changing it's TM_MAD to ssh from sunstone everything worked fine.

Hope it will help others too!

From: cmartin at opennebula.org
Date: Mon, 13 Jan 2014 11:16:19 +0100
Subject: Re: [one-users] Opennebula migration error
To: vlad.mirel at outlook.com
CC: users at lists.opennebula.org

Hi,
On Fri, Jan 10, 2014 at 3:43 PM, Vlad Mirel <vlad.mirel at outlook.com> wrote:


Hello!

I have installed Latest Series 4 Stable of OpenNebula. 
I have 2 worker nodes and 1 front-end. On both the worker nodes I can deploy virtual machines.
The problem occurs when I try to migrate one virtual machine from one of the worker nodes to the other one. I am trying to create a cold migration and I am not using a shared data storage, as I am counting on scp .



So, for example I try to do this operation: Migrate from host A to host B.

In my opinion, a POSSIBLE CAUSE might be the followin:
-the checkpoint is not copied to the host B => no such file or directory


Just to be sure, can you paste the output of onedatastore list? All the DS should have 'ssh' as the TM (transfer manager).


Do you have the same problem if you try to suspend + resume a VM?
Regards

--
Carlos Martín, MSc
Project EngineerOpenNebula - Flexible Enterprise Cloud Made Simple
www.OpenNebula.org | cmartin at opennebula.org | @OpenNebula




On Fri, Jan 10, 2014 at 3:43 PM, Vlad Mirel <vlad.mirel at outlook.com> wrote:





Hello!

I have installed Latest Series 4 Stable of OpenNebula. 
I have 2 worker nodes and 1 front-end. On both the worker nodes I can deploy virtual machines.
The problem occurs when I try to migrate one virtual machine from one of the worker nodes to the other one. I am trying to create a cold migration and I am not using a shared data storage, as I am counting on scp .



So, for example I try to do this operation: Migrate from host A to host B.

Here are the logs containing the error:

Thu Jan  9 17:43:04 2014 [LCM][I]: New VM state is SAVE_MIGRATE
Thu Jan  9 17:43:10 2014 [VMM][I]: ExitCode: 0


Thu Jan  9 17:43:10 2014 [VMM][I]: Successfully execute virtualization driver operation: save.
Thu Jan  9 17:43:10 2014 [VMM][I]: ExitCode: 0
Thu Jan  9 17:43:10 2014 [VMM][I]: Successfully execute network driver operation: clean.


Thu Jan  9 17:43:11 2014 [LCM][I]: New VM state is PROLOG_MIGRATE
Thu Jan  9 17:43:11 2014 [LCM][I]: New VM state is BOOT
Thu Jan  9 17:43:12 2014 [VMM][I]: ExitCode: 0
Thu Jan  9 17:43:12 2014 [VMM][I]: Successfully execute network driver operation: pre.


Thu
 Jan  9 17:43:12 2014 [VMM][I]: Command execution fail: 
/var/tmp/one/vmm/kvm/restore '/var/lib/one/datastores/0/230/checkpoint' 
'192.168.1.100' 230 192.168.1.100
Thu Jan 
 9 17:43:12 2014 [VMM][E]: restore: Command "virsh --connect 
qemu:///system restore /var/lib/one/datastores/0/230/checkpoint" failed:
 error: Failed to restore domain from 
/var/lib/one/datastores/0/230/checkpoint
Thu Jan  9 17:43:12 
2014 [VMM][I]: error: Failed to create file 
'/var/lib/one/datastores/0/230/checkpoint': No such file or directory
Thu Jan  9 17:43:12 2014 [VMM][E]: Could not restore from /var/lib/one/datastores/0/230/checkpoint
Thu Jan  9 17:43:12 2014 [VMM][I]: ExitCode: 1


Thu Jan  9 17:43:12 2014 [VMM][I]: Failed to execute virtualization driver operation: restore.
Thu Jan  9 17:43:12 2014 [VMM][E]: Error restoring VM: Could not restore from /var/lib/one/datastores/0/230/checkpoint


Thu Jan  9 17:43:12 2014 [DiM][I]: New VM state is FAILED

This instruction, if ran on host A, where the checkpoint exists gives the following output:

Last login: Thu Jan  9 15:51:59 2014 from utcn.local

$ 

$ virsh --connect qemu:///system restore /var/lib/one/datastores/0/228/checkpoint
Domain restored from /var/lib/one/datastores/0/228/checkpoint

When ran on host B I get the following output:

Last login: Thu Jan  9 15:26:11 2014 from utcn.local


$ virsh --connect qemu:///system restore /var/lib/one/datastores/0/228/checkpoint
error: Failed to restore domain from /var/lib/one/datastores/0/228/checkpoint
error: Failed to create file '/var/lib/one/datastores/0/228/checkpoint': No such file or directory



---------> this errors are identical with those from SUNSTONE

In my opinion, a POSSIBLE CAUSE might be the followin:
-the checkpoint is not copied to the host B => no such file or directory




I would appreciate any help!
 		 	   		  

_______________________________________________

Users mailing list

Users at lists.opennebula.org

http://lists.opennebula.org/listinfo.cgi/users-opennebula.org



 		 	   		  
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20140114/176fd18b/attachment-0002.htm>


More information about the Users mailing list