[one-users] unable to cold migrate

Ruben S. Montero rsmontero at opennebula.org
Wed Feb 6 01:31:23 PST 2013


Hi

So, then you have solved you restore issue? The permission issue is usually
because a misconfigured qemu.conf (dynamic_ownership [1]) or a problem
accessing the datastore from the hosts (mount points, mismatch ID of
oneadmin...)

Regarding the network problem you have to create bridge1 in the hosts and
bind it to the target physical Host for the dummy configuration. However if
you are using other modes like 802.1Q then the bridge is created for you.
This depends on how the host was defined. Information about networking
modes and configuration is here[2]

Cheers

Ruben

[1] http://opennebula.org/documentation:rel3.8:kvmg
[2] http://opennebula.org/documentation:rel3.8:nm


On Wed, Feb 6, 2013 at 7:30 AM, Jürgen Weber
<jurgen.weber at theiconic.com.au>wrote:

> some more on this:
>
> I installed kvm, this was obvious, as well as bridge-utils; apt-get
> install kvm bridge-utils -y completed.
> does not look to copy, copied manually to HOST
> chmod -R 755 /var/lib/one/remotes on MASTER and HOST
>
> Ran it manually on the master and :
> oneadmin at MASTER:~$ virsh --connect qemu:///system restore
> /var/lib/one//datastores/0/**164/checkpoint
> Domain restored from /var/lib/one//datastores/0/**164/checkpoint works
>
> http://dev.opennebula.org/**issues/1324<http://dev.opennebula.org/issues/1324>;
> hashed out == if, pointless I have a later version anyway.
>
> oneadmin at chaos:~$ /var/lib/one/remotes/vmm/kvm/**restore
> /var/lib/one//datastores/0/**166/checkpoint <-- works as well.
>
> Now I am just trying to deploy a VM on the machine, lets just instantiate
> one here and I get:
>
> Wed Feb  6 17:26:10 2013 [TM][D]: Message received: LOG I 172 ln: Cloning
> chaos:/var/lib/one/datastores/**106/**c24ad9c213f28d634b6192e6b52d21**6f
> in /var/lib/one/datastores/0/172/**disk.0
> Wed Feb  6 17:26:10 2013 [TM][D]: Message received: LOG I 172 ExitCode: 0
> Wed Feb  6 17:26:11 2013 [TM][D]: Message received: LOG I 172 context:
> Generating context block device at tartarus.matrix:/var/lib/one//**
> datastores/0/172/disk.1
> Wed Feb  6 17:26:11 2013 [TM][D]: Message received: LOG I 172 ExitCode: 0
> Wed Feb  6 17:26:11 2013 [TM][D]: Message received: TRANSFER SUCCESS 172 -
> Wed Feb  6 17:26:11 2013 [VMM][D]: Message received: LOG I 172 ExitCode: 0
> Wed Feb  6 17:26:11 2013 [VMM][D]: Message received: LOG I 172
> Successfully execute network driver operation: pre.
> Wed Feb  6 17:26:11 2013 [VMM][D]: Message received: LOG I 172 Command
> execution fail: cat << EOT | /var/tmp/one/vmm/kvm/deploy
> /var/lib/one//datastores/0/**172/deployment.0 tartarus.matrix 172
> tartarus.matrix
> Wed Feb  6 17:26:11 2013 [VMM][D]: Message received: LOG I 172 error:
> Failed to create domain from /var/lib/one//datastores/0/**172/deployment.0
> Wed Feb  6 17:26:11 2013 [VMM][D]: Message received: LOG I 172 error:
> Cannot get interface MTU on 'bridge1': No such device
> Wed Feb  6 17:26:11 2013 [VMM][D]: Message received: LOG E 172 Could not
> create domain from /var/lib/one//datastores/0/**172/deployment.0
> Wed Feb  6 17:26:11 2013 [VMM][D]: Message received: LOG I 172 ExitCode:
> 255
> Wed Feb  6 17:26:11 2013 [VMM][D]: Message received: LOG I 172 Failed to
> execute virtualization driver operation: deploy.
> Wed Feb  6 17:26:11 2013 [VMM][D]: Message received: DEPLOY FAILURE 172
> Could not create domain from /var/lib/one//datastores/0/**172/deployment.0
>
> Now i have not bridge configuration on the MASTER and I have observed from
> deleting and creating a VM that OpenNebula is doing all the interface
> configurations its self, which I do vaguely remember getting working last
> time so I am not sure why its not working here.
>
> Any thoughts?
>
> On 06/02/13 10:47, Jürgen Weber wrote:
>
>> Hi
>>
>> Right now I have a OpenNebula server, running the front end (SunStone)
>> and host all on one machine. It works great. Lets refer to this as the
>> master.
>>
>> I am trying to add a new host (reffered to as the host) to run more VM's.
>> I am attempting to migrate from the master to the host without success.
>> Firstly, what is installed:
>>
>> MASTER
>> uname -a
>> Linux MASTER 3.2.0-4-amd64 #1 SMP Debian 3.2.35-2 x86_64 GNU/Linux
>>
>> cat /etc/debian_version
>> 7.0
>>
>> ii  opennebula                         3.4.1-3.1 amd64 controller which
>> executes the OpenNebula cluster services
>> ii  opennebula-common                  3.4.1-3.1 all empty package to
>> create OpenNebula users and directories
>> ii  opennebula-sunstone                3.4.1-3.1 all          web
>> interface to which executes the OpenNebula cluster services
>> ii  opennebula-tools                   3.4.1-3.1 all Command-line tools
>> for OpenNebula Cloud
>> ii  ruby-opennebula                    3.4.1-3.1 all          Ruby
>> bindings for OpenNebula Cloud API (OCA)
>> ii  kvm                                1:1.1.2+dfsg-5 amd64 dummy
>> transitional package from kvm to qemu-kvm
>> ii  qemu-kvm                           1.1.2+dfsg-5 amd64 Full
>> virtualization on x86 hardware
>> ii  libvirt-bin                        0.9.12-5 amd64 programs for the
>> libvirt library
>> ii  libvirt0                           0.9.12-5 amd64 library for
>> interfacing with different virtualization systems
>> ii  python-libvirt                     0.9.12-5 amd64 libvirt Python
>> bindings
>>
>> HOST
>> uname -a
>> Linux HOST 3.2.0-4-amd64 #1 SMP Debian 3.2.35-2 x86_64 GNU/Linux
>>
>> cat /etc/debian_version
>> 7.0
>>
>> i  opennebula-common                  3.4.1-3.1 all          empty
>> package to create OpenNebula users and directories
>> ii  opennebula-node                    3.4.1-3.1 all empty package to
>> prepare a machine as OpenNebula Node
>> ii  libvirt-bin                        0.9.12-5 amd64 programs for the
>> libvirt library
>> ii  libvirt0                           0.9.12-5 amd64 library for
>> interfacing with different virtualization systems
>>
>> Reading /usr/share/doc/opennebula-**node/README.Debian it tells you what
>> to do, I followed those instructions, everything completed successfully.
>> Things I find odd... kvm/qemu-kvm packages are not installed on the HOST
>> but that README file says to ensure that the kernel modules are loaded
>> which they are.
>>
>> oneadmin at MASTER:~$ onehost list
>>   ID NAME         CLUSTER     RVM   TCPU   FCPU   ACPU   TMEM FMEM   AMEM
>> STAT
>>    2 MASTER.matrix Internal     16   3200   2509    200 126.1G 27.2G
>>  22.1G   on
>>    4 HOST.mat -             0   1600   1600   1600  31.5G  31.1G 31.5G
>> on
>>
>> But that said, here is what happens in the VM's log when I try to migrate:
>>
>> Wed Feb  6 09:31:46 2013 [VMM][D]: Message received: LOG I 161 ExitCode: 0
>> Wed Feb  6 09:31:46 2013 [VMM][D]: Message received: LOG I 161
>> Successfully execute virtualization driver operation: save.
>> Wed Feb  6 09:31:46 2013 [VMM][D]: Message received: LOG I 161 ExitCode: 0
>> Wed Feb  6 09:31:46 2013 [VMM][D]: Message received: LOG I 161
>> Successfully execute network driver operation: clean.
>> Wed Feb  6 09:31:46 2013 [VMM][D]: Message received: SAVE SUCCESS 161 -
>> Wed Feb  6 09:31:47 2013 [TM][D]: Message received: LOG I 161 ExitCode: 0
>> Wed Feb  6 09:31:47 2013 [TM][D]: Message received: LOG I 161 ExitCode: 0
>> Wed Feb  6 09:31:47 2013 [TM][D]: Message received: TRANSFER SUCCESS 161 -
>> Wed Feb  6 09:31:47 2013 [VMM][D]: Message received: LOG I 161 ExitCode: 0
>> Wed Feb  6 09:31:47 2013 [VMM][D]: Message received: LOG I 161
>> Successfully execute network driver operation: pre.
>> Wed Feb  6 09:31:47 2013 [VMM][D]: Message received: LOG I 161 Command
>> execution fail: /var/tmp/one/vmm/kvm/restore /var/lib/one//datastores/0/*
>> *161/checkpoint HOST 161 HOST
>> Wed Feb  6 09:31:47 2013 [VMM][D]: Message received: LOG E 161 restore:
>> Command "virsh --connect qemu:///system restore /var/lib/one//datastores/0/
>> **161/checkpoint" failed: error: Failed to restore domain from
>> /var/lib/one//datastores/0/**161/checkpoint
>> Wed Feb  6 09:31:47 2013 [VMM][D]: Message received: LOG I 161 error:
>> Failed to create file '/var/lib/one//datastores/0/**161/checkpoint':
>> Operation not permitted
>> Wed Feb  6 09:31:47 2013 [VMM][D]: Message received: LOG E 161 Could not
>> restore from /var/lib/one//datastores/0/**161/checkpoint
>> Wed Feb  6 09:31:47 2013 [VMM][D]: Message received: LOG I 161 ExitCode: 1
>> Wed Feb  6 09:31:47 2013 [VMM][D]: Message received: LOG I 161 Failed to
>> execute virtualization driver operation: restore.
>> Wed Feb  6 09:31:47 2013 [VMM][D]: Message received: RESTORE FAILURE 161
>> Could not restore from /var/lib/one//datastores/0/**161/checkpoint
>> Wed Feb  6 09:31:49 2013 [InM][I]: ExitCode: 0
>>
>> I can work out this operation is that is not permitted, is it a
>> permissions issue? What host is this error originating from.. is it saying
>> the MASTER does not have access to that file or the HOST? I have no idea.
>>
>> Any help appreicated.
>>
>> Thanks
>>
>> Jurgen
>>
>>
> --
> Jürgen Weber
>
> Systems Engineer
> IT Infrastructure Team Leader
>
> THE ICONIC | E jurgen.weber at theiconic.com.au | www.theiconic.com.au
>
> ______________________________**_________________
> Users mailing list
> Users at lists.opennebula.org
> http://lists.opennebula.org/**listinfo.cgi/users-opennebula.**org
>
> --
> Ruben S. Montero, PhD
> Project co-Lead and Chief Architect
> OpenNebula - The Open Source Solution for Data Center Virtualization
> <http://lists.opennebula.org/listinfo.cgi/users-opennebula.org>
> www.OpenNebula.org | rsmontero at opennebula.org | @OpenNebula
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.opennebula.org/pipermail/users-opennebula.org/attachments/20130206/3b122d94/attachment-0002.htm>


More information about the Users mailing list