<html>

  <head>

    <meta http-equiv="content-type" content="text/html; charset=ISO-8859-1">

  </head>

  <body bgcolor="#FFFFFF" text="#000000">

    Hello everybody,<br>

    <br>

    I have 4 physical computers: 1 as a FrontEnd with CentOS 6.2 and

    OpenNebula 3.6, two hosts with CentOS 6.2 and qemu-kvm and another

    as an nfs server with a debian based distro.<br>

    Each time I try to create a new VM, it waits for a long time in

    PROLOG status and then it fails. According to the log, it seems to

    be a problem connected to NFS.<br>

    In the NFS server side there are no relevant information in the

    logs, just a few entries like this:<br>

    <font face="Helvetica, Arial, sans-serif"><small>Aug 29 14:56:37

        localhost kernel: svc: unknown version (4)<br>

        Aug 29 14:56:37 localhost rpc.mountd: authenticated mount

        request from host2:751 for /opt/virtual/one (/opt/virtual/one)</small></font><br>

    <br>

    <br>

    This is the output of the VM log:<br>

    <br>

    <font face="Helvetica, Arial, sans-serif"><small>Wed Aug 29 14:58:28

        2012 [DiM][I]: New VM state is ACTIVE.<br>

        Wed Aug 29 14:58:29 2012 [LCM][I]: New VM state is PROLOG.<br>

        Wed Aug 29 14:58:29 2012 [VM][I]: Virtual Machine has no context<br>

        Wed Aug 29 15:01:12 2012 [TM][I]: Command execution fail:

        /var/lib/one/remotes/tm/shared/clone

        frontend:/var/lib/one/datastores/1/5c0455eb494fd43b5c7c576e0c642fbe

        Host2:/var/lib/one//datastores/0/32/disk.0 32 1<br>

        Wed Aug 29 15:01:12 2012 [TM][I]: clone: Cloning

        ../../1/5c0455eb494fd43b5c7c576e0c642fbe in

        Host2:/var/lib/one//datastores/0/32/disk.0<br>

        Wed Aug 29 15:01:12 2012 [TM][E]: clone: Command "cd

        /var/lib/one/datastores/0/32; cp -r

        ../../1/5c0455eb494fd43b5c7c576e0c642fbe

        /var/lib/one/datastores/0/32/disk.0" failed: cp: reading

        `../../1/5c0455eb494fd43b5c7c576e0c642fbe': Stale NFS file

        handle<br>

        Wed Aug 29 15:01:12 2012 [TM][E]: Error copying

        frontend:/var/lib/one/datastores/1/5c0455eb494fd43b5c7c576e0c642fbe

        to Host2:/var/lib/one//datastores/0/32/disk.0<br>

        Wed Aug 29 15:01:12 2012 [TM][I]: ExitCode: 1<br>

        Wed Aug 29 15:01:12 2012 [TM][E]: Error executing image transfer

        script: Error copying

        frontend:/var/lib/one/datastores/1/5c0455eb494fd43b5c7c576e0c642fbe

        to Host2:/var/lib/one//datastores/0/32/disk.0<br>

        Wed Aug 29 15:01:12 2012 [DiM][I]: New VM state is FAILED</small></font><br>

    <br>

    <br>

    I've checked that path and filenames, and both are correct. The file

    itself is a 300MG sized and the error appears in this case just a

    few minutes later but in other cases aprox. 30 minutes after the

    start of the copy, so it doesn't finish the file copy.<br>

    At first I managed to solve it unmounting and remounting the shared

    directory and it was possible to fully copy the file (after aprox.

    30 minutes) and run the VM. But today, the problem persists after

    doing it (even after rebooting the clients).<br>

    Additionally, I've tried doing the same operation (to copy the same

    file from a directory to another) locally in the NFS Server and it

    takes just a minute (aprox.) to accomplish it.<br>

    <br>

    May it be related to OpenNebula? Could you give me advice on

    troubleshooting it?<br>

    <br>

    <br>

    Thanks in advance.<br>

    <br>

    Carlos.

  </body>

</html>