Dears,<br><br>I 'm running Squid Web Cache Proxy server on Ubuntu Server 12.04 VMs (kernel 3.2.0-23-generic-pae), OpenNebula 3.4.<br>My private cloud is composed by one frontend and three nodes. VMs are running on that 3 nodes, initially one by node.<br>
Outside cloud, there are 2 hosts, one working as web clients and another as web server, using Web Polygraph Benchmakring Tool.<br><br>The goal of tests is stress Squid cache running on VMs.<br>When same test is executed outside the cloud, using the three nodes as Physical Machines, there are 100% of cache service availability.<br>
Nevertheless, when cache service is provided by VMs, nothing better than 45% of service availability is reached.<br>Web clients do not receive responses from squid when it is running on VMs in 55% of the time.<br><br>I have monitored load average of VMs and PMs where VMs are been executed. First load average field reaches 15 after some hours of tests on VMs, and 3 on physical machines.<br>
<span id="result_box" class="short_text" lang="en"><span class="hps">Furthermore, t</span></span>here is a set of processes, called migration/X, that are champions in CPU TIME when VMs are in execution. A sample:<br><br><span style="font-family:courier new,monospace">top - 20:01:38 up 1 day, 3:36, 1 user, load average: 5.50, 5.47, 4.20</span><br style="font-family:courier new,monospace">
<br style="font-family:courier new,monospace"><span style="font-family:courier new,monospace"> PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ TIME COMMAND </span><br style="font-family:courier new,monospace">
<span style="font-family:courier new,monospace"> 13 root RT 0 0 0 0 S 0 0.0 408:27.25 408:27 migration/2 </span><br style="font-family:courier new,monospace"><span style="font-family:courier new,monospace"> 8 root RT 0 0 0 0 S 0 0.0 404:13.63 404:13 migration/1 </span><br style="font-family:courier new,monospace">
<span style="font-family:courier new,monospace"> 6 root RT 0 0 0 0 S 0 0.0 401:36.78 401:36 migration/0 </span><br style="font-family:courier new,monospace"><span style="font-family:courier new,monospace"> 17 root RT 0 0 0 0 S 0 0.0 400:59.10 400:59 migration/3 </span><br style="font-family:courier new,monospace">
<br><br><span id="result_box" class="" lang="en"><span class="hps">It isn't possible to </span></span><span id="result_box" class="" lang="en"><span class="hps">offer</span> web cache service <span class="hps">via</span> <span class="hps">VMs in t</span></span><span id="result_box" class="" lang="en"><span class="hps">he way</span> <span class="hps">the service</span> <span class="hps">is behaving</span><span class="">, with so small availability.<br>
<br>So, my questions:<br><br>1</span></span>. Does anybody has experienced a similar problem of unresponsive service? (Whatever service).<br><span id="result_box" class="" lang="en"><span class="">2. How to state the bootleneck that is overloading the system, so that it can be </span></span><span id="result_box" class="short_text" lang="en"><span class="hps">minimized?</span></span><br>
<br>Thanks a lot,<br><br>Erico.<br>