<?xml version="1.0" encoding="utf-8"?>

<rss version="2.0">
	<channel>
		<title>Anand Lal Shimpi's Weblog</title>
	    <link>http://www.anandtech.com/weblog</link> 
	    <description>The personal weblog for Anand Lal Shimpi</description>
    	<language>en-us</language>
		<copyright>Copyright 2010 - Anand Lal Shimpi</copyright>
		<pubDate>Mon, 15 Mar 2010 02:47:21 EDT</pubDate>
		
		<item>
			<title>Databases and power management, not a perfect fit</title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=670</link>
			<description><![CDATA[ <div>In <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3722">our last article</a>, I showed that the current power management <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3722&amp;p=8">does not seem</a> to <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3722&amp;p=11">work well with the Windows Scheduler</a>. We got tons of interesting suggestions and superb feedback. Also several excellent academic papers from two universities in Germany which confirm our findings and offer a lot of new insights. More about that later.The thing that is really haunting me once again is that our follow up article is long overdue. And it is urgent, because some people feel that the benchmark we used undermines all our findings. We disagree as we chose the Fritz benchmark not because it was realworld, but because it let us control the amount of CPU load and threads so easily. But the fact remains of course that the benchmark is hardly relevant for any server. Pleading guilty as charged.<br />
</div>
<div>&nbsp; <br />
</div>
<div>So how about SQL Server 2008&nbsp; Enterprise x64 on Windows 2008 x64? That should interest a lot more IT professionals.We used our "Nieuws.be" SQL Server test, you can read about <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3653&amp;p=5">our testing methods here</a>. That is the great thing about the blog, you do not have to spend pages on benchmark configuration details :-). Hardware configuration details: a single Opteron 2435 2.6 GHz running in the server <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3722&amp;p=3">we described here</a>. This test&nbsp; is as real life as it gets: we test with 25, 50, 100 and so on users which fire off queries with an average rate of one per second. Our <a href="http://www.anandtech.com/IT/showdoc.aspx?i=3567&amp;p=5">vApus stresstest</a> makes sure that all those queries are not sent at the same time but within a certain time delta, just like real users. So this is much better than putting the CPU under 100% load and measuring maximum throughput. Remember in our first article, we stated that the real challenge of a server was to offer a certain number of users a decent responsetime, and this preferably at the lowest cost. And the lowest cost includes the lowest power consumption of course. &nbsp; <br />
</div>
<div>&nbsp;</div>
<div>While I keep some of the data for the article, I like to draw your attention to a few very particular findings when comparing the "balanced" and "performance" power plan of Windows 2008. Remember the balanced performance plan is the one that should be the best one: in theory it adapts the frequency and voltage of your CPU to the demanded performance with only a small performance hit. And when we looked at the <span style="font-weight: bold;">throughput</span> or queries per second figures, this was absolutely accurate. But throughtput is just throughput. Response time is the one we care about. </div>
<div>&nbsp;</div>
<div>Let us take a look at the graph below. The response time and power usage of the server when set to performance (maximum clock all the time) is equal to one. The balanced power and response time are thus relative to the numbers we saw in performance.&nbsp; Response time is represented by the columns and the first Y-axis (on the left), Power consumption is represented by the line and by the second Y-axis (on your right). <br />
</div>
<div>&nbsp;</div>
<div>&nbsp;</div>
<div><img alt="" src="http://images.anandtech.com/reviews/IT/2010//februaryblog/powerrelative.png" /></div>
<div>&nbsp;</div>
<div>&nbsp;The interesting&nbsp; thing is that reducing the frequency and voltage never delivers more than 10% of power savings. One reason is that we are testing with only six-core CPU. The power savings would be obviously better when you look at a dual or even quad CPU system. Still, as the number of core per CPU increases, systems with less CPUs become more popular. If you have been paying attention to what AMD and Intel <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3681">are planning in the next month(s)</a>, you'll notice that they are adapting to that trend. You'll see even more evidence next month. <br />
</div>
<div>&nbsp;</div>
<div>What is really remarkable is that our SQL Server 2008 server took twice as much time to respond when the CPU is using DVFS (Dynamic Voltage Frequency Scaling) than when not. It clearly shows that in many cases, heavy queries were scheduled on cores which were running at a low frequency (0.8 - 1.4 GHz).&nbsp; <br />
</div>
<div>&nbsp; <br />
</div>
<div>I am not completely sure whether or not CPU load measurements are completely accurate when you use DVFS (Powernow!), but the CPU load numbers tell the same story. <br />
</div>
&nbsp;
&nbsp;
<div><img alt="" src="http://images.anandtech.com/reviews/IT/2010//februaryblog/powerCPUload.png" /></div>
<div>&nbsp;</div>
<div>The CPU load on the "balanced" server is clearly much higher. Only when the CPU load was approaching 90%, was the "balanced" server capable of delivering the same kind of performance as when running in "performance" mode. But then of course the power savings are insignificant. So while power management makes no difference for the number of users you can serve, the response time they experience might be quite different. Considering that most servers run at CPU loads much lower than 90%, that is an interesting thing to note. <br />
</div> 
             ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Wed, 17 Feb 2010 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>Setting up a high performance OpenVZ container</title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=667</link>
			<description><![CDATA[ <div>As promised for a long time, we've been working on pitting Xen and OpenVZ against eachother in a little "battle of the free virtualization solutions". (If you can't quite recall what this OpenVZ business is all about, we suggest you go read our article on <a href="http://www.anandtech.com/IT/showdoc.aspx?i=3349">container-based virtualization</a>)</div>
<div><br />
</div>
<div>Though development of our vApus FOS benchmark suite is moving on quite diligently, it takes time to create both a realistic testing setup that will prove useful and relevant for a while in a world where cores are multiplying like a pair of rabbits.&nbsp;As it turns out, our test client is up for a thorough rewrite and optimization as well in the face of the upcoming Magny-Cours and 64-core Nehalem systems, so we definitely have our work cut out for us.</div>
<div><br />
</div>
<div>In preparation for the "official" rollout of vApus FOS, we have been using our beta versions to test both the performances of CentOS 5.4 Xen and OpenVZ, meanwhile figuring out just how easy it is to set up a large scale realistic testing environment in OpenVZ.</div>
<div><br />
</div>
<div>As with many extensive open source software packages, OpenVZ comes with quite a few hefty man-pages and very minimal basic configuration, making the learning curve quite steep.&nbsp;</div>
<div><br />
</div>
<div>Having a repeatable test ready, however, helps quite a lot in tracking down possible bottlenecks in your container setup, and because our greatest issues came up when trying to configure a container for a relatively heavily queried MySQL database, here's some pointers for our readers out there trying to do the same.</div>
<div><br />
</div>
<div>
<ul>
    <li>While testing, keep a very close look on <em>/proc/user_beancounters</em>. The very last column of this table displays the failcount of a certain resource in the container. When you start noticing problems, check <em>user_beancounters</em> first to get a better idea of what's going wrong.</li>
</ul>
</div>
<div>
<ul>
    <li>Problematic resource counters to look out for are the following:</li>
</ul>
</div>
<div><em><strong>numproc</strong></em> - This is the number of processes the container is allowed to create. In MySQL, every connection will get its own process, so make sure you allow for at least the the value you entered for <em>max_connections</em> in my.cnf, plus the usual amount of processes in a container. For a test with 900 users, we just set this to 1000 to be sure.</div>
<div><br />
</div>
<div><em><strong>numtcpsock</strong></em> - Same as above, you need to increase this to at least the amount of users you want to allow at the same time. Each of them will need a TCP Socket.</div>
<div><br />
</div>
<div><em><strong>kmemsize</strong></em> - When allowing a container access to a certain amount of memory, not all of it will be used in the same way. kmemsize is the amount of bytes that will be used for kernel activity of that specific container. Creating a large amount of processes requires quite some kernel intervention, so make sure it gets the memory it needs to keep track of the processes' data structures. Though it's best to experiment somewhat to figure out which setting is optimal, a good starting point is to look at your number of processes and multiply it by 50kb, then downscale or upscale as necessary. This is something you can easily keep track of by watching <em>/proc/user_beancounters</em>.</div>
<div><br />
</div>
<div><em><strong>numfile</strong></em>&nbsp;- Again, this parameter depends on the type of application you use, how many users use it, how many tables they access (in the case of MySQL) and even which storage engine you use. Giving pointers here can become quite complex, but what worked for us was simply multiplying the base value by two to start with and examining the maxheld column in <em>/proc/user_beancounters</em> to downsize the amount to what we required.</div>
<div><br />
</div>
<div><em><strong>tcpsndbuf &amp; tcprcvbuf</strong></em> - These two buffers can be a little tricky, and confusing to notice while not paying attention. When the difference between the barrier setting and the limit of these buffers are too small, some connections can in fact be made, but some of them simply won't send or receive anything, and keep silent. This was very confusing to vApus, which opens its full amount of connections before starting the test, in the assumption that the successful creation of all connections would allow transmission of data, however slow. Instead, quite a few of its connections simply stalled indefinitely, for no apparent reason. The rule of thumb in this case is that, no matter the amount of memory you want to allow the container for networking purposes, the difference between the barrier and limit for these buffers should always allow for 2.5kB per connection, e.g. the amount filled in for <em><strong>numtcpsock</strong></em>. For our environment, this came down to 2500kB. As such, you can set the barrier value for these buffers as low as you like, but the limit should be set at <em>barrier + numtcpsock * 2.5kB</em>.</div>
<div><br />
</div>
<div>The easiest way to tweak these settings is by simply updating your containers' config files. In our case, they were located at <em>/etc/vz/conf/[containerid].conf</em> in the host container's filesystem.</div>
<div><br />
</div>
<div>Well, it's back to the grindstone for me, time to show these multicore monsters what we're made of.&nbsp;</div> ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Fri, 22 Jan 2010 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>Internet Served TV versus Cable and Satellite TV</title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=665</link>
			<description><![CDATA[ <div> I've been using Dish Network for quite a few years now. Recently, I went through a forced upgrade to their latest ViP 722 high definition DVR. (I say "forced" because the older ViP 622 I had died, and Dish no longer supported the older unit. I didn't have to extend my contract, though.)</div>
<div><br />
</div>
<div>I haven't paid a great deal of attention to how rapidly IPTV services have been coming to the living room, built into consumer electronics devices. I've certainly used Hulu, plus the dedicated streaming services from individual "legacy" networks -- NBC and the like. I've also watched shows on Revision 3 and others of the new generation of Internet-only video.<br />
</div>
<div>&nbsp;</div>
<div>About the only regular IPTV viewing we do here at the Case House as a family is the Netflix Watch Instantly service through the Xbox 360. Overall, that's been a pretty positive experience. We did have a couple of burbs, however. A few months ago, we transitioned from Comcast consumer broadband to Comcast Business. I mostly wanted faster upstream bandwidth, but we also encountered the dreaded bandwidth cap when using the Consumer service. What happened when we hit the cap was watching videos through Netflix in highly compressed, worse-than-standard def mode. Ugh.<br />
</div>
<div>&nbsp;</div>
<div>But most of my internet TV viewing has been through the PC. Watching videos on a high performance PC is necessarily different than watching on a TV in the living room. PC users tend to be more forgiving than your average TV watcher. If you get a momentary pause as more data is buffered on the PC, you'll tend to accept it as routine. When that happens in the living room, there's usually a chorus of groans.</div>
<div><br />
</div>
<div>Nevertheless, we've seen a whole bunch of IPTV services integrated into consumer electronics devices in the last 18 months or so. Netflix Watch Instantly and Youtube have been the most common, but Amazon.com's service has garnered a few wins.&nbsp;</div>
<div><br />
</div>
<div>At the recent CES 2010 show, even more devices had Internet video services integrated -- even networks, like CBS, CNN, ESPN and others were integrated directly into devices. Companies like Panasonic, Sony, Samsung, Sherwood and others now have IPTV right in the box.<br />
</div>
<div><br />
</div>
<div>From what I can see, users will encounter a number of different problems. Network configuration issues will probably become a major problem. Most of these devices purport to work wirelessly, over 802.11n. My brother-in-law can't keep his run-of-the-mill Linksys router working. I can just imagine him struggling with streaming services on his TV.</div>
<div><br />
</div>
<div>There will also be the inevitable security issues, though no one seems to know what form that will take.</div>
<div>&nbsp;</div>
<div>Internet TV services are also struggling with their business models. Hulu is already poised to start charging for their service. Will a TV owner with Hulu built in pony up the subscription fee? <br />
</div>
<div><br />
</div>
<div>On the other hand, these are very early services, and as the infrastructure becomes more robust, delivery and networking issues will gradually subside, though I suspect that will take years. What will happen to the cable and satellite delivery services then? One thing they do offer is content aggregation -- users pay one company for access to a variety of networks. Will customers want to manage a variety of different payments to different services?</div>
<div><br />
</div>
<div>Nevertheless, delivering video services over the Internet will gradually become one of the accepted delivery vehicles. Whether the cables and satellite companies can adapt will be interesting to watch.<br />
</div> 
 ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Wed, 20 Jan 2010 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>Cloud computing in 2010: let us get practical</title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=663</link>
			<description><![CDATA[ <div>Cloud Computing was probably the most popular buzzword of 2009. There was a lot of hype, but basically, cloud computing is about using the large datacenters of the Internet to your advantage. Either by copying the methods they use to be very scalable and available and applying them in your own datacenter (what VMware is partly trying to do with their "private Cloud", "vCloud"), by outsourcing your infrastructure (PaaS, SaaS) to an external datacenter via the Internet or most likely some hybrid form.&nbsp; <br />
</div>
<div>&nbsp;</div>
<div>In 2010, all the hype and buzz should materialize. Will you use a form of cloud computing?</div>
<div>&nbsp; <br />
</div>
<div> </div>
<div>{poll 167:550}</div> 
          ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Wed, 23 Dec 2009 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>the x86 instruction proprietary extensions: a waste of time, money and energy</title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=661</link>
			<description><![CDATA[ Agner Fog, a Danish expert in software optimization is <a href="http://www.agner.org/optimize/blog/read.php?i=25">making a plea</a> for an open and standarized procedure for x86 instruction set extensions. Af first sight, this may seem a discussion that does not concern most of us. After all, the poor souls that have to program the insanely complex x86 compilers will take care of the complete chaos called "the x86 ISA", right? Why should the average the developer, system administrator or hardware enthusiast care?<br />
<br />
Agner goes in great detail why the incompatible SSE-x.x additions and other ISA extensions were and are a pretty bad idea, but let me summarize it in a few quotes:<br />
<ul>
      <li>"The total number of x86 instructions is <strong>well above one thousand</strong>" (!!)<br />
      </li>
      <li>"CPU dispatching ... makes the code bigger, and it is so costly in terms of development time and maintenance costs that it is almost never done in a way that adequately optimizes for all brands of CPUs."</li>
      <li>"the decoding of instructions can be a serious bottleneck, and it becomes worse the more complicated the instruction codes are"</li>
      <li>The costs of supporting obsolete instructions is not negligible. You need <strong>large execution units</strong> to support a large number of instructions. This means more silicon space, longer data paths, more power consumption, and slower execution.</li>
</ul>
<div>Summarized: Intel and AMD's proprietary x86 additions cost us all money. How much is hard to calculate, but our CPUs are consuming extra energy and underperform as decoders and execution units are unnecessary complicated. The software industry is wasting quite a bit of time and effort supporting different extensions. </div>
<div>&nbsp;</div>
<div>Not convinced, still thinking that this only concerns the HPC crowd? The virtualization platforms contain<a href="http://www.anandtech.com/IT/showdoc.aspx?i=3263&amp;p=11"> up to 8% more code</a> just to support the incompatible virtualization instructions which are offering almost exactly the same features. Each VMM <a href="http://www.anandtech.com/IT/showdoc.aspx?i=3263&amp;p=11">is 4% bigger</a> because of this. So whether you are running Hyper-V, VMware ESX or Xen, you are wasting valuable RAM space. It is not dramatic of course, but it unnecessary waste. Much worse is that this unstandarized x86 extention mess has made it a lot harder for datacenters to make the step towards a really dynamic environment where you can load balance VMs and thus move applications from one server to another on the fly. It is impossible to move (vmotion, live migrate) a VM from Intel to AMD servers, from newer to (some) older ones, and you need to fiddle with CPU masks in some situations just to make it work (and <a href="http://www.vmware.com/files/pdf/vmotion_info_guide.pdf">read complex tech documents</a>). <strong>Should 99% of market lose money and flexibility because 1% of the market might get a performance boost? </strong><br />
</div>
<div><br />
</div>
<div>The reason why Intel and AMD still continue with this is that some people inside feel that can create a "competitive edge". I believe this "competitive edge" is neglible: how many people have bought an Intel "Nehalem" CPU because it has the new SSE 4.2 instructions? How much software is supporting yet another x86 instruction addition? <br />
</div>
<div>&nbsp;</div>
So I fully support <a href="http://www.agner.org/optimize/blog/read.php?i=25">Agner Fog in his quest</a> to a (slightly) less chaotic and more standarized x86 instruction set. 
    ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Sun, 06 Dec 2009 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>vApus for Open Source: Creating a virtualized stress test</title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=656</link>
			<description><![CDATA[ <div>If you've been keeping up with our articles for a while, you might have picked up on vApus Mark I: the virtualized stress test we created for internal use at the Sizing Servers testlab.</div>
<div><br />
</div>
<div>As detailed in Johan's article, this bench consists of 3 separate applications, all of which we are very familiar with due to extensive optimization and stress testing efforts. Although we believe the results published based on this bench speak for themselves, the problem remained that it was impossible for anyone outside our lab to verify the results, seeming as how two out of three of the applications used were owned by private companies and were entrusted to our lab under rather strict conditions (distributing them to the rest of the world sadly not being one of them).</div>
<div><br />
</div>
<div>Secondly, vApus M1 being a bench that focuses on fairly heavy VM's, we feel the need to create another point of reference. One that will back up the results of the original, but with a completely different mix of VM's.</div>
<div><br />
</div>
<div>Thus began the process of creating vApus For Open Source, or vApus FOS, as we like to call it in the lab.</div>
<div><br />
</div>
<div>The idea behind vApus FOS is that the VM's can be freely distributed to any vendors that wish to verify our results, and our lab can provide a version of the actual in-house developed vApus benching software to generate the load.</div>
<div><br />
</div>
<div>I am happy to say that the preliminary 1-tile testing for this new benchmark has just completed, and so far everything has been running quite smoothly. The results are reproducible, the VM's stable... looks like our 4-tile (16 VM's in total) testing can begin!</div>
<div><br />
</div>
<div>The fun part is that a lot of the ideas we incorporated into the new setup we owe to you, our readers! Thanks to the feedback we got on vApus M1, we were able to combine some new workloads into an interesting mix:</div>
<div><br />
</div>
<div>As it stands, one tile consists of 4 VM's, all of which run a basic, minimal CentOS 5.4.</div>
<div><br />
</div>
<div>VM1 runs an Apache webserver and MySQL database, hosting a phpbb3-forum. The VM is given 2 vCPU's and 1GB RAM.</div>
<div><br />
</div>
<div>VM2 runs the same setup as VM1, but is only given 1 vCPU.</div>
<div><br />
</div>
<div>VM3 runs a fully configured mailserver using Postfix, Courier and a Squirrelmail frontend. This VM is assigned 2 vCPU's and 1GB RAM.</div>
<div><br />
</div>
<div>VM4 runs a separate MySQL OLAP database, using InnoDB as its storage engine. This machine is also assigned 2 vCPU's and 1GB RAM.</div>
<div><br />
</div>
<div>The goal is currently to get a 4-tile test going on a 16-core machine, meaning that the hypervisor will have to account for 28 vCPU's in total. This should prove to be a very interesting exercise for the scheduler. Of course, this VM setup can be made to work perfectly fine in an OpenVZ environment as well, meaning we can finally do some real world testing on alternative Linux-based virtualization solutions as well.</div>
<div><br />
</div>
<div>We thought we'd keep you updated on the progress of our research. As any experienced IT professional will know, well thought-out server technology testing takes time, and it's important to realize the amount of steps required to produce results that can immediately be applied in the real world.</div>
<div><br />
</div>
<div>Stay tuned for our first testing results, they should be rolling in very soon now!</div>
<div><br />
</div>
<img src="http://adminnet.anandtech.com/iblog/ITgeneral/Liz/vApusFOS.png" width="550" height="341" alt="" />
<div><br />
</div>
          ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Tue, 17 Nov 2009 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>Choosing the right foundation: which hypervisor do you evaluate? </title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=653</link>
			<description><![CDATA[ First of all, we were pretty excited to see so many comments and votes (5000!) on <a href="http://it.anandtech.com/weblog/showpost.aspx?i=649">our last IT poll</a>. It is good to see that professional IT is so much alive at Anandtech.com. So yes, we should have updated this blog quicker, to keep the momentum going. The reason why this update comes rather late is -once again - that we are working on <strong>the much delayed hypervisor comparison</strong>. Hundreds of tests have already been done, but we have added more tests to check important I/O performance factors such as VMDq and iSCSI performance.
<div>&nbsp;</div>
<div>And of course, the virtualization market is evolving fast. There is a new kid on the block: KVM. Two of the three most important Linux vendors, Red Hat and Canonical, have ripped Xen out of their distributions in favor of KVM. KVM has an interesting philosophy: it simply <a href="http://www.linux-kvm.org/page/Main_Page">adds two kernel modules</a> to the Linux kernel to turn the latter into a hypervisor. As a result, KVM can leverage the huge amount of Linux drivers and the Linux kernel improvements such as power management. Still, a virtualization solution needs to mature quite a bit before it is ready. And that is more than a cliche. Xen's support for Windows VMs was for example supposed to work at the beginning of 2007, as Xen introduced support for Hardware Virtual Machines at the end of 2006. But only around in the middle of 2008, we felt confident enough to say that Windows virtual machines work well on Xen. <a href="http://it.anandtech.com/weblog/showpost.aspx?i=467">We reported</a>:&nbsp; </div>
<div>&nbsp;</div>
<blockquote>
<div>"Xen 3.2.0 which can be found in the newest Novell SLES 10 SP2, is capable of running Windows 2003 R2 under heavy stress." <br />
</div>
</blockquote>
<div>So it took Xen several major revisions to really get it right. It is unlikely that KVM will do this much quicker. We will be giving KVM some heavy stresstesting so we can tell you more than just hearsay. <br />
</div>
<div>&nbsp;</div>
<div>In the mean time, a <a href="http://www.centrify.com/blogs/tomkemp/virtualization_market_survey.asp">new survey by Centrify</a> shows a still dominant VMware, but it also tell us that Hyper-V and Xen are making a lot of progress, growing strong enough to be dangerous opponents in the near future. I have been talking to tens of Small and Medium Enterprises (SME) in Belgium and the Netherlands. Our own tests show that VMware ESX is still the most robust hypervisor and most people concur. However VMware's half-hearted attempts to make <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3550&amp;p=4">vSphere</a> more attractive to the SME does not create&nbsp; a lot of enthousiasm. If VMware does not create a more budgetfriendly solution for SMEs (and VMware, newsflash: most SME <a href="http://www.vmware.com/products/vsphere/small-business/buy.html">have more than 3 servers</a>), we have the impression it may lose the server virtualization battle in the SME world, where everything is still possible. But those are my personal impressions. At the end of the day, what will happen in your working environment determines who will prevail. So let us know what you are planning... <br />
</div>
<div>&nbsp;</div>
<div> {poll 158:300}</div>
<div> {poll 159:300}</div>
<div> {poll 160:300}</div>
<div> {poll 161:300}</div>
<div><br />
</div> 
 ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Tue, 03 Nov 2009 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>The basic "Server Building Block" for your virtual infrastructure</title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=649</link>
			<description><![CDATA[ <p>If you read <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3653">our last article</a>, it is clear that when your applications are virtualized, you have a lot more options to choose from in order to build your server infrastructure . Let us know how you would build up your "dynamic datacenter" and why!</p>

{poll 157:440}
             ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Wed, 07 Oct 2009 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>Intel talking about the 16-thread RISC killer </title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=604</link>
			<description><![CDATA[ <div>Take two Nehalem dies, turn them&nbsp; 90 degrees, add a lot of system interface logic and 8 MB extra of L3-cache and you get - very oversimplified - the impressive Nehalem EX, alias "Beckton". The new Xeon MP is an impressive monster, just like it's <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3414">predecessor Dunnington</a>. Dunnington consisted of <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3414">1.9 Billion transistors</a>, the Xeon MP based on the "Nehalem" architecture will feature up to 2.3 Billion transistors. <br />
</div>
<div>&nbsp;</div>
<img alt="" src="http://images.anandtech.com/iblog/exdie.png" width="550" height="250" />
<div>&nbsp;</div>
<div>Those 2.3 Bilion transistors are needed for&nbsp; <br />
</div>
<div>
<ul>
    <li>Up to eight cores, 16 threads thanks to SMT</li>
    <li>Up to 24MB of shared L3 cache</li>
    <li>four QuickPath links</li>
    <li>four memory channels which support for up to 16 memory modules per socket&nbsp;</li>
</ul>
<div>Intel calls the chips to drive the DDR-3 modules "Scalable Memory Buffer" chips, which means that Intel figured out that it is best to move the power gobbling AMB chip from the FBDIMMs to the systemboard. As you need only one chip to drive several registered DDR-3 modules, it consumes a lot less power than placing an AMB chip on each DIMM. </div>
<div>&nbsp;</div>
<div>&nbsp;<img alt="" src="http://images.anandtech.com/iblog/quadsocket.png" width="550" height="453" /></div>
<div>&nbsp;</div>
<div>&nbsp;</div>
<div>In the second of half of this year, Intel will have a IBM Power 6 killer and a server platform to match. The irony is that when it comes to "Intel Scalable Memory Buffers", IBM has the right to say "what to took you so long to figure out that FB-DIMMs were a pretty bad idea?" Back in 2005, IBM's X3 chipset already featured a solution that allowed large memory capacities with lower latency and much lower power consumption than FBDIMMs. <br />
</div>
<img alt="" src="http://images.anandtech.com/iblog/x3.png" width="443" height="506" />
<div>&nbsp;</div>
</div>
<div>It will be interesting to see what IBM's respons to the Nehalem EX will be, as Intel's first octal core is going to enter the last market where RISC CPUs still hold their ground: 8 sockets and more.There have been previous attempts, but this time it is for real:more than 15 8+ socket designs are being readied. More irony: IBM will probably design the servers with the highest socket counts which really give the Power servers a run for their money... <br />
</div>
<div>&nbsp;</div>
<div>As Intel gave its octal core CPU RAS features (MCA) that once belonged to the RISC and Itanium families only, it seems that the last stronghold of the non-x86 servers is going to fall..."mainframe slowly"&nbsp; but steadily. Only the Ultrasparc T2 with its radically different architecture may survive this assault. <br />
</div>
<div>&nbsp;</div>
<div>The Machine Check Architecture is of course ultra important for the future Xeon MP systems. Even a quad socket system will contain 32 cores and probably up to 512 GB of RAM. That kind of machine simply cries out for large databases and virtualization consolidation. In the latter case, MCA should allow hypervisors such as ESX to overcome critical errors in one of the VMs, instead of shutting down tens of VMs.&nbsp;</div>
<div>&nbsp;</div>
<div>In a different note, Intel claims that by August 2009 50% of it's DP server processors sold&nbsp; will be "Nehalem" based. So even though AMD is executing very well and introducing the hex-core "Istanbul" soon, it is not a minute too soon as the Opterons are under heavy attack. <br />
</div>
<div>&nbsp;</div>
<div><strong>Update:&nbsp;</strong> Anand also talked about Nehalem EX <a href="http://www.anandtech.com/weblog/showpost.aspx?i=603">in his lab update here</a>. <br />
</div>
<div>&nbsp;</div> ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Wed, 27 May 2009 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>quick update from the "Professional IT" lab</title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=599</link>
			<description><![CDATA[ <div> We promised you a new datapoint, <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3560&amp;p=5">a new independent virtualization benchmark in "a few days"</a>. Those "few days" have become a week in good "IT at Anandtech" tradition. :-) But this wednesday, unless Murphy strikes us hard, the article will be online. It will offer a refreshing look at the virtualization performance, the result of months of work.&nbsp; Liz will follow up quickly with a "performance optimization for virtualization" article. <br />
</div>
<div><br />
</div>
<div>Until then, we have updated two articles. We told you in <a href="http://it.anandtech.com/weblog/showpost.aspx?i=563">one of our "Intel Nehalem vs AMD Istanbul"</a> blogs, that you will have to wait for ESX 4.0 for EPT support. However, we found that "forcing hardware VMMU" (= EPT) improves performance tangible, so we wrote that ESX 3.5 update 4 has support for EPT. That is not true, at least not officially. EPT is only officially supported on ESX 4.0 (the hypervisor of <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3550">vSphere</a> 4.0).&nbsp; Check <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3560&amp;p=3">out the updates</a> that we did <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3560">to the last article</a>, as it clarifies some of the VMmark benchmarking. Our thanks goes to Scott Drummonds of VMware for the excellent info. <br />
</div>
<div>&nbsp;</div>
<div>The last update can be found in our "<a href="http://it.anandtech.com/IT/showdoc.aspx?i=3536">The Best Server CPUs part 2"</a> article. We solved the problems with our Shanghai "exchange" server and managed to get some Opteron numbers. The newest quadcore "Shanghai" opterons are clock for clock as fast as the quadcore "Harpertown Xeons. In other words, Microsoft exchange runs faster on the Xeons 54xx thanks to their clockspeed advantage, and the Xeon 55XX is still by far the MS Exchange champion. You can <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3536&amp;p=10">find the benchmarks here</a>.So expect a lot of new content soon... New CPUs, new servers, new storage. The second part of May and June should be fun. <br />
</div>
<div>&nbsp;</div>
<div>&nbsp;<br />
</div>
<div>&nbsp;</div>
<div>&nbsp;</div> ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Tue, 19 May 2009 00:00:00 EDT</pubDate>
			
		</item>
		
		<item>
			<title>The million dollar question: how do you upgrade your datacenter</title>
			<link>http://it.anandtech.com/weblog/showpost.aspx?i=585</link>
			<description><![CDATA[ <div>
</div>
<div>In <a href="http://it.anandtech.com/IT/showdoc.aspx?i=3536 ">our last article about server CPUs</a>, I wrote:&nbsp;</div>
<div>&nbsp;</div>
<blockquote>
<div><em>"the challenge for AMD and Intel is to convince the rest of the market - that is 95% or so - that the new platforms provide a compelling ROI (Return On Investment). The most productive or intensively used servers in general get replaced every 3 to 5 years. Based on Intel's own inquiries, Intel estimates that the current installed base consists of 40% dual-core CPU servers and 40% servers with single-core CPUs."</em></div>
<div>&nbsp;</div>
</blockquote>
<div>At the end of <a href="http://download.intel.com/pressroom/kits/xeon/5500series/pdf/2009_03_Xeon5500Launch-Gelsinger.pdf ">the presentation of Pat Gelsinger</a> (Intel) makes the point that replacing nine servers based on the old single core Xeons with one Xeon X5570 based server will result in a quick payback. Your lower energy bill will pay back&nbsp; your investment back in 8 months according to Intel.</div>
<div>&nbsp;</div>
<div>Why these calculations are quite optimistic is beyond the scope of this blogpost, but suffice to say that Specjbb is a pretty bad benchmark to perform ROI calculations (it can be "inflated" too easiliy) and that Intel did not consider the amount of work it takes to install and configure those servers. However, Intel does have a point that replacing the old power hungry Xeons (irony...) will deliver a good return on investment. <br />
</div>
<div>&nbsp;</div>
<div>In contrast, <a href="http://blogs.amd.com/work/2009/02/11/a-socket-full-of-growth/">John Fruehe (AMD) is pointing out</a> that you could <strong>upgrade</strong> dualcore Opteron based servers (the ones with four numbers in their modelnumbers and DDR-2) with hex-core AMD "Istanbul" CPUs. I must say that I encountered few companies who would actually bother upgrading CPUs, but his arguments make some sense as the CPU will still use the same kind of memory: DDR-2. As long as your motherboard supports it, you might just as well upgrade the BIOS, pull out your server, replace the 1 GB DIMMs with 4 GB DIMMs and replace the dual cores with hex-cores instead of replacing everything. It seems more cost effective than redo the cabling, reconfigure a new server and so on...<br />
</div>
<div>&nbsp;</div>
<div>There were two reasons why few professional IT people bothered with CPU upgrades:<br />
</div>
<div>
<ol>
     <li><strong>You could only upgrade to a slightly faster CPU</strong>. Upgrading a CPU to a higher clocked, but similar CPU rarely gave any decent performance increase that was worth the time. For example, the Opteron was launched at 1.8 GHz, and most servers you could buy at the end of 2003 were not upgradeable beyond 2.4 GHz. <br />
     </li>
     <li><strong>You could not make use of more CPU performance.</strong> With the exception of the HPC people, higher CPU performance rarely delivered anything more than even lower CPU percentage usage. So why bother?<br />
     </li>
</ol>
</div>
<div> AMD has also a point that both things have changed. The first reason may not be valid anymore if hex-cores do indeed work in a dualcore motherboard. The second reason is no longer valid as virtualization allows you to use the extra CPU horse power to consolidate more virtual servers on one physical machine. On the condition of course that the older server allows you to replace those old 1 GB DIMMs with a lot of 4 GB ones. I checked for example the HP DL585G2 and it does allow up to 128 GB of DDR-2. <br />
</div>
<div>&nbsp;</div>
<div>So what is your opinion? Will replacing CPUs and adding memory to extend the lifetime of servers become more common? Or should we stick to replacing servers anyway? <br />
</div>
<div>&nbsp;</div>
<div>{poll 124:400}
</div> 
 ]]></description>
			<author>anand@anandtech.com (Anand Lal Shimpi)</author>
			<category></category>
			<pubDate>Tue, 07 Apr 2009 00:00:00 EDT</pubDate>
			
		</item>
		
	</channel>
</rss>

