September 16, 2008

VMware FT – The Top Four Reasons it’s Kinda Sorta Fault Tolerance

Posted by: Brian Mullins

Marathon’s team at VMworld got to sit in on the VMworld session Tuesday morning that covered VMware’s newly announced technology for fault tolerant VMs. While not bad for FT rookies, from what we saw, it’s a less than perfect solution for a lot companies that want to run business critical and mission critical applications in VMs. Let’s look at the four primary reasons why.

1. No component-level fault tolerance. The most common failures that result in unplanned downtime are component failures such as storage, NIC or controller failures. Yet VMware Fault Tolerance doesn’t do anything to protect against I/O, storage or network failures. By not addressing these primary sources of failures, VMware appears to be saying that you/the customer are on your own do figure out how to protect your storage and network connections. This may be okay for the very largest IT staffs in the world, but for the other 98%; it will not be sufficient.

2. Complexity on top of complexity. In order to use VMware Fault Tolerance, you’ll first have to install both VMware HA and DRS. No small feat in and of themselves. Then, because VMware FT requires NIC teaming, you’ll also have to manually install paired NICs. Then you’ll need to manually setup dual storage controllers (with the software to manage them) because it requires multi-pathing. And to top it all off, you’re required to use an expensive, and often complicated, SAN.

3. Limited CPU fault tolerance. With VMware FT, you’ll need to setup what VMware refers to as a “record/replay” capability on both a primary and secondary server. If something happens to the primary server, the record is stored on the SAN and then restarted on the secondary server. Two things to point out here. First, the whole thing depends on the quality of the SAN. Second, in the words of the VMware engineer who presented at VMworld, “this can take a couple of seconds.” So what happens to your application state in those couple of seconds?

4. For VMware virtual environments only. VMware FT will only work in VMware environments. It won’t work with other hypervisors, and most importantly, you can’t use for business critical and mission critical applications that you want to keep on physical server platforms (i.e., non-virtualized environments which still represent the vast majority of customer use cases). Oh well, only the vast majority of critical applications run in physical environments anyway.

It’s great to see VMware recognizing the need for fault tolerance, but we’re puzzled why they decided not to address the biggest source of failures – component failure. And we wonder how many mid-market companies will be able to justify the cost and complexity of getting VMware FT setup and keep it running.

If you had a chance to attend the session what were your thoughts?

6 Responses to “VMware FT – The Top Four Reasons it’s Kinda Sorta Fault Tolerance”

  1. Mike DiPetrillo Says:

    I’m afraid you’ve missed several major points here and don’t understand basic VMware features. Too much to list here so you can read the full blog response here: http://mikedatl.typepad.com/mikedvirtualization/2008/09/marathon-and-vm.html.

  2. sjaak Says:

    Check this response:
    http://mikedatl.typepad.com/mikedvirtualization/2008/09/marathon-and-vm.html

  3. Gabrie van Zanten Says:

    Hi
    I was wondering if you have ever installed a VMware Infrastructure or something like it? Because Nic failover, HBA failover, shared storage are all things that are configured in VMware ESX by default. Just a few clicks to get it up and running.

    HA difficult?? Come on.

    Please consult someone that can demo you a normal VMware installation so that you at least have some clue about what you’re writing about.

    Gabrie

  4. KW Says:

    Yes this write up is completely based on a lack of understanding of the basic features VMware ESX has enjoyed since version 1. I’d suggest go and doing some research, or perhaps even implementing VMware before you write such crap!

  5. Scott Bowe Says:

    What has been said above in comments sums it up.

    VMware Virtual Infrastructure is very simple to set up, configure, and manage - and guess what it’s becoming even simpler - in fact I also believe VMware is planning ot release Virtual Center as a “plug and play” type appliance in the not so far off future - and on top of that ESXi is OEM shipped on many pieces of HW, just turn it on answer a few questions and you are set.

    VMware in and of itself solves many of the issues you relate - HW failure? great move the VM to a new system.

    In the future, please gather and understand the facts before writing a disparging opinion.

  6. Virtualization Master Says:

    Hi,

    I disagree with you man. VMware FT is definitely a plus. I am not sure why you have to disgrace it in here. Check it the sneak preview & video of VMware FT at http://www.virtualizationteam.com/virtualization-vmware/vmware-esx-40-ft-fault-tolerant-sneak-peek.html
    which I am pretty sure will make you feel better about it. You need further to remember as earlier poster mentioned VMware FT is not the only protection feature VMware ESX is offering, but its an enhancement to the plenty of operation continuity features VMware is offering today.

    Best Regards,
    Virtualization Master

Leave a Reply