I could visit just one session on Tuesday so I decided I would attend a session which is on a topic that I love “Software Defined Storage”, aka vCloud Distributed Storage. This session was hosted by Christos Karamanolis and Kiran Madnani. Christos is one of the lead engineers on the project and Kiran is the product manager, both VMware employees.
Before I lay down what their session was about, note that this was a technology preview. No commitments have been made when or even IF this will ever be released… but as the demo looked so promising I figured it was worth sharing.
[Edit: Cormac Hogan published the demo, worth watching!]
Kiran started out with explaining the journey to Software Defined Storage. Awesome features he mentioned were Storage vMotion, Storage DRS, SIOC, Profile Driven Storage, vSphere Replication and more. Kieran made a statement which I can only agree with….
The world is changing, compute resource keep growing, data keeps on growing!
Data management is a challenge in today’s world. In many environments virtual machines are provisioned on tier 1 storage just to ensure requirements are met, while in many cases that is overkill. High storage costs, inconsistency and operational complexity are the direct results of that.
So if VMware had to solve this problem and create a new flexible / agile storage platform, what would the requirements be? Customers asked us: “Why don’t you virtualize local storage the same way you virtualized CPU and memory?”
These are the requirements Kiran mentioned:
- Converged platform (compute + storage using commodity hardware)
- Simple to manage just like CPU and memory
- Ability to scale on demand, in lock-step with application needs, but also in a flexible way
- Provide per-VM automated SLA management
Now Christos entered the stage. Christos being an engineer, I knew this was going to get more technical. Christos explained that he is working on what is now known as “VMware Distributed Storage” but also sometimes referred to as vSAN.
This new solution/product is a software based distributed storage solution which is built directly in the hypervisor. I guess this is also the main differentiator compared to competitors, it is not a virtual machine / appliance! Enabling it will be simple, it should be just two clicks. And it will be a property of a vSphere Cluster. When enabled a shared datastore is presented. This datastore grows as the cluster grows. This datastore is formed by using local storage (SSD / SATA / SAS) of the host.
Note that not all hosts need to be identical. Some hosts can have storage others might not. Christos demoed this during the presentation, he had a virtual machine running on a host without local storage while running on a “remote” VMware Distributed Storage datastore. This virtual machine was then vMotioned to a different host which also did not have local storage, but still had access to the VMware Distributed Storage shared datastore.
VMware Distributed Storage heavily relies on SSD for caching / performance… eventually though all writes will go to magnetic disks (SATA) eventually. For resiliency “objects” will be replicated across multiple hosts, how many hosts/disks will depend on the level of availability selected for the object.
This in my opinion one of the biggest advantage of a software defined storage solution, you can define an SLA per virtual machine. Yes you read that correctly, policies are on a virtual machine level and not on a datastore level. You need 99.99% uptime or 99.999% uptime, VMware Distributed Storage can do that for you on the same datastore. Not only can you specify availability but also IOPs (reservations and limits) or capacity. Note that VMware Distributed Storage doesn’t use a typical RAID solution, it uses JBOD (just a bunch of disks) and the software takes care of the distribution / replication of the data.
The one thing that stood out for me during this session was the demo. It was so simple, I wonder if people caught on to that. Enabling VMware Distributed Storage was only 2 or 3 clicks and that was it. Expanding the datastore was just a matter of adding a new host (or adding disks to a host for that matter).
I noted down the specs of the systems used for the demo. This is no guarantee of what will eventually be supported or part of the requirements but it gives a nice idea around what a VMware Distributed Storage environment could look like, I am guessing though you will want more local disks though.
- 7 ESX hosts, each with:
- Quad-Core Intel Xeon E5620 2.4GHz processor
- 24 GB DDR3 1333MHz memory module
- 10 Gb/s Dual-channel, 10GBase-CR, PCI-Express 2.0
- 4 ESX hosts have local storage:
- 1 x 500GB SAS 6.0Gb/s HDD
- 1 x 256GB SAS SSD
So now what would be the usecase for this? During the session 4 usecases were mentioned explicitly:
- Virtual desktops
- Scale out model, using predictive (performance etc) repeatable infrastructure blocks lowers costs and simplifies operations
- Test & Dev
- Avoids acquisition of expensive storage (lowers TCO), fast time to provision
- Big Data
- Scale out model with high bandwidth capabilities
- Disaster recovery target
- Cheap DR solution, enabled through a feature like vSphere Replication that allows to replicate to any storage platform
It is funny though, as I would have probably added cloud on top of that… and potentially just production server workloads. Kiran ended summarizing vSAN / VMware Distributed Storage as:
- Software-driven – use industry standard hardware
- Flexible – scale as needed, when needed
- Simple – ridiculously easy to manage
- Automated – per-VM policy-based management
The simplicity and the flexibility of this product is what stood out the most, I personally believe this has huge potential. This will be big!
A couple of articles I would like to refer to:
- Massimo Re’Ferre just posted his article about Distributed Storage, make sure to read it!
- Christos Karamanolis posted a great article on the CTO blog about the Distributed Storage project.
- William (Bill) Earl wrote a nice piece about storage futures for the CTO blog.
Frank says
thanks for blogging about this, hopefully this sessions runs in Barcelona
Duncan Epping says
https://vmworldeurope2012.activeevents.com/connect/sessionDetail.ww?SESSION_ID=2149
Tom says
Now that seems such an obvious move, there is so much wasted space around with local storage, yes you could configure local storage, yes you could use a VSA but the ability to share this without the need for an appliance and being built into the hypervisor is an excellent idea, I really hope this come to fruition.
Rob Bergin says
Isn’t this just Nutanix? Or Simplivity? or Scale?
Duncan Epping says
VMware doesn’t ship HW. This will be a software only offering. On top of that it is fully VM-aware and policies like availability and performance can be selected per VM.
Other than that, yes it is a virtual storage solution that uses local disks.
Tom says
logically yes, physically no, this will hopefully be Hypervisor based. rather than appliance based
Michael Ryom (@MichaelRyom) says
so how is the VM protected again disk failure ?
Duncan says
As mentioned by replicating objects to different locations.
paul says
Great feature.
This can make capacity management a lot easier.
It will make the storagearray vendors a bit nerveous.
I just Love where this is going , it can become big indeed.
Great read, thanks.
Mike Foley says
When it replicates objects to different locations, does it replicate to multiple locations? (kinda like Google just making copies of stuff all over the place) If it does to that then does the I/O from the VM get sent to these multiple locations and the first one to return satisfies the I/O?
That could be interesting for performance and backup…
mike
Fred Damstra says
I hope this becomes real, too, and is suitable for production workloads. Storage is the most expensive part of our deployment, when it should be one of the cheapest.
That said, EMC owns VMWare, and they have a vested interested in keeping the SAN alive.
J says
EMC owns 80% of VMware. VMware is a subsidiary of EMC.
ARRITDOR says
All I can say is – it’s about time!
From my experience, the simplification of infrastructure that VMware has brought to table has caused improved performance and availability in datacenters everywhere. Compute and network have been there since the beginning. Storage, not-so-much. I never understood why – I just blame EMC for wanting to sell more/bigger storage boxes(!).
I agree that this will be big and for many of the same reasons Nutanix and SimpliVity are getting a lot of attention right now.