• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar

Yellow Bricks

by Duncan Epping

  • Home
  • ESXTOP
  • Stickers/Shirts
  • Privacy Policy
  • About
  • Show Search
Hide Search

datrium

Datrium DVX goes GA

Duncan Epping · Jan 29, 2016 ·

I wrote an article about Datrium DVX when Datrium came out of stealth about 6 months ago. I am not going to rehash that as I think it was pretty clear around what Datrium DVX is. Datrium DVX is a combination of “host local data services” and network accessed storage capacity. This basically means that you utilize local flash resources for read caching (deduped before stored on the flash tier), writes will go directly to the “NetShelf” as Datrium calls it, which is basically your persistent layer. The write is compressed by the way before it is send to the NetShelf, and on theNetShelf itself there is a also a dedupe process that runs periodically to “collect garbage” and optimize the system for capacity.

I had a chat with the team from Datrium and we spoke about what was new with GA and stuff that I hadn’t covered yet in my previous article. Some of the facts from that call:

  • The DVX list price is $125,000. This includes a NetShelf D12X4 appliance and unlimited licenses for DVX Hyperdriver software on vSphere hosts
    • Servers and SSDs must be purchased and supported independently
    • There currently is a limit of 32 hosts connected to a NetShelf
    • With 32 hosts the expected aggregate is 1m IOPS
  • The NetShelf D12X4 appliance comes with 48TB raw, or 60TB – 180TB effective capacity at common deduplication and compression rates for Enterprise data (2x-6x)
    • Expected effective capacity on average is around 125TB
  • The NetShelf comes in an HA configuration with 2 controllers and mirrored NVRAM, no single point of failure
  • Datrium DVX takes up to 20% of the host CPU resources, but this is the most it will take and only when running at max speed, so typically CPU resources used will be much lower
  • Has “data locality”, meaning that when a VM is migrated between hosts than the cache will be read from the “source” host to the “destination” so that the “destination” warms up

I guess that says it all. Well there’s a bit around support for hypervisors and things like VVols and VAAI. Today Datrium only supports vSphere, and it wasn’t clear if and when other hypervisors would be supported. When it comes to VAAI it was mentioned that VAAI clones are fully supported as it stands today. On top of that Datrium is also working hard on enabling a VVol capable solution, they have joined the VVol program and are looking forward to release their VVol capable software, I am sure that when they do I will talk to them again.

A couple of things are missing in todays offering, which would definitely be nice to have, or in some cases even a must. Native replication and array based snapshots for instance. Today Datrium partners with Zerto, and of course VMware customers can use vSphere Replication for free. Snapshots rely on vSphere today, but there are plans to have them natively on the array at some point. Also, when you run out of capacity and you add a NetShelf, this will become a new mountpoint, it would be nice if this was scale-out, and the Datrium folks agreed but didn’t commit to anything (of course). Overall I think it is an interesting solution, and with 30 install sites for the beta I think it is safe to say that they will be a player to consider, but only time will tell.

Thanks guys for taking the time to brief me.

Datrium finally out of stealth… Welcome Datrium DVX!

Duncan Epping · Jul 28, 2015 ·

Before I get started, I have not been briefed by Datrium so I am also still learning as I type this and it is purely based on the somewhat limited info on their website. Datrium’s name has been in the press a couple of times as it was the company that was often associated with Diane Greene. The rumours back then were that Diane Greene was the founder and was going to take on EMC, that was just a rumour as Diane Greene is actually an investor in Datrium. Not just her of course, Datrium is also backed by NEA (Venture Capitalist) and various other well known people like Ed Bugnion, Mendel Rosenblum, Frank Slootman and Kai Li. Yes, a big buy in from some of the original VMware founders. Knowing that two of the Datrium founders (Boris Weissman and Ganesh Venkitachalam) are former VMware Principal Engineers (and old-timers) that makes sense. (Source) This morning a tweet was send out, and it seems today they are officially out of stealth.

As the sun rises this morning, so does a new dominant #datastorage player #Datrium #stealthmode

— Datrium Storage (@DatriumStorage) July 28, 2015

So what is Datrium about? Well Datrium delives a new type of storage system which they call DVX. Datrium DVX is a hybrid solution comprised of host local data services and a network accessed capacity shelf called “netshelf”. I think this quote from their website says it all what their intention is… Move all functionality to the host and let the “shelf” just take care of storing bits. I included a diagram that I found on their website as it makes it more clear.

On the host, DiESL manages in-use data in massive deduplicated and compressed caches on BYO (bring your own) commodity SSDs locally, so reads don’t need a network hop. Hosts operate locally, not as a pool with other hosts.

datrium dvx

It seems that from a host perspective the data services (caching, compression, raid, cloning etc) are implemented through the installation of a VIB. So not VM/Appliance based but rather kernel based. The NetShelf is accessible via 10GbE and Datrium uses a proprietary protocol to connect to it. From a host side (ESXi) they connect locally over NFS, which means they have implemented an NFS Server within the host. The NFS connection is also terminated within the host and they included their own protocol/driver on the host to be able to connect to the NetShelf. It is a bit of an awkward architecture, or better said … at first it is difficult to wrap your head around it. This is the reason I used the word “hybrid” but maybe I should have used unique. Hybrid, not because of the mixture of flash and HDD but rather because it is a hybrid of hyper-converged / host local caching and more traditional storage but done in a truly unique way. What does that look like? Something like this I guess:

datrium dvx

So what does this look like from a storage perspective? Well each NetShelf will come with 29TB of usable capacity. Expected deduplication and compression rate for enterprise companies is between 2-6x which means you will have between 58TB and 175TB to your disposal. In order to ensure your data is high available the NetShelf is a dual controller setup with dual port drives (Which means the drives are connected to both controllers and used in an “active/standby” fashion). Each controller has NVRAM which is used for write caching, and a write will be acknowledge to the VM when it has been written to the NVRAM of both controllers. In other words, if a controller fails there should be no data loss.

Talking about availability, what if a host fails? If I read their website correctly then there is no write caching from a host point of view as it is states that each host operates independently from a caching point of view (no mirroring of writes to other hosts). This also means that all the data services need to be inline –> dedupe / compress / raid. When those actions complete the result will be stored on the NetShelf and then it is accessible by other hosts when needed. It makes me wonder what happens when DRS is enabled and a VM is migrated from one host to another. Will the read cache migrate with it to the other host? And what about very write intensive workloads, how will those perform when all data services are inline? What kind of overhead can/will it have on the host? How will it scale out? What if I need more than 1 Netshelf? Those are some of the questions that popup immediately. Considering the brain-power within Datrium I am assuming they have a simple answer to those questions… (Former VMware, Data Domain, NetApp, EMC etc) I will try to ask them these questions at VMworld or during a briefing and write a follow up.

From an operational aspect it is an interesting solution as it should lower the effort involved with managing storage almost to zero. There is the NFS connection and you have your VMs and VMDKS at the front end, at the back-end you have a blackbox or better said a shelf dedicated to storing bits. This should be dead easy to manage and deploy. It shouldn’t require a dedicated storage administrator but the VMware admin should be able to manage it. Some of you may ask, well what if I want to connect anything other than a VMware host to it? For now Datrium appears to be mainly targeting VMware environments (which makes sense considering their dna) but I guess they could implement this for various platforms in a similar fashion.

Again, I was not briefed by Datrium and I accidentally saw their tweet this morning but their solution is so intriguing I figured I would share it anyway. Hope it was useful.

Interested? More info here:

  • Datasheet – http://www.datrium.com/datasheet/DVX_DataSheet.pdf
  • Host side implementation info – http://www.datrium.com/dvx-overview/diesl-software/
  • DVX Netshelf – http://www.datrium.com/dvx-overview/datrium-netshelf/
  • Twitter: http://www.twitter.com/datriumstorage

Primary Sidebar

About the author

Duncan Epping is a Chief Technologist in the Office of CTO of the Cloud Platform BU at VMware. He is a VCDX (# 007), the author of the "vSAN Deep Dive", the “vSphere Clustering Technical Deep Dive” series, and the host of the "Unexplored Territory" podcast.

Upcoming Events

Feb 9th – Irish VMUG
Feb 23rd – Swiss VMUG
March 7th – Dutch VMUG
May 24th – VMUG Poland
June 1st – VMUG Belgium

Recommended Reads

Sponsors

Want to support Yellow-Bricks? Buy an advert!

Advertisements

Copyright Yellow-Bricks.com © 2023 · Log in