At the Italian VMUG I was part of the “Expert panel” at the end of the event. One of the questions was around innovation in the world of IT, what should be next. I knew immediately what I was going to answer: backup/recovery >> data copy management. My key reason for it being is that we haven’t seen much innovation in this space.
And yes before some of my community friends will go nuts and point at Veeam and some of the great stuff they have introduced over the last 10 years, I am talking more broadly here. Many of my customers are still using the same backup solution they used 10-15 years ago, yes it is a different version probably, but all the same concepts apply. Well maybe tapes have been replaced by virtual tape libraries stored on a disk system somewhere, but that is about it. The world of backup/recovery hasn’t evolved really.
Over the last years though we’ve been seeing a shift in the industry. This shift started with companies like Veeam and then continued with companies like Actifio, and this is now accelerated by companies like Cohesity and Rubrik. What is different from what these guys offer versus the more traditional backup solution… well the difference is that all of these are more than backup solutions, they don’t focus on a single use case. They “simply” took a step back and looked at what kind of solutions are using your data today, who is using it, how and of course what for. On top of that, where the data is stored is also a critical part of it of the puzzle.
In my mind Rubrik and Cohesity are leading the pack when it comes to this new wave of, they’ve developed a solution which is a convergence of different products (Backup / Scale-out storage / Analytics / etc). I used “convergence” on purpose, as this is what it is to me “converged data (copy) management”. Although not all use cases may have reached the full potential yet, the vision is pretty clear, and multiple layers have already converged, even if we would just consider backup and scale-out storage. I said pretty clear as the various startups have taken different messaging approaches. This is something that became obvious during the last Storage Field Day where Cohesity presented. Different story than for instance Rubrik had during Virtualization Field Day. Just as an example, Rubrik typical;y leads with data protection and management, where Cohesity’s messaging appears to be more around being a “secondary storage platform”. This in the case of Cohesity lead to discussions (during SFD) around what secondary storage is, how you get data on the platform and finally then what you can do with it.
To me, and the folks at these startups may have completely different ideas around this, there are a couple of use cases which stand out for a converged data management platform, use cases which I would expect to be the first target and I will explain why in a second.
- Backup and Recovery (long retention capabilities)
- Disaster Recovery using point in time snapshots/replication (relatively short retention capabilities and low RPO)
Why are these the two use cases to go after first? Well it is the easiest way to suck data in to your system and make your system sticky. It is also the market where innovation is needed, on top of that you need to have the data in your system first before you can do anything with it. Before some of the other use cases start to make sense like “data analytics”, or creating clones for “test / dev” purposes, or spinning up DR instances whether that is in your remote site or somewhere in the cloud.
The first use case (backup and recovery) is something which all of them are targeting, the second one not so much at this point. In my opinion a shame, as it could definitely be very compelling for customers to have these two data availability concepts combined. Especially when some form of integration with an orchestration layer can be included (think Site Recovery Manager here) and protection of workloads is enabled through policy. Policy in this case allowing you to specify SLA for data recovery in the form of recovery point, recovery time and retention. And then when needed, you as a customer have the choice of how you want to make your data available again: VM fail-over, VM recovery, live/instant recovery, file granular or application/database object level recovery and so on and so forth. Not just that, from that point on you should be capable of using your data for other use cases, the use cases I mentioned earlier like analytics, test/dev copies etc.
We aren’t there yet, better said we are far from there, but I do feel this is where we are headed towards… and some are closing in faster than others. I can’t wait for all of this to materialize and we start making those next steps and see what kind of new use cases can be made possible on converged data management platforms.
Michael White says
Hi Duncan, I agree with your thinking – I too know of people doing backup the same way today as 15 years ago. i would like to add that I think we will see more expectations around that data too over time. Our customers at DataGravity (where as you know I work) like knowing there is no PII type info in the backups. They like knowing that there is no ‘secrets’ in the backups. And our array helps them with that. So they can think differently about backup. But also, it means that they can think differently about their data in general, not just around backups but what is on their array and who has created / updated what for example. So I agree that backup is changing, but I also think a big thing related to that is how people think about backups and data. All changes for the better and all helpful to customers.
Michael
Duncan Epping says
Majority of these solutions will also be capable of telling you just that over time Michael. Analytics is one of their major use cases….
Michael White says
Quite right Duncan, that is why we continue to innovate and iterate.
Michael
Michael Troiano (@miketrap) says
Thanks, Duncan. I would point out that resiliency use cases are only half the story, though. What about the copies being produced for compliance purposes, or analytics, or as test data for new application development? Isn’t sound copy data management a critical enabler of the whole DevOps movement?
We would say yes. Having coined the term “copy data management” and been at this the longest, Actifio is the world’s first and ONLY enterprise class copy data management platform… meaning architected to cover all the applications enterprises care about (Oracle, SQL, VMware, Linux, SAP…,) through the full lifecycle of application data (production > deletion), across all infrastructure options (virtual/physical + data center/cloud), and all copy data use cases (resiliency + agility.)
We architected VDP to provide a comprehensive copy data virtualization solution, and spent the last 6 years building a portfolio of 65 patents to solve that problem at scale in the real world. The newer entrants you mentioned may in time become very good at a corner of this problem, but I don’t see how any can catch up to that breadth of capability.
Mike Troiano, Actifio CMO
Duncan Epping says
I am pretty sure I mention some of those additional service Mike in my article. Analytics, Test-Dev work flows etc.
I respect that you see your solution as leading, and I agree that you are definitely one of the leading in this space, my article may not have reflected that to that extend. Now, I also personally feel that what Cohesity and Rubrik do is the next step. It feels less disruptive to introduce, but maybe it is my relative unfamiliarity with your platform.
Kyle Hailey says
Hi Duncan,
Yes Cohesity and Rubrik are joining a tightening market along with Datos.io, Actifio and Delphix. Of these companies, Delphix was the first in the domain starting in 2008 and dominating the market with almost a third of the Fortune 100 using Delphix. Delphix is a leader in the Gartner magic quadrant on structured data archiving where as Actifio was a niche player. Delphix is also the only copy data solution that I know of to come integrated with data masking and a Challenger in Gartner’s magic quadrant for Masking. Delphix is partners with VMware and VMware is using Delphix for database backup in vCloud Air. Delphix is an SAP Endorsed Business Solution and again AFAIK the only solution that supports Sybase. Delphix concentrates on fully automated database support as databases are the most complex data to manage, but Delphix supports any kind of data.
On top of that, it’s the only solution in this domain, AFAIK, that you can download and run on your laptop for free:
http://datavirtualizer.com/delphix-express-installation/
Best
Kyle
Technical Evangelist Delphix
Duncan says
I am not sure I necessarily agree, Cohesity and Rubrik provide a scale-out storage at the same time, which also happens to be highly deduplicated and compressed while leveraging flash at the same time. As far as I know most other solutions you mention don’t offer that capability, that is why I talk about “converged” as it is combining what are usually different offerings in to one platform. Again, I could be wrong here as I am no expert on Delphix.
Don’t get me wrong though, I do really like the solution Delphix offers but it seems the primary focus is database backup, where as other solutions appear more versatile. In the end it all depends on what a customer is looking for I guess.