Primary Storage -- The Origin of a Life Cycle of Storage Challenges
The most complex and costly storage challenges are entirely related to storing and managing data over a life cycle that must be longer than reasonable on rapidly evolving digital systems. CES will have resounding and lasting impact on the practice of storage because that life cycle begins when data is created on primary storage. The integration of cloud and primary storage systems for the first time enables primary storage to deliver features and data management well beyond physically limited devices, systems or data centers. Such storage can eradicate or at least massively reduce the need to move and manipulate stored data.
The lifecycle of data and information management within the business is tremendously long, complex and costly because data must be moved and manipulated. Such a life cycle, which starts with primary storage systems, can involve many different specialized, siloed storage systems and technologies, and create entire stand-alone domains of IT practice within the business (e.g., backup, archive and disaster recovery). Within each of these practices and storage silos, the cloud has so far delivered innovative possibilities for reducing complexity and cost of ownership. But each of these practices, even with new technologies, remains an isolated silo and each carries enormous cost. This makes the life cycle of data management tremendously expensive (see Figure 3).
Pairing the cloud with this primary source of data creation presents the opportunity to touch nearly every piece of business data at the very point in time when it is created, and this can put an end to these separate practices. The consequent cost savings will drive cloud integration to the forestage of the primary storage market. What complex data problems and practices could such a solution take on? A few examples include:
- Retention: Data could be captured for archive and compliance as soon as it is created, and then transferred into the cloud when the time is right, without requiring separate applications, storage appliances, and IT processes. Meanwhile, all data was kept visible and accessible in one consolidated point of access.
- Protection: Copies of data for protection -– snapshots and clones -– could be migrated into the cloud over time, and transform high speed primary protection. By leveraging the ability to automatically place snapshots on-premise, in the cloud or both, a CES array could deliver long-term, low cost, globally dispersed, and on-premise accessible backup that does away with the need for traditional backup and offsite tape. Moreover, based on the solutions available today, such a solution could perform at vastly faster speeds than traditional onsite and offsite backup, irrespective of whether data is recovered from on-premise copies or off-premise copies.
- Disaster: Meanwhile, primary data could still be synchronized with the cloud periodically to create offsite replicas that could, if done right, forever replace complex disaster recovery technologies, dispense with tape and physical transport, and provide a master repository from which recovery data could be accessed, from anywhere. This could unlock highly virtualized, hosted DR on-demand, and do away with wasted standby conventional infrastructure.
These are not small challenges to tackle, but the potential improvements we've highlighted in these examples can create a massive transformation in the practice and costs of data storage and management and reduce a surfeit of technologies to one single data storage and management array.
Core Requirements -- Cloud-Integrated Enterprise Storage
While there is vast room for considerable innovation and differentiation, we can boil the fundamental requirements for delivering CES down to a few capabilities. While each may seem simple or common place on its own, it is the synergistic integration of these capabilities that will create an enormous breakthrough in how we store data and deliver a comprehensive set of features such as those in Figure 4.
Built for Primary Storage.
CES must be engineered for primary storage performance and availability. As usual, primary storage must include the best clustered, active-active controllers designed for non-stop operation, enterprise components, and redundancy throughout. Moreover, today performance is a hotbed of innovation, and customers are steadily coming to expect uniquely integrated and always optimized high IO media (solid state) alongside exceptional approaches to caching that can constantly deliver maximized performance but simultaneously auto-tier data to rotational disk to minimize the cost of capacity.
Built with Primary Storage Features.
Just as importantly, CES requires state of the art primary storage features, that are entirely array-integrated. Such features include internal array virtualization that reduces complexity and harnesses all disks into a single pool of easy to use storage, thin-provisioning, a full set of snapshot and cloning technology, flexible replication, and enterprise caliber management and visibility. There can be absolutely no compromise in the performance and use of these features. First-generation cloud-attached storage devices were too often dependent upon a cloud connection for the use of their features; such systems are not ready to delivery primary storage that must be always-on.
Innovative beyond the Array.
Finally, delivering CES is more than simply turning to the cloud as a parking lot for old data. The cloud requires unique integration, and vendors that fail to innovate may serve up additional capacity, but they will fall far short of enhancing and extending an array to tackle complex lifecycle problems. Innovative integration will reach beyond capacity expansion and provide effortless, always-on backup, seamless offsite data protection, instant DR from anywhere, cloud provider lock-in elimination, and total performance and capacity optimization irrespective of where data is located. The combination of those features will enable a single physical device to reach far beyond physical boundaries and handle all data storage lifecycle needs.
CES will be the most impactful storage technology to emerge from the cloud. It stands to eliminate the most complex storage tasks in the data center, without any more infrastructure alteration than the replacement or addition of a primary storage array. Such potential can yield a massive transformation in the total cost of ownership for storage, and it will foment rapid innovation in this market. But a vendor's attention to integrating both on-premise and off-premise data, and using the combination to address a lifecycle of data management pain points, will differentiate vendors over the long run.
Only a few vendors have broken through in delivering what we consider true CES, although there are several who have begun steadfastly moving in the direction of CES. Among major vendors, HP's 3PAR deserves credit for taking first steps here. Some time ago the 3PAR team began crafting innovative cloud partnerships under the banner of their CloudAgile program. Hitachi has more recently moved toward CES by linking their BlueArc (NAS) and Panasas (scale-out file storage) acquisitions to offsite, Hitachi Content Platform storage. But for major vendors, much work and opportunity remains. In each case these traditional legacy vendors have taken the first step by integrating their primary storage with their own "private cloud." We expect the next step will be to integrate with the pubic cloud. But political considerations remain.
Among new innovators, there are many cloud-attached appliances serving up various incarnations of product similar to CES. Vendors include Avere, CTERA, Nasuni, Panzura, Riverbed, TwinStrata, Nasuni, StorSimple, TwinStrata and others. At their forefront today, is clearly StorSimple, and they are the only one yet delivering true CES for core business data. Many vendors still fall short in xyz dimensions (HA, clustering, snapshots that can function locally as well as deliver protection in the cloud, etc. etc.). Many of these vendors are trying to move up the "enterprise primary storage" flagpole, and we expect each to either continue reaching for more primary capabilities as they innovate around cloud attachment, or else rapidly choose a more selective path that targets single storage challenges.
Such solutions will no doubt continue to deliver significant value, but if selecting a vendor on the promise of transforming the entire lifecycle of data storage, pay close attention to their current product and the practicality of their promised roadmap. Cloud-integrated Enterprise Storage stands to be transformative across all data storage practices, and will alter the cost equation of enterprise storage. The question is when, not if, it will someday cross the threshold of your business place.