DSstar Logo Providing News & Information For Data Intensive
Storage Solutions For The Enterprise

  |  Table of Contents  |  

Features - Enterprise Data Insights:

A RADICAL VIEW ON VIRTUALIZATION
By Dick Benton, Senior Consultant, GlassHouse Technologies

Not so long ago, nearly every industry publication carried an article or two in each issue on virtualization's promise. What is the current state of the art and what problem does this technology really solve?

The first challenge is one of definition. The word "virtualization" has had its meaning defined and redefined by various vendors. A definition not so far from common acceptance describes virtualization provides an ability to view, allocate, access, manage and protect storage without needing to understand or know of the underlying physical storage structure. The goal of virtualization appears to be to treat storage as a utility, just like the electric or phone utility. We don't need to know about, nor understand the electrical grid to use an appliance. We just plug it in. We don't need to know how the telephone companies interconnect worldwide; we just pick up a phone and use it. With virtualization of storage, the theory is that we don't need to know where the data is nor do we need to know on what it resides, we simply consume it as required.

Certainly, the ability to view enterprise storage and its utilization is of significant benefit in understanding the needs of the business. Many virtualization solutions provide this excellent capability. The problem comes when allocation is attempted without cognizance of the underlying storage technology. Imagine an Oracle database supporting SAP. Tables need to be expanded or space made available for new tables, so we are lead to believe that virtualization can be used to allocate additional storage from the available pool.

Alas, the virtualized storage pool, by definition, includes all storage devices in the enterprise and de facto these will include devices with widely diverse metrics on performance, availability, and reliability. In our example above, the database table may conceivably end up on a large disk or a small disk, in a Raid 5 array or on mirrored disk, on a high availability box or on a low end device, on a fibre fabric or an Ethernet fabric, on a block level controller or a file level controller. Do we really want our Oracle database spread out over EMC Symmetrix, IBM's FastT and a bunch of direct attached raid devices on multiple NT file servers?

Does it make sense to utilize such a diverse range of storage classes? Perhaps so if we are simply looking for some available scratch space. Or perhaps if we are looking for additional (end user unique) file space for office productivity applications. But how can this possibly be a solution for high-end enterprise applications, for database structures, for mission critical applications that require a very specific class of service allocation?

De facto, a SAN storage device is a virtualized storage pool (available to fabric-attached servers and their applications). A NAS device is also, de facto, virtualized storage serving its LAN-attached clientele. Expanding the storage pool in this environment is done simply by adding additional storage to existing arrays. What is the real benefit of removing the "box boundary" that currently limits SAN and NAS virtualization to a single frame?

The virtualization promise of ease of allocation and deployment again fails the test when we consider the wildly disparate performance characteristics of the various components of a virtualized storage pool. While it may be possible to allocate LUNS at a logical level, we still face the issue that this is an inherently invalid approach given the need to match storage characteristics (classes of storage) to application requirements.

The virtualization promise of ease of management also founders on the proprietary nature and very unique interaction of each vendor's storage frame management software. Add to this the additional complexities of fabric zone management and the promise of virtualization seems a long way off in the future.

Expanding the virtualization of storage beyond the frame boundaries of existing SAN and NAS devices appears to add very little real functionality and this at the expense of placing intrusive and immature software/firmware/hardware components into mission critical environments. In band or out of band, NT backplane, Unix or Linux, virtualization inevitably adds complexity and additional failure points into a carefully balanced environment of mission critical applications.

Finally, an enterprise virtual solution should, by definition, be a heterogeneous solution capable of utilizing storage no matter what it is connected to. This should include mainframe storage. Most of today's virtualization products are limited substantially to the open systems side, thus making inaccessible perhaps the largest and most highly available mass storage in the enterprise.

A significant benefit delivered by current virtualization products is their ability to provide a replication capability and sometimes a snapshot capability too. This can be advantageous to those organizations where the allocated class of service would not normally provide this level of functionality and sophistication.

Nonetheless, I believe the prudent IT executive will contain virtualization solutions to a File and Print NT environment where they may indeed provide some benefit in optimizing direct attached storage across a number of file servers. Even here, this solution will soon be made obsolete by the imminent arrival of iSCSI, with TCP encapsulation on NICs and the use of NAS storage devices. This solution will be a far more elegant and a more appropriate "open systems" solution than current proprietary virtualization solutions.

Much has been written about the virtues of in-band solutions (virtualization engines in the data path itself) versus out of band solutions (virtualization engines outside the data path); however the real issue is what happens when the virtualization engine is removed from the environment. Will hosts still be able to access their data? Until recently, many virtualization products operated in such a manner that, should they be removed suddenly from the environment, hosts would not be able to find their data. This additional failure point was often of itself sufficient concern to militate against use of virtualization.

More recently, some virtualization engines can indeed be turned off. Such engines can provide the ability to look at and perhaps even manage the storage environment through an abstracted layer that removes the complexities of heterogeneous management from tedious day-to-day management and metrics gathering. Nonetheless, vendor-specific complexities, lack of industry implemented standards at this time, and a drive to "own" the storage environment are leading individual storage vendors to develop heterogeneous management platforms based on an API philosophy. (Application Program interface to competitors storage frames). While this may be less elegant than a virtualized management solution, it has the giant advantage of being supported by the storage vendor itself.

Subject to publication of industry standards and the embracing of those standards by mainstream vendors, it is the ability of virtualization to enhance the management function, rather than its original premise of abstracting physical data from the application, that we may see as the key driver of the evolution of virtualization.

Should this scenario come to pass, the industry may well benefit from a number of small startups offering "faster, better, cheaper" solutions to managing the complexities of the storage environment.

About The Author

Dick Benton is a senior consultant with GlassHouse Technologies Inc. He works with clients to analyze their needs and the options available for cost effective storage and disaster recovery infrastructures. Dick focuses on pragmatic solutions, leveraging existing infrastructure where possible, to develop solutions that are appropriate for the particular business requirements and phased to suit organizational budgets. He holds a M.Ed. from Cambridge College and is in the process of being certified by the Disaster Recovery Institute in business continuity planning.


Top of Page


  |  Table of Contents  |