Cisco UCS Invicta OS: Data Reduction for Scalable Flash Memory

White Paper
Cisco UCS Invicta OS:
Data Reduction for
Scalable Flash Memory
Arrays
What You Will Learn
Flexible and effective data reduction helps you get the best performance, capacity, and value from scalable flash memory arrays.
Deduplication and thin provisioning both offer ways to limit space consumption and extend longevity for flash memory devices.
Cisco UCS Invicta™ OS offers innovative, optional inline deduplication for Cisco UCS Invicta Series Solid State Systems. With
deduplication rates of up to 10:1, this technology can greatly reduce the cost of flash memory arrays, while extending their
useful lifetime. And Cisco® data reduction technology doesn’t negatively affect performance for those applications that don’t
benefit from deduplication.
© 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public information.
Cisco UCS Invicta OS: Data Reduction for Scalable Flash Memory Arrays
February 2015
Contents
Introduction......................................................................................................3
Data Reduction and Flash Technology..............................................................4
Unique Considerations for Flash Technology.........................................................4
Traditional Deduplication........................................................................................4
Cisco’s Unique Approach to Deduplication............................................................5
Cisco UCS Invicta Data Reduction Overview.....................................................6
Deduplication in Cisco UCS Invicta OS..................................................................6
Error Detection and Correction..............................................................................8
Thin Provisioning and Space Management............................................................8
Conclusion.......................................................................................................9
For More Information........................................................................................9
© 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public information.
Page 2
Cisco UCS Invicta OS:
Data Reduction for Scalable Flash
Memory Arrays
White Paper
February 2015
Introduction
Powered by Cisco UCS Invicta OS, Cisco UCS Invicta Series Solid State Systems
deliver powerful and easy-to-manage application acceleration for I/O- and
bandwidth-intensive workloads. Designed to work with the industry-leading Cisco
Unified Computing System™ (Cisco UCS®), the solution represents a unique and
scalable application of flash memory technology. Applications ranging from email
and databases to virtual desktop infrastructure (VDI) can all benefit from Cisco UCS
Invicta Series Solid State Systems.
Although flash memory technology can accelerate workloads, you need to use care
to apply effective data reduction techniques to make the most of flash capabilities:
• Capacity must be used effectively without needless waste to help ensure value.
• The longevity of flash devices must be enhanced with data reduction techniques.
• Data integrity must be preserved through effective error correction.
• Architecture must capitalize on the speed of flash memory without needlessly
encumbering it.
Cisco UCS Invicta OS was designed from the outset to implement effective data
reduction strategies in concert with the unique properties of flash memory. Unlike
with other solutions that mandate deduplication, however, organizations deploying
Cisco UCS Invicta Solid State Systems have a choice of assigning workloads to
either performance or deduplication nodes within a Cisco UCS Invicta Scaling
System. Alternatively, organizations can choose whether to focus an individual Cisco
UCS Invicta C3124SA Appliance on performance or deduplication according to
their application needs. Cisco’s deduplication technology also enables data integrity
through automatic error correction, and Cisco’s thin provisioning lets organizations
get the most from their flash memory investment.
© 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public information. Page 3
Cisco UCS Invicta OS: Data Reduction for Scalable Flash Memory Arrays
February 2015
Data Reduction and Flash Technology
Cisco UCS Invicta OS provides data reduction technologies that extend beyond
those typically provided for flash storage technology.
Unique Considerations for Flash Technology
Data reduction technologies are particularly important for flash-based arrays.
Beyond reducing the amount of storage required, data reduction ultimately results
in a lower number of write operations for the media, and correspondingly greater
longevity for flash devices.
• Deduplication: Through deduplication, only unique segments are written to
the media, and write requests for duplicate data result only in the storage of a
pointer. This approach eliminates data redundancy at the block level and reduces
the amount of data written to the disk. Physical storage capacity is effectively
increased, allowing commitment of more logically mapped data than is physically
available on the device. In addition, the lower number of write operations
increases the longevity of the flash media.
• Thin provisioning: Thin provisioning is a data reduction approach that provides
the logical space requested on the device using only the physical space that
is actually required. By not consuming requested unused space in a volume,
thin provisioning allows better utilization of flash media. Storage can be
overcommitted so that more logically mapped data can be committed than is
physically available on the media.
Both of these data reduction approaches require special care when applying them
to flash technology.
Traditional Deduplication
Traditional deduplication can take a variety of forms, and it is implemented differently
by different vendors. Data can be deduplicated at the source (where the data is
created) or at the target (where the data is stored). It can be deduplicated as it
enters the device (inline) or after it is stored (postprocessing). Data can also be
deduplicated in fixed data segments (fixed block) or variable-length segments
determined by an algorithm (variable block).
Mainly associated with backup storage, traditional deduplication is often seen as a
significant performance detractor. The hashing algorithms used by deduplication
can be very CPU intensive, and many systems cannot hold in memory the metadata
used by the data reduction operations used to manage the underlying media. In
addition, locality issues with hard-disk-drive (HDD) media and deduplication can
result in significant I/O overhead for the system.
Moreover, deduplication is not a benefit for every workload. It is most effective for
cases in which redundant data is likely, such as persistent virtual desktops, system
volumes for virtual servers, and unstructured data stores. Deduplication is least
effective for cases in which relatively unique data is stored, such as databases,
video, encrypted volumes, and swap spaces. Poor deduplication rates needlessly
encumber application performance and should be avoided (Table 1).
© 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public information.
Page 4
Cisco UCS Invicta OS: Data Reduction for Scalable Flash Memory Arrays
February 2015
Table 1. Good and Poor Candidates for Deduplication
Good Deduplication Candidates
Poor Deduplication Candidates
• VDI (persistent virtual desktops)
• Server virtualization (system volumes)
• Unstructured data stores
•
•
•
•
Databases
Video
Encrypted volumes
Swap space
Cisco’s Unique Approach to Deduplication
Unfortunately, many vendors support an all-or-nothing approach to data
deduplication for flash memory, requiring that it be enabled for an entire system if it
is enabled at all. This limitation makes it difficult for organizations to support multiple
applications within the same storage system in a flexible way without reducing
performance for applications for which deduplication is not as effective.
To address this challenge, Cisco UCS Invicta OS supports a flexible combination
of deduplication and performance nodes within the Cisco UCS Invicta Scaling
System. In addition, an individual Cisco UCS Invicta appliance can be configured
as either a deduplication or a performance appliance, according to the capability of
deduplication to benefit the application.
With Cisco UCS Invicta OS, deduplication is licensed on a per-node or perappliance basis. Organizations can choose to deploy a specific mix of performance
and deduplication nodes within the Cisco UCS Invicta Scaling System cluster to
meet their actual application requirements. In this manner, Cisco UCS Invicta OS
doesn’t force a performance penalty on applications with data that doesn’t benefit
from deduplication. Organizations also don’t incur unnecessary licensing costs for
that data.
Cisco UCS Invicta OS deduplication is performed inline and doesn’t require
computing or memory resources on the flash drives. Not only does this approach
preserve the performance characteristics of flash memory, but the hash table is
not constrained by relatively meager flash-drive memory resources. The resulting
deduplication rate provided by Cisco UCS Invicta OS can be up to 10:1, resulting
in up to a tenfold cost-per-bit reduction (assuming that the data can be reduced at
that rate).
The deduplication ratio is the ratio of logical write operations to physical write
operations, and it is visible in the GUI (Release 5.0.1.2; Figure 1). Through the GUI,
storage administrators can determine the effectiveness of deduplication for their
data and make appropriate decisions about storage strategies. The GUI also shows
physical space consumption, which is critical in managing flash-based storage.
© 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public information.
Page 5
Cisco UCS Invicta OS: Data Reduction for Scalable Flash Memory Arrays
February 2015
Figure 1. Cisco UCS Invicta OS GUI Shows Administrators How Well Data Is Being
Deduplicated
Cisco UCS Invicta Data Reduction Overview
Cisco UCS Invicta OS data reduction is designed for performance, efficiency, and
data integrity in the context of scalable flash memory array technology.
Deduplication in Cisco UCS Invicta OS
Cisco UCS Invicta OS deduplication is performed inline, before data is even placed
into the write block, employing a very fast CPU-efficient hash algorithm. A fixedblock hash is run using a 4-KB block size for a balance between the number of I/O
operations per second (IOPS) and latency. The 4-KB size means that deduplication
is applied to even small changes to large files. Larger block sizes would increase the
minimum latency per I/O operation. The hash physical block address (PBA) is stored
in system memory, eliminating locality issues.
Collision handling is an important innovation in Cisco UCS Invicta OS, implemented
to achieve data integrity. Collisions occur when two distinct pieces of data generate
the same hash value. To cope with this issue, the system doesn’t trust a mere hash
match as absolute. Instead, it completely reads the on-disk block to confirm every
hash match and verify that duplicate data actually exists. Read operations from
the media have little impact on overall performance due to the system’s high read
speeds, and this approach achieves nearly 100 percent data integrity.
Cisco UCS Invicta OS employs a very high-performance Murmur3 hash function.
The block translation layer (BTL) of Cisco UCS Invicta OS calculates a 128-bit hash
for data. Thirty-one bits of that hash are stored in memory tables, with the complete
128-bit hash stored on disk. Collision lists are maintained for 31 bits of the 128-bit
hashes. Data is always checked byte by byte before deduplication is declared.
© 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public information.
Page 6
Cisco UCS Invicta OS: Data Reduction for Scalable Flash Memory Arrays
February 2015
When deduplication is enabled, the BTL experiences a performance impact; only
write operations are affected, with read operations unaffected. The smaller hash
boundary presents an effective trade-off, because more collisions will occur
and more time will be required to identify duplicate data. However, the overall
performance of the deduplication engine is improved with smaller hash scale.
Figure 2 illustrates the Cisco UCS Invicta OS deduplication process, which is
designed to efficiently handle original, duplicate, and updated data. Every inbound
write is hashed to a 4-KB boundary. The hash is confirmed as either new or
duplicate or as a collision using the hash array index, resulting in the following
assessments of the data itself:
• Original data: In the case of new or original data, the hash position is null, and the
hash engine determines that there is no match in the hash table. All the data is
then written to the media.
Original File
Duplicate
Solution Brief
Updated
Solution Brief
January 2015
March
2014
Solution Brief
January 2015
March
2014
Highlights
January 2015
March
2014
Highlights
Change the Way You Do Business
• We change the way you do business.
Integrating Cisco UCS Invicta™
Series Solid-State Systems with the
Cisco Unified Computing System™
(Cisco UCS®) makes virtual desktops
perform better and delivers a better
user experience.
Simplified and Scalable Solutions
• Cisco’s simplified and scalable
solutions provide broad architectural
support for virtual desktops.
High Performance
• Achieve fast performance for virtual
desktops and applications with lower
costs.
Outstanding Capacity
• Get outstanding capacity and
economy with up to 10x deduplication
efficiency.
Integrated into Cisco UCS
• Gain all the value of Cisco UCS with
accelerated virtual desktop access.
Comprehensive Infrastructure
Management
• Simplify and manage the entire
solution with Cisco UCS Director.
Users are more and more demanding. They expect technology to respond
immediately and flawlessly to their every action and encounter. Using traditional
architecture to provide a high level of user access and efficiency, virtual desktop
infrastructure (VDI) deployments have been slowed by several major obstacles. High
acquisition costs, management complexity, improper planning and sizing: all of these
challenges get in the way.
We deliver better performance at affordable costs. We integrate Cisco UCS Invicta™
Series Solid-State Systems with Cisco Unified Computing System™ (Cisco UCS®)
for greater user satisfaction and business efficiency. Your users are no longer
frustrated—and you exceed business expectations while operating within budget.
In virtual desktop deployments, storage is the most challenging element. Small and
random write requests push traditional hard-disk-based storage to the limit—even
during normal operations. This phenomenon is exacerbated during startup or login
storms, which occur at the beginning of the workday. With traditional storage, virtual
desktop users often start their day frustrated with the lack of responsiveness.
Inbound
Data
While solid-state memory performance is commonly acknowledged to be
measurably superior to that of spinning disks, the importance of properly integrating
solid-state memory into the overall virtual desktop solution is less well known.
Cisco UCS with the Cisco UCS Invicta Series accelerates VDI to efficiently handle
high-demand virtual desktops and applications with reduced capital and operating
expenses.
© 2014-2015 Cisco Systems, Inc. All rights reserved. This document is Cisco Public Information.
Highlights
Change the Way You Do Business
• We change the way you do business.
Integrating Cisco UCS Invicta™
Series Solid-State Systems with the
Cisco Unified Computing System™
(Cisco UCS®) makes virtual desktops
perform better and delivers a better
user experience.
Simplified and Scalable Solutions
• Cisco’s simplified and scalable
solutions provide broad architectural
support for virtual desktops.
High Performance
• Achieve fast performance for virtual
desktops and applications with lower
costs.
Outstanding Capacity
• Get outstanding capacity and
economy with up to 10x deduplication
efficiency.
Integrated into Cisco UCS
• Gain all the value of Cisco UCS with
accelerated virtual desktop access.
Comprehensive Infrastructure
Management
• Simplify and manage the entire
solution with Cisco UCS Director.
Change the Way You Do Business
• We change the way you do business.
Integrating Cisco UCS Invicta™
Series Solid-State Systems with the
Cisco Unified Computing System™
(Cisco UCS®) makes virtual desktops
perform better and delivers a better
user experience.
Users are more and more demanding. They expect technology to respond
immediately and flawlessly to their every action and encounter. Using traditional
architecture to provide a high level of user access and efficiency, virtual desktop
infrastructure (VDI) deployments have been slowed by several major obstacles. High
acquisition costs, management complexity, improper planning and sizing: all of these
challenges get in the way.
Simplified and Scalable Solutions
• Cisco’s simplified and scalable
solutions provide broad architectural
support for virtual desktops.
We deliver better performance at affordable costs. We integrate Cisco UCS Invicta™
Series Solid-State Systems with Cisco Unified Computing System™ (Cisco UCS®)
for greater user satisfaction and business efficiency. Your users are no longer
frustrated—and you exceed business expectations while operating within budget.
High Performance
• Achieve fast performance for virtual
desktops and applications with lower
costs.
In virtual desktop deployments, storage is the most challenging element. Small and
random write requests push traditional hard-disk-based storage to the limit—even
during normal operations. This phenomenon is exacerbated during startup or login
storms, which occur at the beginning of the workday. With traditional storage, virtual
desktop users often start their day frustrated with the lack of responsiveness.
4-KB
Write Operations
While solid-state memory performance is commonly acknowledged to be
measurably superior to that of spinning disks, the importance of properly integrating
solid-state memory into the overall virtual desktop solution is less well known.
Cisco UCS with the Cisco UCS Invicta Series accelerates VDI to efficiently handle
high-demand virtual desktops and applications with reduced capital and operating
expenses.
© 2014-2015 Cisco Systems, Inc. All rights reserved. This document is Cisco Public Information.
Outstanding Capacity
• Get outstanding capacity and
economy with up to 10x deduplication
efficiency.
Integrated into Cisco UCS
• Gain all the value of Cisco UCS with
accelerated virtual desktop access.
Comprehensive Infrastructure
Management
• Simplify and manage the entire
solution with Cisco UCS Director.
Users are more and more demanding. They expect technology to respond
immediately and flawlessly to their every action and encounter. Using traditional
architecture to provide a high level of user access and efficiency, virtual desktop
infrastructure (VDI) deployments have been slowed by several major obstacles. High
acquisition costs, management complexity, improper planning and sizing: all of these
challenges get in the way.
We deliver better performance at affordable costs. We integrate Cisco UCS Invicta™
Series Solid-State Systems with Cisco Unified Computing System™ (Cisco UCS®)
for greater user satisfaction and business efficiency. Your users are no longer
frustrated—and you exceed business expectations while operating within budget.
In virtual desktop deployments, storage is the most challenging element. Small and
random write requests push traditional hard-disk-based storage to the limit—even
during normal operations. This phenomenon is exacerbated during startup or login
storms, which occur at the beginning of the workday. With traditional storage, virtual
desktop users often start their day frustrated with the lack of responsiveness.
While solid-state memory performance is commonly acknowledged to be
measurably superior to that of spinning disks, the importance of properly integrating
solid-state memory into the overall virtual desktop solution is less well known.
Cisco UCS with the Cisco UCS Invicta Series accelerates VDI to efficiently handle
high-demand virtual desktops and applications with reduced capital and operating
expenses.
© 2014-2015 Cisco Systems, Inc. All rights reserved. This document is Cisco Public Information.
HASH ALGORITHM
1 2
3 4
Hash Data
1 2
1 5
3 4
3 4
DEDUP
ENGINE
Determines whether the hash already exists in the in-memory hash table
NO MATCH
MATCH
Existing data
is read to
confirm
duplication
MATCH
NO
MATCH
Pointers to the existing data
are stored
All Data
Written
Pointers
Written
Update
Written
Data is sorted into write blocks and sent to media
Figure 2. The Cisco UCS Invicta OS Data Deduplication Process Provides Fine-Grained
Deduplication with Effective Collision Detection and Data Integrity
© 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public information.
Page 7
Cisco UCS Invicta OS: Data Reduction for Scalable Flash Memory Arrays
February 2015
• Duplicate data: If the hash array positions show a possible duplicate, a further
evaluation is made to determine whether the physical location shows the same
data (duplicate) or different data (collision). Upon confirmation of duplicate data,
pointers to the existing data are stored. Upon a collision, a mismatch forces a
unique store of incoming data.
• Updated data: The 4-KB hash boundary allows highly specific updates to existing
data. As shown in the example in Figure 2, a portion of a webpage can be
updated while retaining deduplication on the remainder of the page.
Error Detection and Correction
The deduplication algorithm in Cisco UCS Invicta OS is also used for enhanced error
detection and correction. If an optional read hash check is turned on, data is read
and hashed. The hash is then compared to the expected hash from the logical block
address (LBA) table. If the hashes don’t match, then an error-correction flow is
performed to correct the data. The error correction flow repairs the RAID 6 algorithm
that manages the flash devices, as follows:
• The process begins with an N-1 read of the RAID set while calculating parity.
• The resulting read data is hashed and compared to the expected hash.
• The system continues excluding a different RAID element until a hash match is
found.
• The excluded element contains the error.
Thin Provisioning and Space Management
Traditional provisioning (also known as thick provisioning) is inefficient and generally
not well suited for flash storage. In traditional provisioning, storage is preallocated,
and fixed physical space is allocated whether it is used or not. This approach
often traps significant unused space, resulting in poor utilization of the storage
media. Making matters worse, when a logical unit number (LUN) runs out of space,
applications can encounter performance problems and need to be brought down. It
is often difficult to add storage to the original LUN. Typically, you must create a new
LUN with more space and then restore the original LUN contents to it.
Thin provisioning generally addresses these concerns by not preallocating storage,
instead waiting until the storage is actually needed. By allocating physical space only
as it is needed, this approach allows overprovisioning, and more logical space can
be committed than the actual physical space available. For example, you can create
a 4-terabyte (TB) LUN even though only 500 GB of physical space is set aside. In
this manner, thin provisioning does not consume unused space in the volume.
Cisco UCS Invicta OS provides thin provisioning at the volume-group level. This
approach differs from traditional thin provisioning in that it does not allow the
addition of more storage capacity to the volume group if it runs out of physical
space. Running out of space can have dire effects on applications. To address this
concern, Cisco UCS Invicta OS holds space in reserve to prevent the file system
from being completely filled. If the device becomes full, the reserved storage is
unmapped and used as a workspace to move data from other LUNs. The physical
space consumption should still be monitored closely through the GUI (see Figure 1)
to help ensure that applications do not run out of physical space. To ameliorate
growing storage needs, additional volume groups can proactively be created in the
same or additional storage media and made available to the applications.
© 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public information.
Page 8
Cisco UCS Invicta OS: Data Reduction for Scalable Flash Memory Arrays
February 2015
Conclusion
Effective data reduction is critical for flash storage such as that embodied in the
Cisco UCS Invicta Scaling System and Cisco UCS C3124SA Appliances. Cisco
understands that deduplication isn’t right for all data, so Cisco UCS Invicta OS
provides deduplication as a per-node option within a Cisco UCS Invicta Scaling
System. Individual Cisco UCS Invicta appliances can also be configured for
performance or deduplication. The resulting modularity enables the Cisco UCS
Invicta Scaling System to be built from a combination of performance- and
deduplication-enabled nodes, letting organizations tune their deployment to the
needs of their specific applications.
Cisco’s efficient and high-performance inline approach to deduplication provides
up to a 10:1 deduplication ratio with very low overhead, offering space savings,
error correction, and media longevity. The GUI provided by Cisco UCS Invicta OS
lets storage administrators see to what degree their data is being deduplicated—
feedback that can help them decide whether deduplication is appropriate and
cost effective for specific applications and data. Further enhancing flash memory
management, Cisco UCS Invicta OS thin provisioning offers good utilization of
storage media, and system checks set aside storage to avoid costly and disruptive
storage overruns. Together these data reduction technologies result in a system
that can deliver significant performance, efficiency, and data integrity for demanding
applications that need fast data access.
For More Information
For more information on Cisco UCS Invicta Solid State Systems, visit: http://www.
cisco.com/go/ucsinvicta.
Americas Headquarters
Cisco Systems, Inc.
San Jose, CA
Asia Pacific Headquarters
Cisco Systems (USA) Pte. Ltd.
Singapore
Europe Headquarters
Cisco Systems International BV Amsterdam,
The Netherlands
Cisco has more than 200 offices worldwide. Addresses, phone numbers, and fax numbers are listed on the Cisco Website at www.cisco.com/go/offices.
Cisco and the Cisco logo are trademarks or registered trademarks of Cisco and/or its affiliates in the U.S. and other countries. To view a list of Cisco trademarks, go to this
URL: www.cisco.com/go/trademarks. Third party trademarks mentioned are the property of their respective owners. The use of the word partner does not imply a partnership
relationship between Cisco and any other company. (1110R)
LE-44402-00 02/15