Hitachi Data Systems Object Storage Solutions
Transcription
Hitachi Data Systems Object Storage Solutions
SOLUTION PROFILE Hitachi Data Systems Object Storage Solutions An Evolution in Storage and Data Mobility: The Object Store As unstructured data continues to grow faster than IT budgets, organizations are looking for ways to support growth while reducing complexity and easing the pressure on IT budgets. In addition, the rise of hybrid cloud, bringyour-own-device (BYOD) policies, and file synchronization and sharing pose new challenges for IT. Hitachi Data Systems provides intelligent, object-based storage solutions and applications that support diverse use cases like file synchronization and sharing, cloud storage and archiving, from a single cluster, simultaneously. These solutions enable more efficient operations, help secure and protect data assets, and help IT stay agile as organizations and business needs evolve. Unstructured Data Challenges The challenge with unstructured data (file data) is that it is unstructured. Many of the technologies to manage it were implemented when this data was a small fraction of the total compared to structured data. As unstructured data began to grow more quickly, the fundamental differences between structured and unstructured data began to impact the IT environment in significant ways. In response, organizations deployed specialized technology to support the vast quantity of data being created. The technology of choice was network attached storage or NAS. Easy deployment and compelling cost led to storage sprawl, which created new challenges in managing, governing, protecting and searching content. In response, many organizations are now considering cloud storage due to its perceived lower cost and ease of scale. However, the loss of control over those data assets is troubling to many IT organizations: Who has the encryption keys? What kind of service levels can be expected? What if I want to change service providers? Object storage brings structure to unstructured file data, making it easier to store, protect, secure, manage, organize, search, sync and share file data. The great scale and rich features of these solutions help organizations leverage a single storage investment for a variety of workloads. Such workloads SOLUTION PROFILE include cloud-based file synchronization and sharing, providing efficient file services to remote sites and mobile users. They also include storage for Web 2.0 applications, backup-free file storage, organizational archives and much more. The Hitachi Data Systems object store, Hitachi Content Platform (HCP), provides solutions to these challenges through a single platform. It brings cloud management in-house and provides intelligent automation that frees up IT staff from day-to-day, hands-on administration. Bring Structure to Unstructured File Data Hitachi Data Systems object storage solutions bring structure to unstructured data. They avoid the limitations of traditional file systems by intelligently storing content in far larger quantities and in a much more efficient and economical manner. These solutions provide for new demands imposed by the explosion of unstructured data and its growing importance to organizations, their partners, their customers, their governments and their shareholders. The Hitachi Data Systems object storage solutions treat file data, file metadata and custom metadata as a single object that is tracked and stored among a variety of storage tiers. With secure multitenancy and configurable attributes for each logical partition, the object store can be divided into a number of smaller virtual object stores that present configurable attributes to support different service levels. In this way, the object store can support a wide range of workloads, such as content preservation, data protection, content tiering and distribution, and even hybrid cloud from a single physical infrastructure. One infrastructure is far easier to manage than disparate silos of technology for each application or set of end users. The Content Cloud Object storage is fundamental to the Hitachi cloud strategy. Hitachi Data Systems object storage, Hitachi Content Platform, serves as the core of the content cloud. Layers of additional applications and media technologies around HCP extend the reach of the Figure 1. Hitachi Data Systems Object Storage Solution object store to open source environments, the cloud and beyond. With object storage at the core, data center operations for file data can be automated. They can be made more efficient by archiving fixed content and eliminating tape backups for data in the object store. Once data has been consolidated on HCP, business insights can then be developed by leveraging the built-in HCP metadata search engine. The result is consolidation of the highest performance: Expensive storage can be right-sized for the workload with the bulk of the data in the object store until high-performance access is required. Seldom-used or sensitive data can also be tiered to Hitachi Content Platform S series nodes. The HCP S series node is a costoptimized, massively scalable, local, onpremises Tier 3 storage target for HCP that is built on commodity hardware. It is ideal for storage and protection of large data sets, such as those used for big data. The S series node delivers a plug-and-play-based streamlined implementation process. It also provides erasure coding for data protection. With this in mind, content that must be stored on-premises can be tiered to HCP S series nodes using the HCP “service plan” feature. A flexible data management strategy can also be implemented so that the data residing on HCP can be protected appropriately with RAID while data on the HCP S series nodes is protected with erasure coding. Erasure coding provides faster rebuilds for large data sets compared to RAID. However, there is literally Hitachi Data an entire world outside the Ingestor data center. File services at Capabilities remote and branch offices LEARN MORE can be easily provisioned and managed to the data center’s object store using Hitachi Data Ingestor (HDI). This connection allows them to enjoy elastic, backup-free file serving from a device that acts a lot like a NAS device. The difference is, the device stores most of the data in the content cloud instead of in local storage. The files that get used regularly can be “pinned” to that site. HDI also enables elastic scale by using the over 450PB of capacity available in HCP and by growing and shrinking remote file systems. Now all data is available at all sites, at all times, without the burden of replication, and with a much smaller IT footprint. In addition, object storage provides the platform for similar functions on end-user devices, otherwise known as file synchronization and sharing. File Synchronization and Sharing The traditional means of sharing files are breaking down, giving way to email attachments, content management systems, copies on user devices, copies in backups, and copies on file servers. This development leads to inefficient storage and network utilization, owing to massive content duplication and high cost for 3 storage, backup and data management. The limitations of the old methods have led to the popularity of consumer cloud-based file synchronization and sharing tools. It’s not just file sharing, though. The rise of BYOD means end users want their work data on multiple devices; getting work data onto a smartphone or tablet would require individuals to use file-sharing techniques to move that data where they want it, making matters worse. These trends are causing problems for IT. End users are generating and sharing more and more copies of data, exacerbating storage and network inefficiencies and storing them in unsanctioned devices, applications and clouds. These actions put the data outside the control and governance of corporate IT. The answer is not ruthless enforcement of strict policies, as end users will just find another workaround. The answer is not to simply give up and turn data over to consumer clouds. The true solution is to deliver file synchronization and sharing from within IT. The true solution enables end users to access data and collaborate on any device, from any location, at any time. And it allows them to do so safely, securely and with corporate oversight, using a private object-storage-based cloud. The Hitachi Data Systems solution combines Hitachi Content Platform object store and Hitachi Content Platform Anywhere (HCP Anywhere). A file synchronization and sharing application, HCP Anywhere was designed and built for enterprise IT, unlike most solutions, which are built for consumer use. Intelligent Objects In most storage systems, Hitachi the intelligence resides Content Platform within the storage, itself, Anywhere which limits service to hunLEARN MORE dreds of millions to about a billion files. This volume, unheard of less than a decade ago, is now becoming more and more common. To make the significant next jump in scale requires some intelligence to reside in the objects, themselves. In such a model, individual objects would have the “DNA” to know when to create clones of them and how to adjust to changes in environment. For example, in the case of a rush of read requests in a particular geography, objects would be cloned and migrated to the hot spot to service requests locally. Once read activity subsided, objects would know to die off, as there would no longer be a need for such a large population. As a means of comparison, consider the human organism, which contains tens of trillions of cells. If it were solely governed by conscious control, the human organism couldn’t operate. Instead, the human organism is controlled by a set of autonomic functions. These functions operate independently of conscious thought and thus can perform the myriad functions necessary to keep such a complex group of cells operating as a single unit. To achieve extreme scales in the tens of trillions of objects, intelligent object stores will likewise need to push down some of the intelligence to the objects themselves. This action will create “intelligent objects” capable of responding to changes in the environment. To do this, object storage uses metadata, or information about a file, to intelligently automate the management of file data. All files have metadata: their file name, file type, size, last access date and so on. Hitachi object storage goes several steps further. It provides multiple fields for metadata so that different end users and applications can use their own metadata and tags without disturbing others’ metadata. It also provides a built-in metadata query tool. The tool enables fast search. It also offers more complex queries to help select sets of data for further analysis or create smart policies around how content should be stored, retained, protected, accessed and more. Cloud Enabled Consider these attributes: ■■ ■■ ■■ ■■ The security and integrity of an archive. The protection of RAID-6 erasure coding, advanced replication and failover capabilities. Massive scale. Support for thousands of tenants and namespaces. ■■ Built-in chargeback capabilities. ■■ OpenStack Swift API compatibility. ■■ A management API. ■■ ■■ ■■ ■■ A REST-based http interface that works with a variety of http dialects. Simultaneous IPv4 and IPv6 support. De facto standards, such as Amazon’s S3 API. Built-in hybrid cloud capability to automatically tier data to a choice of one or multiple leading public cloud services based on user-defined service plans. With these benefits, and more, Hitachi Data Systems object storage solutions compose an ideal platform. From this platform, organizations can build the core of a private, public or hybrid cloud that delivers secure data mobility throughout, plus file synchronization and sharing. It provides an on-premises storage solution for cloudbased applications. SOLUTION PROFILE Key to the economics of cloud is virtualization and secure sharing of a common set of physical resources. Hitachi Data Systems object storage solutions provide multitenancy that allows IT to securely provision a portion of the infrastructure and turn control of that storage and its capabilities to the end users of that storage. By imposing quotas on those tenants and charging based on their measurable usage, IT can better influence the behavior of end users by showing them the cost of their storage practices. Also important for cloud is the ability to easily adapt new storage models to current user and application behavior. With an integrated “on-ramp” or “edge” device that connects applications and end users at distributed sites to centralized object stores, the power of Hitachi is available to distributed consumers. This device enables private organizations to reduce storage and data protection costs at remote or branch offices, and control the distribution of content to different geographies, lines of business and other appropriate audiences. Cloud service providers can deliver an edge device that integrates directly with their core infrastructure, providing their customers with greater control and security for data in the cloud. In both cases, IT organizations can gain simplicity, focus on the business and speed return on investment. Content Preservation Many organizations want to ensure that digital content is preserved for the long term. Some of the reasons are regulatory, but others are to ensure content is preserved and protected for the future as an asset to the organization. Many times these assets can then provide a competitive advantage for an organization, driving value from the content assets. Many organizations want to continue using their preferred software provider to interface the content source to the object storage infrastructure and remove their historical “islands of information.” This stance allows IT to shift its focus to implementing an enterprise-wide strategy with a common repository for long-term management, preservation, protection and search of content and its metadata. It allows IT to take the first steps toward “big data.” IT can reduce the cost and risk associated with managing different “islands,” as well as optimize the return on investment and provide a longterm corporate repository. IT can improve the cost-effectiveness of the organization’s IT strategy by leveraging a variety of media as part of the object store. Such media includes the economically priced and highly scalable Hitachi Content Platform S series nodes, tape media and even a choice of leading public cloud services. IT can also establish a solid platform for future compliance or information governance requirements. These solutions provide an infrastructure that can be provisioned and configured to serve a wide range of use cases from a single infrastructure that provides key functionality, such as: ■■ ■■ ■■ ■■ ■■ ■■ ■■ “Write once, read many” (WORM) and content authenticity service for data integrity. Encryption and access control for privacy and security. Index and search for e-discovery. Object tracking and event logging for audit support. Metadata mining and full content search, which help gather metrics, look for trends and find relationships among data. Multiple protocols, that can access advanced features to support multiple applications. Retention and disposal management services to automatically govern how long content is kept and how it is deleted. Back Up Less … or Backup-Less? The growth in unstructured data stresses traditional, tape-based backup and restore operations. Numerous, disparate systems with large numbers of files and duplicate copies of data increase backup and restore times and impact the performance and availability of production systems. They drive up cost and complexity with the handling of increasing numbers of tapes, the management of off-site storage and the possibility of a compliance or legal action needing information stored in tape-based backups. Hitachi Data Systems object storage solutions attack the problem in four ways that reduce the amount of data to be written to tape and streamline recovery processes. First, the object store proObject vides a target to offload Storage data from primary systems Solutions to the object store as an active archive. While LEARN MORE archives used to be considered only the end of the line for content, Hitachi Data Systems object storage solutions provide an environment that supports multiple versions of the same content. Multiple versions of less frequently used content can be in the object store and be accessed directly by end users and applications: No special tools or custom applications are required to view and access the archive. By moving less 5 used and static content to an object store, IT vastly reduces the amount of data on expensive, heavily used primary systems. This approach reduces the amount of time spent backing up and, more importantly, restoring critical systems, and it basically eliminates the hassle over less-critical content. Furthermore, it effectively reduces the buying frequency for expensive software capacity license upgrades. Second, data deduplication and compression are used to control data size by eliminating unnecessary copies and shrinking the amount of storage used for a given piece of content. As new objects are written to the object store, the content is compared with similar objects and unnecessary, duplicate data is eliminated or compressed to save space. This capability combines with selective replication (where administrators can decide what data to replicate) to reduce the amount of data at replica sites and conserve precious replication bandwidth. Controlling the overall amount of storage consumed on the object store and any of its replica systems streamlines failover to secondary systems and recovery of primary systems once the failure is repaired. Third, because of its content preservation capabilities, the object store already ensures data integrity with WORM, encryption and more. By adding services such as data protection levels, advanced replication, version awareness and the ability to browse the environment, the object store ensures objects are well protected and easily recoverable. As the data is on-site and on disk that can be easily browsed, content can be recovered quickly, on demand, at a particular point in time, and in a selfservice manner. This approach reduces help desk costs and avoids the hassle of finding the right tape, mounting it (assuming it is on-site), reading the catalog, and spinning to the right point of the tape only to learn that another version is needed. Fourth, the object store provides data retention and disposition services that automatically keep content for the prescribed duration. Barring a retention hold, it automatically deletes expired content so the capacity can be reclaimed and recycled back into available storage. These deletions can be logged and annotated to provide an audit trail of what content was removed, when, by whom and why. These technologies are key as the traditional methods of keeping every file forever and backing all files up every week are too costly and risky in today’s economic, regulatory and legal climates. By putting policies in place and adhering to them with automated tools that log important events, organizations can greatly reduce the risk of failing an audit or facing a fine due to rogue data in long-forgotten tape. Using the object store as a platform for file synchronization and sharing can have significant effects on data protection, as well. Consider all the copies of a file that get created: the original on a user device, a copy on a file server, a copy on the Web, a copy on the mail server, a copy in each recipient’s inbox. The list goes on and on. Despite these all being the same file, they all get backed up, some get replicated, and some aren’t even known to the IT organization. Now, with the object store, image links instead of files are moving through all these systems. Rather than a 2MB file, there is a hyperlink that refers to the latest version of a single instanced, compressed and well-protected file that can be accessed anywhere. No more full inboxes. No more version confusion. No more unnecessary duplication of data. Hitachi Data Systems object storage solutions combine the capabilities of an object store with file synchronization and sharing technologies and key attributes of data protection. This combination gives IT organizations the ability to deploy a single, intelligent, object-based storage infrastructure that protects data in place. This enables them to back up less data to tape without sacrificing recoverability or scrapping existing investments in backup infrastructure. In addition, Hitachi Data Systems object storage solutions position IT to pursue a backup-less strategy that provides greater protection and faster recovery, and is more reliable as well as easier to use and manage than competitive solutions. And, by making use of highly scalable and economically priced storage tiers, and even removable media, this approach rivals the cost of traditional tape-based backup. Summary Unstructured data has surpassed structured data in total volume and given rise to a new set of challenges for IT. Rather than continually deploying more capacity and suffering the effects of sprawl, or handing over control, security and protection of corporate data assets to a consumer cloud, the time has come for a change in how content is stored and managed. Hitachi Data Systems object storage solutions are the product of customer and partner input. They are designed to address the challenges of fast growing file data, increasingly diverse data types and access methods, and storing content for years, decades, centuries and beyond. By integrating many key technologies in a single storage platform, Hitachi Data Systems object storage solutions provide a path to short-term return on investment and significant long-term efficiency improvements: Intelligent metadata, the tools to search and analyze that information, support for open source environments, as well as support for legacy, current and emerging storage protocols combine to meet IT challenges. Hitachi Data Systems object storage solutions not only ensure that IT can address the challenges faced today, but they also set IT up for what’s next. IT can evolve to meet new challenges, stay agile over the long term, and address future change and growth. For More Information To learn more about how Hitachi Data Systems can help you with your unstructured data and to read more about our solutions, please visit www.HDS.com/ solutions, contact your local sales representative or solutions consultant, or call Hitachi Data Systems at 888-234-5601. Corporate Headquarters 2845 Lafayette Street Santa Clara, CA 95050-2639 USA www.HDS.com community.HDS.com Regional Contact Information Americas: +1 866 374 5822 or [email protected] Europe, Middle East and Africa: +44 (0) 1753 618000 or [email protected] Asia Pacific: +852 3189 7900 or [email protected] HITACHI is a trademark or registered trademark of Hitachi, Ltd. All other trademarks, service marks, and company names are properties of their respective owners. SP-012-I DG October 2015