IT Business Continuity Planning and Management with
Transcription
IT Business Continuity Planning and Management with
PlateSpin Forge Guide Book IT Business Continuity Planning and Management with PlateSpin Forge IT Business Continuity Planning and Management with PlateSpin Forge Introduction Information technology, or IT, is used on a day-to-day basis in many fields of business, and it is no exaggeration to describe IT as a key infrastructure without which corporate activities cannot continue. Improving the usability of IT systems while maintaining an environment that ensures stability is surely one of the most central roles of any IT department. Engagement in business continuity is a make-or-break issue for many corporations. Strategies can be created from a number of different perspectives, by planning business continuity in terms of management of corporate resources such as personnel, assets, capital and information. In this document we will specifically focus on IT business continuity planning, or IT-BCP. A range of IT-BCP methods can be adopted to suit the importance of the data being stored or the importance of the system. A luxurious environment with unlimited resources is a rare thing indeed. So the main goal is to maintain an environment that ensures business continuity while at the same time costs are being minimized as much as possible. In this document we introduce the PlateSpin Forge hardware appliance as the optimal strategy for such a business environment. Of all the approaches available in the field, PlateSpin Forge represents the business continuity solution which is both the most practical and most economical to implement. The central feature of PlateSpin Forge is its full utilization of virtualization technology. It permits fast and straightforward failover, by switching a new system in as the production server with a simple click in the event of a disaster. Moreover, once the original unit has been repaired or replaced, PlateSpin Forge allows straightforward failback, by switching back to the fully recovered original environment. This document casts a fresh eye on various approaches to business continuity being taken by corporations, and explains their distinct methodologies. In doing so, it will become clear why PlateSpin Forge is the optimal solution. We hope this document will help you plan appropriately for IT business continuity in your corporation. Approaches to business continuity Contents 2 The cost of business continuity Introduction……………………………………………………………………… chapter 01 Approaches to business continuity……………………………………… 4 The cost of business continuity…………………………………………… 16 Merits and demerits of IT-BCP solutions chapter 02 chapter 03 chapter 04 PlateSpin Forge as IT-BCP solution……………………………………… 40 PlateSpin Forge as IT-BCP solution Merits and demerits of IT-BCP solutions………………………………… 26 chapter 05 chapter 06 PlateSpin Forge GUI overview…………………………………………… 70 PlateSpin Forge usage overview PlateSpin Forge usage overview………………………………………… 52 PlateSpin Forge GUI overview chapter 01 4 Approaches to business continuity Approaches to business continuity The cost of business continuity Merits and demerits of IT-BCP solutions The importance of business continuity ……………………………………………… 6 Required approaches for business continuity………………………………………… 8 Historical and current guidelines for business continuity…………………………… 10 The power of virtualization technology for business continuity…………………… 14 PlateSpin Forge as IT-BCP solution Tuning IT systems components……………………………………………………… 12 PlateSpin Forge usage overview PlateSpin Forge GUI overview 5 chapter 01 Approaches to business continuity The importance of business continuity Creating an environment capable of enabling Information technology (IT) systems are already business to continue despite disaster or breakdown extensively deployed by corporations, and the is an essential part of risk management and importance of IT systems as business infrastructure cannot be ignored by corporations. Even where is increasing. For this reason, IT infrastructure an interruption to operations is inevitable, it is presents a very real risk to the continuous operation essential that they are back up and running as of corporations and to the provisioning of services. quickly as possible. This is why we think of IT business continuity planning (IT-BCP) as a separate field of BCP, focused For this reason, corporations world-wide invest on IT. Ordinarily, BCP initiatives include measures in business continuity planning (BCP), which seeks such as maintaining backup systems and ensuring to protect the corporation from reduced market a backup office, accelerating safety confirmation share or reduced corporate value by recovering procedures, ensuring staff and arranging alternative from interruptions to operations, or avoiding them production facilities. This document however all together. A number of approaches can be taken focuses particularly on issues within IT-BCP, to define BCP, but in its document PAS56 Guide concerned with maintaining backup systems. to Business Continuity Management, the British Standards Institution (BSI) defines it as "continuity When considering BCP initiatives, it is necessary planning which recognizes impacts due to poten- to think about the recovery point objective (RPO), tial losses, establishes and implements an enforce- the point in the data record and the age thereof able continuity strategy, and ensures business at which recovery is acceptable, the recovery time continuity in the event of an emergency. Business objective (RTO), the amount of time it takes to continuity outcomes which document procedures restore services, as well as the recovery level objec- and information, developed, compiled and main- tive (RLO), the level of operations to be recovered. tained for use in the event of an emergency." Efforts Additionally, day-to-day training for business conti- to establish, operate, simulate and continuously nuity should consider the test time objective (TTO) improve BCP are generally known as business con- metric, which asks how easy the system is to test. tinuity management (BCM). 6 guide book ② Enables restoration of operations within a set time frame After (initial and BCP response) 100% Restoration Merits and demerits of IT-BCP solutions Target Threshold Target Current expected recovery curve Recovery curve after implementing BCP Threshold Time PlateSpin Forge as IT-BCP solution Level of operation (quantity of product supplied, etc.) Before Enables business to continue at a level above a set threshold The cost of business continuity ③ Approaches to business continuity PlateSpin Forge ① Difference between target and actual recovery time Ref: http://www.bousai.go.jp/kyoiku/kigyou/keizoku/hajimete.html RPO: Recovery Point Objective → How recent should the recovered data be? RTO: Recovery Time Objective → How much time should be allowed for recovery? TTO: Test Time Objective → How easily can testing be carried out? Conceptual diagram of BCP measures Service level 100% under normal conditions Full recovery Disaster occurs Status of data at time of provisional recovery Provisional recovery RLO RPO RTO Time PlateSpin Forge GUI overview RLO: Recovery Level Objective → What level of granularity should be recovered? PlateSpin Forge usage overview Keywords for business continuity Ref: Definition of BCP: http://www.meti.go.jp/policy/netsecurity/downloadfiles/BCPguide.pdf 7 chapter 01 Approaches to business continuity Required approaches for business continuity When considering business risk as a factor in busi- By precautionary, we mean measures to avoid ness continuity, several perspectives must be incor- encountering damage. Specifically, these would porated. These perspectives cover a broad range of be measures such as seeking to improve reliability topics including exchange fluctuation risks, recall of systems such as through load distribution or risks associated with quality assurance, litigation redundancy, or migrating to a robust and secure risks and production equipment related risks such data center capable of handling disasters. Recov- as plant fires. Some of these could be included in ery means providing an environment capable of IT related business continuity risk. But in particular, implementing rapid failover, e.g. by implementing both breakdown risks and disaster risks must be backups and having alternative servers as an emer- considered as factors in IT risk. gency precaution. Providing day-to-day training in preparation for an emergency is another effective Breakdown risks are risks arising primarily due to machine, equipment or human errors, and include hardware breakdown, primarily concerning server or network equipment, and communication services breakdown such as services provided by contracted telecommunication operators or data center services. Disaster risks are risks that are likely to arise due to a natural disaster such as an earthquake or typhoon, and include large-scale systems breakdowns such as loss of or interruption to communication lines to data centers. Handling these risks requires consideration of BCP measures from both a precautionary and a recovery perspective. 8 means of achieving rapid recovery. guide book The cost of business continuity Thejeopardize very real ITbusiness risk to business continuity, and countermeasures IT risks that continuity and countermeasures IT risk Internal factors Recovery countermeasures Minimizing risk (threat avoidance and reduction of vulnerability) Loss/damage minimization on realization of risk Hardware failure Failure of communication services Software bugs Improving system reliability Redundancy, load balancing, QA, SPOF avoidance, etc. Operational error Backup and recovery Computer crime Local disaster External factors (unavoidable) Widespread disaster Power failure/ restrictions Loss of data services Favorable location, seismic isolation, redundancy in equipment / circuits, etc. PlateSpin Forge GUI overview Tsunami/flood Data center ruggedization Remote site control Earthquakes PlateSpin Forge usage overview Disaster risk Backup acquisition, preparation of alternative devices, failover, training, etc. PlateSpin Forge as IT-BCP solution Primarily on-site control Processing overload Merits and demerits of IT-BCP solutions Failure risk Preventative countermeasures Approaches to business continuity PlateSpin Forge * QA: Quality assurance, SPOF: Single point of failure, Exhibit: ITR 9 chapter 01 Approaches to business continuity Historical and current guidelines for business continuity Originally, BCP existed as a concept from the in March 2005, and the Business Continuity Guide- 1960s called disaster recovery (DR) and set out lines Ver. 1 by the Central Disaster Prevention Coun- disaster recovery countermeasures to cope with cil of the Cabinet Office in August of the same year. host computer breakdown. The term BCM emerged In February 2006 the Small and Medium Enterprise as a management concept in the 1999 British Agency published an introductory examination Standards Institution (BSI) standard on Information and a range of templates for BCP policy formation Security Management Systems (BS7799). This under the title Small and Medium Enterprise BCP: was later developed into Business Continuity Guidelines for Formulation and Application. Management Systems (BS25999) to handle a wide Then, in November 2009 the document Business range of organizations, and BCM is now applied Continuity Guidelines Ver. 2 was published to sim- by an increasing number of corporations at the plify the application of Business Continuity Guide- management level. lines Ver. 1. The requirements of BS25999 were later turned In terms of guidelines on the subject of IT-specific into an ISO standard, the International Standard for issues (IT-BCP), METI provided the IT Service Con- Business Continuity Management Systems (BCMS) tinuity Guidelines in September 2008, and the ISO 22301, issued in May 2012. These standards are Information-technology Promotion Agency (IPA) applied to all organizations, including corporations, provided documents including the well known public bodies, not-for-profit organizations and Highly Resilient System Infrastructure Guide- NGOs, whatever the scale or activities of the orga- lines (Summary, Planning) in May 2012. Finally, nization. They are set out as standard requirements the National Information Security Center is due to explicitly identifying the control items that need to release guidelines looking at IT-BCP including the be implemented. Operational Continuity Planning Guidelines for Information Systems in Central Government and In contrast, the process of establishing guidelines in Japan involved the publication of the Business Continuity Policy Formation Guidelines by METI 10 the supplementary IT-BCP Formulation Model. guide book Key Points 1. What are business continuity initiatives?……………………1 2.2.3 Estimating damage to key activities……………… 15 2. Features of business continuity initiatives…………………3 2.2.4 Extracting key elements…………………………… 17 3. Features of these guidelines………………………………3 2.2.5 Establishing a business continuity plan…………… 18 2.2.5.1 Defining the command structure……………… 18 5. Standpoint of these guidelines……………………………5 2.2.5.2 Ensuring functionality at key corporate sites…… 19 6. Adopting a checklist………………………………………5 2.2.5.3 External data transmission and data sharing…… 19 2.2.5.4 IT system backup……………………………… 20 I. Basic approach to business continuity requirements…………6 2.2.5.5 Product-service supply relationship…………… 20 2.2.6 Issues to be derived in conjunction with 1.1 Requirements and key points in business continuity……6 1.1.1 The need to make business continuity work in times of emergency…………………………………6 1.1.2 Key points in the approach to business continuity……6 1.1.3 Features of business continuity planning in Japan: 2.2.6.1 Safety assurance and safety confirmation……… 21 2.2.6.2 Disaster damage mitigation for offices/places of work and equipment……………………… 21 2.2.6.3 Prevention of secondary disasters…………… 22 2.2.6.4 Community cooperation and contribution…… 23 2.2.6.5 Cooperation and mutual aid…………………… 24 1.2 Basic approach……………………………………………7 2.2.6.6 Other matters to consider……………………… 24 1.2.1 Hypothetical disaster risk………………………………7 1.2.2 Issues to be derived in conjunction with business continuity………………………………………………8 1.2.3 Standpoint of each item provided in these guidelines………………………………………………9 2.3 Implementation and operation………………………… 24 2.3.1 Implementation of measures under business PlateSpin Forge as IT-BCP solution being ready for widespread natural disasters…………7 business continuity………………………………… 21 Merits and demerits of IT-BCP solutions 4. Aims of and issues promoted by initiatives…………………4 The cost of business continuity Business Continuity Guidelines Version 2 Contents Approaches to business continuity PlateSpin Forge continuity plan……………………………………… 24 2.3.2 Preparation of documents………………………… 25 2.3.2.1 Preparation of protocols and manuals………… 25 2.3.2.2 Preparation of checklists……………………… 25 2.3.3 Financial allowances……………………………… 25 II. Content of business continuity plans and initiatives……… 11 2.3.4 Confirming the plan works in practice…………… 26 2.3.5 Requirement for business decisions in event of 2.1 Policy…………………………………………………… 12 emergency………………………………………… 26 2.2 Planning………………………………………………… 12 2.4 Implementation of education and training…………… 26 PlateSpin Forge usage overview 1.3 Continuous improvement……………………………… 10 2.2.1 Identifying relevant disasters……………………… 12 2.2.2 Evaluation of impact……………………………… 13 2.5 Inspections and corrective action……………………… 27 2.2.2.1 Estimating stoppage time and 2.6 Management level estimates…………………………… 27 2.2.2.3 Setting RPO and RLO…………………………… 14 III. Proposals for managers, the economy and society……… 28 PlateSpin Forge GUI overview responsiveness………………………………… 13 2.2.2.2 Determining key activities……………………… 13 IT systems operation continuity planning guidelines for central government: http://www.nisc.go.jp/active/general/itbcp-guideline.html 11 chapter 01 Approaches to business continuity Tuning IT systems components When thinking about IT-BCP, a major prerequisite Taking hardware as an example, for hardware with is that we understand the resources required to a low recovery priority, the countermeasure objec- continuously operate IT systems. It is essential for tive may be considered as entering into contracts business continuity that a strategy is developed for for hardware maintenance and having the ability every such resource. to obtain repaired or replacement parts. As the recovery priority increases, the objective might be Specifically, required resources might be hardware keeping a backup unit on hand at a remote site to run systems, system-based resources such as away from the system operations site, and ensuring applications or system configurations, data-based hardware is available on cold standby for setup in resources including management of backup data, the event of an emergency. For hardware with the facilities to house systems, and an uninterruptable highest levels of recovery priority, the objective communication network. In practice, additional could be to provide an exclusive replacement unit resources are required such as a non-emergency at a remote site with synchronized data, and ensur- testing system and an on-site operational system ing an environment capable of rapid recovery by to investigate IT-BCP, and even business partners switching to the replacement unit in the event of who also have business continuity capabilities and an emergency. alternative procedures to restart activities, should a disaster happen. We clearly see that a vast number It is crucial to have a clear idea of the recovery pri- of resources are needed to achieve operational orities and countermeasure objectives for each IT continuity of IT systems. system resource. Setting a priority for recovery of each of these resources enables us to understand practical recovery level objectives. Understanding recovery level objectives provides information on where current measures are lacking, and helps to define future measures. 12 guide book The cost of business continuity Example IT system components Component Explanation of component 1 Hardware 2 System-related 3 Data-related 4 Facilities 5 Data communication network required for use of IT systems (carrier, type, route diversification, etc.) 6 IT-BCP analysis system (* Item added in this study) System for constant analysis of IT-BCP content 7 System operational structure Ensuring operational human resource and role allocation to achieve rapid confirmation of damage to systems and 8 Business continuity capabilities of vendors 9 Fallback system Role, quantity and location of hardware such as servers (including for replacement units) Location and status of data required for recovery of IT systems, such as applications and system configuration data (external storage of backup media, etc.) Environmental aspects for IT system equipment (location, robustness, presence of on-site generator and presence of backup center within buildings or at data center, redundancy of electricity system, water supply, etc.) Arrangement of data communication networks (within buildings and externally such as between sites) appropriate responses, and ensuring maintenance of standard operating procedures and means of communication. (* Item added in this study) Alternative procedures for recovery of operations where IT systems are suspended in case of an emergency Countermeasure Objective Ensuring hardware for hot standby A • Recovery is performed through backup systems by switching in an replacement unit in case of an emergency. • Provision of exclusive replacement units in locations which will not be simultaneously subject to the same disasters as production units. 3 • Provision of alternative client PCs for use in case of emergency. Ensuring hardware for cold standby • Provision of a backup unit in a state requiring setup at a location which will not be simultaneously subject to the same disasters as the current location. Elimination of end-of-maintenance parts through hardware maintenance contracts (recovery at damaged sites) • Ensuring that hardware which cannot be confirmed after sale is not used. • Enter into maintenance contracts to ensure parts repair and replacement parts are available when damage is E 2 • Utilize thin clients to avoid impact on client environments due to damage to the working environment. incurred to hardware. • Regular saving of data on client PC operating systems. 1 PlateSpin Forge GUI overview D PlateSpin Forge usage overview S Countermeasure Level PlateSpin Forge as IT-BCP solution Support and collaboration structures with IT system vendors in case of an emergency (capturing of vendors' business continuity capabilities, maintenance of service assurance contracts, etc.) Recovery Priority Rank C Merits and demerits of IT-BCP solutions Location and status of key data (external storage of backup media, etc.) Hardware Countermeasure Objectives Level B Approaches to business continuity PlateSpin Forge Ref: http://www.nisc.go.jp/active/general/pdf/IT-BCP.pdf 13 chapter 01 Approaches to business continuity The power of virtualization technology for business continuity Let us now think about specific strategies for This is also where virtualization technology business continuity in the IT field, i.e. IT-BCP comes in. Virtualization technology enables measures. We of ten hear about ef for ts to physical environments to be deployed as virtual increase redundancy in IT systems and improving machines, and allows systems to be designed and fault tolerance through the use of clusters. For built freely using a hypervisor, with no hardware high priority tasks, it is obvious that a backup platform dependence. This technology not only environment is required at a different location enables migration to different environments, but is to the production site, but it is important above also ideal when used as a system infrastructure for all to create a highly flexible system that allows BCP, enabling the automation of the operational operations to be rapidly redeployed. management of system continuity. The key features of virtualization technology are increased reliability The key benefit of providing a remote environ- and availability, while making efficient use of com- ment is that even if the production site is knocked puting resources. offline by a natural disaster, there is no loss of data. Additionally, in the event of a large-scale natural disaster that renders the production site inoperable, it provides an additional service as a backup site for re-initiating the systems in a short period of time. Considering the prerequisite that the IT environment should be fully recoverable, it is clear that systems that are dependent on a specific hardware environment should be avoided as far as possible. Where possible, systems should be built to be quickly recoverable, without depending on infrastructure such as hardware. This is why it is so important to make sure that operating systems and applications are not platform-dependent and are structured with flexibility. 14 guide book Vir tual Machine Application Application Application Application Guest OS Guest OS Guest OS Guest OS The cost of business continuity Difference between physical environments and virtualized environments Approaches to business continuity PlateSpin Forge Physical Machine Physical Machine Operating System Vir tualization platform x86 Hardware x86 Hardware Memory Disk NIC CPU Memory Disk Virtualized platform • One OS on one ser ver • Vir tualized hyper visor • Inefficient use of resources • High per formance • Close integration of software and hardware • Flexible resource management • Independent from software and hardware virtualization platform. This enables the virtual machine to operate flexibly, with no specific platform dependence. PlateSpin Forge GUI overview Unlike physical environments, virtualization technology transfers the close connection to the hardware to the PlateSpin Forge usage overview Conventional IA ser ver NIC PlateSpin Forge as IT-BCP solution CPU Application Merits and demerits of IT-BCP solutions Application 15 chapter 02 16 The cost of business continuity Approaches to business continuity The cost of business continuity Merits and demerits of IT-BCP solutions Cost considerations for business continuity: hardware……………………………… 18 Cost considerations for business continuity: software……………………………… 20 Cost considerations for business continuity: personnel……………………………… 22 PlateSpin Forge as IT-BCP solution Cost considerations for business continuity: other…………………………………… 24 PlateSpin Forge usage overview PlateSpin Forge GUI overview 17 chapter 02 The cost of business continuity Cost considerations for business continuity: hardware In this chapter we will look in more detail at the provider, or another alternative would be to build a costs of IT-BCP. Firstly, we have to ask what hard- customized environment using robust remote data ware elements are required. center facilities. The first thing to understand is that it is some- For low priority systems allocated a medium or what difficult to generally predict what hardware long RTO and RPO of several days to one month, is required, as the recovery time objectives (RTOs) the general method would be to restore data once and recovery point objectives (RPOs) differ depend- the new environment would be ready, having pre- ing on the type and priority of system. The biggest pared a backup of data and system parameters on differences in hardware configurations arise from portable media such as tape. In this case we use a the necessity for rapid recovery and the necessity tape device such as an autoloader or tape library for recovered data to be perfectly up-to-date. for backup of production environment data, requiring storage on multiple volumes of tape media For example, if the RTO and RPO are set to a short such as LTO tape drives. However, it must not be period or time, then system redundancy is the forgotten that to finally recover the original state, key premise. In this case, the backup site has to be the hardware for operating the production environ- ready with the same system environment as the ment must be redeployed or repaired. production environment. In practice, this requires new storage such as a storage server or SAN, network devices such as routers and switches, load balancers and application delivery controllers (ADCs) as required to distribute load, and web application firewalls (WAFs) to handle security. One option would be to use a public cloud rather than setting up the environment internally, but this raises the difficult issue of configuring it in exactly the same way as the production environment. One method would be to use a private cloud environment outsourced to a systems integrator or service 18 guide book The cost of business continuity DB ser ver Approaches to business continuity PlateSpin Forge AP ser ver LAN switch If the RTO and RPO are relatively short Router With a relatively RTO and RPOhot or periods, less thanshor 24 thours, either of less than 24 hours, a mirror production cold standby ismethods beonused environment installed may locally hot in a or cold installation standby remote mirroring the producADC/ WAF tion environment. PlateSpin Forge as IT-BCP solution SAN storage Merits and demerits of IT-BCP solutions SAN switch periods, a backup production With a relatively longofRTO and RPO, enviproduction network data andparameters system ronment data and system is PlateSpin Forge usage overview If the RTO and RPO are relatively long settings are backed up on por table Tape media Tape librar y made a portable mediato such as tape media such as tape. PlateSpin Forge GUI overview 19 chapter 02 The cost of business continuity Cost considerations for business continuity: software The software that is required will naturally differ est backup tools, a growing number of products depending on the hardware configuration. If the support virtual environments, and incorporate the RTO and RPO are set to short periods, under 24 functions necessary for IT-BCP. hours, some sort of replication tool will be required to make regular copies of data to a backup site. Some backup tools include options with replica- When a method using an intrinsic replication func- tion functionality, but as many backup appliance tion of SAN storage or some similar technology is products have their own internal software function- adopted, then this requires advanced hardware. ality, this software may not be required, depending on the environment. Additionally, in order to maintain the backup site as an equivalent environment, this method generates a requirement for software licenses that mirror the production server, including server operating systems and applications, antivirus software, and server virtualization software to act as the hypervisor needed for the virtual environment. As some vendors have license terms allowing use of software at backup sites, the calculation is not as simple as just doubling the cost. Systems for which a one week to a one month recovery timescale is appropriate do not require real-time data copying. A data backup may be made according to a schedule to portable media such as tape, and restored once the hardware is ready for recovery. The software required in this case is a backup tool to be used when performing backup and restoration to tape. Amongst the lat- 20 guide book • Complete backup The cost of business continuity Key functions of legacy backup software Approaches to business continuity PlateSpin Forge • Incremental backup Data backup • Differential backup • Single file backup • Deduplication • Disk image • RAW backup Backup devices • Tape backup (D2T) • Disk backup (D2D) • D2D2T • Cloud backup Application agent • SQL Server • Oracle Database • Lotus Domino • Exchange Server PlateSpin Forge GUI overview • Single file • Bare metal • Virtual machine PlateSpin Forge usage overview Recovery PlateSpin Forge as IT-BCP solution System backup Merits and demerits of IT-BCP solutions • Open file backup 21 chapter 02 The cost of business continuity Cost considerations for business continuity: personnel Now let’s examine the cost of tasks that are likely operating system security patches must also be to arise during the day-to-day operation of infra- applied. Cold standby units will also require regular structure at a backup site. operating system and application upgrades. Maintaining the backup system in the same way as the First and foremost, if RTO and RPO are defined to production system ensures that work can proceed be short, then a state of the art replication infra- without a glitch in the event of an emergency. structure will have to be provided at the backup site. If there are plans to rebuild the production If the RTO and RPO are not quite as aggressive, environment, the opportunity should be taken to the routine backups as described earlier would still setup both sites concurrently, while incorporating apply as a strategy. Where backup is performed to data copying techniques such as replication into the tape or similar, the tapes would need to be trans- design. This is an efficient way to build a backup site. ferred to a remote location, and their storage would require careful management. When a backup site is built to support an operational production environment, it is important to Another cost that arises, regardless of the time- ensure the hardware infrastructure and software scale, is regular testing and training to confirm that are as similar to the production environment as recovery would be possible in any event. A strict possible. If an unavailable CPU type or an older schedule should be prepared and the switching version of the software is used, it may be necessary process verified, showing that the backup site to update the backup site, which may require con- could be setup under conditions where the produc- siderable time and effort in verification and testing. tion environment is offline. This element also incurs a labor cost. Once the backup site is set up, thought must also be given to labor costs for routine operational and maintenance tasks. If a hot standby unit is already operational, the same monitoring solution that is used at the production environment will be necessary, and any software upgrades and server 22 guide book Approaches to business continuity PlateSpin Forge The cost of business continuity Routine monitoring Upgrades PlateSpin Forge usage overview Recovery testing PlateSpin Forge as IT-BCP solution Security patching Backups Merits and demerits of IT-BCP solutions Network setup In addition to building the initial environment, routine operations such as monitoring and Recovery verification testing is another essential. PlateSpin Forge GUI overview upgrade operations must also be considered. 23 chapter 02 The cost of business continuity Cost considerations for business continuity: other In addition to the hardware, software and personnel Alternatively, if the backup site is always in oper- costs already addressed, there are a number of other ation, there would need to be a calculation of the costs associated with IT-BCP that cannot be ignored. costs associated with the backup, such as electricity costs. Options for power outages such as UPS If a rapid recovery is sought, routine online back- or on-site power generation should also not be ups are required, but for high volume backups this ignored. On-site power generation would also incur requires a dedicated broadband connection. Even if costs related to ensuring a source of fuel as well as incremental and differential backups or deduplica- storage costs. tion is being used, it may be impossible to run the backups on the same connection as ordinary tasks. Thinking about IT-BCP in this way, we can easily In these cases, a duplicate connection would defi- see the wide range of associated costs. Corpora- nitely aid IT-BCP. It is important to review the band- tions must ensure optimal BCP countermeasures width available on existing network connections. while keeping these cost issues in mind. The installation of the backup site would require new personnel, and would naturally generate costs associated with day-to-day operations and management. While routine monitoring tasks can be performed remotely, it is important to recognize the setup effort that would be required at the backup site in the event of an emergency. The costs associated with these efforts should also be considered. This system setup process would need to include a review of operational rules, such as how data is returned to the main site and how services are reinitiated once circumstances return to normal. 24 guide book Approaches to business continuity PlateSpin Forge The cost of business continuity Merits and demerits of IT-BCP solutions Power strategy (cost of power, UPS, self-generating devices) Training costs PlateSpin Forge usage overview Network costs PlateSpin Forge as IT-BCP solution Securing human resources IT-BCP countermeasures come with a variety of unavoidable costs, in addition to hardware and software. PlateSpin Forge GUI overview The optimal solution is found by estimating and preparing for these costs. 25 chapter 03 26 Merits and demerits of IT-BCP solutions Approaches to business continuity The cost of business continuity Characteristics of tape backup……………………………………………………… 30 Merits and demerits of IT-BCP solutions Potential solutions and comparison thereof………………………………………… 28 Characteristics of storage replication………………………………………………… 32 Key points for selecting the optimal IT-BCP solution………………………………… 36 Example cost calculation……………………………………………………………… 38 PlateSpin Forge as IT-BCP solution Characteristics of system redundancy……………………………………………… 34 PlateSpin Forge usage overview PlateSpin Forge GUI overview 27 chapter 03 Merits and demerits of IT-BCP solutions Potential solutions and comparison thereof IT-BCP solutions can take various forms depend- However, the procedure for switching systems ing on the scale and structure of the system and in, such as changing environmental settings of the the type of business. Earlier we introduced RPO installed operating system and applications, as well (recovery point objective) and RTO (recovery time as changing over networks, is complex, and requires objective), and the options available differ depend- technicians with adequate training to be present at ing on how these are set for each system. But let the backup site. Additionally, the initial installation us first look at general BCP measures. Here, we will costs tends to be high, as a high-end storage array introduce tape backup, storage replication and is needed to handle the remote replication. system redundancy methods. The key feature of system redundancy then, Firstly, the key feature of tape backup systems where a backup site is maintained with the same using portable tape media has to be the low cost infrastructure as the production environment, is of the environment. For example, the latest tape rapid system recovery. System redundancy is the standard, LTO-6 (2.5TB) can be obtained for a little optimal method for mission critical operations that over $50. A drive to run LTO tapes can also be pur- must not fail. There is, of course, a very high cost chased for around $2,000. This is the recommended associated with the installation of a mirror envi- method for organizations seeking to constrain ronment, and the need for maintenance to match initial costs. However, whatever the RPO or RTO, the production environment adds a considerable restoration using this method takes time, and there running cost. A stable network is also required for are additional running costs related to tape replace- real-time data copying. Some implementations will ment and storage. also require a broadband network, making this a method requiring a considerable investment. Next we have storage replication, which maintains the same storage baseline as the production envi- All three methods have their merits and demerits, ronment, giving it the key feature that data at the but the optimal IT-BCP approach can be found by backup site represents the situation just prior to selecting the appropriate target recovery point and any incident, so the backup data is up-to-date and recovery time. the RPO is very aggressive. 28 guide book The cost of business continuity Tape Backup ● Low ● cost Only data can be recovered ● Restoration OS and applications need to be re-installed ● High daily operational load (replacement, storage and rotation of tapes) RPO: RTO: Insufficient for BCP Merits and demerits of IT-BCP solutions ● TCO: takes time Approaches to business continuity PlateSpin Forge Storage Replication very up-to-date ● Only data can be recovered ● OS and applications need to be re-installed ● Switching ●A process is complex technician is needed at the disaster recovery site TCO: RPO: RTO: Not cost-effective PlateSpin Forge as IT-BCP solution ● Data ● Installation costs are high (reconstruction of storage infrastructure) PlateSpin Forge usage overview System Redundancy ● System can be recovered in a short time installation costs (at least doubles system installation cost) TCO: ● High-speed RPO: ● Expensive connection needed to operate (maintenance of backup systems) RTO: Highly effective, but also highly expensive PlateSpin Forge GUI overview ● High 29 chapter 03 Merits and demerits of IT-BCP solutions Characteristics of tape backup Let us look again at the characteristics of tape stores only the parts that have changed since the backup. previous backup, and differential backup, which stores only the parts which have changed since the Magnetic tapes used for backup enable informa- previous full backup. Recently it has become the tion to be recorded by magnetizing a large number norm to employ deduplication, which optimizes of minuscule particles. While various standards have the number of backup tapes and storage capacity existed in the past, including DDS, DTL and AIT, the by excluding duplicate data. current mid-range leading product is LTO (Linear Tape-Open), with some DDS still in use. There are Although tape backup can significantly con- also additional high-end standards designed by strain initial installation costs compared to other individual vendors. The present version of LTO is methods, there is a considerable cost associated Generation 6 (LTO-6: without compression 2.5TB, with routine management such as magnetic tape with compression 6.25TB), and a roadmap exists up replacement and storage. Furthermore, in the event to Generation 8. of recovery, the process involves finding the appropriate magnetic tape and performing the restora- Tape backup uses a backup tool to backup and tion manually. So while via scheduling the backup restore data. The backup method is capable of process itself can be automated, recovery requires backing up individual files, as well as images, where a lot of time and effort. the whole system including the operating system and applications is stored block-by-block as an A product similar to magnetic tapes that offers image file. There also exists a system called CDP high capacity and portability is RDX. RDX is a hard (Continuous Data Protection) that is installed as an drive based solution and is used for rapid data agent on the device to be protected, where it con- access. An RDX drive uses a USB connection, and stantly monitors its status and performs backup in when a cartridge is inserted it is recognized as a real time in the event of any change or addition. local drive, allowing simple data copying. As well as full backup of all data, there are a number of methods for reducing the time required for backup, such as incremental backup, which 30 guide book The cost of business continuity Operating site Disc redundancy (R AID board, GDS) Mirror Business continuity for minor disasters Merits and demerits of IT-BCP solutions A Integrated backup environment Built in Approaches to business continuity PlateSpin Forge Tape library Backup server Mirror Built-in B PlateSpin Forge as IT-BCP solution A B PlateSpin Forge usage overview Data storage site A Transportation Backup data PlateSpin Forge GUI overview B 31 chapter 03 Merits and demerits of IT-BCP solutions Characteristics of storage replication Next, let us look at storage replication. in accordance to a schedule. Storage hardware enabled for replication can usually handle both syn- Storage replication is a technology for real-time chronous and asynchronous replication processes. copying of the computer data to be protected, and is an effective way of mitigating the breakdown risk If synchronous replication is selected, restoration of continuous services. A storage replication sys- can be performed using data from immediately tem constantly acquires data, allowing restoration prior to the breakdown, giving an optimal RPO, to the point just prior to the breakdown, and can, although an operating system and environment from a business continuity perspective, provide would have to be arranged first. Setting a short RTO excellent RPOs (recovery point objectives). (recovery time objective) is not sufficient in itself. Skilled technical support would be required at the There are two types of storage replication: remote site to achieve rapid recovery in the event synchronous, where the computer to be protected of an emergency. and the replication storage media are written to simultaneously, and asynchronous, where the Replication solutions that come as a black box replication storage is written to according to a from a storage vendor require a network that can defined schedule. handle the load as well as a system for transferring data to the remote site. While the initial data 32 Synchronous replication completes the I/O writ- transfer requires a full copy of all data stored at ing process when it is confirmed that the write to the time, once this is completed, the data transfer the protected computer and the replication stor- volume can be reduced by employing functions to age media was successful. This means adequate transfer only those blocks in a snapshot which have network bandwidth to the remote site is required, changed, and by making use of deduplication to as well as a low-delay latency. Asynchronous rep- identify duplicate data block-by-block. These meth- lication writes data to be protected to the storage ods provide a system for minimizing data transfer media, exports data to be replicated to memory, volume on the network, while achieving high and performs copying to the replication device speed data copying. guide book Application server ACK ① ④ ② ③ Local Merits and demerits of IT-BCP solutions Write I/O An I/O complete message (ACK) is returned to the server upon completion of transfer to the remote site The cost of business continuity Synchronous Replication Approaches to business continuity PlateSpin Forge Remote Zero data loss in event of disaster Asynchronous Replication PlateSpin Forge as IT-BCP solution Additional delay for transfer to remote site Application server ACK ① PlateSpin Forge usage overview Write I/O An I/O complete message (ACK) is returned to the server before transfer to the remote site ② ③ ④ Remote No delayed I/O to server Some data loss in event of disaster PlateSpin Forge GUI overview Local 33 chapter 03 Merits and demerits of IT-BCP solutions Characteristics of system redundancy Now let us look at some ways of achieving system As mentioned above, transfer of data to a redundancy. remote site is achieved by replication. To ensure the shortest RPO and RTO while maximizing System redundancy, as the name suggests, availability, synchronous replication is required, involves the construction of a backup site with the but this does come with an additional cost, as exact same environment as the production envi- a low latency broadband network is needed. ronment, and that is capable of handling hardware Although asynchronous replication is available as breakdown by duplicating the computer itself using an option, mission critical systems requiring system clustering, as well as coping with disaster by copying redundancy are generally better suited with the data to a remote site in real-time using replication. synchronous model. As this system enables instant recovery to a state However, providing a system environment at two immediately prior to the breakdown, in IT-BCP sites has a considerable impact on system installa- terms it enables RPO and RTO to be set at will. It tion costs, and a broadband connection is essential is ideal for mission critical operations that require to achieve real-time information processing and high business continuity performance. instantaneous switching. Furthermore, day-to-day maintenance would be required at the backup site Clustering is most often achieved using failover in the same manner as the production environ- clustering, which for Windows environments ment, adding to the maintenance costs. is available in Windows Server 2012, but uses specific clustering software under non-Windows Operations that require continuous system avail- platforms or virtualized environments. The reason ability demand infallible protection through system for this is that clustering functions in a virtualized redundancy regardless of cost. environment are unable to monitor ever y application, and therefore cannot detect when a breakdown occurs. Using clustering functions incorporated into the hardware allows a higher precision of fault detection, as a separate network is provided for fault detection. 34 guide book A A Cluster Cluster Currently in use Currently in use Standby Standby Mirror between R AID devices A Merits and demerits of IT-BCP solutions Storage area network (SAN) Backup server The cost of business continuity Primary site (data center) B B Approaches to business continuity PlateSpin Forge B PlateSpin Forge as IT-BCP solution Remote mirror Wide Area Ethernet R AID device Storage area network (SAN) PlateSpin Forge usage overview B Backup server A Standby Standby Cluster Active (reference system) B A B Standby Cluster PlateSpin Forge GUI overview Active (reference system) AA Secondary site (data center) A 35 chapter 03 Merits and demerits of IT-BCP solutions Key points for selecting the optimal IT-BCP solution We have seen how each of the IT-BCP solutions monthly accounts could be missed, so the level of discussed so far has its own merits and demerits. Pre- protection should perhaps be of the order of a few venting system interruptions in the event of a break- hours. Difficulties would also arise if core sales sys- down can be costly, and while low cost methods of tems were to fail, such as plant production control avoiding operational difficulties are available, these or invoice issuing. require patience. Essentially, the key is to determine the recovery priority of all existing systems. Internal communication infrastructure such as mail servers and groupware servers represent Although the approach will differ from corpora- another field where the impact on business due tion to corporation, we should seek to minimize to a loss of service could be significant, but it may the RTO of those systems that, if interrupted, be possible to overcome this by adopting alterna- would have a significant impact on business. The tive means of communication such as cellphones, operations that require rapid recovery are those smartphones or social network services like Face- that have a considerable impact on business, such book. The RTO could well be set to be a little more as accepting orders or shipping instructions. Such flexible in this case. issues have a direct impact on customers. Important systems such as call centers handling customer inquiries should also be considered. On the other hand there are systems that have little impact externally, such as file servers. For such systems a low cost recovery method such as tape backup may be selected, rather than implementing an expensive IT-BCP solution. Some core systems such as financial accounting are also of key importance, but it must be determined whether immediate recovery is a necessity. If the financial accounting system were to fail it is possible that 36 guide book The cost of business continuity PlateSpin Forge usage overview Ordering system Sales management system PlateSpin Forge as IT-BCP solution Call center system Financial accounting system RTO Merits and demerits of IT-BCP solutions File server Approaches to business continuity PlateSpin Forge Communication system Impact on operations PlateSpin Forge GUI overview 37 chapter 03 Merits and demerits of IT-BCP solutions Example cost calculation An unavoidable key concern when considering Therefore, the focus is now on installing a custom- IT-BCP solutions is cost. Seeking recovery as rapidly ized appliance running a virtual environment at a as possible implies significant investment and orga- remote site, as a backup standby environment. In nization, and deters some corporations from adopt- the event of an emergency the appliance is capable ing measures. The reason for this is the high cost of of continuing operations in the same way as the maintaining the backup site or environment, due production environment. One such backup appli- to the dependence of the system on the hardware. ance product is PlateSpin Forge from Netiq. However, the latest virtualization technology is now delivering solutions that enable standby environ- Real-use cost calculations show this approach to ments to be built flexibly, with no dependence on be exceedingly efficient. Let us compare storage the physical environment. These backup methods replication with a backup appliance, both running use customized infrastructures such as cloud ser- storage synchronization. If we assume the corpo- vices or backup appliances to allow a mirror of the ration's production environment runs 15 physical production environment to sit in standby. servers and five virtual servers, we would expect a typical cost reduction of 44.8% for a TCO period of In other words, to address both low cost and four years from installation. rapid recovery, the approach must now consider the recovery level objective (RLO), or the level of This backup appliance option has gained consid- business that should be recovered. As a recovery erable attention as a solution that satisfies the prin- level of 100% would require considerable main- ciples of business continuity while keeping costs to tenance costs for the standby environment, a a minimum. flexible approach is needed, whereby, for example, 30% of operations are initially recovered, leaving the remainder to be recovered once the required environment is deployed. While this leads to some reduction in system responsiveness, it ensures business continuity with minimum impact. 38 guide book The cost of business continuity Breakdown of total costs (normal operations): "Inter-storage transfer vs. Backup appliance" $ Approaches to business continuity PlateSpin Forge 700K 600K 500K 400K 300K 100K 0 B: Backup appliance Service $53,240 $53,240 Testing $41,917 $16,765 $201,204 $0 $43,104 $8,714 $179,715 $154,354 Permanent operation Construction Product $ PlateSpin Forge as IT-BCP solution A: Inter-storage transfer Merits and demerits of IT-BCP solutions 200K Change in TCO (normal operations): "Inter- storage transfer vs. Backup appliance" 700K 600K PlateSpin Forge usage overview 500K 400K 300K 200K 100K 0 Year 2 Year 3 Year 4 $162,145 $281,192 $400,195 $519,229 B: Backup appliance $51,223 $111,850 $172,435 $233,047 PlateSpin Forge GUI overview Year 1 A: Inter-storage transfer From the ITR document, "Investment efficiency in business continuity solutions using virtual environments: low cost disaster recovery". 39 chapter 04 40 PlateSpin Forge as IT-BCP solution Approaches to business continuity The cost of business continuity Merits and demerits of IT-BCP solutions PlateSpin solution overview………………………………………………………… 42 Optimal virtualization planning: PlateSpin Recon…………………………………… 44 Migrating to a virtual environment: PlateSpin Migrate……………………………… 46 Overview of business continuity using PlateSpin Forge……………………………… 50 PlateSpin Forge as IT-BCP solution PlateSpin Forge as IT-BCP solution…………………………………………………… 48 PlateSpin Forge usage overview PlateSpin Forge GUI overview 41 chapter 04 PlateSpin Forge as IT-BCP solution PlateSpin solution overview We have talked about how virtualization tech- calculated accurately during the migration plan- nology can be useful for IT-BCP. The PlateSpin ning, so that they match the actual workloads. The product line from NetIQ offers a solution for each PlateSpin Recon virtualization planning tool is ideal stage (phase) of virtualization, with virtualization for these capturing and planning phases. technology-enabled solutions to support all efforts on the journey to a fully virtualized environment. Once migration planning has been done, the Essentially, the four phases are situation capture, migration phase begins, to eventually migrate from planning, migration and backup/BCP. the current environment to the new environment. Generally, if the existing environment runs on phys- In order to achieve a virtualized environment, the ical infrastructure, this is a P2V (physical to virtual) first essential phase is to capture the current state migration. If a workload is to be developed on a of affairs, such as capturing internal servers, prepar- virtual platform using an on-premise environment, ing an inventory and gathering data on workloads a V2V (virtual to virtual) migration is required. (whole environments to run on the virtual machine hosts). Before making the switch to a virtual envi- PlateSpin Migrate is a useful solution for migration ronment, assuming the purpose is to develop a of data and systems between various environ- backup environment for IT-BCP at the same time, it ments, and is capable of freely migrating workloads is important to ascertain which servers are handling between physical and virtual environments. Its key important workloads, as well as creating the sched- feature is the ability to handle all major virtualization ule for implementation. platforms such as VMware, Microsoft Hyper-V, SUSE Linux Enterprise with Xen and Citrix XenServer. Once the situation has been captured, the next 42 phase is the planning phase, which involves migra- The final phase is the IT-BCP solution: backup/ tion planning to determine the sizing of virtual BCP. When specifying any environment for backup, servers according to the size of the new environ- whether physical or virtual, it is important to inte- ment and the schedule for migration according grate backup while mitigating operational load. to prioritization, as well as careful measurement In the event of an emergency, the solution should of effectiveness following migration. The required be capable of implementing provisional recovery resources for the migration environment must be using flexible failover, and supporting rapid busi- guide book The PlateSpin backup appliance PlateSpin Forge, available for use when an environment matching with its software component PlateSpin Protect, the production environment is again ready for use. meets the needs of this phase. Phase Planning Migration Backup / BCP • Capture internal servers • Sizing of virtualized servers • P2V (physical to virtual) • BCP Solution • Compile inventory • Migration planning • V2V (virtual to virtual) • Backup of physical servers • Monitoring • Measurement of overall effectiveness • V2P (virtual to physical) • Compile workload • Identifying targets for virtualization • P2P (physical to physical) • P2I (physical to image) • V2I (virtual to image) • I2V (image to virtual) • Failover • Backup integration • Disaster recovery appliance PlateSpin Forge/PlateSpin Protect Product PlateSpin Forge usage overview • I2P (image to physical) • Backup of virtual servers PlateSpin Forge as IT-BCP solution Key Tasks Capturing current situation Virtual/Cloud Merits and demerits of IT-BCP solutions Route to virtualization and the cloud The cost of business continuity ness continuity. A failback function should also be Approaches to business continuity PlateSpin Forge PlateSpin Recon PlateSpin Migrate PlateSpin Forge GUI overview 43 chapter 04 PlateSpin Forge as IT-BCP solution Optimal virtualization planning: PlateSpin Recon Next we will take a closer look at the product in business changes and improved planning. Efficient the PlateSpin solution portfolio for planning the sizing is achieved by constantly maintaining an optimal virtual environment, PlateSpin Recon. optimal environment. PlateSpin Recon is a virtualization planning tool to When considering an IT-BCP solution, it is only assist you in achieving optimal resource planning by setting the level at which business needs to for your virtual environment. In addition to tracking be recovered, the RLO (the fallback level required CPU, memory, HDD and network use on all existing for recovery), that a plan suited to the required servers on the network in both physical and virtual performance can be created. PlateSpin Recon environments, PlateSpin Recon enables remote can help in creating such a plan. It also includes capture of inventories and performance data. The a function for analysis of cost reductions and workloads are then analyzed in detail leading to investment efficiencies compared to the existing automated creation of a server integration and environment, which even calculates hypothetical consolidation plan. PlateSpin Recon is designed electricity and cooling costs under the created plan. to automatically handle up to 10,000 server data So as well as with IT-BCP, the software also helps points. All operations are performed agent-less. with corporate social responsibility (CSR) efforts for CO2 emission reduction. Creation of server integration and consolidation scenarios can also be automated by configuring settings for the hardware or virtual platform to be used, as well as resource allocation following consolidation. Another key feature is the simulation function for estimating operational status up to 12 months ahead, enabling potential issues with the virtualization process to be identified in advance. At the same time, tracking of resource usage can be set to continue after migration to capture changes to the environment, enabling a flexible response to 44 guide book Approaches to business continuity PlateSpin Forge The cost of business continuity Merits and demerits of IT-BCP solutions PlateSpin Forge as IT-BCP solution PlateSpin Forge usage overview PlateSpin Forge GUI overview 45 chapter 04 PlateSpin Forge as IT-BCP solution Migrating to a virtual environment: PlateSpin Migrate Next in the PlateSpin solution portfolio is The software can handle all major virtualization PlateSpin Migrate, which enables workloads to be platforms including VMware ESX, VMware ESXi, freely migrated. Microsoft Hyper-V, SUSE Linux Enterprise with Xen and Citrix XenServer, and is designed to enable PlateSpin Migrate is a virtualization and migration simple migration between different virtualization tool that enables migration of servers running on platforms, such as from VMware to Hyper-V or the virtual or physical environments, called workloads, other way around. This means virtual environments to other physical or virtual environments. Because can be created with no dependence on a specific workloads can be migrated as they are, with- vendor, allowing the virtualization platform to be out altering the existing environment, PlateSpin changed in the event of new trends in virtualization Migrate is able to migrate a physical server to a test platforms or new evaluations of cost efficiency. virtual environment to verify application functionality. Conversely, a system that has been developed Because PlateSpin Migrate is capable of returning and tested in a virtual machine can be deployed virtualized workloads back to a physical environ- in a single step to a physical server to become the ment, there is always peace of mind that a V2P production environment. operation can be implemented in the event of application incompatibilities leading to issues with Ordinarily, P2V migration requires complex proce- the virtual environment. All in all, PlateSpin Migrate dures such as the setup of the virtual machine, but is a powerful solution supporting end-to-end vir- with PlateSpin Migrate this operation is simplified tual environment migrations. via automation. The entire process, from creation of the virtual machine on the destination server to adjusting the system and copying data, can be automated using the GUI drag and drop functionality. The bandwidth throttling function enables migration of workloads in a short time period without impacting the network too much. 46 guide book Approaches to business continuity PlateSpin Forge The cost of business continuity Merits and demerits of IT-BCP solutions PlateSpin Forge as IT-BCP solution PlateSpin Forge usage overview PlateSpin Forge GUI overview 47 chapter 04 PlateSpin Forge as IT-BCP solution PlateSpin Forge as IT-BCP solution The next item in the PlateSpin portfolio is NetIQ’s While business continuity for production environ- IT-BCP solution: the PlateSpin Forge backup appli- ments usually involves a performance reduction, ance and its software element PlateSpin Protect. the level of fallback operation can be adjusted Let us focus more closely on the appliance product to allow considerable improvements in RTO. If a PlateSpin Forge. network is provided to remotely access PlateSpin Forge, the provisional recovery process can be sim- PlateSpin Forge is a backup appliance product ply implemented without the need for staff to be which provides business continuity functionality deployed at the remote site. via a provisional recovery environment in the event 48 of an emergency, through integrated backup of Even if the new failback environment has multiple servers, whether physical and virtual envi- hardware from a different vendor than the one ronments, on a single unit. Because PlateSpin Forge used in the initial environment, or uses a different is an appliance server preloaded with the hardware virtual environment, the system can handle it. environment required for business continuity as PlateSpin Forge comes equipped out-of-the-box well as virtualization and recovery software, it can with a broad set of drivers for multiple virtual and start providing value as soon as it is connected to physical platforms, and is capable of flawlessly the network, and vastly simplifies the process of restoring from e.g. Microsoft Hyper-V to VMware managing BCP environments. ESX, or to Citrix XenServer. When using PlateSpin Forge, recovery, including It also incorporates a failover test function that different system settings for each and every data allows failover testing to be performed using backup, is simply a matter of a few clicks. As well different network settings than the production as being useful as a routine backup environment, environment. PlateSpin Forge can be set up to run PlateSpin Forge excels at operating as a failover its own test environment, so failover testing can environment for provisional recovery in the event be performed with no impact on the production of an emergency. It can also perform failback back environment, allowing it to be used as a verification to the original environment, once the infrastructure environment for security patches and application is ready, to achieve full recovery. upgrades. Once testing is completed, all updates guide book Forge is thin provisioning-enabled, meaning it is to a synchronized state, meaning you can freely capable of virtualizing storage volumes to support change, add or delete test environments and flexible storage operations. The cost of business continuity are deleted and the system automatically reverts Approaches to business continuity PlateSpin Forge always have the option to immediately reset. Finally, as well as offering recovery point manage- PlateSpin Forge Dashboard Merits and demerits of IT-BCP solutions ment on a workload-by-workload basis, PlateSpin PlateSpin Forge as IT-BCP solution PlateSpin Forge usage overview PlateSpin Forge GUI overview 49 chapter 04 PlateSpin Forge as IT-BCP solution Overview of business continuity using PlateSpin Forge Now let us look again at how PlateSpin Forge can be 100%, but it will provide business continuity be of use from a business continuity perspective. at a fallback level. As in practice often a network Because it is a backup appliance, PlateSpin Forge switchover is required, PlateSpin Forge can auto- excels across a range of scenarios, for routine matically convert the backup system's IP address backup use as well as for business continuity and to match the failover network before booting. This restoration in emergencies. means the system will have network access as soon as it is switched on. This eliminates the task of Firstly, during routine use in a production envi- changing network addresses following restoration, ronment, PlateSpin Forge functions as a device to as often required with system backup tools. collectively backup multiple servers. It implements replication for both data and system settings, and Once the hardware and the software that are to applies internal recovery point management. In the be used for the full recovery are ready, PlateSpin event of data loss such as due to a deletion made in Forge's internal virtual workload can be restored error, even if this error is replicated, PlateSpin Forge to the production environment. The new environ- can still recover the environment by returning to ment options can be simply selected using the GUI, one of the multiple previous recovery points. It also providing a secure failback route without multiple employs incremental backup, so backup time is complex steps. As well as failback to a physical kept to a minimum by avoiding backup of the same environment, PlateSpin Forge is also designed to data again and again. handle failback to virtual environments, and restoration can proceed even when different hardware In the event of a disaster or breakdown, the is used in the new system. Restoration can also be workload can be booted up using PlateSpin Forge's performed to a different virtualization platform internal virtual environment, providing dramatic than the original one. improvements to RTO. As you only need to boot up the backup virtual server, there is no need to carry out restoration procedures as with an ordinary backup tool. Because multiple server environments are housed in a single unit, the service level will not 50 guide book In event of disaster / failure Service level Under normal operation The cost of business continuity Overview of business continuity using PlateSpin Forge Approaches to business continuity PlateSpin Forge Full recovery 100% RLO RPO RTO Time Merits and demerits of IT-BCP solutions Provisional recovery Final backup PlateSpin Forge as IT-BCP solution Implementation of integrated backup Reduced-level operation Restoration to production using Forge's internal server once prepared backup system PlateSpin Forge usage overview PlateSpin Forge GUI overview 51 chapter 05 52 PlateSpin Forge usage overview Approaches to business continuity The cost of business continuity Key feature of PlateSpin Forge: easy-to-use GUI with low overhead………………… 56 Two methods for initial workload replication………………………………………… 58 Merits and demerits of IT-BCP solutions Key feature of PlateSpin Forge: simple and user-friendly setup……………………… 54 Routine operations example 1……………………………………………………… 60 Routine operations example 3 – system recovery training…………………………… 64 Failover……………………………………………………………………………… 66 PlateSpin Forge as IT-BCP solution Routine operations example 2……………………………………………………… 62 Failback……………………………………………………………………………… 68 PlateSpin Forge usage overview PlateSpin Forge GUI overview 53 chapter 05 PlateSpin Forge usage overview Key feature of PlateSpin Forge: simple and user-friendly setup One key feature of PlateSpin Forge is its simple for business continuity simply by placing it on the setup. This is because as a product it includes all network and making simple configuration changes the elements required for business continuity: using the GUI. Of course, the performance will not hardware such as a server and storage, recovery be equal to the production environment, but it is software with replication and backup functionality, powerful enough to implement business continuity and a virtual environment. All of this is provided in at an excellent fallback level. one unit, as a single package. System building with PlateSpin Forge is easy. Ordi When we think about ordinary business continu- narily the integration costs are considerable due ity we think of having servers and storage ready at to the range of tasks to be completed, such as the a backup site, and of synchronization settings such installation of an operating system in preparation as replication to a storage mirror. As well as setting for applications, and creating the virtual environ- up the environment we also have to implement ment. With PlateSpin Forge, you simply remove the backup of the production environment, along with unit from its box, connect it to the network, assign complex system settings. In practice there are also an IP address and password, and select the servers a number of other requirements before we have a to be protected. Backup is handled automatically functioning backup site, such as stationing a tech- simply by setting the synchronization policies, such nician on site and scheduling copying of data and as the schedule and compression level, and clicking system settings from the production environment execute. The initial backup will complete within one to the backup environment. Setting the levels for to two days, depending on the number of servers RPO and RTO will depend on the specific organiza- to be protected, and thereafter the incremental tion, but it is easy to imagine a significant time and backup system will proceed automatically. This effort required to setup each system. means a considerable reduction in time and effort needed for setup. With PlateSpin Forge, all the essential backup site equipment including the server, the storage and the recovery software is provided in one unit, making it possible to build a backup environment 54 guide book Backup target ser ver Operating environment The cost of business continuity Typical disaster recovery site setup Approaches to business continuity PlateSpin Forge 1. Application setup 2. OS installation Backup 4. Preparation of server Remote storage (critical) Local storage (critical) 3. Construction of virtual environment Backup site 5. Preparation of storage 6. Software replication Large system integration cost and effort 7. Storage replication Backup target ser ver PlateSpin Forge as IT-BCP solution Simplicity of PlateSpin Forge infrastructure Merits and demerits of IT-BCP solutions Production site Mount Synchronous Operating environment Using Forge: Production site 1. Remove from box Backup site 2. Configure IP address and password 3. Select servers for protection Simple disaster recovery site setup 4. Set synchronization timing PlateSpin Forge GUI overview A single PlateSpin Forge unit enables IT-BCP with no requirement for additional equipment or technical staff PlateSpin Forge usage overview Synchronous 55 chapter 05 PlateSpin Forge usage overview Key feature of PlateSpin Forge: easy-to-use GUI with low overhead Even more than simplicity of setup and system to be carried out in a way that ensures switching building, PlateSpin Forge of fers excellent works in practice, with no impact on the production controllability for day-to-day backup operations. environment. The test outcomes are shown on The same applies to recovery, all the while ensuring the GUI, allowing any problems to be eliminated business continuity. in advance. Once testing is complete, temporary settings for the workload used in the test are Information backup is performed incrementally discarded, and PlateSpin Forge automatically in accordance with the synchronization policies reverts to the pre-test environment. set at the time of install, so there is no need for any specific routine maintenance. Any changes, Should a protected server suffer a breakdown, including changes to system settings, are part of rapid recovery is needed. With PlateSpin Forge, the incremental backup. So if, for example, there is failover optimized for business continuity can be an operating system patch for a protected server, executed with a single button. All that is required is the patch will be automatically installed to the selecting the workload for recovery and selecting server operating system on the workload as part of the recovery point. While performance will not be the backup process. Recovery point management equal to the production environment, PlateSpin is also implemented, so the required environment Forge provides a fallback version of the produc- can be selected and restored in the event of a mis- tion environment. If just a single workload is to be take on the protected server. recovered this can be done in a few minutes, while simultaneous recovery of multiple workloads can be One task that should be performed periodically brought to provisional recovery in up to a few hours. is a failover test, to determine whether the 56 environment can be correctly restored in the Once hardware, etc. is prepared for full recov- event of an emergency. This task simply requires a ery, a failback operation is required to move from recovery point to be selected and executed using the provisional recovery environment to the full the Test Failover button on the control screen. A recovery environment. The failback function works test network is created as a virtual network, so that by configuring how failback will be executed, for when the virtual machine is booted for testing example by selecting which transfer method and it connects to the test network rather than to a which volumes are to be used, and then executing network with external access. This allows testing failback to the full recovery environment. guide book ing replication following recovery ensures that PlateSpin Forge's failback function can then be set system protection for business continuity is once for replication to ensure future protection. Restart- again maintained. Backup Tools Using Forge • Failure is detected on production system • Failure detected on production system • Switch networks to standby system • Select restore point using data stored in Forge • Suspend storage replication • Mount the storage LUN in standby system • Start standby system and verify operation • Implement failover in Forge, start up Forge environment and verify operation Provisional recovery • Prepare new ser ver • Prepare new ser ver • Prepare new ser ver product • Exchange HDDs • Prepare new storage products • Implement failback in Forge • Reinstall OS and applications (including backup tool agents) • Exchange storage • Confirm operation • Implement restoration • Reinstall OS and applications (including replication software) • Test and verify storage replication PlateSpin Forge usage overview • Start backup tool and select restore point • Implement reverse replication • Start applications and verify operation Full recover y Several days are needed before business can start PlateSpin Forge as IT-BCP solution • Apply OS updates and the latest patches to the standby system Merits and demerits of IT-BCP solutions • Failure is detected on the production system Storage Replication (non-synchronous) The cost of business continuity The full recovery environment created through Approaches to business continuity PlateSpin Forge Work can begin after several hours under provisional recovery System maintenance and personnel needed at backup site Provisional recovery can be implemented remotely using Forge, enabling minimum cost business continuity PlateSpin Forge GUI overview Work can begin after several hours under provisional recovery 57 chapter 05 PlateSpin Forge usage overview Two methods for initial workload replication Installing PlateSpin Forge initially requires the If, however, the volume of data for backup is small entire workload of the target server that is to be or the servers to be protected are distributed nation- protected to be copied, including operating system wide, it would be problematic to host a PlateSpin and other system settings and data. This process is Forge unit at each site. Therefore, the second option called replication. There are two main methods of would be to install it at the backup site and perform performing the initial full replication upon install. the initial full replication over the network. If the server to be protected holds a large quantity Although this method requires more time to com- of data or there is no broadband network to the plete than full replication using the first option, the backup site, the first option would be to copy all targets to be protected can be assigned from the data and system settings within the local environ- backup site, and the IT-BCP environment can be ment. PlateSpin Forge is installed within the same achieved by scheduling replication of workloads in LAN segment as the server to be protected, and the sequence. Detailed settings are available including entire workload is copied over. If there are multiple network bandwidth, compression level and band- servers, workloads from multiple units can be cop- width limits, so replication can be managed with- ied simultaneously, up to the limit supported by out significant impact on existing networks. Once the network bandwidth, and PlateSpin Forge will the initial full replication is complete, a continuous integrate these as a single backup environment. process of incremental backup is initiated. Next, PlateSpin Forge is transported to the backup site and powered up. PlateSpin Forge's own IP address is then changed, which changes the replication IP addresses in the backup settings of workloads to be protected, allowing data protection to be implemented passing only incremental data from the local environment across the network. Performing the initial full replication in the local environment means the backup environment can be prepared as a rapid IT-BCP solution. 58 guide book Approaches to business continuity PlateSpin Forge The cost of business continuity Copy the production environment locally and transport it, for example by truck, to a backup site where it can be installed Backup site Merits and demerits of IT-BCP solutions Production site PlateSpin Forge as IT-BCP solution Production site Production site Production site PlateSpin Forge GUI overview Production site PlateSpin Forge usage overview Install at backup site and copy the production environment across the network 59 chapter 05 PlateSpin Forge usage overview Routine operations example 1 Here we will look at a specific example of After the migration of the production environ- adopting PlateSpin Forge. Let us say the intention ment to a private cloud, PlateSpin Forge is config- is to migrate around 20 physical servers with ured to protect the workloads that are now running integrated backup to tape to a private cloud as an in the private cloud. Once the initial replication is IT-BCP solution. performed, PlateSpin Forge is installed at a remote backup site, and functions as an IT-BCP solution Firstly, the workloads of the 20 physical servers providing remote backup for the production envi- should be consolidated onto a single PlateSpin ronment in the cloud. Forge unit using its virtual environment, so all workloads are copied using a P2V migration process. At PlateSpin Forge enables a simple migration this stage, the production environment is usable to a private cloud, as well as making a valuable and IT-BCP enabled, and can be rapidly recovered contribution to IT-BCP in the private cloud to the provisional recovery state on PlateSpin Forge environment. Configuring replication settings in in the event of any problems with the physical advance means that the environment is capable servers. At the same time, the private cloud envi- of rapidly switching to provisional recovery mode ronment can be built for future use as the produc- for business continuity even when the party tion environment. In this case, we assume that the responsible is not present. environment will run 20 workload units as virtual machines on a single physical server. As a result, not only does PlateSpin Forge provide a level of business continuity infrastructure that Once the workloads of the 20 existing environ- was unavailable in the past, it also succeeds ments are copied to PlateSpin Forge, the PlateSpin in reducing the hardware cost and complex Forge unit is moved and installed at the site where operational burden associated with 20 physical the private cloud environment is to be built. A servers. In case of an emergency, PlateSpin Forge V2V migration process enables migration from provides valuable functionality as a provisional the PlateSpin Forge environment into the private recovery infrastructure. cloud environment. 60 guide book Problem: 1. A business continuity solution cannot be implemented without ensuring countermeasures in the event of a failure. 2. Downtime during failure can run into days. 3. Plans must be made to switch out old equipment as well as looking ahead to virtualization. Before 12 physical servers Phase 1: Backup existing physical production environment to Forge. Phase 1′: Backup vir tual environment to Forge. After Phase 3: The newly constructed vir tual environment is synchronized and backed up to Forge, providing a disaster recover y solution when sited remotely. One virtual server (12 instances) Vir tual environment is backed up on Forge. PlateSpin Forge usage overview Phase 2: Vir tual environment on Forge is migrated as a whole to become a vir tual environment on the new ser ver. PlateSpin Forge as IT-BCP solution Tape Two virtual servers (8 instances) Merits and demerits of IT-BCP solutions PlateSpin Forge solves this problem The cost of business continuity PlateSpin Forge: Case Study 1 Approaches to business continuity PlateSpin Forge Two virtual servers (8 instances) PlateSpin Forge GUI overview 61 chapter 05 PlateSpin Forge usage overview Routine operations example 2 Next, we will discuss how PlateSpin Forge can no longer any requirement to perform tape back- help at sites that will unavoidably continue to use ups at each site. Unlike with simple replication, multiple backup tools, by breaking down each sys- this system provides recovery point management, tem into silos. allowing the system to be reverted precisely to any recovery point. Backup operations are also Organizations with different systems at each site integrated, reducing on-site operational load, and or plant may find it difficult to establish respective allowing human resources to be diverted to other backup environments. This is because in many operations. cases backup is left to the technician onsite, meaning that backup software and tape media may This P2V process provides full integration of differ for each system, and that rationalization of cross-site backup environments, and enables usage methods and backup policies may not occur. organizations to switch away from tape backup With a wide distribution of operations and environ- environments that can require several days ments, it can be difficult to assign priorities to sys- for recovery. PlateSpin Forge delivers dramatic tems, making rapid recovery for business continuity improvements in RPO and RTO compared to such in the event of an emergency an unlikely prospect. legacy technologies. The most efficient solution for such a mixed backup environment is PlateSpin Forge. Firstly, all workloads running on the various systems should be migrated to PlateSpin Forge's virtual environment using P2V. Specifically, the physical environments running at each site are selected for protection, and the workloads are replicated to PlateSpin Forge. Once the initial full replication is complete, the workloads are kept synchronized using incremental backup at a predefined schedule, meaning there is 62 guide book 1. Each site has its own backup system. Different plants have different systems and information capture cannot be implemented using information systems. Problem: The cost of business continuity PlateSpin Forge: Case Study 2 Approaches to business continuity PlateSpin Forge 2. Backup policies are disparate. 3. Only operational staff members understand the systems, confounding system handover. 4. The system is not periodically backed up. Before Merits and demerits of IT-BCP solutions PlateSpin Forge solves this problem Site A backup tool PlateSpin Forge as IT-BCP solution Site B backup tool Site C backup tool Site D backup tool PlateSpin Forge usage overview After PlateSpin Forge GUI overview Different vendor systems, different sites and different operation methods can all be combined. Only one or two staff members are required, meaning only a simple manual is required for handover. 63 chapter 05 PlateSpin Forge usage overview Routine operations example 3 – system recovery training Here we will discuss the failover test process, ing the backup workload to be verified. Indeed, the which is easy to be carried out routinely when process simply involves selecting the workload to using PlateSpin Forge. be verified and clicking on the failover test button. This makes shorter-term TTOs achievable. The failover process to be performed at the backup site in the event of an emergency should be A traditional failover test often requires personnel routinely tested. Despite this, many organizations to be stationed at the backup site, and must be omit such verification in their routine operations performed on a non-working day in order to avoid in an attempt to avoid impact on the production impacting the existing environment. In some cases, environment. For proper implementation, however, the test can only be performed with the production failover capability should be checked on a monthly environment temporarily offline, to simulate an basis, at the minimum. incident, making the impact of the failover testing considerable. With PlateSpin Forge, however, the This factor is quantified using the TTO (Test Time test can be performed with no impact on the pro- Objective) metric, which asks how easy it is to duction environment whatsoever. test the system as part of day-to-day training for business continuity. Often several days have to be PlateSpin Forge provides its own test environment devoted to verifying the system, so that you know on a custom internal network, which can be used whether your IT-BCP solution will function correctly to perform a wide range of verification tasks such when it comes to the crunch. Daily verification as pre-verification of security patches or application can even be essential to ensure rapid and reliable upgrades. Moreover, PlateSpin Forge is designed to IT-BCP countermeasures. automatically revert to the original synchronized environment once testing is complete, by undo- PlateSpin Forge was designed from the outset to ing the configuration changes used for testing. allow failover testing to be performed simply, and This means peace of mind, no matter how many without interrupting operations. It is capable of changes or deletions you make to the system or running a test environment internally that has no data in the test environment. impact on the production environment, enabling failover testing to be performed simply by select- 64 guide book Test environment Standby ser ver Merits and demerits of IT-BCP solutions Test environment setup in Forge The cost of business continuity PlateSpin Forge Approaches to business continuity PlateSpin Forge PlateSpin Forge as IT-BCP solution PlateSpin Forge usage overview PlateSpin Forge GUI overview 65 chapter 05 PlateSpin Forge usage overview Failover Now let us look at how PlateSpin Forge can be involves selecting the workload to place in failover used in the event of an actual emergency to pro- mode from the PlateSpin Forge dashboard, and vide a failover service. pressing the execute button next to the desired recovery point. Doing so boots up the backup Under normal operation, users access specific workload within PlateSpin Forge, and recovers the servers and make use of business applications. For server state from the selected recovery point. servers subject to the backup process, these applications are running within workloads, and PlateSpin If settings for switching the IP address of the Forge draws down data on a day-to-day or hour-to- booted workload and the IP address of the offline hour basis for incremental backup of workloads in server have been configured in the Active Direc- accordance with a predefined schedule. tory or DNS server in advance, users will be able to access the server as soon as PlateSpin Forge has Let us assume that a server breakdown occurs. booted the workload, providing a temporary provi- Straight away the server connection enabling users sional recovery state. to access the server is cut, as is the connection to PlateSpin Forge. The server has lost its online status However, as this failover state operates on a and further backup is no longer possible. Users who workload within PlateSpin Forge, performance will have lost access will be aware that some issue has not be at the same level as the production environ- arisen on the server or network, but before they ment. Users should be aware that this state offers a are able to deliver this message to their manager, fallback level of service. PlateSpin Forge will promptly detect the breakdown and deliver an email to the email address of the relevant manager. The manager, having received the email, can then access PlateSpin Forge from a device such as a smartphone or notebook, and initiate the recovery process for business continuity. Specifically, this 66 guide book Normal operation Production PlateSpin Forge The cost of business continuity � Approaches to business continuity PlateSpin Forge Users In event of breakdown Offline PlateSpin Forge Forge sends breakdown alert or cellphone/tabletnotice, instruction to switch over is provided using cellphone/tablet � Forge's internal standby server boots and continues operations PlateSpin Forge PlateSpin Forge Dashboard PlateSpin Forge usage overview � PlateSpin Forge as IT-BCP solution Users Merits and demerits of IT-BCP solutions � Offline PlateSpin Forge PlateSpin Forge GUI overview Users 67 chapter 05 PlateSpin Forge usage overview Failback While PlateSpin Forge can be used to provide a Once the new full recovery environment has been failover service, the environment offers only a cer- deployed, manual adjustments can be made to tain fallback performance level, and does not reach make it a virtualized platform, if so desired. Next, the stage of complete recovery. Therefore, let us the failback target is selected, and settings similar now look at the failback process for moving work- to the initial replication are configured for failback, loads back to a full recovery environment, once the such as compression level and transfer method. original infrastructure has been repaired or rebuilt. When failback is executed on the configured system, failback is performed to provide a system While business continues in a provisional and data environment with a 100% recovery level, recovery state under PlateSpin Forge, the system achieving full recovery. manager will have the damaged server repaired or, if it cannot be repaired, will arrange for it to Once full recovery has been achieved, PlateSpin be replaced. At this replacement stage, it may be Forge replication can be once again set to execute, unavoidable that the same hardware as was used to provide an IT-BCP solution for the now recovered in the previous model cannot be ensured. It may environment. This is possible because PlateSpin also be the case that the business environment has Forge is a dual functionality appliance, providing a altered and a different virtualized platform is now running environment for provisional recovery, as sought for full recovery. well as a day-to-day backup function. PlateSpin Forge is loaded with functionality to allow recovery of workloads to physical servers from various vendors and to different virtualization platforms. PlateSpin Forge has tens of thousands of drivers available for operating systems and virtualized platforms, and is preloaded with the ability to automatically switch drivers to suit any new hardware on the restoration target. PlateSpin Forge accommodates for the differences between physical environments and virtual environments seamlessly. 68 guide book Forge synchronizes with original server PlateSpin Forge Original Server The cost of business continuity � Approaches to business continuity PlateSpin Forge Users Switch back to original server occurs once synchronization is complete PlateSpin Forge Original Server Merits and demerits of IT-BCP solutions � PlateSpin Forge as IT-BCP solution Users PlateSpin Forge Original Server PlateSpin Forge usage overview New Server No need to worry about original server spec! vendors and using different hyper visors. The drivers in Forge / Protect are installed automatically to enable data migration. PlateSpin Forge GUI overview - Data can be migrated between different machine specs NetIQ's conversion technology allows recover y to machines from different 69 chapter 06 70 PlateSpin Forge GUI overview Approaches to business continuity The cost of business continuity Merits and demerits of IT-BCP solutions Addition of a workload……………………………………………………………… 72 Initial configuration, including scheduling…………………………………………… 74 Configuration for failover and failover testing……………………………………… 76 Failback procedure under PlateSpin Forge…………………………………………… 80 PlateSpin Forge as IT-BCP solution Failover procedure under PlateSpin Forge…………………………………………… 78 PlateSpin Forge usage overview PlateSpin Forge GUI overview 71 chapter 06 PlateSpin Forge GUI overview Addition of workload A key feature of PlateSpin Forge is its easy to use WebUI. It provides a data protection and business continuity system that is easy for managers to navigate. Here we will look at the actual WebUI and introduce some of the configuration processes. 01 Firstly, when you open up the PlateSpin Forge web interface you are presented with the “Dashboard” screen. To the left of the screen is the PlateSpin Forge usage summary, which will show the number of configurable workloads. 72 guide book Workloads are added by selecting “Add workload” at the bottom of the left-hand panel, then selecting the IP address of the workload to be protected and the workload type according to the operating system it is running, and then clicking “Execute”. The cost of business continuity 02 Approaches to business continuity PlateSpin Forge Merits and demerits of IT-BCP solutions PlateSpin Forge as IT-BCP solution 03 Addition of the workload is now complete. Note that as the initial full replication has yet to complete, the workload status will show as “Unprotected”. PlateSpin Forge usage overview PlateSpin Forge GUI overview 73 chapter 06 PlateSpin Forge GUI overview Initial configuration, including scheduling Now that a new workload has been added let us look at the settings for the initial synchronization. Here we can set the synchronization timing and method. 01 02 74 Firstly, full replication must be selected on the options screen for the initial replication method. Next come the settings for “Tiers” or synchronization groups. The protection tier can be set to a specific recurrence such as “Weekly”, “Nightly”, or “Hourly”, or a “Custom” option can be selected. In addition to the incremental replication schedule, compression level and bandwidth throttling can be specified. The section “Recovery points to keep” allows version management to be configured for backups as required, and is capable of managing up to 32 recovery points. guide book Now that the tier is defined, we can move on to replication settings. A range of settings is available, including transfer method settings such as file or block level replication, data encryption, the network to be used during the transfer, volumes to be protected, and services to be suspended during replication. The cost of business continuity 03 Approaches to business continuity PlateSpin Forge Merits and demerits of IT-BCP solutions PlateSpin Forge as IT-BCP solution PlateSpin Forge usage overview PlateSpin Forge GUI overview 75 chapter 06 PlateSpin Forge GUI overview Configuration for failover and failover testing With routine replication settings configured, the next task is to make further configurations to ensure smooth failover in the event of an emergency. As soon as the failover settings are complete, protection of the workload added earlier is initiated. 04 05 76 Failover settings specify the amount of memory to allocate to the virtual machine running on PlateSpin Forge, as well as other settings such as changes to the domain or workgroup following failover, and network settings to be used when failover is executed. This is where details should be entered to ensure the IP address is changed to match that of the backup site network. Similar settings should be configured for failover testing. The details of these settings can be the same as those for failover, or they can be different settings altogether. Once configuration is complete, clicking on the “Save and prepare” button at the bottom of the screen prepares the workload for replication. guide book Next, the workload to be replicated should be selected followed by a click on the "Run Replication" button. The cost of business continuity 06 Approaches to business continuity PlateSpin Forge Merits and demerits of IT-BCP solutions 07 PlateSpin Forge as IT-BCP solution While replication is executing, a detailed progress report is displayed for each step. When replication is complete, the word “Completed” is displayed in the status bar. Now, initial replication of the designated workload is complete, and incremental replication will be performed according to the designated schedule. PlateSpin Forge usage overview PlateSpin Forge GUI overview 77 chapter 06 PlateSpin Forge GUI overview Failover procedure under PlateSpin Forge Having already looked at the initial settings screen, we shall now discuss the procedure for putting a workload on PlateSpin Forge into failover mode in the event of an emergency. Because detailed failover settings have been configured in the initial setup, the failover procedure is as simple as selecting the workload for failover and clicking “Run Failover”. 01 02 78 As soon as a problem arises with a protected server, simply click the workload to be restored and press the “Run Failover” button. If recovery point management is enabled, simply select which recovery point to be used for failover and press “Execute”. guide book A status report can be checked while failover is running. Clicking on the phrase “Running Failover” on the “Workloads” screen will display the following screen. The cost of business continuity 03 Approaches to business continuity PlateSpin Forge Merits and demerits of IT-BCP solutions Once the status shows as "Completed", the failover process is completed. The workload status will show as “Live”. PlateSpin Forge as IT-BCP solution 04 PlateSpin Forge usage overview PlateSpin Forge GUI overview 79 chapter 06 PlateSpin Forge GUI overview Failback procedure under PlateSpin Forge After failover is complete, and once an environment for full recovery has been prepared, it is time to configure settings for failback, to restore the workload to the new environment. A 100% restoration is achievable by configuring this process in the same way as the initial replication and failover. 01 02 Firstly, select “Failback” from the “Workloads” tab. Next, follow the prompts to configure failback target settings such as the failover workload settings and replication method. Once the failback process is prepared, press the “Failback” button. Here we have the same set of configuration fields for failback as we had for replication, such as data transfer method and compression level. 80 guide book As part of the failback configuration, settings can be configured to once again protect the workload using PlateSpin Forge after the failback is done. Selecting Save and failback will execute failback. The cost of business continuity 03 Approaches to business continuity PlateSpin Forge Merits and demerits of IT-BCP solutions The status will show as “Completed” to indicate that failback has completed without problems. The status of the workload displayed by PlateSpin Forge will change to “Ready For Reprotect”. PlateSpin Forge as IT-BCP solution 04 PlateSpin Forge usage overview PlateSpin Forge GUI overview 81 IT Business Continuity Planning and Management with PlateSpin Forge April 2015 Issued by: NetIQ •About PlateSpin Forge Product URL: https://www.netiq.com/products/forge/ •About other PlateSpin products PlateSpin Recon Product URL: https://www.netiq.com/products/recon/ PlateSpin Migrate Product URL: https://www.netiq.com/products/migrate/ PlateSpin Protect Product URL: https://www.netiq.com/products/protect/ Copyright 2015 NetIQ Authors: Toshiki Iida and Ichiro Kai