Virtualization of EPOCH IPS

Transcription

Virtualization of EPOCH IPS
Virtualization of EPOCH IPS
April, 2016
Tatsuya Watanabe / Assistant Manager
System Engineering Team
SKY Perfect JSAT Corporation (SJC)
0
Agenda
1. Overview of SJC and YSCC
2. Current status of virtualization in SJC
3. Function of VMware and how SJC uses it
4. Points of concern about virtualization
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
1
Overview of SJC and YSCC
2
Satellite Fleet
* Horizons-1 & 2 are Jointly owned by
JSAT and Intelsat.
JCSAT-4A
(82E)
JCSAT-85**
(Intelsat-15)
(85E)
** JSAT owns 5 Ku-band
transponders on IS-15
Horizons-1*
(127W)
GEO
36,000km
3rd Party
Customer
Satellite
Horizons-2*
(84.8E)
N-SAT-110
(Backup)
JCSAT-1B
(150E)
JCSAT-RA
(Backup)
JCSAT-5A
(132E)
*** JCSAT-110R is Jointly owned by
JSAT and B-SAT.
Superbird-B3
Superbird-C2
(144E)
Superbird-B2
(162E)
Superbird-A3
(158E)
JCSAT-2A
(154E)
JCSAT-110R***
(110E)
JCSAT-15
JCSAT-14
JCSAT-16
JCSAT-4B
(124E)
JCSAT-3A
(128E)
N-STAR c
(136E) 3rd Party Customer
Satellite(143E)
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
3
Satellite Operations Experience
As of 2014/12
’89
J C S A T -1
’90
’91
’92
’93
’94
’95
’96
’97
’98
’99
’00
’01
’02
’03
’04
’05
’06
SSL
SSL
J C S A T -3
’12
’13
’14
D e - o rb it
D e - o rb it
B o e in g
S u p e rb ird -A 3
B o e in g
B o e in g
J C S A T -1 B
J C S A T -4 A
B o e in g
B o e in g
S u p e rb ird -B 2
NTT
N -S T A R a
N -S T A R b
NTT
SSL
D e - o rb it
D e - o rb it
SSL
LockheedL Martin
社製
M
N -S A T -1 1 0
Boeing
社製
B o e in
g
J C S A T -2 A
N -S T A R c
Orbital社製
O r b ita l
NTT
Boeing
社製
B o e in
g
H o riz o n s-1
D e - o rb it
Space System
S S LLoral 社製
3 rd P a rty S a t- A
J C S A T -5 A
Lockheed
社製
L Martin
M
Lockheed
社製
L Martin
M
J C S A T -3 A
J C S A T -1 1 0 R
’11
D e - o rb it
B o e in g
J C S A T -R
J C S A T -8 5
’10
D e - o rb it
S u p e rb ird -A
J C S A T -R A
’09
D e - o rb it
B o e in g
S u p e rb ird -B
S u p e rb ird -C 2
’08
D e - o rb it
B o e in g
J C S A T -2
H o riz o n s-2
’07
Orbital
社製
O r b ita
l
Boeing 社製
Lockheed Martin 社製
Space Systems Loral 社製
Orbital 社製
三菱電機
社製
Melco
NEC 社製
NEC
三菱電機製
M e l co
LM
O r b ita l
LM
LM
J C S A T -4 B
3 rd P a rty S a t- B
N EC
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
4
SJC Ground Stations
Gunma Satellite Control Station (GSCS)
Sub Ground Station
Yamaguchi Network Control Center(SPW)
Sub Ground Station
Ibaraki Network Control Center (SPE)
Backup Satellite Control Center and Teleport
Yokohama Satellite Control Center (YSCC)
Primary Satellite Control Center and Teleport Facilities
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
5
YSCC Organization
Satellite Planning &
Operations Group
Satellite Operations
Division
Service Operations
Division
Noriko Masuda
Hideo Hayashi
Satellite Control
Team
Yoji Sato
System Engineering
Team
Hiroaki Nagai
Satellite Engineering
Team
Azuma Miyata
Network Operations
Team
Satelite Facilities
Management &
Administration Div.
Tetsuya Segawa
Team-1
Ground Systems Procurement &
Maintenance
Teleport Operations
Team-1
Teleport Operations
Team-2
Team-2
General affairs & administrations in
YSCC
Team-3
Orbital Dynamics
Team
General affairs & administrations
in SPE
Hideshi Chazono
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
6
Current status of virtualization
in SJC
7
Reasons to use virtualization

SJC will launch 4 satellites within 2 years.
 work on reducing the operational load in several ways
 One of the way is virtualization of EPOCH IPS

Virtualization refers to the creation of a virtual machine that acts like a
real computer with an operating system(#).
Traditional Architecture
Virtual Architecture
(#)Wikipedia: https://en.wikipedia.org/wiki/Virtualization
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
8
Reasons to use virtualization
SJC decided
to use VMware as a virtualization software for EPOCH IPS.
VMware
has a lot of merits as follows:
Cost effective
Resource Optimization, Saving Rack space, Consumed Power
Functions to Reduce Operational Load
vMotion, Snapshot, cloning, HA, Backup, Converter, and so on.
Last year,
SJC verified following points:
Essential functions of VMware work as expected
EPOCH IPS on Virtual Server is working just like on a Physical Server
9
The relation between virtualization and increase of EPOCH IPS


EPOCH IPS constitutes an increasingly large portion of the SJC.
The more the percentage of EPOCH IPS is increasing, the more we’ll see
the effect of virtualization of EPOCH IPS go up.
Bus Type / Operations Software
Bus
Type
Operations
software
Boeing
601
A2100
(5 satellites)
Coming Soon
StarBus2
DS2000
SMACS
(1 satellites)
(1 satellites)
(1 satellites)
(5 satellites)
TCR/STA
New
Satellites
(4 satellites)
EPOCH v4 EPOCH v4
OASYS
OASYS
BirdStar
ORAMS
SMACS
EPOCH v4
OASYS
Consider the way to change to EPOCH IPS
Plan to change to EPOCH v4 & OASYS
without extra cost (because these are old
so it’s difficult to secure a budget
Copyright for
© 2016 now)
Sky Perfect JSAT Corporation. All Rights Reserved
10
Current situation and virtualization plan in the future


SJC has virtualized only part of EPOCH IPS servers.
SJC will continue to virtualize other servers and to investigate the
appropriate method to virtualize workstation.
Percentage of EPOCH IPS servers and workstations in each site
Already virtualized
Servers for existing satellites in YSCC
Servers for upcoming satellites in YSCC
Servers in Sub-site
Workstations in YSCC
Virtualization of workstation is
under investigation.
Workstations in Sub-site
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
11
Functions of VMware and
how SJC uses it
12
Physical Configuration Overview
VMware
is Installed on both of 2 dell servers
Virtual Machines (VM) are saved to a Shared Disk.
One of 2 VMware ESXis loads and executes VM onto its memory
Virtual Machines (VM)
EPOCH SV
Compass
REDHAT
Win Server
VMware ESXi
VMware ESXi
Dell Server
Dell L2SW
NAS for backup VMs
EPOCH SV
Compass
REDHAT
Win Server
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
Dell Shared Storage
for saving VMs
13
Saving Rack Space of Physical Configuration


Lots of VM are consolidated and working on these machines.
Hardware failure has a significant impact on VM, so it’s necessary to
make each Machine as a redundant configuration.
2 Dell servers
1
equipped with 2 (6 core,HT)
CPUs and 192GB Memory each
2
3
4
2 Dell Layer-2 Switches (back)
5
6
NAS
7
8
equipped with 4 7,200 RPM
SATA HDDs
9
Dell Shared Storage
10
equipped with 24 15,000 RPM
SAS HDDs
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
14
Basic License Type of VMware

So far, SJC is using Essentials Plus License of VMware that is reasonable
but has many functional restrictions.

This License enables us to:
 Use basic functions such as vMotion, Snapshot, clone, HA, and VDP

This License doesn’t enable us to:
 Scale-out more than 3 servers
 Use advanced functions such as FT, DRS, Storage vMotion, App-HA,
and so on
VMware
ESXi
VMware
ESXi
VMware
ESXi
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
VMware
ESXi
15
vMotion

vMotion enables running VM to move into another server with zero
downtime

SJC confirmed that EPOCH server can finish its vMotion process without
disconnecting from DSS.
DSS
Telemetry
①Stop
EPOCH SV
REDHAT
EPOCH SV
REDHAT
VMware ESXi
①Activate
VMware ESXi
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
16
Snapshot

Snapshot preserves the state of the VM at a specific point in time.

When you revert to a snapshot, you discard all changes made to the VM
since you took the snapshot.
①Take a snapshot
⑤return to Stable state
Stable state
②Apply Patch
③become Unstable state
④Back to the previous state
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
17
Clone

Clone is a copy of an existing virtual machine.

If you want to make a copy of a virtual machine for separate use, create
a clone.
Clone
EPOCH SV
REDHAT
①Cloning
EPOCH SV
REDHAT
VMware ESXi

If you want to save the current state of the virtual machine, so you can
revert to that state in case you make a mistake, take a snapshot.
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
18
VMware HA

Failover function when Physical or Virtual Machine fails.
1. Virtual Machine Failover
②Reboot VM on the same host
①failure
VM
monitor
VMware ESXi #1
VMware ESXi #2
2. Host Machine Failover
②Stop all powered-on VMs
VM
①failure
③VMs start
VM
VM
VM
monitor
VMware ESXi #1
VMware ESXi #2
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
19
VMware Data Protection (VDP)

VDP is an agent-less, image-level VM backup tool

Saving a lot of backups causes unstable state of VDP, so SJC set up
backup schedule of VDP as following
 Once
a day for 7 days, Once a week for 4 weeks, Once a month for 12 months
VDP
VMware ESXi
VMware ESXi
Dell Server
Dell L2SW
Backup
EPOCH SV
REDHAT
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
Dell Shared Storage
20
vCenter Converter




It’s hard to rebuilt already running EPOCH IPS servers as a VM.
vCenter Converter enables us to convert EPOCH IPS into VM easily.
SJC conducted load tests of the converted EPOCH server on VM, and
confirmed it works fine.
SJC has already been operating the converted EPOCH Server as a VM.
vCenter Converter
Control
Control
EPOCH SV
EPOCH SV
EPOCH CL
REDHAT
REDHAT
Windows7
Copy
EPOCH CL
Target Physical Servers
VMware ESXi 5.5
Windows7
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
Destination
21
Other features of VMs

VM is just a file, so it’s easy to move VM between a production
environment and a test environment.
Test environment
EPOCH SV
EPOCH SV
REDHAT
REDHAT
VMware Player
copy
EPOCH SV
REDHAT
Cloning
VMware Player
copy
EPOCH SV
EPOCH SV
REDHAT
REDHAT
VMware ESXi (Production environment)
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
22
Points of concern
about virtualization
23
Points of resource allocation

It’s necessary to decide resource allocation size of each VM.
 Resources are such as number of CPU cores, Memory size, and HDD size

If the resource allocation size is too much, and the total of all the VM’s
actual resource consumption exceeds physical server’s resource size, all
unprotected VMs become unstable.

On the other hand, if the resource allocation size is too little, it affects the
performance.

To prevent these situations, it’s necessary for important VMs such as
EPOCH server to investigate and to reserve (a function to assure allocated
resource) the minimum requirement of resource size.
24
Resource allocation size of each EPOCH IPS server in SJC

SJC conducted load tests and observed the resource utilization of each
EPOCH IPS server.
Specific resource allocation size of each EPOCH IPS server in SJC (#)
EPOCH server
Task
for 1 telemetry Initiator
Compass
Server
CPU core 2~
2~
2~
Memory
8GB~
4GB~
4GB~
HDD
100GB~
30GB~
30GB~

These specs are smaller than on Physical servers, yet they’re working fine.

The appropriate size depends on the use case scenario.
(#) The memory and HDD sizes will greatly differ according to whether or not we use the memory over commit and the thin provisioning functions
(SJC tries not to use these functions)
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
25
Failure detecting and resource monitoring

Hardware failure and resource shortage on the Physical Machine have a
significant impact on all virtual machines.

SJC built an NMS server to detect Hardware, VMware Hypervisor,
vCenter process failure and to monitor resource utilization.
vCenter
NMS
OS
Network Management System
VMware ESXi
iDrac
SNMP Polling & VMware API
Dell Server
Dell L2SW
SNMP Polling & SNMP Trap
Dell Shared Storage
SNMP Trap
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
26
Conclusion
27
Advantages and Disadvantages of VMware


There was a lot of hard work involved in our first introduction of
virtualization of EPOCH IPS as follows

to consider hardware specs, redundancy design, and migration plan

to prepare the way of failure detecting and resource monitoring (#)
However, VMware functions provide the benefit of increasing the
efficiency of the system operation.
(#) Preparing fault isolation methods is indispensable. If something happened to a VM, He that is unfamiliar with virtualization may doubt VMware.
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
28
Thank you.
Please feel free to contact me;
[email protected]
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
29
Appendix
30
Total optimization in the next stage

SJC is working on virtualizes only servers in main site for now, however
it’s necessary to consider multi-site design. For example,
 SJC needs sufficient number of client only in case of emergency in
other site.
 There is some possibility to make unused licenses that can be used
for a purpose of replace TCR/STA to EPOCH IPS.
Other Site (Sub-site)
Main Site
Server
Server
REDHAT
REDHAT
Client
Windows
Copy VM by using
VMware or Storage
replication
Client
Most part of the clients are
deactivate in normal time
and activate in case of
emergency
Windows
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
31
Ideal design of HA configuration

It may be possible to monitor the EPOCH server on the VM with higher
class license and custom settings of VMware.

This way is smart because it’s not necessary to prepare EPOCH Server on
the each VMware ESXi, but we have to consider the failover downtime,
reconnection with client / baseband, and get over some hurdles as
mentioned above.
①failure
EPOCH Server
Processes
monitor
②reboot Application
Redhat
monitor
VMware ESXi #1
monitor
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
VMware ESXi #2
32
VMware HA in SJC

If the application's process finishes, we want VMware to reboot the
application to recover. However, due to the licensing constraints and
other problems, this is not realized for now.

SJC prepares EPOCH server on the each VMware ESXi for redundancy.
BB
Telemetry
EPOCH Server
Process
Not monitor
EPOCH Server
Process
Redhat
monitor
VMware ESXi #1
Not monitor
monitor
Redhat
monitor
VMware ESXi #2
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
33
Problem that we have to solve

SJC needs the way to confirm whether or not EPOCH IPS server process is
alive without monitoring GUI. As one of the methods, we can use “ps aux
| grep epoch”, but it’s an uncertain way.

Does anyone know any monitoring functions of the EPOCH server like the
“statusEpoch” command?
NMS
monitor
EPOCH Server
Process
Not monitor
EPOCH Server
Process
Redhat
monitor
VMware ESXi #1
Not monitor
monitor
Redhat
monitor
VMware ESXi #2
Copyright © 2016 Sky Perfect JSAT Corporation. All Rights Reserved
34