(SANs) with Brocade Fabric Switches

Transcription

(SANs) with Brocade Fabric Switches
1 YEAR UPGRADE
BUYER PROTECTION PLAN
Building
SANs
Brocade
with
Fa b r i c S w i t c h e s
How to Design, Implement, and Maintain Storage Area
Networks (SANs) with Brocade Fabric Switches
• Step-by-step instructions for establishing your SAN requirements—such as high
availability, performance, and cost savings—and translating those requirements
into an effective SAN design
• Detailed examples to guide you through the process of installing and troubleshooting your Brocade SAN
• Practical discussions about SAN components and popular SAN configurations
such as storage consolidation, disaster tolerance, and LAN-free backup
Chris Beauchamp Author
Josh Judd Author
Benjamin Kuo Contributor
140_SANs_FM
8/14/01
3:40 PM
Page i
[email protected]
With more than 1,500,000 copies of our MCSE, MCSD, CompTIA, and Cisco
study guides in print, we continue to look for ways we can better serve the
information needs of our readers. One way we do that is by listening.
Readers like yourself have been telling us they want an Internet-based service that would extend and enhance the value of our books. Based on
reader feedback and our own strategic plan, we have created a Web site
that we hope will exceed your expectations.
[email protected] is an interactive treasure trove of useful information focusing on our book topics and related technologies. The site
offers the following features:
■
One-year warranty against content obsolescence due to vendor
product upgrades. You can access online updates for any affected
chapters.
■
“Ask the Author”™ customer query forms that enable you to post
questions to our authors and editors.
■
Exclusive monthly mailings in which our experts provide answers to
reader queries and clear explanations of complex material.
■
Regularly updated links to sites specially selected by our editors for
readers desiring additional reliable information on key topics.
Best of all, the book you’re now holding is your key to this amazing site.
Just go to www.syngress.com/solutions, and keep this book handy when
you register to verify your purchase.
Thank you for giving us the opportunity to serve your needs. And be sure
to let us know if there’s anything else we can do to help you get the
maximum value from your investment. We’re listening.
www.syngress.com/solutions
140_SANs_FM
8/14/01
3:40 PM
Page ii
140_SANs_FM
8/14/01
3:41 PM
Page iii
1 YEAR UPGRADE
BUYER PROTECTION PLAN
Building
SANs
Brocade
with
Fa b r i c S w i t c h e s
Chris Beauchamp Author
Josh Judd Author
Benjamin Kuo Contributor
140_SANs_FM
8/14/01
3:41 PM
Page iv
Syngress Publishing, Inc., the author(s), and any person or firm involved in the writing, editing, or
production (collectively “Makers”) of this book (“the Work”) do not guarantee or warrant the results to
be obtained from the Work.
There is no guarantee of any kind, expressed or implied, regarding the Work or its contents.The Work is
sold AS IS and WITHOUT WARRANTY.You may have other legal rights, which vary from state to state.
In no event will Makers be liable to you for damages, including any loss of profits, lost savings, or other
incidental or consequential damages arising out from the Work or its contents. Because some states do not
allow the exclusion or limitation of liability for consequential or incidental damages, the above limitation
may not apply to you.
You should always use reasonable case, including backup and other appropriate precautions, when working
with computers, networks, data, and files.
Syngress Media®, Syngress®, and “Career Advancement Through Skill Enhancement®,”are registered trademarks of Syngress Media, Inc. “Ask the Author™,”“Ask the Author UPDATE™,”“Mission Critical™,”
“Hack Proofing™,” and “The Only Way to Stop a Hacker is to Think Like One™” are trademarks of
Syngress Publishing, Inc. Brands and product names mentioned in this book are trademarks or service marks
of their respective companies.“Brocade®,” “SilkWorm®,” and the Brocade logo are registerd trademarks of
Brocade Communications Systems, Inc., in the United States and/or any other countries.
KEY
001
002
003
004
005
006
007
008
009
010
SERIAL NUMBER
Q3G4T9U2F5
6YHQ94MLE4
VMERKJ6C4N
XD7Y4B39UN
8SRT9U6N7H
3W7YRNTEP4
LHB65TR46T
4DB9R5LZMR
N835M4KBAZ
QT6Y4RTWFC
PUBLISHED BY
Syngress Publishing, Inc.
800 Hingham Street
Rockland, MA 02370
Building SANs with Brocade Fabric Switches
Copyright © 2001 by Syngress Publishing, Inc. All rights reserved. Printed in the United States of
America. Except as permitted under the Copyright Act of 1976, no part of this publication may be reproduced or distributed in any form or by any means, or stored in a database or retrieval system, without the
prior written permission of the publisher, with the exception that the program listings may be entered,
stored, and executed in a computer system, but they may not be reproduced for publication.
Printed in the United States of America
1 2 3 4 5 6 7 8 9 0
ISBN: 1-928994-30-X
Technical Editors: Chris Beauchamp,
Josh Judd, Benjamin Kuo
Acquisitions Editor: Catherine B. Nolan
Developmental Editor: Kate Glennon
Copy Editor: Beth A. Roberts
Freelance Editorial Manager: Maribeth Corona-Evans
Cover Designer: Michael Kavish
Page Layout and Art by: Shannon Tozier
Indexer: Jennifer Coker
140_SANs_FM
8/14/01
3:41 PM
Page v
Syngress Acknowledgments
We would like to acknowledge the following people for their kindness and support
in making this book possible.
Greg Reyes, Jack Cuthbert, Doug Wesolek, Maggie Conroy, Julie Chiu, Elaine Tite,
Jeff Seltzer, and Chris Mingrone at Brocade, for championing the idea of a Brocade
SANs book. Also special thanks to Viet Dao, John Gareri, Mark Murphy, Jay Rafati,
Ron Totah, Ezio Valdevit, John Bae, James Carpignano, Steve Daheb, Derek Granath,
Jay Kidd, Omy Shani, James Bleess, Owen Higginson, Leo Kappeler, Chris M.
Nguyen, Mark Peluso, and Henry Robinson for their help in making this book a
reality.
Ralph Troupe of Callisma for his invaluable insight and guidance. Ralph’s expertise in
SAN architecture and design solutions for next-generation storage networking
implementations helped define our vision for this book.
Richard Kristof and Duncan Anderson of Global Knowledge, for their generous
access to the IT industry’s best courses, instructors, and training facilities.
Karen Cross, Lance Tilford, Meaghan Cunningham, Kim Wylie, Harry Kirchner,
Kevin Votel, Kent Anderson, and Frida Yara of Publishers Group West for sharing
their incredible marketing experience and expertise.
Mary Ging, Caroline Hird, Simon Beale, Caroline Wheeler,Victoria Fuller, Jonathan
Bunkell, and Klaus Beran of Harcourt International for making certain that our
vision remains worldwide in scope.
Anneke Baeten and Annabel Dent of Harcourt Australia for all their help.
David Buckland,Wendi Wong, Daniel Loh, Marie Chieng, Lucy Chong, Leslie Lim,
Audrey Gan, and Joseph Chan of Transquest Publishers for the enthusiasm with
which they receive our books.
Kwon Sung June at Acorn Publishing for his support.
Ethan Atkin at Cranbury International for his help in expanding the
Syngress program.
v
140_SANs_FM
8/14/01
3:41 PM
Page vi
This book was designed and written to provide information about storage area networking architectures. Every effort has been made to make this book as complete
and accurate as possible. However, the information in this book is provided to you
“AS IS,” without warranty of any kind, including, without limitation, any implied
warranty of merchantability or fitness for a particular purpose.
The authors and Brocade Communications Systems, Inc., shall have no liability or
responsibility to any person or entity with respect to any loss, cost, liability, or damages arising from the information contained in this book or the computer programs
that accompany it, and specifically disclaim any implied.
vi
140_SANs_FM
8/14/01
3:41 PM
Page vii
Brocade Acknowledgments
This book truly represents a complete Brocade team effort.We would like to
acknowledge several people in particular.Without their help, dedication, and knowledge, this book would not have been possible.The thorough technical review by Viet
Dao, John Gareri, Mark Murphy, Jay Rafati, Ron Totah, and Ezio Valdevit shaped our
manuscripts into a book that Brocade can be proud of. John Bae, James Carpignano,
Steve Daheb, Derek Granath, Jay Kidd, and Omy Shani provided several timely contributions to the content.We also incorporated material written by others within
Brocade: James Bleess, Owen Higginson, Leo Kappeler, Chris M. Nguyen, Mark
Peluso, and Henry Robinson.We would also like to thank Maggie Conroy and Doug
Wesolek for their guidance and help with the publication process.
—Josh Judd and Chris Beauchamp
vii
140_SANs_FM
8/14/01
3:41 PM
Page viii
140_SANs_FM
8/14/01
3:41 PM
Page ix
Authors
Chris Beauchamp is a Senior SAN Architect for Brocade
Communications Systems, Inc. Chris moved to Brocade in 1998 as a
Systems Engineer supporting several strategic customers with the application and qualification of SilkWorm fabric switches. Chris now focuses on
SAN design and architecture, with an emphasis on scalability and troubleshooting. His specialties include Sun servers, storage performance analysis and capacity planning, Fibre Channel trace analysis, scripting in
various languages, and SAN administration. Chris holds a Master of
Science in Computer Engineering from Villanova University and a
Bachelor of Science in Computer Science degree from West Chester
University. Chris’s background includes positions as a Systems
Administrator at General Electric and a Systems Engineer at Sun
Microsystems. Chris currently resides outside of San Jose in the Santa
Cruz Mountains with his wife Sarah and daughter Meagan.
Josh Judd is a Senior SAN Architect with Brocade Communications
Systems, Inc. In addition to writing technical literature, he provides
senior-level strategic support for major OEMs and end-users of Brocade
storage network products worldwide.When he first went to work for
Brocade, he was the company’s Senior IT Specialist, responsible for escalations in every area of the company’s network, server, and desktop infrastructure. Josh’s career as an IT consultant has given him a diverse skill set,
which includes senior-level expertise in several UNIX variants,Windows
9x/NT/2k administration, RAID configuration and optimization, storage
virtualization and clustering software (such as that produced by VERITAS
Software), and network engineering with many vendors, including Cisco,
Foundry, Lucent, and 3com. Before joining Brocade four years ago, Josh
worked at IBM Global Services, LSI Logic, and Taos Mountain
Consulting. He lives in San Jose, California.
ix
140_SANs_FM
8/14/01
3:41 PM
Page x
140_SANs_FM
8/14/01
3:41 PM
Page xi
Special Contributor
Benjamin F. Kuo is a Software Development Manager at TROIKA
Networks. Headquartered in Westlake Village, CA,TROIKA Networks is
a developer of Fibre Channel Host Bus Adapters, dynamic multipathing,
and management software for Storage Area Networks. Ben manages
development of network management software at TROIKA and is also
active as the chair of the HBA API subgroup of the Storage Networking
Industry Association (SNIA), where he spearheads efforts to develop
interoperable standards in storage networking. Ben’s background includes
positions at Paracel Inc. (now Celera Genomics), IBM, Micropolis, and
Echelon Corp. Ben also runs socalTECH.com, a Web site and daily
newsletter focused on high tech in Southern California. Ben holds a
Bachelor’s degree in Electrical Engineering from the University of
Southern California and is a member of the IEEE. Ben lives in
Moorpark, California, with his wife Jennifer and son Jonathan.
Contributor
Alex Neefus is the Lead Interoperability Test Engineer at Lamprey
Networks, Inc. Lamprey Networks offers certification testing services and
test tool development to the Fibre Channel industry. Alex has worked on
developing testing tools for the SANmark program hosted by the FCIA.
This program certifies Fibre Channel devices for conformance and interoperability. Alex has also co-authored and written a number of conformance test suites for both the FCIA and the University of New
Hampshire Interoperability Lab. Alex’s background also includes working
for the UNH Interoperability Lab in the Fibre Channel Consortium for
over a year and a half. At the lab his primary work is in developing procedures and tools for testing Fibre Channel products, and working with
members of the industry to solve interoperability problems with devices
on the market. Alex resides in Durham, New Hampshire.
xi
140_SANs_FM
8/14/01
3:41 PM
Page xii
140_SANs_toc
8/14/01
3:45 PM
Page xiii
Contents
Foreword
Learn When to
Deploy a SAN
Things to consider when
deciding whether a SAN is
the right solution:
■ The primary
application that
needs to be solved
■ Speed, bandwidth,
and distance
requirements
■ The amount of
data sharing or
consolidation needed
■ The cost of the SAN
infrastructure
required, such as
switches, cables,
and HBAs
Chapter 1 Introduction to SANs
Introduction
Overview of SANs
Taming the Storage Monster
Benefits of Building a SAN
Ensuring High Availability
Consolidating Storage
Reducing Network Congestion from Backup
Accelerating Backup Cycles
Speeding Up Data Access
Increasing Server Cycles
Ensuring Disaster Tolerance
When to Deploy a SAN
Designing Around the Application
Assessing Speed, Bandwidth, and
Distance Requirements
Data Sharing and Consolidation Needs
Resource Sharing
Volume-Level Sharing
File-Level Sharing
Steps to a Successful SAN Deployment
Summary
Solutions Fast Track
Frequently Asked Questions
xxv
1
2
2
8
10
10
11
13
14
14
14
15
16
16
17
18
19
19
19
20
25
26
28
xiii
140_SANs_toc
xiv
8/14/01
3:45 PM
Page xiv
Contents
Master Fabric Services
Fabric services provide
information to nodes in a
switched fabric topology.
Services can be distributed
across all switches,
creating the appearance
of single-service type
servers. In this chapter, we
discuss a number of
different fabric services,
including:
Login Server
Name Server
Fabric/Switch
Controller
Management Server
Time Server
Chapter 2 Fibre Channel Basics
Introduction
The Architecture of SANs
Fibre Channel Protocol
Classes of Service
Storage Network Topologies
Fabric Services
Fibre Channel Protocol Basics
Fibre Channel Levels
ULPs
Classes of Service
Class 1
Class 2
Class 3
Class 4
Class F
Storage Network Topologies
Point-to-Point Topology
Fibre Channel Arbitrated Loop (FC-AL)
Topology
Switched Fabric Topology
Fabric Services
Login Server
Name Server
Fabric/Switch Controller
Management Server
Time Server
Other Services
Summary
Solutions Fast Track
Frequently Asked Questions
Chapter 3 SAN Components
and Equipment
Introduction
Overview of Fibre Channel Equipment
Cabling and Media
29
30
30
37
37
37
38
38
40
42
43
43
44
44
44
45
45
45
47
48
49
50
50
51
51
52
52
53
54
57
59
60
61
61
140_SANs_toc
8/14/01
3:45 PM
Page xv
Contents
Understand Fibre
Channel Equipment
WARNING
Any single-mode or
multimode laser
can damage your
eyes if it is transmitted at 1300 nm.
The 1300 nm wavelength is not in the
visible spectrum, so
you will not see a
laser being transmitted like in 850 nm
fiber. A 1300 nm
laser is dangerous,
because it can cause
severe retina
damage.
GBICs and Connectors
Hubs
Switches
Storage
Host Bus Adapters
Routers and Bridges
Cabling and GBICs
Copper Versus Optical: Selecting Your Media
Copper Cabling
Multimode Optical Cabling
Single-Mode Optical Cabling
Connecting with Connectors
The DB-9 Copper Connector
The HSSDC Copper Connector
The SC Optical Connector
High-Density Fiber-Optic Connectors
Comparing GBICs to Fixed Media
Using a GBIC
Pros and Cons of Using GBICs
GBIC Ports on Equipment
Serialized Versus Nonserialized
Common Problems with GBICs
Media Interface Adapters
Using Hubs
Simple Electrical Hubs
Managed Hubs
LIP Service: Fibre Channel LIPs,
Problems, and Solutions
Getting Out of the Loop: Migrating
to Switched Fabric
Using Switches and Fibre Channel Fabrics
Basic Switch Types
Entry-Level Switches
Scalable Fabric Switches
Core Fabric Switches
xv
61
63
63
64
64
64
65
65
65
66
68
69
69
70
71
72
73
73
74
74
74
75
75
76
76
76
78
79
80
80
81
81
81
140_SANs_toc
xvi
8/14/01
3:45 PM
Page xvi
Contents
Features of Fibre Channel Switches
Zoning
Classes of Service
Fabric Services
Redundancy
Buffer Credits per Port
Self-Configuring Ports
Auto-Negotiating Speeds
IP over Fibre Channel Broadcasting
Firmware Upgrade Methods
Loop Operation: Making Your Switch
Act Like a Hub
FSPF Compliance
Management Interfaces
Serial Port
Telnet
SNMP
Web-Based Management
Application-Based Management
SCSI Enclosure Services
Connecting Your Servers with Host
Bus Adapters
Connecting Hosts to the Fabric
HBA Types
Speeds
Ports
Combination Adapters
Fabric-Capable Versus Loop Adapters
HBA-Based LUN Masking
Persistent Binding
Default LUN Access Permissions
Upper-Level Protocol Access Permissions
Dynamic Versus Static Discovery
Configuration Management Software
HBA API Support
Remote Boot across the SAN
Hot-Plug Support
82
83
84
85
86
86
87
88
88
89
90
90
91
91
91
91
93
94
94
95
95
95
97
98
98
98
99
99
100
100
101
101
101
103
104
140_SANs_toc
8/14/01
3:45 PM
Page xvii
Contents
Connecting Legacy Devices into Your SAN
Basic Features of Routers
Number of SCSI Buses
Types of SCSI Ports,Termination
Selective LUN Presentation
Extended Copy Support
Management Interfaces
Bridging and Routing to IP Networks
and Beyond
Fibre Channel to DWDM
Fibre Channel across IP Networks
IP over Fibre Channel to Gigabit Ethernet
Fibre Channel Storage
Individual Disk Drives and JBODs
High-End Storage Arrays
Selective LUN Presentation
LUN Export across Multiple Ports
Snapshot Backup Volumes
Summary
Solutions Fast Track
Frequently Asked Questions
Simplify SAN fabric
management with
Brocade WEB TOOLS
Brocade WEB TOOLS is a
software utility that
enables you to manage
and monitor your fabric
through a Web browser
interface and Java plug-in.
Using WEB TOOLS, you
can view all switches in
the SAN from a single
interface from any
workstation in your
enterprise—even at a
remote location.
Chapter 4 Overview of Brocade
SilkWorm Switches and Features
Introduction
Selecting the Right Switch
Entry-Level Switches
SilkWorm 2010 (8 Ports) and
2210 (16 Ports)
SilkWorm 2040 (8 Ports) and
2240 (16 Ports)
SilkWorm 2050 (8 Ports) and
2250 (16 Ports)
Scalable Fabric Switches
SilkWorm 2400
SilkWorm 2800
xvii
106
106
107
108
108
108
109
109
109
110
110
111
111
113
113
113
114
115
116
121
123
124
124
126
127
127
128
128
129
129
140_SANs_toc
xviii
8/14/01
3:45 PM
Page xviii
Contents
SilkWorm 6400 Integrated Fabric
SilkWorm 12000 Core Fabric Switch
Understanding the Brocade Fabric OS
Fabric OS Core Functions
Fibre Channel Services for Reconfiguration
Dynamic Routing Services
Facilities for End-to-End SAN Management
Brocade Command Line Interface
Using Optional Brocade Features
Brocade Zoning
Extended Fabrics
Fabric Watch
Understanding Loop Support, QuickLoop,
and Fabric Assist
Brocade WEB TOOLS
Future Capabilities in the Brocade Intelligent
Fabric Services Architecture
Brocade ISL Trunking
Brocade Frame Filtering
More Robust Hardware-Enforced
Zoning
Enhanced End-to-End Performance
Analysis
Secure Fabric OS
Summary
Solutions Fast Track
Frequently Asked Questions
Chapter 5 The SAN Design Process
Introduction
Looking at the Overall Lifecycle of a SAN
Data Collection
Data Analysis
Architecture Development
Prototype and Testing
Transition
Release to Production
130
131
132
133
133
134
135
135
135
136
136
138
138
139
140
140
142
142
143
143
144
144
146
149
150
151
153
153
153
153
154
154
140_SANs_toc
8/14/01
3:45 PM
Page xix
Contents
Master the seven
phases of the SAN
design lifecycle:
1. Data Collection
2. Data Analysis
3. Architecture
Development
4. Prototype and Test
5. Transition
6. Release to Production
7. Maintenance
Maintenance
Conducting Data Collection
Creating an Interview Plan
Conducting the Interviews
What Overall Business Problem Are
You Trying to Solve?
What Are the Business Requirements
of the Solution?
What Is Known about the Nodes that
Will Attach to the SAN?
Which SAN-Enabled Applications
Do You Have in Mind?
Which Components of the Solution
Already Exist?
Which Components Are Already
in Production?
Which Elements of the Solution
Need to Be Prototyped and Tested?
What Equipment Will Be Available
for Testing?
How and When Are Backups to
Be Done?
What Will Be the Traffic Patterns in
the Solution?
What Do We Know about Current
Performance Characteristics?
What Do We Know about
Future Performance Characteristics?
How Much Downtime Is Acceptable to
Production Components During
Implementation?
How Much Downtime Is Acceptable
for Routine Maintenance? How Much
Downtime Is Acceptable for Upgrades
and Architectural Changes?
When Do You Need Each Piece
of the Solution to Be Complete?
xix
155
156
156
157
158
158
160
165
165
166
166
166
167
167
168
172
174
174
175
140_SANs_toc
xx
8/14/01
3:45 PM
Page xx
Contents
Answer Your
Questions about SAN
Applications and
Configurations
Q: I would like to cluster
my databases for
better performance.
What databases can
I use?
A: Most major databases
now support fabric
switch-based
clustering, including
Oracle Parallel Server,
IBM DB2 Parallel
Edition, and Microsoft
SQL Server.
Q: I would like to have my
Exchange Mail Server
highly available. What
should I do?
A: Brocade has developed
HA solutions for the
Exchange Server that
can be used in setting
up your desired SAN
configuration.
Summary List of Questions
Conduct a Physical Assessment
Analyzing the Collected Data
Processing What You Have Collected
Establishing Port Requirements
Simple Case
Moderate Case
Complex Case
Preparing an ROI Analysis
The Return On Investment Proposition
The Rest of the Process and the
Repetition of the Cycle
Summary
Solutions Fast Track
Frequently Asked Questions
Chapter 6 SAN Applications
and Configurations
Introduction
Configuring a High-Availability Cluster
Typical HA Application or Database Server
Microsoft Cluster Server
Using a SAN for Storage Consolidation
Shared Storage Using a Web Farm
Storage Partitioning Using Switch Zoning
Switch Zoning Configuration for
Departmental SANs
Storage Partitioning Using Storage LUN
Masking
Storage Partitioning Using HBA LUN
Masking
Partitioning with Software
LAN-Free Backup Configuration
SAN Server-Free Backup
SAN-Based Third-Party Copy Data Movers
Making Your Enterprise Disaster Tolerant
Data Replication and Remote Backup
176
176
177
177
182
183
185
186
187
188
190
191
192
193
195
196
196
198
200
203
206
208
208
210
210
211
212
213
215
216
218
140_SANs_toc
8/14/01
3:45 PM
Page xxi
Contents
Metropolitan Area Network Solutions
Summary
Solutions Fast Track
Frequently Asked Questions
Develop a SAN
Architecture
xxi
219
222
222
226
Chapter 7 Developing a SAN Architecture
Introduction
Identifying Fabric Topologies and SAN
Architectures
Useful Topologies
Scalability
Cascade Topology
Ring Topology
Mesh Topologies
Core/Edge or Star Topologies
Topologies at a Glance
Complex Topologies
Working with the Core/Edge Topology
Scaling without Downtime
Adding an Edge Switch
Upgrading the Core
Determining Levels of Availability
Configuring Traffic Patterns
Leveraging Tiers
Exploiting Locality
Using Any-to-Any Connectivity
Evaluating Performance Considerations
When Is Over-Subscription Bad?
Considerations Outside the Fabric
Summary
Solutions Fast Track
Frequently Asked Questions
227
228
Chapter 8 SAN Troubleshooting
Introduction
The Troubleshooting Approach:The SAN
Is a Virtual Cable
277
278
229
235
236
236
237
238
242
246
246
246
248
248
250
256
261
261
266
268
269
270
270
272
273
275
278
140_SANs_toc
xxii
8/14/01
3:45 PM
Page xxii
Contents
SAN Troubleshooting
When you start the
troubleshooting process,
determine whether the
issue is fabric-related or
device-related. A fabricrelated issue impacts
many devices, while a
device-related issue affects
only a few devices.
A Typical Scenario: “I Cannot See My Disks”
Where to Start and What Data to Gather
Take a Snapshot: Describe the
Problem and Gather Information
Troubleshooting Tools
Using the Switch LEDs
Switch Diagnostics
Helpful Commands
SAN Profile
What Data Can a Host Provide?
When to Use portLog and Other
Advanced Tools
Troubleshooting the Fabric
What to Look for in a Malfunctioning Fabric
Host Behavior
SAN Profile
Switch LEDs
The errShow Command
The switchShow Command
The topologyShow Command
The nsShow and nsAllShow Commands
Now that You Suspect a SAN Issue:
Digging Deeper
Timeout of Edge Devices During
Fabric Bring Up
Port Configuration Conflict or
Missing Fabric License
Segmented Fabrics
Troubleshooting Devices that Cannot Be Seen
What to Look for in the Fabric
Are the Host and Storage Visible via
switchShow on Their Respective Switches?
Do the Devices Show Up in the Name
Server?
Rule Out Zoning Issues
Edge Device Not in the Name Server
279
283
284
287
287
289
290
308
312
314
316
317
317
317
318
318
318
320
320
321
321
322
323
327
329
329
332
333
334
140_SANs_toc
8/14/01
3:45 PM
Page xxiii
Contents
Troubleshooting Marginal Links
Marginal Point-to-Point/Fabric Device Links
Marginal Loop Connections
Nx_Port (Host/Storage) Behavior with
a Marginal Port in the Loop
Marginal GBIC/Cable
Connected Device
Fault Isolation
How the Switch Can Help: Fabric Watch
and QuickLoop Zoning
Overview of SilkWorm Port Error Statistics
Troubleshooting I/O Pauses
Summary
Solutions Fast Track
Frequently Asked Questions
Chapter 9 SAN Implementation,
Maintenance, and Management
Introduction
Installation Considerations
Use licenseShow to
How to Cable Your SAN for Ease of
Determine What
Operation
Licenses Are Installed
Racking Considerations
on Your Switch
In-Band or Out-of-Band Management?
IPFC In-Band Guidelines
core1:admin> licenseShow
Setting Switch Parameters
SRzy9Sz9zeTS0zAG:
What Fabric OS Version Should I Use?
Licenses
Web license
Automating
Switch Administration Activities
bbSz9eQb9zccT0AQ:
Fabric OS APIs
Zoning license
Expect Scripting
RdzdSRcSyzSe0eTn:
A Switch Management Wrapper
QuickLoop license
Using Expect
Brocade Zoning Considerations
cSczRScd9RdTd0SY:
Where to Zone?
Fabric license
Hard Zoning or Soft Zoning?
xxiii
335
335
337
338
338
339
339
339
341
342
344
345
347
349
350
351
351
354
356
357
358
361
366
367
367
369
369
372
373
375
140_SANs_toc
xxiv
8/14/01
3:45 PM
Page xxiv
Contents
Hard Zoning and Soft Zoning
Differences
Zone Management
Scripting Zoning Operations
Zoning Tips
Validating Your Fabric
Baseline Your SAN Profile
Fault Injection
Running an I/O Load
Types of Load
I/O Generators
SAN Maintenance
The Configuration Log: Key Information
to Gather and Maintain about Your SAN
Backing Up and Restoring a Switch
Configuration
Bringing Up a Fabric
Expanding a Fabric: Merging Fabrics, Adding
a Switch, or Replacing a Switch
Upgrading Your Fabric
Issues Applicable to Both Hot and
Cold Upgrades
Performing a Hot Fabric Upgrade
Performing a Cold Fabric Upgrade
How to Automate firmwareDownload
Replacing or Adding an Edge Device in
the Fabric
Summary
Solutions Fast Track
Frequently Asked Questions
378
378
379
381
382
382
384
385
386
387
391
391
393
394
395
398
398
399
400
400
401
403
405
408
Appendix Building SANs with
Brocade Fabric Switches Fast Track
409
Index
431
140_SANs_fore
8/14/01
3:44 PM
Page xxv
Foreword
Why Write a Book about SANs?
During the last few years, Storage Area Networks (SANs) have fundamentally changed
the way organizations design, build, and manage their enterprise networks. As a superior alternative to direct-attached storage models, SANs have enabled a wide range of
new configurations and applications. In turn, those applications have generated a
variety of benefits for the organizations that have implemented them.These advantages include superior scalability, simplified storage management, optimized resource
sharing, higher availability, and tremendous cost savings to name just a few.
As a primary facilitator of the networked storage model, Brocade
Communications Systems actively seeks out new opportunities to raise industry
awareness about the value of SANs. One of our primary goals at Brocade is to help
educate all kinds of organizations about the advantages a networked storage environment can offer.
Based on feedback from our customers and business partners, we realized that
there was no self-contained, effective guidebook for implementing Fibre Channel
SANs.To help fill that void, we have joined with Syngress Publishing to bring you
Building SANs with Brocade Fabric Switches. This book details the design, installation,
configuration, and troubleshooting of Brocade-based SANs —basically everything
you need to know before beginning your own SAN implementation.
Who Should Read This Book?
Building SANs with Brocade Fabric Switches is written for anyone who plans to design,
build, and manage SANs using Brocade switches and software. In particular, this book
provides a “how to” reference that describes what you can do today, given the technologies currently available.
xxv
140_SANs_fore
xxvi
8/14/01
3:44 PM
Page xxvi
Foreword
By necessity, the focus is Brocade-centric and features the theory of operation
behind Brocade SilkWorm switches and Fabric OS. However, this book is not
intended to be a comprehensive guide for every configuration and scenario possible.
After all, with the rapid expansion of the SAN marketplace, there will undoubtedly
be other technologies available in the not-so-distant future.
What Does the Book Contain?
In addition to providing an overview of current technology, tools, products, and
design topologies, this book should serve as a guideline for actual SAN implementation. For instance, the book begins with a detailed analysis of technology requirements and the benefits of implementing a SAN. Next, you can learn about Fibre
Channel concepts and definitions as well as the full range of SAN components.
We then introduce you to the Brocade SilkWorm series of Fibre Channel
switches, including guidelines for integrating these switches into your existing IT
environment.The book concludes with examples of design processes, popular SAN
applications, and detailed troubleshooting and maintenance tips. In addition, each
chapter features a high-level summary and FAQs for anyone who needs a quick
overview of the SAN basics.
Our goal is to make this book a valuable tool for implementing your own
SAN infrastructure and teach how a well-designed SAN can deliver a
competitive advantage for your organization.We welcome your feedback on our
efforts. If you have any comments or suggestions about this book, please let us know
at www.syngress.com/solutions.
—Kumar Malavalli
Vice President,Technology
Brocade Communications Systems
www.syngress.com
140_SANs_01
8/14/01
1:32 PM
Page 1
Chapter 1
Introduction
to SANs
Solutions in this chapter:
■
Overview of SANs
■
Taming the Storage Monster
■
Benefits of Building a SAN
■
When to Deploy a SAN
■
Steps to a Successful SAN Deployment
; Summary
; Solutions Fast Track
; Frequently Asked Questions
1
140_SANs_01
2
8/14/01
1:32 PM
Page 2
Chapter 1 • Introduction to SANs
Introduction
In the early 1980s, direct-attach disk storage through interconnects such as Small
Computer Systems Interface (SCSI) was the standard way to connect to data.This
worked well for the amount of data typically handled at the time, and became
the standard way to connect high-speed, high-performance storage to computer
systems.
However, as computer systems increased in speed and data storage needs
increased, the parallel bus architecture of SCSI soon started hitting performance
and distance limits. In response to these needs, Fibre Channel was developed to
provide gigabit-speed serial networking capabilities for storage. Fibre Channel
includes support for the SCSI protocols for storage, the Internet Protocol (IP) for
networking, and the Virtual Interface (VI) protocol for clustering, which are
mapped onto a network architecture.The Fibre Channel standard combines long
distances of up to 10 km, simplified serial cabling over multiple media types,
gigabit speeds, and the ability to simultaneously use more than a single protocol
over the same wire.These features won adoption for Fibre Channel throughout
the 1990s as a replacement for parallel SCSI, and Fibre Channel is now used for
most high-capacity, high-end direct storage devices.
With the advent and market acceptance of Fibre Channel as a point-to-point
replacement for parallel bus SCSI, a new technique has emerged that combines
pure storage usage with networking—the Fibre Channel Storage Area Network
(SAN). A SAN is a network of storage and system components, all communicating on a Fibre Channel network, that can be used to consolidate and share
storage, provide high-performance links to data devices, add redundant links to
storage systems, speed up data backup, and support high-availability clustered
systems.
This chapter provides an overview of what a SAN is, what types of problems
are best solved with a SAN, and some steps to make a SAN deployment successful. After reading this chapter, you should be able to determine if you should
deploy a SAN, identify the types of applications best solved by SAN technology,
and be ready to create a deployment plan for your SAN.
Overview of SANs
Throughout the 1980s, the standard way of connecting hosts to storage devices
was point-to-point, direct-attach storage through interfaces such as Integrated
Drive Electronics (IDE) and parallel SCSI (Figure 1.1). Parallel SCSI offered
140_SANs_01
8/14/01
1:32 PM
Page 3
Introduction to SANs • Chapter 1
relatively fast (5 or 10 Mbit/sec) access to SCSI-enabled disks, and several disks
could be connected at once to the same computer through the same interface.
This worked well for the time, with fairly reliable, fast-speed connections
allowing administrators to connect internal and external storage through just
simple ribbon cabling or multiconductor external cables. However, as storage
subsystems became larger and larger and computers faster and faster, a new
problem emerged—external storage (which at one time was just a simple disk
drive on the desk next to a machine) started to get bigger.Tape libraries,
Redundant Array of Inexpensive Disks (RAID) arrays, and other SCSI devices
began to require more and more space—requiring the parallel SCSI connection
to be stretched farther and farther away from the host. Input/Output (I/O) rates
also increased, pushing on the physics of keeping signal integrity in a large bundle
of wires (32- and 64-bit data bus widths). Simple parallel SCSI variants were
devised to enable longer distances and to address the signal integrity issues.
However, they all eventually ran up against the difficulties of high-speed signals
across the parallel SCSI bus architecture.
Figure 1.1 Parallel SCSI Bus Connection
SCSI
ID 1
SCSI
ID 2
SCSI
ID 3
Host
Parallel SCSI Bus
The solution to all of this was slow in coming, but eventually the storage
industry settled on using a serial protocol with high-speed transceivers—offering
good noise immunity, ease of cabling, and plentiful bandwidth. Different specifications (Serial Storage Architecture [SSA] and Fibre Channel as well as more
advanced parallel SCSI technologies) competed for adoption, and companies
began experimenting with different serial communications media. New highspeed circuits made serial transfers (using a simple pair of wires to transmit bits
serially, in order, rather than a large number of wires to transfer several bytes or
3
140_SANs_01
4
8/14/01
1:32 PM
Page 4
Chapter 1 • Introduction to SANs
words of data at a time) the most practical solution to the signal problems.The
high speed of the circuits enabled the data rates for Fibre Channel to offer up to
100 Mbit/sec transfers, versus the slower 10 to 20 Mbit/sec parallel limitations.
When Fibre Channel was first applied to the area of storage connections, the
primary reason for the technology was for the extended distances and simplified
cabling that the technology offered.This extension of direct-attach operation
basically replaced the old parallel SCSI attachments with a high-speed serial line
(Figure 1.2).The new Fibre Channel connections offered a much faster interface
and simplified cabling (four copper wire connections through DB-9 connectors,
as well as optical cabling), and could be used to distribute storage as far as 10 km
away from a host computer, or 30 km away with optical extenders.
Figure 1.2 Using Fibre Channel to Extend Distances from Storage
Host
Storage Array
Fibre Channel Link
Up to 10 km
The connections to disks at this time began using the Fibre Channel
Arbitrated Loop (FC-AL) protocol, which enabled disks to negotiate their
addresses and traffic on a loop topology with a host (Figure 1.3). Because of the
combined ability to easily cable and distribute storage, users were now able to
add separate racks of equipment to attach to hosts. A new component, the Fibre
Channel hub, began to be used to make it easier to plug in devices.The hub, a
purely electrical piece of equipment that simply connected pieces of a Fibre
Channel loop together, made it possible to dynamically add and remove storage
from the network without requiring a complete reconfiguration. As these components began to be used in increasingly more complex environments, manufacturers began to add “intelligence” to these Fibre Channel hubs, enabling them to
independently deal with such issues as failures in the network and noise in the
network from loops being added and removed. An alternative to the hub came in
the form of the Fibre Channel switch, which, unlike a hub, was not just connecting pieces of a loop, but instead offered the packet-switching ability of
traditional switches.
140_SANs_01
8/14/01
1:32 PM
Page 5
Introduction to SANs • Chapter 1
Figure 1.3 Arbitrated Loop Disk Configuration Attached to a Single Host
Disk
Disk
Disk
Fibre Channel
Arbitrated Loop
Host
Disk
Disk
Disk
Because there was now a Fibre Channel network available, other hosts (not
storage) were added to take advantage of the same network.With the addition
of SAN-aware software, it was suddenly possible to share storage between two
different devices on the network. Storage sharing was the first realization of the
modern SAN, with companies in the multimedia and video production areas
paving the way by using the Fibre Channel network to share enormous data
files between workstations, distribute jobs for rendering, and make fully digital
production possible (Figure 1.4).
The next big step in Fibre Channel evolution came with the increased reliability and manageability of a Fibre Channel switched fabric. Early implementations
of FC-AL were sometimes difficult to manage, unstable, and prone to interoperability problems between components. Because the FC-AL protocol was quite
complex, what sometimes would happen would be an inability for anything to
communicate and stay operational on a loop.The solution to this was a move to a
switched fabric architecture, which not only enhanced the manageability and reliability
of the connection, but provided switched, high-speed connections between all
nodes of a network instead of a shared loop. As a result, each port on a switch now
provides a full 1 Gbit/sec of available bandwidth rather than just a portion of the
total 1 Gbit/sec of bandwidth shared between all the devices connected to the
loop. Fabrics now make up the majority of Fibre Channel installations. A typical
Fibre Channel switched fabric installation (Figure 1.5) has multiple hosts and
storage units all connected into the same Fibre Channel network cloud through
one or more Fibre Channel switches.
5
140_SANs_01
6
8/14/01
1:32 PM
Page 6
Chapter 1 • Introduction to SANs
Figure 1.4 Multiple Host Arbitrated Loop for Storage Sharing
Disk
Disk
RAID
Fibre Channel
Arbitrated Loop
Host
Disk
Host
Host
Figure 1.5 Switched Fabric, Multiple Host, and Storage Unit Configuration
JBOD
JBOD
RAID
Fibre Channel Switch
Tape
JBOD
Host
Host
Host
140_SANs_01
8/14/01
1:32 PM
Page 7
Introduction to SANs • Chapter 1
Today, the modern SAN looks much like any other modern computer network. Network infrastructures such as switches, hubs, bridges, and routers help
transport frame-level information across the network. Network interface cards
interface computer systems to the same network (called HBAs in the SAN world,
as they replaced SCSI Host Bus Adapters). Figure 1.6 shows an example of how
these components could be used in conjunction with Fibre Channel switches.
Figure 1.6 Typical Deployed SAN Configuration with Multiple Hosts, Storage,
and Tape Devices
Legacy
Parallel
SCSI
Storage
RAID
Array
Web Server
Database Server
Fibre Channel-toSCSI Router
HBA
Remote
SAN
Fibre Channel Switch
HBA
Fibre Channel Switch
Fibre Channel Switch
ISL
(Inter-Switch
Link)
Fibre Channel-toDWDM Bridge
HBA
Fibre Channel
Cloud
Host
Fibre Channel Switch
Fibre Channel Hub
Storage
Array
HBA
Fibre Channel-toSCSI Router
Host
Tape Array
JBOD
7
140_SANs_01
8
8/14/01
1:32 PM
Page 8
Chapter 1 • Introduction to SANs
Resources for SAN Information
Rather than relying on just the equipment vendor, an effective way to
learn and become an expert on the technology is to track the industry and
attend conferences, meetings, and tutorial sessions about the subject.
Additional resources for learning more about SAN technology are
the industry organizations devoted to this area. The Storage Networking
Industry Association (SNIA) offers white papers and educational
resources, holds technical tutorial sessions and Storage Networking
World conferences, and supports both the users and vendors involved in
the storage networking field. More information can be found at
www.snia.org. The Fibre Channel Industry Association (FCIA) provides
resources for users and vendors, conducts the SANmark suite of Fibre
Channel interoperability tests, and holds conferences and meetings to
help promote Fibre Channel technology. Their site can be found at
www.fibrechannel.org.
Taming the Storage Monster
The advent of SANs has been driven by today’s insatiable appetite for storage.
The Internet, e-mail, multimedia, and the increasing digital nature of society
have resulted in an ever-increasing demand for ways to store, retrieve, and back
up that data.
For example, e-mail has been on a staggering growth path in the last few
years, as more and more people have gone online and businesses have made
e-mail a critical part of their communications infrastructure. According to the
Year-End 2000 Mailbox Report, there are over 891 million e-mail mailboxes now
in existence. Corporate mail usage grew 34 percent in 2000, bringing with it a
huge increase in the need for data storage to save all of that e-mail. Multimedia
attachments, the movement of business processes to e-mail, and just the sheer
volume of e-mail have made the storage and backup of e-mail one of the most
pressing requirements of IT departments.
140_SANs_01
8/14/01
1:32 PM
Page 9
Introduction to SANs • Chapter 1
The Internet has also affected the need for storage, with increasing numbers
of Web servers and storage required to support those Web servers. As information
is increasingly digitized and published on the Web, there is an insatiable appetite
for storage to contain that information. Music and full-motion video, even with
compression, take an immense amount of disk space, and the movement of studios and companies to run a “full digital” shop has resulted in an enormous
demand for storage capacity. Databases, which used to be considered big if they
were gigabytes in size, are now well beyond a terabyte—with companies talking
about eventually having to manage petabytes of database storage.
In addition, with caching servers,Web load balancing, and Web farms built to
distribute the processing load for Web traffic, the data being presented on Web
sites has to be duplicated 10, 20, and even 100 times to serve those distributed
hosts with information.With the increased connectivity of the Internet, information and content are being generated and distributed faster than ever before in
history—so much, in fact, that the University of California at Berkeley recently
released a study that claims that more data will be created in the next two years
than was produced in the history of mankind.
All of this data has to go somewhere, and it has exceeded the space available
and beyond what can practically be managed on local, direct-attached storage to
hosts. Because local storage is relatively fixed and difficult to expand, and because
its local nature is difficult to manage, organizations have started to look for a
better way to manage this data.The solution has come in the form of very large
storage arrays, capable of storing terabytes and terabytes of data, and farms of
inexpensive Just A Bunch Of Disks (JBOD). All of this needs to be connected,
and the logical way to connect high-speed, block-oriented traffic is through a
Fibre Channel SAN. Increased manageability, the ability to centrally manage
storage, and consolidation of storage space have made the SAN a necessity in any
growing enterprise.
Data growth is increasing at such a rapid pace that IBM recently reported
that storage sales now exceed server sales at a 70:30 ratio.The requirements to
store data are increasing at a greater rate than the requirement for CPU cycles,
and the entire industry is changing as a result.This gain has meant that data is
now managed separately from the machines that consume that data, making
SANs an ideal choice to break the dependency of hosts from the storage, and
increasing the manageability and usability of a corporation’s investment in data
storage.
9
140_SANs_01
10
8/14/01
1:32 PM
Page 10
Chapter 1 • Introduction to SANs
Implementing a SAN is an ideal technique for taming the storage requirement monster that has resulted from the growth of the Internet and increased
connectivity of our electronic age.
Benefits of Building a SAN
A number of practical, real-world uses for SANs have emerged in recent years.
Knowledgeable administrators have figured out the types of problems that SAN
technology best solves. SANs are typically used for the most business-critical,
technically challenging problems a company faces. Critical, high-availability systems used for e-mail, database, and file servers have been the first to switch to
SANs. A need to consolidate storage and centrally manage volumes has resulted
in a trend toward using SANs for storage consolidation.With the increase in data
growth, backups have also become a problem, with companies looking to accelerate backup cycles. Protocols such as IP available on Fibre Channel also make
SANs attractive for some general networking applications, and VI clustering support allows installations to leverage their SAN infrastructure for VI-enabled clustering applications. Finally, the distance capabilities of Fibre Channel and bridges
to Metropolitan Area Networks (MANs) and even Wide Area Networks (WANs)
have enabled a new level of disaster tolerance for storage resources.
Ensuring High Availability
As the Internet and digital data have grown exponentially,Web caching techniques,Web load balancers and distributed server clusters, and other techniques
have been used to handle the demands of serving up Web requests for static
pages. Images, files, and Web pages that do not change often can be copied across
a bank of hosts, all of which can service a request from a user.
However, these techniques cannot be applied in many critical applications.
For example, an e-mail server requires one single, consistent image for e-mail
storage. Back-end databases of e-commerce applications require combining live,
real-time inventory data with live pricing data to service requests correctly. None
of these can be cached across a Web server due to the real-time, non-cacheable
nature of the information.This dependency on a consistent, single image of data
cannot be solved by just replicating data or sharing across a cluster.The result is a
new, critical point of failure in the e-mail server or database. Especially with the
growth in data, more and more vital data is being trusted to those single points of
140_SANs_01
8/14/01
1:32 PM
Page 11
Introduction to SANs • Chapter 1
failure, raising the stakes and potential losses if those services go down.The concern over these critical points of failure has resulted in a renewed focus on highly
available (fault tolerant) solutions, particularly in the storage area. In combination
with failover software packages such as Microsoft Cluster Server or VERITAS
Cluster Server, high-availability hardware and software has come to the forefront
in ensuring the performance and availability of these critical systems.
For example, one area where the use of SANs is ideal has been the use of
high-availability solutions for managing and running very large Microsoft
Exchange databases.With the immense increase in data stored in Exchange
servers all over the world, there has been an increase in the amount of back-end
storage required for serving those Exchange installations. Because of the nondistributed nature of Exchange mail databases, there has been a concentration of
data tied to single hosts and storage units—a single point of failure that could
cripple many businesses.The natural solution has been to use application clustering techniques combined with a robust, fully redundant high-availability SAN
to support those clusters and share redundant storage between hosts.
High-availability systems are now regularly used for ensuring fault-tolerant
access to storage. A focus on eliminating single points of failure has stimulated
demand for fault-tolerant equipment configurations, specific fault-tolerant network equipment, and techniques for ensuring high availability. SAN technology is
ideal for these types of solutions. It allows host-to-host connectivity for heartbeat,
equipment status, and network communications, as well as for sharing critical
storage between alternate and backup servers.
The availability of SAN connections has solved one of the big problems with
high-availability, clustered installations: access to the same data across a network.
In combination with high-availability features in storage arrays and other equipment, the SAN allows for multiple redundant paths to be made from multiple
redundant hosts, dramatically increasing the reliability of critical systems. In addition, with flexible SAN interconnections, the large amount of data that needs to
be accessed can more easily be managed separately, rather than being captive to a
potential failure in a host.
Consolidating Storage
As data needs have increased, it has become increasingly difficult to manage the
hundreds of hosts and local disks attached to those hosts. In order to manage this
growth, administrators have begun to centralize their storage resources. Large
storage arrays and pooled storage are much more efficient and infinitely more
manageable than local storage.
11
140_SANs_01
12
8/14/01
1:32 PM
Page 12
Chapter 1 • Introduction to SANs
As storage needs have increased, the model of attaching local storage to hosts
has broken down. Administrators figured out that, even though a company as a
whole might own enough storage for all of its needs, that storage was not necessarily in the right place. For example, a Web server might be running out of space,
with no more space available on local disks and not enough SCSI connections to
add more external storage, while the database server next to it has gigabytes free.
In the old model of local storage, there was no way to take advantage of that fact.
You ended up purchasing much more storage than you needed, because you had a
very low rate of utilization—yet you never had enough capacity.You also ended
up purchasing more servers than you needed, because you did not need more
CPU cycles, but rather, more storage slots.
With the advent of the Fibre Channel network, the ability of both clients and
storage to coexist and share storage has spawned a new crop of solutions that take
advantage of that sharing. Sharing of storage, which previously was limited to vertical markets such as video editing and multimedia, has become a general technique
used anywhere that storage is more easily managed in a pool, such as in Internet
Service Provider (ISP) and Application Service Provider (ASP) installations. Indeed,
most corporate IT environments can take advantage of this technique.
Through software such as VERITAS Volume Manager,Tivoli SANergy, and
DataCore SANsymphony, users are now able to allocate and share storage among
multiple hosts.
By using the SAN infrastructure, large centralized pools of disks can be
divided between hosts, and new volumes allocated as needed from the general
pool.This results in a huge increase in efficiency in use of storage, eliminating the
pools of expensive, local, unusable storage. Instead, one large, easily managed virtual storage pool can be centrally administered, and storage costs and administration centralized and consolidated.
Sharing is accomplished through this high-level software, which discovers and
manages all of the storage on the network. Drivers and software in the host manage
which machines do and do not get access to a specific part of a storage device. In
general, a central administrator is able to allocate arbitrary pieces of storage to
specific hosts, all while the network and all hosts are running in real time.
A typical example of this is an ISP with a large number of user Web homepage accounts. Extensive pools of clustered and independent Web servers help to
ease the traffic load and provide redundancy on the Internet, while being tied into
a single- or dual-redundant SAN. Storage allocation and sharing software is run on
all of these hosts, and the different Web homepage accounts are allocated to different Web servers.When a failure on a host or storage device occurs, either an
140_SANs_01
8/14/01
1:32 PM
Page 13
Introduction to SANs • Chapter 1
automated process or manual intervention will re-allocate those user volumes to
another Web server, or fail over to another storage device, resulting in uninterrupted service and no dependency of specific users on a local disk. In some cases,
multiple Web servers can access the same, read-only data on the SAN, providing a
high-bandwidth pipe and eliminating the need for expensive, redundant copies of
the same data.
Reducing Network Congestion from Backup
A typical problem any administrator faces is that of data backup. Because of the
huge growth in data, even on local disks, and the increasing criticality of the data
stored on networks, backup has become very important. Software packages such
as VERITAS NetBackup, Legato NetWorker, and other packages have long relied
on agents that transport data over IP connections to a central backup host.The
result has been a noticeable slowdown due to the vast amount of data being
transported across these IP packets over Ethernet connections—and not just late
at night.The backup window for many enterprises has extended from overnight
to include hours of peak system operation, simply because there is too much data
to fit into the more traditional and convenient backup windows.
Anecdotal stories from system administrators illustrate how entire corporate
networks have become swamped with daily backups over IP, slowing not only
e-mail, but critical file servers, print servers, and Web access. Some shops have
gone as far as to install separate, high-speed Ethernet networks in an attempt to
offload this problem.
SANs lend themselves to several techniques that directly help the backup
problem. One of these techniques is the use of IP over Fibre Channel to offload
the network congestion on the Ethernet network. IP, when transported over
Fibre Channel, is identical in form and function to IP over Ethernet and other
networks.Taking advantage of the fact that there are already Fibre Channel
connections into a network for access to shared data, administrators have installed
IP drivers into their servers and entirely offloaded the backup function onto the
Fibre Channel network.This frees the corporate Ethernet from the immense job
of transporting IP backup traffic, and takes advantage of the increased bandwidth
efficiency that is characteristic of Fibre Channel. Due to the connection-oriented
protocols built on Fibre Channel, IP traffic impacts the Fibre Channel network
less and helps administrators gain better usage out of their networks. In addition,
an increasing number of applications can perform shared backups over a SAN
using the backup devices’ native SCSI protocol, which greatly increases the
efficiency of the backup process.
13
140_SANs_01
14
8/14/01
1:32 PM
Page 14
Chapter 1 • Introduction to SANs
Accelerating Backup Cycles
Another reason for SAN implementation, which attacks the problem of overall
backup cycles, has been the development of the technique of third-party copy.
Taking the advantages of the Fibre Channel network a step further, specialized
hardware called data movers work in conjunction with next-generation backup
software to skip the IP transport of backup data entirely; they directly move
storage that needs to be backed up from storage devices on the network to tape
backup devices on the same network. Because the transfer is direct, it is very fast
(no copy to server memory), and drastically reduces the CPU processing power
needed for backup.The time spent copying data to and from storage to local host
memory frees up valuable CPU cycles for something else: for example, running
the applications that the host was installed to run. Companies such as Chaparral
Network Storage and Crossroads Systems have been developing these third-party
copy devices as part of their Fibre Channel-to-SCSI bridges, in conjunction with
different backup vendors, who are now able to move data across a Fibre Channel
network without the intervention of hosts.
Speeding Up Data Access
The keyword to SANs is speed, speed, and more speed. As a block-level protocol,
SCSI over Fibre Channel Protocol (FCP) is the fastest and most efficient networking technology available to transport block-type data from storage to hosts.
Companies that previously were using TCP/IP-based networking technology
over Ethernet and attempted to migrate that to Gigabit Ethernet have found that,
despite the similar wire speeds of the technology, the efficiency and protocols
available don’t allow for the use of bandwidth that Fibre Channel does.
By using Fibre Channel, companies have found that they can speed up data
access between hosts and storage. In addition, the more efficient usage of IP over
Fibre Channel has advantages in network utilization over Gigabit Ethernet, with
a shared network making standard TCP/IP networking over SANs an attractive
solution.
Increasing Server Cycles
A growing problem has emerged with high-speed networks based on IP.
Companies have been using clustering techniques (running many, coordinated
servers in tandem to distribute processing) to attempt to get past the problems of
limited CPU speeds and server scalability. However, clustering techniques rely on
140_SANs_01
8/14/01
1:32 PM
Page 15
Introduction to SANs • Chapter 1
the latency of a network to determine what types of scaling processing are available.With most Ethernet networks comes a negative scaling effect by adding
clustered servers—adding more servers allows less and less processing power, due
to the increased dedication of CPU cycles required just to coordinate that cluster.
The Virtual Interface (VI) Architecture protocol, a standard proposed by Intel,
Microsoft, and Compaq for reducing the use of the CPU for network transfers,
has emerged as the leading protocol for network communications for clustered
environments. By providing a simplified model for direct hardware access to clustered machines,VI eliminates the complex IP stack in favor of a hardware-based,
Direct Memory Access (DMA) approach to transferring data across a network.
The FC-VI standard maps the VI protocol (also available on Ethernet and proprietary interconnects) to the Fibre Channel protocol, and makes low-latency, direct
access available to clustering applications.
The primary areas in which the VI protocol is being used today include clustered databases such as Oracle Parallel Server and IBM DB2, both of which
natively support the VI protocol over Fibre Channel and other networks. A significant base of researchers and other developers are also using the VI protocol for
scientific computing and distributing computational tasks across large networks
of machines.
SANs are beginning to be used in this area to take advantage of the FC-VI
protocol. Businesses are using the VI protocol to free server cycles on their database servers, and recent third-party copy records have been set using VI-capable
hardware. An administrator running a clustered database such as the Oracle
Parallel Server or IBM DB2 should consider taking advantage of the SAN infrastructure and installing an FC-VI-capable HBA to further accelerate the database
cluster.
Ensuring Disaster Tolerance
One of the major advantages of SAN technology is its high-performance, longdistance capability. Initially, SAN technology was mostly used to extend to larger
distances within a building or campus. However, recently this has been applied to
the problem of disaster tolerance: being able to keep an operation up and running even if catastrophe strikes. For example, data center managers are now using
Fibre Channel technology bridged through MANs to make their installations
more disaster tolerant.
A typical example of using MANs for disaster tolerance is brokerage houses
located on Wall Street. A common scenario is a large data center that supports
15
140_SANs_01
16
8/14/01
1:32 PM
Page 16
Chapter 1 • Introduction to SANs
customer operations and the trading floor in Manhattan, which needs to have a
live backup site to handle the possibility of a power or telecommunications
outage, natural disaster, or other major catastrophe. Brokerages are now locating
live, connected SANs through Fibre Channel to Dense Wavelength Division
Multiplexing (DWDM) and other types of MAN technology directly to SANs in
New Jersey.The two data centers continually share data and replicate (mirror)
data between the independently running sites, so that if a catastrophe strikes, all
information is up to date and still available.This technique is also heavily used in
continental Europe, where operations can be spread between countries through
these metropolitan connections or dark fiber, an optical communications technology that allows the transport of high bandwidth data.
When to Deploy a SAN
Before deciding to deploy a SAN, consider whether implementing a SAN is the
right thing for the situation at hand. Frequently in technology, people decide to
implement something before they have evaluated whether the technology is actually the best for their needs.The result is often disappointing for the user and for
the vendors involved when, after lots of money and time is spent, there was little
chance that the solution would have solved the overall problem in the first place.
On the other hand, if the problem is first considered and matched to the best
technology, the odds for success are much greater.Things to consider when
deciding whether a SAN is the right solution:
■
The primary application that needs to be solved
■
Speed, bandwidth, and distance requirements
■
The amount of data sharing or consolidation needed
■
The cost of the SAN infrastructure required, such as switches, cables,
and HBAs
Designing Around the Application
The most important part of determining whether to deploy a SAN is to focus on
the actual business application that will be served with the SAN deployment.
Unlike Ethernet networking technology, Fibre Channel SAN technology really
should be applied on a network application-by-application basis. Equipment and
software deployment is entirely driven by what types of applications need to be
served, as opposed to being just an interconnect to plug in all of the desktops in
140_SANs_01
8/14/01
1:32 PM
Page 17
Introduction to SANs • Chapter 1
an organization.Typical applications are storage consolidation,Web hosting farms,
database and business-critical transaction servers, and workgroup data sharing.The
types of issues to consider are the use of those applications: data sharing, faster
backup, or the disaster tolerance aspects of the network. An understanding of the
benefits of deploying a SAN is vital to driving the design, software, and hardware
used to deploy a network.
One thing to consider when starting SAN deployment is to carefully rank
and prioritize the goals of the project. Equipment, software, and solutions are all
geared toward specific types of applications, and understanding what is needed is
very important in determining if a new feature that a vendor is trying to sell is
critical to making an application deployment successful.
Assessing Speed, Bandwidth,
and Distance Requirements
Key factors in deciding to use a SAN are speed and bandwidth requirements. For
the most demanding applications, a SAN might be the only option. One
Gbit/sec (100 MB/sec) components are widely distributed now; 2 Gbit/sec (200
MB/sec) switches, storage, and HBAs are starting to hit the market; and Fibre
Channel standards of up to 10 Gbit/sec are already in development. If unimpeded access to storage is required, Fibre Channel exceeds the speeds available
from legacy techniques such as parallel SCSI, and simplified cabling and connections make it far more reliable. Compared with other technologies, such as IPbased file sharing and Network Attached Storage (NAS), the Fibre Channel
protocol provides for more usable bandwidth and faster data transfer.
Distance is another factor in deciding to use SAN infrastructure. If data needs
to be distributed across a building, campus, or city, a storage network is perfect.
Long cable lengths, multiple cabling options, and robust components make a SAN
a perfect fit for distributing data. For example, many companies have solved the
problem of not having enough space for all of the data and hosts they need in a
single server room by running Fibre Channel from different parts of the building
together. Large storage arrays, which infrequently need access by administrators,
can be housed in a “lights out” facility separate from production servers, which
often need connection to monitors, keyboards, and administration. Deployment of
17
140_SANs_01
18
8/14/01
1:32 PM
Page 18
Chapter 1 • Introduction to SANs
these solutions is also relatively easy: optical fiber is connected on either end to
devices or switch ports, and full bandwidth operation is seamlessly available.
SANs Versus NAS: What’s the Difference?
First-time data administrators are often confused as to whether they
should be using SANs or NAS. Both techniques are useful, but for different types of applications.
NAS uses common client networks such as Ethernet to connect
client computers to a host file server. Unlike SANs, the client does not
directly communicate with the storage. Instead, the client computer
uses a high-level networking file system such as Network File System
(NFS), which runs the TCP/IP protocol over Ethernet. Data exchange
occurs at the file level, unlike a SAN where data is operated at the block
level over Fibre Channel.
In general, NAS techniques are best used for client-to-host connections, and SANs are better suited to high-speed file sharing and
host-to-storage connections. NAS connections are typically easier
to deploy over existing infrastructures, albeit much slower. SANs are
typically used where bandwidth and speed are most important, and
block-level, direct connections are required. Both techniques can
coexist in the same installation. In fact, some NAS systems require a
back-end SAN to support their operation.
Data Sharing and Consolidation Needs
Determining whether data will be shared on a SAN is important. As one of the
major motivations behind moving to SAN technology, it is critical to understand
exactly how data will be shared across the SAN. Areas to consider when assessing
data sharing and consolidation needs include:
■
Will information be shared at a file level or volume level?
■
Are resources such as storage arrays (static shares) shared, or is file
sharing required as part of the workflow (dynamic shares)?
140_SANs_01
8/14/01
1:32 PM
Page 19
Introduction to SANs • Chapter 1
Resource Sharing
The simplest form of storage network data sharing and consolidation is simple
resource sharing. In this case, a large storage array or storage farm is shared
among many machines, but access to each array or disk is statically allocated.
Each machine on the network is assigned storage, and does not change that ownership very often, if at all.This is typically used to partition a large storage array
among many hosts and can be a very convenient way to manage storage
resources.This resource partitioning can be done in a variety of ways, including
zoning at the switch level, storage-based Logical Unit Number (LUN) masking,
HBA-based LUN masking, or by a virtualization device that presents “virtual
LUNs” to the hosts. Resources are generally allocated once and are infrequently
changed or modified, and rarely are resources switched between hosts.
Volume-Level Sharing
Volume-level sharing is the sharing of resources at the volume level: for example,
sharing a LUN between two systems for a clustering application, or moving volumes between machines as part of a digital media workflow.This generally
requires the intervention of software that can mount and unmount volumes as
part of an operating system, and also might require translators between different
operating system formats.
File-Level Sharing
File-level sharing is sharing of resources at a file level.This means writing and
reading files on a single volume between different machines.This is typically
done in SAN configurations with one machine that has write permissions to a
volume, and many machines with read permissions to the files on that volume.
However, if per-user security (only certain users have access to certain files) is
desired, or if many machines need to write to the same volume, it will require
more advanced file systems to achieve this functionality. Some techniques (such
as global file systems) can be used in this case, but often a better fit is NAS technology and more traditional network file-sharing techniques such as NFS.
A SAN is an ideal solution for:
■
Block-level access to shared storage
■
High bandwidth requirements
■
Need for expandability
19
140_SANs_01
20
8/14/01
1:32 PM
Page 20
Chapter 1 • Introduction to SANs
■
Required access to very large centralized storage arrays
■
Need for redundant, highly available access to storage
■
Clustered server configurations
■
Distributed applications
■
Need for disaster tolerance
■
Backing up large amounts of data nightly
■
Running clustered databases (Oracle Parallel Server, IBM DB2)
■
Need for a highly scalable infrastructure
■
Centralized storage management
A SAN is not appropriate for:
■
A small amount of storage with no sharing required
■
File-level, client access to volumes only
■
No storage consolidation required
Steps to a Successful
SAN Deployment
As with any advanced technology, the most difficult part of working with the
technology is the actual deployment of the hardware and software.This section
outlines at a high level some recommended steps to take to help ensure success
in SAN deployment, and explains the overall process of deploying a SAN. Later
chapters discuss the SAN design process in more detail, breaking the process into
seven steps: data collection, data analysis, architecture development, testing a prototype, transitioning existing hardware, release to production, and maintenance.
The first step to successful SAN deployment is to evaluate the intended goals
of the deployment. A firm ranking of the top items to achieve with the deployment is key to evaluating hardware and software options as well as determining
topology and design layout.The channel characteristics of storage devices make
topology selection and overall architectural design critical in SANs. For example,
it is critical for high availability. If this is the primary goal, consider dual-redundant SAN fabrics, fault-tolerant components, and a topology that allows for
redundant HBAs and storage ports. On the other hand, if data consolidation and
140_SANs_01
8/14/01
1:32 PM
Page 21
Introduction to SANs • Chapter 1
cost reduction is the goal, the design should be a topology without the redundancy and separate fabrics.The same considerations go into other applications
such as backups, databases, or disaster tolerance. Defining the goals and developing a detailed technical plan is an important factor in success. Future chapters
cover the data collection and analysis phases in detail.
As with any technology solution, a great deal of a SAN deployment should be
the investigation of the different software and hardware that will be part of the
solution. Because hardware and software are constantly changing, and new innovations and equipment are available every day, speaking with various vendors,
attending trade shows, and talking to current users will help in determining the
best options for a SAN. Although Fibre Channel equipment, for the most part, is
now fully interoperable and will work together, it is still a good idea to get a sense
of the different options available. Storage arrays, the foundation of a SAN and the
most important part of the solution, should be a primary focus when designing a
network. However, because everything in a network has to work together, the
infrastructure and HBAs are also a critical part of the formula.
The next step is installing a SAN prototype and testing the install, as covered
in future chapters. Networking is a complex area, and SANs are no exception. An
important deployment risk reduction item is the actual installation of a SAN
testbed to prototype the installation. Creating a SAN prototype allows for testing
the ultimate installation and working out any issues that might be encountered
with both the software and hardware being used. Unless an outside party is doing
the installation, consider it a necessity to personally install the setup, to make sure
that all of the components work together.
It is usually best to set up a lab with all the required power, cabling, and hardware needed to install and test equipment.This includes enough host machines to
run the application and hosts to install and test HBAs and software, along with
racks and benches for the storage, switches, and other hardware that will be used. In
fact, for mission-critical SANs, maintaining a testbed in parallel with the production
SAN should be seriously investigated before rolling it out to the actual production
network.The testbed allows for pretesting configuration changes and new components, debugging production problems, validating changes, and verifying that new
versions of network and system software and firmware behave as expected.
21
140_SANs_01
22
8/14/01
1:32 PM
Page 22
Chapter 1 • Introduction to SANs
SANmark and Other Interoperability Programs
The Fibre Channel Industry Association runs a program called
SANmark, which certifies equipment against Fibre Channel interoperability suites defined by the industry.
Run in conjunction with the University of New Hampshire, the
SANmark tests define how different kinds of equipment need to interoperate with other types of Fibre Channel hardware. Several levels
of SANmark certification exist, and the standards are constantly
evolving to make sure that the latest hardware is tested against
existing standards.
Because of the importance of interoperability in the Fibre Channel
area, many companies publish compatibility matrices that describe
which components and versions of software and hardware have been
tested against common configurations.
The best way to select hardware and software is to start with the information
available from different vendors on suggested configurations and interoperable
hardware.Vendor “interoperability labs” and certifications give excellent points on
how to pick the right products that will work together.
Interoperability labs are now a standard part of almost every vendor’s support
structure.These labs, where vendors extensively test and qualify equipment and
certify configurations, are set up to make sure that all of the equipment they provide is compatible with other vendors. Extensive testing by vendors ensures that
the hardware and software work together as expected. Stress testing, configuration
testing, and negative testing under various loads and in different configurations
flush out problems before a piece of equipment is shipped. Most vendors make all
of this information available to users and are happy to share what configurations
they have certified for use with their equipment. Researching the interoperability
information from the software and hardware vendors should provide a good idea
of what works together.
140_SANs_01
8/14/01
1:32 PM
Page 23
Introduction to SANs • Chapter 1
After investigating the options, installing a SAN testbed, and selecting what
seems to be the right software and hardware, it’s testing time. Even with preconfigured and specified configurations, it’s best to set up a real-world configuration
to help test the deployment on a small scale.This configuration should include a
representative sample of everything that will be deployed on the SAN, with reallife applications and representative data and with sufficient load generated to
replicate a limited deployment. No configuration or combination of hardware
and software is foolproof, so testing a real-life configuration in a controlled environment before rollout can help to flush out the last issues and major showstoppers that could derail the deployment. Key areas to cover in testing include:
■
Installing all of the major hardware vendors that have been selected
■
Testing for interoperability of components with the versions of software
and firmware that will actually be deployed
■
Testing all crucial functionality with the software and applications
running
■
Testing for error handling and tolerance
Simple testing includes plugging components in and out of the network,
powering down components to see how they recover, and moving cables. More
complex testing involves running heavy traffic to components, setting up the
application, and running simulated loads.The most important part of this testing
is running a simulated or actual load on the application that is being deployed
and making sure that even under real-life conditions, everything works as
expected, with no problems.
The final step in a successful SAN deployment is staging the actual deployment into the enterprise. Staging the deployment helps to minimize risk and
maximize the probability of success. Rather than moving to a solution in one fell
swoop, it is better to deploy on a limited basis in certain areas and expand that
deployment once everything is up and running smoothly. A technique that is
frequently used to minimize deployment risk is in-place staging of SAN deployment. In this technique, the equipment and software are set up and tested where
the network will permanently be installed. All the testing in the previous step is
done where the SAN deployment will eventually be installed, so no equipment is
moved, damaged, or inadvertently reconfigured. Instead, when the time comes to
23
140_SANs_01
24
8/14/01
1:32 PM
Page 24
Chapter 1 • Introduction to SANs
deploy, a cable is connected and the SAN configuration is instantly live and available. Using the advantage of long-distance Fibre Channel cabling to enable “live”
installation of new storage, large enterprises use this technique whenever they add
new SAN hardware.The new storage is tested, run through diagnostics and stress
tests, and then added onto the production network with the addition of cabling
and modifications to zoning—all without moving the equipment or reconfiguring the setup.
Carefully staging the deployment, applying changes on a limited basis, and
then rolling it out gradually will minimize any risk and ensure that everything
operates smoothly. Future chapters discuss the steps of the SAN design process in
more detail, how to analyze options and the underlying hardware and software,
how to design a network, and how to best take advantage of the tools available.
140_SANs_01
8/14/01
1:32 PM
Page 25
Introduction to SANs • Chapter 1
Summary
The need for more data storage is constantly growing, with the Internet, e-mail,
multimedia, and the sheer generation of data demanding more and more storage.
With predictions calling for more data to be created, stored, and managed in the
next two years than was produced in the history of mankind, it is very important
to address those storage needs in a scalable and reliable way.
In this chapter, we covered a bit of the history behind the SAN: its roots in
parallel SCSI connections, its evolution from just a SCSI replacement to more
sophisticated, loop-based storage sharing, to its current incarnation in highly scalable, reliable switched fabric networks. Fibre Channel networks are now being
deployed for the most business-critical and important areas in enterprises.Through
this evolution, SAN technology now offers a robust platform to establish and support the most important business applications.
The benefits of building a SAN include ensuring high-availability access to
data, consolidating storage resources and management, reducing backup windows
and traffic, freeing host CPU cycles for other important tasks, and ensuring data
availability through disaster tolerance techniques. Building a Fibre Channel SAN
enables a more reliable, highly scalable, large bandwidth access to data. A SAN is
typically used for the most business-critical, technically challenging problems a
company faces.
Deploying a SAN takes some planning. It is important to consider the application in use, speed and bandwidth requirements, and whether data sharing and
consolidation offer any benefit. It is also important to consider the budget for the
project.The keys to a successful SAN deployment are evaluating the goals for the
technology; fully investigating the software and hardware to purchase; taking the
time and resources to install a testbed; working with vendors to select the right
combination of software and hardware; and testing the configuration thoroughly.
Finally, stage the deployment so that problems can be solved on a limited scale
first, before rolling it out on a larger basis.
SAN technology has the ability to meet the most demanding business needs
and is the only technology currently available that meets the distance, bandwidth,
and reliability requirements of critical applications.With the explosive growth of
data storage requirements, this technology enables the efficient use and management of data resources. By following proven techniques and carefully planning
deployments, Fibre Channel SANs can help solve the most difficult data storage
problems.
25
140_SANs_01
26
8/14/01
1:32 PM
Page 26
Chapter 1 • Introduction to SANs
Solutions Fast Track
Overview of SANs
; SAN technology evolved from direct-attach interconnects like Small
Computer Systems Interface (SCSI).
; Fibre Channel supports SCSI, Internet Protocol (IP), and the Fibre
Channel Virtual Interface (FC-VI) Protocol.
; The distance between Fibre Channel nodes can be as much as 10 km.
; Fibre Channel supports copper, multimode optical, and single-mode
optical media.
; SAN technology has moved from Fibre Channel Arbitrated Loop to full
Fibre Channel switch fabric.
Taming the Storage Monster
; Data storage needs are increasing rapidly.
; Requirements due to databases, e-mail, multimedia, and the Internet
have dramatically increased the required amount of storage for data.
; Disk farms, storage arrays, and storage consolidation are the keys to
solving the storage problem.
Benefits of Building a SAN
; Fibre Channel is ideal for supporting high-availability configurations and
business-critical back-end operations, due to the ability to set up redundant networks and clusters.
; SAN technology allows for storage consolidation and data pooling for
more efficient use of storage resources.
; Backup windows are shrinking, and backup traffic on the LAN can be
easily reduced by using a SAN to reduce network congestion due to
backup.
; Block-level, high-speed access through SCSI-Fibre Channel Protocol
(FCP) can accelerate data access between storage and hosts, and can
140_SANs_01
8/14/01
1:32 PM
Page 27
Introduction to SANs • Chapter 1
free up host resources that would be occupied serving files and data
through IP.
; Cluster protocol access through FC-VI frees up CPU cycles in hosts
and enables clustered database operations.
; One of the major advantages of SAN technology is its long-distance
capability for disaster tolerance.
When to Deploy a SAN
; The most important part of determining whether to deploy a SAN is to
focus on the actual business application that will be served with the
SAN deployment.
; Speed and bandwidth requirements determine if the technology is right
for the application. Compared with other technologies, such as IP-based
file sharing and Network Attached Storage (NAS), the Fibre Channel
protocol provides for more usable bandwidth and faster data transfer.
; A SAN is ideal for block-level access to shared storage.
; Fibre Channel works well for centralized access to storage arrays,
redundant connections, clustered configurations, and disaster tolerance.
Steps to a Successful SAN Deployment
; Data collection Evaluate the goals of the deployment to determine
options in achieving high availability, redundancy, fault tolerance, data
consolidation, cost reduction, and so forth.
; Data analysis Investigate the hardware and software options that
support those goals.
; Architecture development Design and install a SAN testbed to set up
configuration and components. Select the software and hardware
carefully to avoid any interoperability problems.
; Testing the prototype Test the configuration for interoperability,
functionality, error handling, and fault tolerance.
; Transition existing hardware in a controlled release to production
Stage the deployment by rolling out the setup gradually, making changes
on a limited basis to minimize risk.
27
140_SANs_01
28
8/14/01
1:32 PM
Page 28
Chapter 1 • Introduction to SANs
Frequently Asked Questions
The following Frequently Asked Questions, answered by the authors of this book,
are designed to both measure your understanding of the concepts presented in
this chapter and to assist you with real-life implementation of these concepts. To
have your questions about this chapter answered by the author, browse to
www.syngress.com/solutions and click on the “Ask the Author” form.
Q: Is Fibre Channel more expensive to deploy than Gigabit Ethernet?
A: Cabling, GBICs, and transceivers are physically identical to Gigabit Ethernet.
Costs of Fibre Channel switches and other equipment are very close to those
of Gigabit Ethernet components.
Q: Is there any way to preserve the investment in legacy SCSI storage in an
enterprise?
A: Yes, through the use of Fibre Channel-to-SCSI bridges.
Q: Where can I get expert help in setting up a SAN?
A: There are numerous system integrators and resellers who can help. Check
with the equipment or software vendors.
Q: Is interoperability a problem with Fibre Channel?
A: No, the earlier problems with interoperability in Fibre Channel were
mostly due to Fibre Channel Arbitrated Loop (FC-AL) technology. Switched
fabric technology eliminates these problems and provides very reliable performance. However, as with any technology, it is still a good idea to check for
equipment compatibility with the respective vendors.
140_SANs_02
8/14/01
1:33 PM
Page 29
Chapter 2
Fibre Channel Basics
Solutions in this chapter:
■
The Architecture of SANs
■
Fibre Channel Protocol Basics
■
Classes of Service
■
Storage Network Topologies
■
Fabric Services
; Summary
; Solutions Fast Track
; Frequently Asked Questions
29
140_SANs_02
30
8/14/01
1:33 PM
Page 30
Chapter 2 • Fibre Channel Basics
Introduction
Storage Area Network (SAN) infrastructures are built using new technologies that,
although related to and derived from other technologies such as SCSI and IP networking, has its own set of terminology and concepts. Like standard computer
system networking, Fibre Channel also has its own stack of protocol levels, ranging
from the physical connectors and media (FC-0) to upper-level protocols (FC-4).
Each of these levels defines a different and separate part of how Fibre Channel
equipment communicates. An understanding of these protocol levels, although not
required, helps in understanding the equipment and how to debug and monitor
the equipment.The different FC-4 protocols (FCP, IP,Virtual Interface [VI], and
others) are tied directly to the different kinds of applications (storage, networking,
and clustering), and enable Fibre Channel to support a robust set of uses.
This chapter introduces some of the basics of Fibre Channel and reviews the
underlying architecture of Storage Area Networks (SANs).You will discover the
major parts of the Fibre Channel protocol, the primary physical components
involved, and how they relate to the software and applications running on a SAN.
At the end of this chapter you will be able to determine the kinds of protocols
you need to run in your network, and better understand the various SAN
topologies and terminology.
The Architecture of SANs
SANs provide a topology for connecting a number of hosts to storage devices. An
exciting part of Information Technology (IT), SANs allow more users access to
more data at faster rates.The concept of a SAN is to provide an infrastructure
over which large amounts of data can be transferred robustly between servers and
storage devices such as Just a Bunch of Disks (JBODs), tape drives, and
Redundant Array of Independent Disks (RAID) systems. SANs also enable the
sharing of storage devices such as tape silos and RAID systems. Although there
are some efforts in the industry directed to using Gigabit Ethernet and
InfiniBand technologies to implement SANs, the primary SAN infrastructure
available today is Fibre Channel based.
SAN storage is useful for business, because the high level of connectivity
allows you to consolidate all your storage needs in a SAN, which is easily
expanded, as you require more space. SANs are also accessible to everyone on the
network, which makes it easy to share large projects. Another advantage of using
a SAN as a means to distribute data across your network is speed.The most
common protocol used to implement a SAN is the Fibre Channel protocol,
140_SANs_02
8/14/01
1:33 PM
Page 31
Fibre Channel Basics • Chapter 2
which most commonly operates at 1 Gbit/sec (a data transfer rate of 100 MB/sec).
There are also 2 Gbit/sec devices that just came to market and plans for 10
Gigabit devices.These high speeds mean that data is less susceptible to bottlenecks.
A Fibre Channel SAN also provides the advantage of increased reliability.The
Fibre Channel protocol uses both buffer-to-buffer and end-to-end flow control,
and it calculates a Cyclical Redundancy Check (CRC) on every transmitted
frame. Reliability can also be increased through redundancy by developing fallback connections over a large geographic area. Fibre Channel is designed to
transmit distances that exceed the epicenter of an earthquake.This means that a
SAN can stay fully operational if it is designed with redundant links at remote
points.
Another advantage of a SAN is scalability. SANs provide storage that is not
server attached, which improves performance by avoiding bottlenecks on the
connection to one machine. SANs also provide affordable scalability, because a
storage device can be directly attached to the SAN. Since disks are detached from
direct host attachment, multiple devices can allocate the same storage area
without performance limitations (Figure 2.1). Server-detached storage can offer a
more cost-effective storage solution, since a server is no longer necessary in order
to distribute a file system over a multihost network.
A SAN implemented using the Fibre Channel protocol incorporates the benefits of a channeled connection and a network. A channel is a high-speed information conduit but, unlike a network, it is hardware-intensive. Channels
specialize in streaming data between two devices, such as your computer and a
storage subsystem. Some examples of channel protocols are Small Computer
System Interface (SCSI) and High-Performance Parallel Interface (HiPPI). A network, on the other hand, specializes in connectivity, allowing flexibility to add
and remove nodes from the environment. Examples of network protocols are
Token Ring, Ethernet, and Asynchronous Transfer Mode (ATM). Fibre Channel
incorporates the flexibility of a network with the high speed and reliability of a
channel—essentially allowing you to connect a large number of devices without
degrading performance.
When we talk about a SAN, we generally think of transporting SCSI data
over Fibre Channel. Although this is what is most commonly used in a SAN,
Fibre Channel supports many other protocols. Some other protocols that can be
transported over Fibre Channel are HiPPI, Internet Protocol (IP), Fiber
Distributed Data Interface (FDDI), and ATM, although IP, SCSI, and Virtual
Interface (VI) are the predominate protocols transported on Fibre Channel today.
31
32
8/14/01
1:33 PM
Page 32
Chapter 2 • Fibre Channel Basics
Figure 2.1 Storage Server Versus SAN Architecture
Storage Server
RAID
Bottle
Client
Neck
Server
Client
TAPE
Client
SAN
Server
TAPE
Client
Switch
Ethernet
140_SANs_02
Client
RAID
Server
Client
A SAN is constructed from three primary types of elements: target devices,
initiating devices, and interconnecting devices.
A target device is usually a storage device on a SAN.There are many different
types of storage devices, including tape drives, JBODs, RAIDs, and IP targets. A
140_SANs_02
8/14/01
1:33 PM
Page 33
Fibre Channel Basics • Chapter 2
tape drive is commonly used for backup of other storage devices which might be
a database or critical file system. Fibre Channel tape technology is an emerging
technology, and testing procedures have just recently been developed.We discuss
Fibre Channel components in further detail in Chapter 3, “SAN Components
and Equipment.”
Fibre Channel storage disks and Fibre Channel-capable tape drives are the
most common types of target devices. Fibre Channel disks have a Fibre Channel
controller on them. In general, Fibre Channel disks are contained in a JBOD. In a
JBOD, each disk is visible to the SAN, each is assigned an address, and each is
treated as an autonomous device even though the physical disks are located in the
same enclosure. SCSI disks might also be contained in a RAID, in which case the
RAID controller will make the array of disks appear to be one disk to the SAN.
The RAID is one disk in the sense that it will take a single address on the SAN.
Another type of target might be an IP target. Since IP is a protocol commonly
used over Fibre Channel, you might see devices communicate by passing IP
packets back-and-forth. In this case, there is no distinction between a target device
and an initiating device, since both devices can initiate exchanges of Fibre
Channel and IP frames.
An initiating device is a device that actively seeks out and interacts with target
devices on the SAN. Examples are a server or a workstation, and they are often
called hosts. A Host Bus Adapter (HBA) is a Peripheral Component Interconnect
(PCI) or bus-type adapter that resides in a host machine.That machine can be a
server, a workstation, or other device that would request information from a
group of disks or storage. It could conceivably be an automated tape backup
system.The distinction between a target and an initiator is that an initiator
actively searches for a target with which to initiate a transfer, while a target is a
passive device.There is often a fine line between the two, since some devices
(such as IP devices or bridge devices) might read and write to each other.When
a device opens an exchange, it acts as an initiator.
From an infrastructure perspective, the most important components in a SAN
are the interconnecting devices, namely as switches. Switches create the foundation of
a Fibre Channel SAN and provide a high-speed interconnect for routing frames
from one device to another. Switches provide fabric services, additional ports for
scalability, and the linking capability of the SAN over a wide distance. Although a
Fibre Channel SAN can technically exist without any switches using arbitrated
loop topology (discussed later in this chapter), a loop-only topology does have its
challenges. Arbitrated loop topologies can be subject to performance issues,
which can be avoided by connecting the SAN in a switched topology. Switches
33
140_SANs_02
34
8/14/01
1:33 PM
Page 34
Chapter 2 • Fibre Channel Basics
are responsible for correctly routing frames from one node to another over the
entire network, with a group of one or more interconnected switches called a
fabric (Figure 2.2).
Figure 2.2 Interconnected Switches Make Up a Fabric
RAID
RAID
RAID
JBOD
RAID
FABRIC
Switch
Switch
Switch
Switch
Switch
Switch
Workstation
Host
Host
Host
Host
FC/SCSI
Bridge
SCSI Tape Library
There are two other common devices encountered in a SAN architecture:
hubs and routers. A Fibre Channel hub provides similar function to an Ethernet
hub. A hub is a box with a number of ports to which devices can be attached,
which simplifies device interconnection.The bandwidth on the hub, which is 1
Gbit/sec, is shared among all the connected devices.There are two types of hubs,
managed and unmanaged. An unmanaged hub simply provides a physical wiring
140_SANs_02
8/14/01
1:33 PM
Page 35
Fibre Channel Basics • Chapter 2
between all the connected devices. It does not do any signal processing. A device
that is transmitting data, regardless of what the data is, will be connected to the
other devices on an unmanaged hub. A managed hub, on the other hand, will
wait to connect a device to the other devices on a hub until it sees valid transmission data from the device.The disadvantage of unmanaged hubs is that they
increase disruption time during the insertion of a new device, and they will also
allow a device that is no longer functioning properly to continue transmitting
bad data over the link.This can cause a Fibre Channel loop to remain in an
unusable state, and stop traffic between other devices on the hub. For these
reasons, most hubs are managed hubs.
The terms router and bridge are interchangeable in Fibre Channel terminology.
A router generally connects two different protocols, such as Fibre Channel and
Ethernet, or Fibre Channel and SCSI. A router is usually a one-to-many connector or a many-to-many connector, whereas a bridge generally connects in a
one-to-one manner.
The American National Standards Institute (ANSI) began work on Fibre
Channel in 1988, and since then the X3T11 Task Group has developed over 20
standards. Fibre Channel’s complexity is not without reward, however: Fibre
Channel presently transmits at 1.0625 Gbit/sec over all types of physical media.
Recently, many companies have increased that number to speeds of 2.125
Gbit/sec and specifications have recently been published on 10 Gbit/sec Fibre
Channel as well. Since Fibre Channel bytes are encoded in 10-bit blocks, this
provides a transfer rate of approximately 100 Mb/sec at 1.0625 Gbit/sec.
NOTE
Right now there are over 20 Fibre Channel standards projects with many
more to come. The main organization involved in the standards process
is T11 (www.t11.org). Copies of all the current standards are available at
the T11 Web site. The Fibre Channel Industry Association (FCIA, at
www.fibrechannel.com) has also started the SANmark program to test
the conformance of Fibre Channel devices to those standards, based on a
sample set of interoperability tests. Devices can be certified to a number
of SANmark Conformance Documents (SCDs). A device’s ability to pass
these tests is an indication of its ability to interoperate with other Fibre
Channel devices.
35
140_SANs_02
36
8/14/01
1:33 PM
Page 36
Chapter 2 • Fibre Channel Basics
Fibre Channel is most easily understood if it is broken down into layers.
There are five Fibre Channel layers, labeled FC-0 to FC-4.The layered breakdown makes Fibre Channel easier to study and understand.We can break it down
further by thinking of FC-0 and FC-1 as the physical and signaling layers. FC-2
is a link, or protocol layer.The FC-3 layer specifies common services such as the
Name Server, which provide services for all nodes on a Fibre Channel network.
The FC-4 layer specifies the mapping of Upper-Level Protocols (ULPs) with the
Fibre Channel protocol.
The physical media is the FC-0 layer. Although it is called Fibre Channel, it
can be carried over either fiber-optic cables or copper twisted-pair type cables.
There are two common types of fiber-optic cable: single-mode and multimode.
Single-mode cable has the ability to transmit longer distances (100 km) than
multimode fiber (500 m).
Fibre Channel transmits in 8b/10b-encoded characters, and the signaling interface is the FC-1 layer.This means that for each 10 bits of information transmitted,
you actually receive 8 bits of information, which is encoded into a character.The
8b/10b encoding of characters provides a low level of error detection, because if
bits are lost or inadvertently changed, invalid characters will be received. Four transmission characters make a transmission word. Certain transmission words are then
used as the primitives in the Fibre Channel protocol for signaling purposes.
We consider primitives and transmission words to occur at the FC-2 level.
Primitives control the flow of frames on a Fibre Channel link. Frames are sets of
transmission words that contain routing headers and a payload.The payload is
where ULP information is stored, such as SCSI commands or data.The mapping
of SCSI commands or data into Fibre Channel frames is an ULP activity that
occurs in the FC-4 layer.
Devices in a SAN are generally interconnected with a switch. A single switch
or a group of all interconnected switches is commonly referred to as a fabric,
which provides certain services to the nodes attached to it.The services provided
are part of the FC-3 layer and include a Name Server,Time Server, Alias Server,
and so on.The Name Server is a distributed database that registers all devices on
a fabric and responds to requests for address information. On a fabric, all services
are conceptually distributed, meaning that the same server provides service to all
nodes independent of direct switch attachment.
140_SANs_02
8/14/01
1:33 PM
Page 37
Fibre Channel Basics • Chapter 2
Fibre Channel Protocol
This chapter explains the concepts of the Fibre Channel protocol.The goal is to
gain a high-level understanding of the mechanisms of the protocol, such as arbitration, arbitrated loop address selection, and frame generation and transfers.
We abstract the Fibre Channel protocol by dividing it into five layers and
analyze how each layer interacts with the other layers.We will further analyze the
FC-4 layer in detail, because it is the Fibre Channel layer that controls the mapping of ULPs that can be transported over Fibre Channel.We discuss SCSI and
IP primarily, but also consider HIPPI, ATM, and IPI-3.
Classes of Service
Classes of service are different semantics used to transfer frames using various verification and buffering mechanisms.The classes of service section later in this
chapter describes the different types of classes of service and their uses:
■
Class 1 Acknowledged connection-oriented service
■
Class 2 Acknowledged connectionless service
■
Class 3 Unacknowledged connectionless service
■
Class 4 Connection-oriented fractional bandwidth
■
Class F Inter-switch communication format
Storage Network Topologies
In the storage network technologies section later in this chapter, we will look at
different topologies and discuss how differences in architecture can affect data
flow over your SAN.There are three primary topologies, and the goal in this section is to understand how different functions can be achieved in a SAN by using
a single topology or a combination of topology models.We look at examples of
topologies and define the terminology for referring to nodes in the topology:
■
Point-to-point topology
■
Arbitrated loop topology
■
Switched fabric topology
37
140_SANs_02
38
8/14/01
1:33 PM
Page 38
Chapter 2 • Fibre Channel Basics
Fabric Services
Fabric services provide information to nodes in a switched fabric topology.
Services can be distributed across all switches, creating the appearance of singleservice type servers.The services provided by the different servers on a fabric
make the interconnection of hundreds to thousands of devices seamless.They
provide addressing, device-type, and connection-type information to requesting
nodes. In this chapter, we discuss a number of different fabric services, including:
■
Login Server
■
Name Server
■
Fabric/Switch Controller
■
Management Server
■
Time Server
Fibre Channel Protocol Basics
Fibre Channel was developed to combine the benefits of channel and network
technologies. Channels are directly connected devices that do not require large
amounts of logic to be incorporated. Channels are hardware-intensive because they
are designed for fast transfer of large amounts of data between buffers. Examples
of channels are HiPPI and the serial connection made between serial ports on
two computers. Networks, on the other hand, are capable of handling very large
numbers of nodes. Networks used to be software-intensive because packets needed
to be routed to one of many devices on a network. Most of today’s networks use
hardware-forwarding. Networks also have to adapt “on the fly” to devices being
added and removed. Fibre Channel was developed to incorporate the best features of both.
Fibre Channel allows data to be transferred at faster speeds.The base speed of
Fibre Channel is 1 Gbit/sec. Many devices, however, are running at double speed
right now, and the 10 Gbit/sec specification is presently in draft form. Another
advantage of Fibre Channel is that it incorporates the ability to dynamically connect large numbers of nodes over a very wide area. Using single-mode fiber, elaborate SANs can span many kilometers.This adds the benefit of being able to
incorporate redundancy for mission-critical applications. Fibre Channel was
designed to produce redundant dynamically reconfigurable SANs that would provide storage even in the event of a natural disaster after a large portion of the
infrastructure was damaged.
140_SANs_02
8/14/01
1:33 PM
Page 39
Fibre Channel Basics • Chapter 2
Fibre Channel is primarily used to transport the SCSI and IP protocols.The
benefits of using Fibre Channel for the mapping of these protocols is increased
speed connectivity and longer connection distances.There are three primary
topologies for Fibre Channel devices.The first topology is point-to-point.This
topology is used between two devices. In point-to-point topology, there is no
addressing, since all frames (Fibre Channel packets) are intended for “the other
device.” Device connections to switches are sometimes called point-to-point
connections.
The second topology is arbitrated loop. In this topology, devices are connected
together in a loop, with the receive fiber coming from an upstream device and
the transmit device going to a downstream device. An 8-bit Arbitrated Loop
Physical Address (AL_PA) identifies devices on that loop.
For specifics regarding FC-AL, you can consult the state diagrams in FC-AL-2
(www.t11.org).This section provides the basics on how the arbitrated loop protocol works. It is not important that as a system administrator or end user you
understand the protocol in detail. However, it is important to understand the concepts, because it will make diagnosing problems in your SAN faster and easier.
The third topology is the switched fabric configuration, which enables you to
connect a large number of devices. A switched fabric topology is sometimes
referred to as a point-to-point topology as well.The switched fabric topology is
easily scalable, allowing devices to be added and removed with little disruption to
the rest of the attached nodes. A switched topology allows more efficient use of
bandwidth by using circuits in the switches to route paths between nodes, as
opposed to arbitrated loop where there is one path between a set of nodes on
the loop.
Information is transferred in frames, which contain a header and a payload.The
header contains routing information. It specifies where the frame came from,
what kind of frame it is, and where it is going. Frames start with a primitive Start
Of Frame (SOF), which indicates the class of service the frame is being transmitted in, specifying the connection type.The class of service, discussed in detail
later in the chapter, is a set of universal rules for nodes handling the frame of that
type. Classes of service handle tasks such as frame acknowledgment and transfer
verification.
Information transfer in Fibre Channel is analogous to writing a paper. A total
message or idea is broken down into parts. Like a paragraph made up of sentences, it is a collection of related information composed of sequences. A
sequence (sentence) is a collection of frames (words) that fit together logically. In
Fibre Channel there is punctuation around the “words” as well. Frames start with
39
140_SANs_02
40
8/14/01
1:33 PM
Page 40
Chapter 2 • Fibre Channel Basics
an SOF and end with an End Of Frame (EOF).There are multiple fields in a
frame—the first six words, fields zero through five, are mandatory and are called a
frame header.The frame header specifies the source of the frame, the destination of
the frame, and what type of frame it is.The payload contains the data from the
ULP frame.
Fibre Channel Levels
When discussing Fibre Channel, it is usually easiest to break down the technology into a number of levels.You can also use the level approach when trying
to debug a situation where something is broken.You can first determine the
failure level, and work with the components of that level. In this section, we
abstract the Fibre Channel protocol by dividing it into five layers and analyzing
how each layer interacts with the other layers.There are five Fibre Channel
layers, designated FC-0 to FC-4.The layers include all aspects of the technology,
from the physical media to the ULPs that are transported on Fibre Channel.
Figure 2.3 is a commonly used visual aid to help envision how the different
layers interact.
Figure 2.3 Fibre Channel Layers
FC-4
Fibre Channel Upper Level Protocol (ULP) Mappings
FC-3
Fibre Channel Common Services
FC-2
Fibre Channel Framing and Flow Control
FC-1
Fibre Channel Encode and Decode
FC-0
Fibre Channel Physical Media
The FC-0 layer is the lowest-level layer; we have already seen most of the
components of this layer by looking at the different media types and connectors
involved in creating a Fibre Channel connection.The FC-0 layer specifies how
light is transmitted over fiber and how transmitters and receivers work for all
media types.This layer deals with the physics of transmitting and receiving a
signal at different transfer rates.When there is a problem with a GBIC or fiber
line, you know you have an FC-0 level problem. Most of the work being done at
the FC-0 level is electrical engineering work in designing transmitter and
receiver components.
140_SANs_02
8/14/01
1:33 PM
Page 41
Fibre Channel Basics • Chapter 2
The FC-1 layer is the signal encoding and decoding layer.When you consider
the FC-0 and FC-1 layers together, they are generally referred to as the signaling
interface.This layer is responsible for taking the serialized signal and encoding it
into data characters you can use.The FC-1 layer uses 8b/10b encoding.This
means that for each 10 bits sent, you get 8 bits of actual data—the other bits are
parity bits.The bits are encoded into two kinds of characters: K characters and D
characters. In Fibre Channel, all primitives (LIP, SOF, OPN, CLS, IDLE, and so
on) are delimited by a K character, which is often referred to as a special character.
Data characters are used to provide all the other 8-bit values.
The FC-2 layer is the Fibre Channel protocol level, which is responsible for
framing and flow control.The protocol level operates on primitives that are
encoded from the FC-1 level as a special character followed by three data characters (D characters). Primitives drive the state machines that control things like
arbitration, loop initialization, and data-carrying frames.The FC-2 layer is where
the firmware embedded on chips in your Fibre Channel devices is active.The
FC-2 layer controls the flow of data by sending the correct primitives to initiate
transfers. Although the FC-2 layer sends the frames, the payload of the frames is
not part of the FC-2 layer.The FC-2 layer is responsible for correctly filling in
the frame headers that are responsible for routing the frames.
The FC-3 layer is the Fibre Channel common services layer. An example of
common services is the Name Server, which provides to requestors the addresses
of other fabric-connected devices. Fabric servers are necessary to provide centralized resources to all attached nodes. Although there might be server agents on
each individual switch, all the Name Server agents will share their information
through a switch protocol, which makes the Name Server on each switch identical to every other Name Server.This creates the illusion of a single Name
Server.This concept is called distributed fabric services, and the same theory is
applied to all servers, like the Time Server, which synchronizes with all of the
other switches as well.The behavior of all servers that can be implemented in a
distributed fabric is specified in FC-GS-3, a Fibre Channel standard that specifies
generic service.
FC-4 is the Fibre Channel ULP mappings layer.This layer specifies how
ULPs like SCSI, IP, HiPPI, IPI-3, and ATM can be carried over a Fibre Channel
conduit.The most commonly transported protocol is SCSI. SCSI Fibre Channel
Protocol (SCSI-FCP) is the standard that specifies how to encapsulate SCSI
frames in the Fibre Channel protocol.The FC-4 layer is responsible for making
sure that the ULP data or commands get broken down appropriately and packaged correctly in the Fibre Channel frames.The frame is then passed down to the
41
140_SANs_02
42
8/14/01
1:33 PM
Page 42
Chapter 2 • Fibre Channel Basics
FC-2 layer where the node might query an FC-3 server to obtain the destination
address for the frame.The FC-2 layer then adds this information into a header on
the frame and sends the frame to the FC-1 decoder, which breaks the frame into
bits that can be sent over the physical wire at the FC-0 level.
ULPs
The FC-4 layer specifies the mapping of different ULPs to Fibre Channel.To
recap, ULPs are the protocols that can be transferred over Fibre Channel. A wide
variety of protocols can be transported over Fibre Channel.The advantage for
most network protocols of mapping over Fibre Channel is increased speed. For
channel protocols, the advantage is added scalability and dynamic reconfiguration.
The following are some specifics on protocols and their mappings on Fibre
Channel:
■
Small Computer System Interface (SCSI) The most widely used
ULP on Fibre Channel networks. SCSI is a parallel interface standard
capable of speeds up to 80 MB/sec. SCSI devices can be chained
together to create a channel with multiple nodes. SCSI gains speed from
Fibre Channel since Fibre Channel operates at a base speed of 100
MB/sec. FCP is the name of the FC-4 protocol for SCSI.
■
Internet Protocol (IP) IP is a standard networking protocol. IP over
Fibre Channel is used for different reasons than Gigabit Ethernet, and
has many uses, such as offloading backup traffic, in-band access for managing devices, and so on. Fibre Channel allows faster transfer speeds than
most Ethernet technology.
■
Virtual Interface (VI) VI is a standard protocol defined for low-level
clustering communications over Fibre Channel.This protocol is used by
distributed databases, file systems, and other clustering applications to
efficiently transfer cluster information over a network between hosts.
■
Intelligent Peripheral Interface (IPI) IPI is an ANSI-defined standardized protocol for controlling peripherals from a host computer.The
IPI-3 is the level-three part of IPI that deals with packetized communication between a host and a peripheral device.
■
High-Performance Parallel Interface (HiPPI) HiPPI is a channel
used to transfer large amounts of data at 800 Mb/sec or more to supercomputers that have the processing power to use that much data at such
140_SANs_02
8/14/01
1:33 PM
Page 43
Fibre Channel Basics • Chapter 2
a fast rate.The data can be read either between a file system and processor, or between memories on separate systems to create parallel
machines. Fibre Channel is a good conduit in the point-to-point
topology since it can provide the speed at the FC-0 and FC-1 levels to
achieve the goals of HiPPI.
■
Fiber Distributed Data Interface (FDDI) FDDI is one of the first
protocols developed for fiber-optic technology. FDDI networks use
token passing and support transfer rates of up to 100 Mbit/sec. FDDI
networks were typically used as backbones for WANs, but are now being
commonly replaced by Fibre Channel and other high-speed Ethernet
technologies.
Classes of Service
Classes of service specify what mechanisms will be used for the transmission of
data. Different classes of service are used for different types of data. Flow control is
one mechanism that is specified by class of service. End-to-end flow control is when a
receiving port transfers a frame back to the sender to verify receipt of the transmitted frame.When the transmitter receives the acknowledge frame (ACK) back, it
is allowed to adjust its credit by one, so it can send another frame. Buffer-to-buffer
flow control is used between fabric ports and node ports, or two node ports, to indicate the maximum number of frames the device can receive. A Receiver Ready
(R_RDY) primitive signal is sent on the link to indicate that the device can receive
a frame. If a certain number of R_RDYs are sent, it indicates that the device has
enough buffer space to accept that number of frames. In addition to flow control,
classes of service also specify whether the connection is dedicated. In a connectiontype transfer, you cannot send frames that are not addressed to the dedicated
receiver. In addition, you cannot send frames in a class other than the connection
class.This guarantees that the connection can utilize the full bandwidth.
Class 1
Class 1 service is a dedicated connection class between a transmitter and a
receiver. Class 1 connections emulate the features of channel protocols. All packets
sent are acknowledged, meaning that an ACK frame is sent back for every frame
transmitted.The connection is dedicated, which means that the communicating
device uses the full bandwidth of the connection. No other devices can communicate with the connected devices as long as the Class 1 connection is open.
43
140_SANs_02
44
8/14/01
1:33 PM
Page 44
Chapter 2 • Fibre Channel Basics
In a Class 1 connection, since the connection is dedicated, you are assured that
frames will arrive in the order in which they were sent. Only end-to-end flow
control is used in a Class 1 connection. Class 1 connections are used for timecritical applications and for transferring streaming data such as sound or video.
Intermix is an optional service of devices that support Class 1 connections.
Intermix allows the transmitter to send Class 2 and Class 3 frames when there are
no Class 1 frames to be transmitted, allowing the device to use the bandwidth
more efficiently.The Class 2 or Class 3 frames cannot be sent to the same device
with which the Class 1 connection is established.
Class 2
Class 2 service provides a connectionless conduit between two ports.This means
that the frame is transferred to the switch, and the switch is responsible for
attempting delivery at the switch’s earliest convenience. Class 2 allows devices to
share all the available bandwidth. Class 2 frames use both buffer-to-buffer and
end-to-end flow control, so the transmitter will receive either a positive or negative acknowledgment of receiving a frame.This is an acknowledged class of service.
Class 3
Class 3 service is similar to Class 2, except there is no end–to-end confirmation
of the data transfer.This is the preferred class of service for SCSI, and therefore is
the class of service most often used in transfers over a SAN. Class 3 uses bufferto-buffer flow control, which is controlled on an FC-2 level using the R_RDY
primitive. Class 3 allows devices to share the bandwidth of the SAN. Class 3
allows devices to operate at full speed when there is little traffic, but causes the
bandwidth to be shared when there is heavy traffic. It is ideal for distributed
storage solutions like SANs.
Class 4
Class 4 service is a less common service but most similar to Class 1. In Class 4
service, the bandwidth is divided into Virtual Circuits (VCs). For this reason, Class
4 is known as a fractional bandwidth class of service.Within a VC, the bandwidth
allocated is guaranteed. A node can divide the bandwidth into a number of VCs
that share the connection.VCs can be established with a number of other ports.
In a Class 4 connection, both buffer-to-buffer and end-to-end flow control are
used. Frame ordering within VCs is guaranteed. For Class 4, intermix is required
with Class 2 and Class 3 frames.
140_SANs_02
8/14/01
1:33 PM
Page 45
Fibre Channel Basics • Chapter 2
Class F
Class F service is used for internal control and coordination of the fabric. Class F
frames can be sent only between switches, so all devices are instructed to ignore
them. Switches use Class F frames to coordinate services like the Name Server
and resolve transmission hierarchy.
Storage Network Topologies
As mentioned earlier, there are three primary types of topologies in Fibre
Channel: point-to-point, arbitrated loop, and switched fabric. Each topology is useful
for different purposes, and the topologies can be combined to form a SAN specific to the solution you need. Point-to-point connections limit you to two
devices, so you will generally use a point-to-point connection only when you
have two systems that need to talk to only each other at high transfer rates, or if
one of the devices is acting as a switch or bridge.The arbitrated loop topology is
useful for connecting many devices, but because only one device can arbitrate at
a time, it severely limits your bandwidth. In any large SAN, you will need to use
a switched topology as the backbone of your connection.There might be one or
many switches that form your fabric. However, just because your SAN incorporates a fabric topology does not mean that the other topologies cannot be integrated into the SAN as well. A fabric port can easily be connected as part of an
arbitrated loop. Point-to-point connections are established when a single device
is plugged into a switch port. Figure 2.4 is a diagram of a fictional SAN that
incorporates all the different topology features.
Point-to-Point Topology
Point-to-point connections in Fibre Channel are limited to a few specific situations.The primary use of the point-to-point topology is to connect devices
directly to switches or other bridge devices. Rarely would a target device and
initiating device be connected in a point-to-point topology.This is generally a
waste of resources; since Fibre Channel components are faster than all disks manufactured today, it is not likely that the disk they would be attaching could fully
utilize the bandwidth provided to it. Furthermore, rarely would a single host
system require data streamed at that rate or be able to process it.There are exceptions, particularly if Fibre Channel is being used to build parallel systems, like in
the case where HiPPI would be the ULP used for memory sharing.
45
140_SANs_02
46
8/14/01
1:33 PM
Page 46
Chapter 2 • Fibre Channel Basics
Figure 2.4 Fibre Channel Topologies
RAID
RAID
WAN
JBOD
IP / Fibre Channel Bridge
FABRIC
Switch
Switch
Switch
Switch
Inter-Switch
Link (ISL)
Workstation
FC/SCSI Bridge
Arbitrated
Loop
Host
SCSI Tape Library
JBOD
Host
In a point-to-point topology there is no addressing, since any data transmitted
is intended for the other device. A point-to-point topology can be set up by connecting device A’s transmit fiber into device B’s receive connector, and vice versa
(Figure 2.5). Point-to-point connections have a very simple initialization routine,
140_SANs_02
8/14/01
1:33 PM
Page 47
Fibre Channel Basics • Chapter 2
since no address assignment needs to be resolved. Point-to-point connections also
have the advantage of allowing the two devices the entire bandwidth of the line
at all times.
Figure 2.5 Point-to-Point Topology
Fibre Channel Device
Transmit
(tx)
Receive
(rx)
Receive
(rx)
Transmit
(tx)
Storage Device
Fibre Channel Arbitrated Loop (FC-AL) Topology
The arbitrated loop topology is a configuration used to connect up to 127
devices without a switch. Devices in an arbitrated loop are connected in a ring
formation.The transmit fiber of the upstream device goes into the receive port of
the downstream device.This is repeated around the loop until the first device
receives the transmit fiber of the last device (Figure 2.6).
Figure 2.6 Arbitrated Loop Topology
FC-AL Disk
FC-AL Disk
FC-AL Disk
FC-AL Disk
Host
FC-AL Disk
FC-AL Disk
FC-AL Disk
47
140_SANs_02
48
8/14/01
1:33 PM
Page 48
Chapter 2 • Fibre Channel Basics
The arbitrated loop initialization process discussed earlier is complicated.The
complexity is necessary in order to fairly assign all devices in the loop an AL_PA.
The initialization state machines are difficult to implement and because of this,
interoperability among different vendors’ products is a major issue in arbitrated
loop topology. By arbitrating for control of the loop, the arbitrated loop configuration allows all devices connected on the loop to share the bandwidth of a single
line. For this reason it is important to put devices on a loop that can afford to
suffer the performance degradation that sharing bandwidth entails.You might
consider a looped topology if the devices on the loop would rarely be accessed
concurrently. It is generally best to place a small group of storage devices on a
loop rather than hosts, which constantly require access to the loop. A loop configuration is also good for archiving across many drives that will be accessed
rarely, but then need to dump large quantities of data across the network, such as
automated backup systems.
Switched Fabric Topology
Fabrics allow you to expand your SAN as need dictates, and they allow thousands
of devices to be interconnected.The switched fabric topology is easily scalable,
allowing devices to be added and removed with little disruption to the rest of the
attached nodes.This is a distinct advantage over an arbitrated loop topology,
which requires a reinitialization of all nodes every time a node is added. Imagine
how unstable a network with hundreds of nodes would be if all devices were
reset every time a device was inserted.
Switches also allow more efficient use of bandwidth by using circuits in the
switches to route paths between nodes. In this way, many transfers can occur at
once using full bandwidth.This is also an advantage over the arbitrated loop
topology. Switches have two types of ports: F_Ports and FL_Ports. FL_Ports are
fabric loop ports—arbitrated loops are attached to these ports. F_Ports are fabric
ports to which point-to-point connections are established. Switches also have
E_Ports, which are used to connect to other switches. E_Ports communicate in
Class F frames to distribute information about the different servers and to set up
circuits for the passing of frames to the appropriate nodes over the fabric.
A fabric provides a way for devices to communicate with each other over
long distances. In order to find a port, the fabric needs an address for identification. Nodes attached to a fabric receive a 24-bit address.The address has the
format XXYYZZ and is carried in the Destination ID (D_ID) of frames
intended for the device and in the Source ID (S_ID) of frames sent from the
device. XX is the domain.This two-digit hexadecimal number refers to the physical
140_SANs_02
8/14/01
1:33 PM
Page 49
Fibre Channel Basics • Chapter 2
switch itself.YY is the area, which corresponds to the port on the switch to
which the device is attached. ZZ is the AL_PA, the address assigned to the device
during the arbitrated loop initialization process.This ZZ value is set to 0x00 for
point-to-point connections between the switch and the edge device. See Figure 2.7
for an illustration of switched fabric addressing. Brocade addressing is discussed
further in Chapter 7.
Figure 2.7 Switched Fabric Addressing
Storage Array
081600
F_Port
Host
011e00
F_Port
E_Port
FL_Port
Host
0a1602
FC-AL Disk
0514ef
Fabric Services
The switches that form a fabric save information about the devices connected
directly to them in databases.The switches also provide services for notifying
devices of changes on the fabric that affect the way the device functions.
49
140_SANs_02
50
8/14/01
1:33 PM
Page 50
Chapter 2 • Fibre Channel Basics
Switches distribute this information among themselves through Class F service frames. Switches exchange information in their servers so that all individual
switch servers contain the same information.This creates a singular fabric entity
and makes it appear that there is only one of each type of server. For instance, the
Name Server information is shared among all Name Servers on all attached
switches.This creates a distributed Name Server that has information about all
devices on the fabric. By distributing the servers, the switch structure becomes
transparent to attached nodes.
There are a number of fabric services defined in FC-GS-3 (Generic
Services).The Alias Server manages aliases for multicast groups and hunt groups.
The Time Server distributes time information for setting timers and expiration
times.The Key Distribution Server provides encryption keys for secure connections
between two nodes. In this section, we cover in detail the Login Server, Name
Server, Fabric/Switch Controller, Management Server, and Time Server.
Like nodes, fabric services have addresses, but the address of a fabric service
is a fixed value called a well-known address.Well-known addresses are reserved by
the standard.
Login Server
The fabric port is at well-known address FFFFFE. It is sometimes called the
Login Server because a device is required to send a Fabric Login (FLOGI) frame
to this port before it can communicate with the rest of the fabric. A port that
needs to connect to the fabric must log in with this server.The node sends a
FLOGI frame with the S_ID field filled in only for its AL_PA value.The Login
Server then sends a response with the D_ID field filled in with the device’s
AL_PA and newly assigned domain and area values (see the Switched Fabric
Topology section earlier in this chapter).
Name Server
Directory services can be accessed at well-known address FFFFFC.The Name
Server is the primary feature of directory services. It is a database used to store
information about devices attached to a fabric.The Name Server gets information from a device through the Port Login (PLOGI) frame at initialization and
through subsequent registration frames.The Name Server acts like a database—
entries can be looked up, added, or deleted. Nodes transmit request frames to the
Name Server and receive a response containing the information requested or
confirmation of the action requested. One of the most common requests is a
Request For Transfer (RFT_ID) frame. An RFT_ID is a request to Register
140_SANs_02
8/14/01
1:33 PM
Page 51
Fibre Channel Basics • Chapter 2
FC-4 types (ULPs). A device does this so that the Name Server has a record of
what type of device it is. Often, a host computer will send a request to return the
address of all devices that support a certain type of ULP, such as FCP.This way, a
host can find all the SCSI disks on the fabric. Another common request is a Get
All Next (GA_NXT).This request obtains all information about the next highest
node in the Name Server at the specified address.This command is useful for
devices that are trying to map the fabric, such as a fabric management utility, or
for devices that are trying to find appropriate hosts with which to begin transfers.
Fabric/Switch Controller
The Fabric Controller, at well-known address 0xFFFFFD, provides a state change
notification service to registered nodes, which notifies any device registered to
receive the service when a change in fabric topology occurs. Devices that use this
service are generally hosts that want to keep track of a number of storage targets.
A device registers for state change notification by transmitting a State Change
Registration frame (SCR) to the well-known address.When there is a change in
fabric topology, the Switch Controller transmits a Registered State Change
Notification (RSCN) frame to the device.The RSCN frame is simply a notification to the device that there has been a change. It is up to the device to query
the Name Server to assess the state of the fabric at this time.
Management Server
The Management Server provides information about the fabric without stipulation as to zone. A zone is a collection of nodes defined to reside in a zone set.
Multiple zones can be defined. Nodes within a zone are aware of other nodes
within that zone, but not of nodes outside their zone. For instance, a Name
Server query will not return information for nodes outside the requestor’s zone.
The Management Server provides a single access point for managing the fabric
as well as three services. First is the Fabric Configuration Server, which provides
information to management entities trying to discover the fabric topology.The
second service is the Unzoned Name Server, which provides access to Name Server
information for nodes within all zones.The final service is the Fabric Zone Server,
which allows management entities to control zone participation and access
present zone information.
51
140_SANs_02
52
8/14/01
1:33 PM
Page 52
Chapter 2 • Fibre Channel Basics
Time Server
The Time Server is provided so devices can maintain system time with each
other.The Time Server is accessed at well-known address identifier FFFFFB. A
client will send a Get_Time frame to the Time Server, which then responds with
a Get_Time_Response frame containing the time offset, in seconds.
Other Services
Switch manufacturers often provide many other common services, such as the
Alias Server, which acts like a Name Server to handle the aliases for multicast
groups. A multicast group is a group of nodes that receives data destined for a
multicast address.The Alias Server keeps a registry of all nodes belonging to a
multicast address, and also handles registration and deregistration of nodes from
multicast groups.The Alias Service is not involved in the routing of frames for
any group.
140_SANs_02
8/14/01
1:33 PM
Page 53
Fibre Channel Basics • Chapter 2
Summary
An exciting part of IT, SANs are allowing more users access to more data at faster
rates.The purpose of a SAN is to provide an infrastructure over which large
amounts of data can be transferred robustly between servers and storage devices
such as JBODs and RAID systems. SANs provide three key advantages: speed,
reliability, and scalability.
Fibre Channel is the primary SAN technology. Currently, the most popular
protocols used over Fibre Channel are FCP and IP. A SAN implemented using
the Fibre Channel protocol incorporates the benefits of a channeled connection
and a network.
A Fibre Channel SAN is constructed from initiating devices, switches, target
devices, hubs, repeaters, and bridges. A target device is a storage device on a SAN,
and there are many different types of storage devices, including tape drives,
JBODs, RAIDs, and IP targets. An initiating device is a device that actively seeks
out and interacts with target devices on the SAN. Examples are a server or workstation, and they are often called hosts. Switches create the foundation of your
Fibre Channel SAN and provide a high-speed interconnect for routing frames
from one device to another. Switches provide the linking capability of a SAN
over a wide distance, as well as additional ports for scalability.
Fibre Channel is most easily understood if it is broken down into its five
layers, which are labeled FC-0 to FC-4.The physical media is the FC-0 layer.
Fibre Channel transmits in 8b/10b-encoded characters.The signaling interface is
the FC-1 layer.This means that for each 10 bits of information transmitted, 8 bits
of information are received, which are encoded into a character. Four transmission characters make a transmission word. Primitives and transmission words are
at the FC-2 level. Primitives control the flow of frames on a Fibre Channel link.
A fabric provides certain services to the nodes attached to it—the services provided are part of the FC-3 layer, and include a Name Server,Time Server, Login
Server, and others. On a fabric, all services are conceptually distributed, meaning
that the same server provides service to all nodes, independent of direct switch
attachment. SCSI data mapped into Fibre Channel frames is the ULP mapping
referred to as the FC-4 layer.
There are three topology types for a SAN: point-to-point, arbitrated loop, and
switched fabric. Most often, your SAN will contain examples of all three topologies. Switched fabric (also called point-to-point) is used to connect single nodes
to a switch F_Port. Arbitrated loop is a topology used to connect a number of
53
140_SANs_02
54
8/14/01
1:33 PM
Page 54
Chapter 2 • Fibre Channel Basics
devices and can be connected to a switch through an FL_Port. Devices on an
arbitrated loop share the bandwidth of one line.
One or more interconnected switches are called a fabric. Switches distribute
data about devices connected to them among the entire fabric to provide distributed services.The switches that form a fabric save information about the
devices connected directly to them in databases.They also provide services for
notifying devices of changes on the fabric that affect the way the device functions.
Classes of service specify what mechanisms will be used for the transmission
of data. Different classes of service are used for different types of data. Class 1 service provides a dedicated connection using end-to-end flow control. Class 2 service is connectionless and uses end-to-end flow control. Class 3 is used for SCSI.
It uses buffer-to-buffer flow control and is connectionless. Class 4 service provides
fractional bandwidth connections.
Solutions Fast Track
The Architecture of SANs
; A Fibre Channel SAN provides the advantages of increased speed, relia-
bility, and scalability.
; Fibre Channel presently transmits at 1.0625 Gbit/sec over single- and
multimode optical and copper cabling.
; A SAN implemented using the Fibre Channel protocol incorporates the
benefits of a channeled connection and a network.
; A SAN is constructed from three primary types of elements: initiating
devices, switches, and target devices.
; A target device is a storage device on a SAN. Device enclosures like
tapes, JBODs, or RAIDs are the most common type of target device.
; An initiating device is a device that actively seeks out and interacts with
target devices on the SAN.
; Switches create the foundation of the Fibre Channel SAN. A group of
interconnected switches is called a fabric.
140_SANs_02
8/14/01
1:33 PM
Page 55
Fibre Channel Basics • Chapter 2
Fibre Channel Protocol Basics
; Fibre Channel is primarily used to transport the SCSI and IP protocols.
; Devices are identified by an 8-bit Arbitrated Loop Physical Address
(AL_PA) in an arbitrated loop topology, and a 24-bit address for
switched fabric connections.
; Frames start with a primitive Start Of Frame (SOF) and end with an
End Of Frame (EOF) primitive.
; There are five Fibre Channel layers, designated FC-0 through FC-4.
; The FC-0 layer is the physical media layer and includes the media
selection and connectors.
; The FC-1 layer is the signal encoding and decoding layer.The FC-1
layer uses 8b/10b encoding.
; The FC-2 layer is the Fibre Channel protocol layer.
; The FC-3 layer is the Fibre Channel common services layer.The ser-
vices are servers in a Fibre Channel fabric that manage connections
between devices connected remotely through the switched fabric.
; The FC-4 layer is the Fibre Channel ULP mappings layer.
Classes of Service
; Classes of service specify what mechanisms are required for transmission
of different types of data.
; Class 1—Acknowledged connection-oriented service.
; Class 2—Acknowledged connectionless service.
; Class 3—Unacknowledged connectionless service.
; Class 4—Fractional bandwidth connection-oriented service.
; Class F—Used for inter-switch communication.
55
140_SANs_02
56
8/14/01
1:33 PM
Page 56
Chapter 2 • Fibre Channel Basics
Storage Network Topologies
; There are three primary types of topologies in Fibre Channel: point-to-
point, arbitrated loop, and switched fabric (also called point-to-point).
; The primary use of the point-to-point topology is to connect devices
directly to switches or other bridge devices.
; The arbitrated loop topology allows up to 127 devices in a ring
formation to share the bandwidth of a single line without a switch.
; Fabrics allow thousands of devices to be interconnected.
; Switches have three types of ports. FL_Ports are fabric loop ports that
attach arbitrated loops to the fabric. F_Ports are fabric ports that connect
single devices to the fabric in a point-to-point topology. E_Ports connect
a switch to another switch.
; Fabric-attached devices have a three-part address.The first segment indi-
cates the physical switch, the second part indicates the physical port, and
the last part is the arbitrated loop address in a loop device or 0x00 for a
fabric device.
Fabric Services
; Switches exchange information in their servers so that all individual
switch servers contain the same information.This creates distributed
servers.
; The fabric port is used to log a device into the fabric.The response
frame from login assigns the device its 24-bit address.
; The Name Server is used as a database to register and store information
about all devices on the fabric.
; The Fabric Controller at well-known address 0xFFFFFD provides state
change notification service to registered nodes. State change notification
is a service that notifies devices when a change in fabric topology
occurs.
; The Management Server provides information about the fabric without
stipulation as to zone.
140_SANs_02
8/14/01
1:33 PM
Page 57
Fibre Channel Basics • Chapter 2
Frequently Asked Questions
The following Frequently Asked Questions, answered by the authors of this book,
are designed to both measure your understanding of the concepts presented in
this chapter and to assist you with real-life implementation of these concepts. To
have your questions about this chapter answered by the author, browse to
www.syngress.com/solutions and click on the “Ask the Author” form.
Q: When should I use a hub rather than a switch?
A: Hubs can be used on small SANs to interconnect devices in an arbitrated
loop topology or connect two devices directly. Hubs should not be used
when there is more than one active host, since there will be more competition for the limited bandwidth of the loop.The more hosts that are added to
the loop, the less efficient it is, because a larger percentage of time is spent
arbitrating for control of the loop than actually transmitting data. Remember
that a switch creates circuits to maximize bandwidth, while all devices
plugged into a hub share the bandwidth of one line.
Q: How does Fibre Channel compare to SCSI in terms of performance?
A: SCSI LVD (wide) has a maximum transfer rate of 80 MB/sec, as opposed to
Fibre Channel’s 100 MB/sec.The advantage of using Fibre Channel over
SCSI is not entirely speed, however. Fibre Channel allows you the unique
opportunity to create a switched network with Fibre Channel devices. Not
only can you attach more devices together, but the performance is actually
increased as well. Fibre Channel also allows you to use fiber-optic cable as a
media type, which extends the area you can connect the devices by 10 km
per cable length.
Q: What are some other SAN technologies?
A: Right now, Fibre Channel is by far the most common protocol in the SAN
marketplace. No other technology has the ability to incorporate the aspects of
networks and channels in the way Fibre Channel does. In the past, FDDI was
a popular technology that used a loop configuration similar to Fibre
Channel’s arbitrated loop. Some emerging technologies in the SAN industry
are InfiniBand and Gigabit Ethernet.
57
140_SANs_02
58
8/14/01
1:33 PM
Page 58
Chapter 2 • Fibre Channel Basics
Q: How do I know if my new device will interoperate properly with my
existing SAN?
A: It is always difficult to know. However, devices are getting dramatically better
at working across multivendor switched fabrics.The Fibre Channel Industry
Association (FCIA) has also started an initiative to document common interoperability problems and develop testing specification documents to determine whether a device contains interoperability bugs.The SANmark Program
has been active for a little over a year, and devices can now be certified as
SANmark-compliant. Devices that pass these tests are probably the most
interoperable devices.
140_SANs_03
8/14/01
1:35 PM
Page 59
Chapter 3
SAN Components
and Equipment
Solutions in this chapter:
■
Overview of Fibre Channel Equipment
■
Cabling and GBICs
■
Using Hubs
■
Using Switches and Fibre Channel Fabrics
■
Connecting Legacy Devices Into Your SAN
■
Bridging and Routing to IP Networks
and Beyond
■
Fibre Channel Storage
; Summary
; Solutions Fast Track
; Frequently Asked Questions
59
140_SANs_03
60
8/14/01
1:35 PM
Page 60
Chapter 3 • SAN Components and Equipment
Introduction
Whether you are building a small or large network, one aspect of a robust Fibre
Channel deployment is the SAN components used to build your solution. By
understanding the different components, features of those components, and how
they are best used, you can plan and deploy a reliable, scalable network. Upfront
qualification, testing, and selection of equipment are important pieces of making
your SAN deployment work. Understanding which features you will be using in
your equipment will help guide your testing process.This chapter discusses the
different components of a SAN and their major features, and guides you in
selecting the right equipment for the job.
Fibre Channel has its own various types of connectors and media, including
both optical and copper interfaces, and varied ways to connect between the
different kinds of media. Fibre Channel uses fiber-optic and high-speed copper
media to bring together the speed and reliability of a channeled technology, with
the scalability of networking technologies.This is the perfect medium for transporting large amounts of data quickly to many different nodes across a network.
The standardized use of Gigabit Interface Connectors (GBICs) has made
switching between media types simple and easy, and mixed-media networks are
standard.This chapter explains the different connector and cabling options, and
how to select the right one for your application. It also covers the kinds of network topologies you can implement and why.
Hubs, switches, Host Bus Adapters (HBAs), and storage make up the components of a SAN. Hubs serve as the center of simple Fibre Channel-Arbitrated Loop
(FC-AL) configurations, and range from simple unmanaged hubs to more intelligent
managed hubs capable of switching frames between ports but not acting as switched
fabrics. For more reliable, manageable, and scalable networks, Fibre Channel
switches are used instead of hubs. Switches scale between as few as eight ports to
64 or more ports, and form the core of a switched fabric. HBAs serve as the entry
point into the SAN from your servers and hosts, providing translation of Small
Computer Systems Interface (SCSI) information from the operating system to
Fibre Channel addresses on the network. High-capacity storage systems can contain
petabytes of data and form the core of the data storage infrastructure of your
storage network. Finally, routers and bridges enable you to move data between
legacy SCSI components and Fibre Channel, as well as to networks based on
Gigabit Ethernet, Asynchronous Transfer Mode (ATM), and Dense Wave Division
Multiplexing (DWDM).
This chapter reviews the hardware components that make up a SAN, explains
the major features and functionality of each, and describes the tools and techniques
140_SANs_03
8/14/01
1:35 PM
Page 61
SAN Components and Equipment • Chapter 3
available to manage each piece.We will guide you through the features to look
for on each component and how to use these features.
Overview of Fibre Channel Equipment
Fibre Channel shares much of the same terminology as Ethernet networking
with hubs, switches, network interface cards, and routers all representing a part of
the network infrastructure. However, although the names are often the same, the
way they work is quite different. In the development of the Fibre Channel
industry, many of the terms were borrowed from more familiar networking terminology, even though in actual practice the functionality has changed. For
example, in Ethernet a hub exists mostly as an unmanaged electrical device that
allows multiple Ethernet connections to connect to a single point, with all connections seeing the same network traffic. In Fibre Channel, a hub connects each
port to the one next to it in a circuit. It is important not to confuse the Ethernet
use of the terminology with the Fibre Channel terminology and usage. Figure
3.1 identifies the components of a SAN.
Cabling and Media
Many characteristics of your SAN are determined by the physical wiring plan of
your network.The type of media you select impacts the scalability and functionality of your SAN.This chapter discusses the options for choosing a physical
media, including the advantages and disadvantages of different types of fiber-optic
cabling and the choice between fiber and copper cables.
Types of media we discuss in this chapter include:
■
Copper (Shielded Twisted Pair [STP])
■
Multimode optical
■
Single-mode optical
GBICs and Connectors
The cabling and GBICs section in this chapter is dedicated to familiarizing you
with the different types of physical connectors used to connect devices and terminate cable.Your selection of a fixed connector versus a GBIC affects the capability of your SAN to adapt to new devices and support legacy connector types.
The choice of connector type can also affect your ability to add to your SAN in
the future. Different types of connectors have different considerations and might
61
140_SANs_03
62
8/14/01
1:35 PM
Page 62
Chapter 3 • SAN Components and Equipment
require special handling to use correctly.Types of connectors we discuss in this
section include:
■
DB9 (copper)
■
HSSDC (copper)
■
SC (optical)
■
High-density optical connectors (Small Form Factor Pluggable [SFP]):
—MT-RJ
—LC
Figure 3.1 Different SAN Components in a Network
JBOD
RAID
RAID
Remote
SAN
Hub
DWDM
Remote
Host
Switch
Switch
Switch
Switch
SAN
FC-to-ATM Bridge
Host
FC/SCSI
Bridge
Host
Host
HBA
HBA
SCSI Tape Library
140_SANs_03
8/14/01
1:35 PM
Page 63
SAN Components and Equipment • Chapter 3
Hubs
In Fibre Channel, hubs serve at a very basic level as electrical connections between
the different ports and are used only in FC-AL configurations. Hubs originally
started out as simple electrical devices, which, if a cable was attached to a port,
completed the electrical loop between ports. If a signal is detected, a hub will
complete the circuit and pass traffic through the attached wire. A simple resilientloop circuit is used to make sure that connections are maintained through unused
ports. A hub in this sense is a simplification of cabling that reduces the need for
separate transmit and receive wires to all devices in a system (one of the original
ways to connect Fibre Channel equipment). Intelligent managed hubs provide
basic functionality but add more sophisticated error and fault detection, switched
frames, and additional features for managing loop environments.
Hubs support the use of Fibre Channel to connect up to 127 devices in a
loop. Due to the complexity of looped environments and available bandwidth,
the number of devices is generally significantly less than the maximum of 127.
Also, while loops of 127 devices are theoretically possible, but impractical, networks larger than 127 devices could not be built due to address space limitations.
Switches
Fibre Channel switches, unlike hubs, are primarily used in Fibre Channel
switched fabric installations. Instead of a loop, where traffic is passed between all
nodes (a shared bandwidth and error segment architecture), the Fibre Channel
fabric instead routes frames directly from initiators and targets across a full-bandwidth fabric.This means that each connection across a fabric can exist independently of every other connection. Switches, which can range from as few as eight
ports to 64 ports or more, contain sophisticated switching hardware used to route
frames from any port to any other port. In addition, switches can also be cascaded
through Inter-Switch Links (E_Ports), which allow fabrics to extend to thousands
of nodes and up to a current limit of 239 switches.
Switches add the intelligence of fabric services such as name services and
management services, and provide a more robust protocol set for connecting
devices. Switches are used in almost all environments to provide a reliable mechanism for connecting hosts to storage and are a necessity for environments with
multiple initiators or more than just a few devices. Fibre Channel switches are
the foundation upon which the rest of the SAN infrastructure is built.
63
140_SANs_03
64
8/14/01
1:35 PM
Page 64
Chapter 3 • SAN Components and Equipment
Storage
Fibre Channel storage is a key component of a Fibre Channel network, providing
the shared storage resources that can be accessed through your SAN. Storage is
usually the area of focus for your SAN, except in the case where the network is
being used exclusively for IP or Virtual Interface (VI) traffic.
Fibre Channel storage ranges from single disk drives that support dual-port
Fibre Channel connections, to banks of disk drives called a Just A Bunch Of
Disks (JBOD) wired together into a cabinet, to more sophisticated Redundant
Array of Independent Disks (RAID) storage devices with hundreds of gigabytes
of capacity, and finally to enterprise-level storage subsystems that contain a terabyte or more of data.
Host Bus Adapters
HBAs are used to connect servers and hosts to the Fibre Channel network.
Analogous to Network Interface Cards (NICs), the term host bus adapter comes
from their use of connecting servers to the SCSI bus. HBAs consist of hardware
and drivers, which interface with operating systems to represent Fibre Channel
storage as devices in the operating system. HBAs are the gateway to accessing
your SAN.
HBAs typically plug into a host’s bus (for example, PCI or Sbus), although
some HBAs might be embedded on the motherboard and translate signals on the
local computer to frames on a Fibre Channel network. A key part of this process
is the driver, which controls your Fibre Channel HBA and determines how the
device behaves with the operating system as well as the general Fibre Channel
network. Unlike typical NICs, Fibre Channel HBAs tend to be far more intelligent than the standard network card, providing for negotiation with switches and
tracking devices that are attached to the network. Robust software and hardware
functionality enable these components to offload I/O processing from the host,
monitor network configurations, and support load balancing and failover
capabilities.
Routers and Bridges
Routers in the Fibre Channel sense do not serve the same purpose as routers in
the networking world. Instead, Fibre Channel routers act as bridges, translating
Fibre Channel frames to other types of transports.The most common routers
translate between legacy SCSI connections, representing a SCSI bus as a number
140_SANs_03
8/14/01
1:35 PM
Page 65
SAN Components and Equipment • Chapter 3
of individual Logical Unit Numbers (LUNs) behind a Fibre Channel port. SCSIto-Fibre Channel routers are frequently used to connect SCSI tape devices to
Fibre Channel. It is possible that true Fibre Channel routers will emerge in the
future, which might cause some confusion.
Other types of bridges include Fibre Channel-to-Gigabit Ethernet bridges,
which typically bridge IP frames from Gigabit Ethernet to Fibre Channel, and
Fibre Channel to DWDM or ATM bridges, which transport full Fibre Channel
frames across DWDM or ATM technology for extended fabrics that span from
several kilometers to several hundred kilometers.
Cabling and GBICs
The most basic layer of your SAN is the physical layer, which includes your media
and connector choices.These choices depend on the primary purpose of your
SAN. Like most technologies, improvements are happening every day, so we will
highlight the connectors and components that are most popular today.
You must consider a number of factors as you put together the physical layer
of your SAN. One factor is distance, or how far you must connect two points of
interest. Next, you need to consider your existing architecture: if there is
already fiber or copper that can be used, determine if it is compatible with the
components you would like to add.You also need to consider scalability, so your
SAN will be easily upgradeable to allow devices to be added and removed with
a minimal amount of added materials.The final consideration is cost.What
components are the least expensive, and what are their advantages and disadvantages? This section provides information on specific media and connectors, so that
you can assemble a cost-effective SAN that is efficient, reliable, and scalable.
Copper Versus Optical: Selecting Your Media
In selecting the type of media to use for your SAN, you have two primary
choices: copper and optical.The distinct advantage of copper is that it is inexpensive
compared to all types of optical.The advantage of optical fiber is that it provides
a reliable signal over a longer distance than copper.The choice between the two
types of optical fiber (multimode and single mode) is also one of distance and
cost.There are no speed differences between any of the media types.
Copper Cabling
Copper has the advantage of being the least expensive media by which to connect
components of your SAN. Copper is generally 150-ohm shielded twisted pair,
65
140_SANs_03
66
8/14/01
1:35 PM
Page 66
Chapter 3 • SAN Components and Equipment
although 75-ohm video cable and mini-coaxial cable are also used. Copper cabling,
when used at a rate of 100 MB/sec, has an effective range of 0 to 25 meters
without sacrificing quality.Transmitting at half speed and quarter speed increases
the effective distance of transmission. However, few companies manufacture half- or
quarter-speed products. Copper is usually terminated with either an HSSDC or
DB-9 male connector (on the DB-9 connector, the end with the pins is male).
Although at this time DB-9 is a more common connector, HSSDC is quickly
becoming more popular. See Table 3.1 for a comparison of media specification.
Table 3.1 Copper Media Type Comparison Chart
Media Type
Shielded Twisted
Shielded Twisted
Video Cable
Shielded Twisted
(Active)
Video Cable
Shielded Twisted
(Active)
Video Cable
Speed (MB/sec) Optimal Distance (Meters)
Pair (Active)
Pair (Passive)
100
100
100
0–30
0–15
0–25
200
200
0–10
0–10
50
50
0–40
0–40
Pair
Pair
Values in table are estimated lengths based on optimal signal strength.
Copper is highly durable and easy to store, which makes it useful for a lab or
area where devices are commonly plugged and unplugged, or when you are constantly connecting and disconnecting a device over a short distance to a number of
different hosts. An advantage to using the DB-9 and HSSDC copper connectors is
that there is only one way they fit into the complementary connector, which
means it is impossible to cross a transmit and receive line, a common mistake for
even experienced individuals dealing with fiber optics. Optical cabling is harder to
terminate and can be susceptible to scratches. In addition, copper is a better choice
in the cabinet short-length connection. For lengths longer than five meters, single
or multimode optical fiber might be a better choice.
Multimode Optical Cabling
Multimode optical cable is available in 50 micrometers (µm) and 62.5 µm sizes.
These measurements correspond to the diameter of the fiber—there is no speed
difference between the two that affects Fibre Channel. Multimode optical cable is
140_SANs_03
8/14/01
1:35 PM
Page 67
SAN Components and Equipment • Chapter 3
available in 850 nanometer (nm) and 1300 nm wavelengths.The 850 nm wavelength is within the visible spectrum and is not harmful to your eyes.This is not
true of 1300 nm wavelength lasers, which are not visible but could severely
damage your retinas. Multimode optical fiber is terminated using a variety of
optical connectors, including SC, LC, and MT-RJ (we discuss connector types
later in the chapter). A 50 µm multimode fiber has an effective range between 0
and 500 meters at a 1 Gbit/sec rate (see Table 3.2 for specifications on other
multimode fibers).The 62.5 µm fiber has about half the range of 50 µm fiber.
Multimode fiber is the more common media type and is inexpensive compared to single-mode fiber, although the two are coming closer together in price
as the demand for single mode rises. Multimode transmitting and receiving components are also much less expensive, because multimode generally uses a concentrated LED rather than an actual laser.This is because multimode fiber is
much wider in diameter than single-mode fiber.
Many multimode fibers have a feature called Open Fiber Control (OFC),
which is a feature of the transmitter and receiver pairs. In OFC, the transmitter
periodically transmits short bursts of light.When the receiver detects this light, it
begins to transmit regularly and causes the other transmitter to go out of OFC
mode.The OFC mechanism was designed to avoid the potential hazards of
having unconnected lasers transmitting in a work environment. OFC is becoming
a less common feature, since most multimode transmitters use LEDs rather than
lasers and there is no associated safety risk.
Table 3.2 Multimode Optical Media Comparison
Media Type
Laser/ LED
Type (nm)
Speed
(MB/sec)
Optimal Distance
(Meters)
50 µm multimode
62.5 µm multimode
50 µm multimode
62.5 µm multimode
50 µm multimode
62.5 µm multimode
850
850
850
850
850
850
100
100
200
200
50
50
2–500
2–300
2–300
2–90
2–1000
2–400
Values in table are estimated lengths based on optimal signal strength.
67
140_SANs_03
68
8/14/01
1:35 PM
Page 68
Chapter 3 • SAN Components and Equipment
Single-Mode Optical Cabling
Single-mode optical fiber (Figure 3.2) is the most expensive media type, but
preferable for long distances. It most often comes in 1300 nm wavelength, which
is not visible and can be harmful to your eyes.
Figure 3.2 Single-Mode Fiber with SC Terminators
Single-mode optical fiber is approximately 9 µm in diameter.The small diameter makes light waves less likely to be altered over long distances, so for longdistance SANs, single-mode fiber is the best solution. Because of its small diameter, it also theoretically has the highest transmission speed potential (the theoretical limit is around 25 Tbit/sec, as opposed to multimode, which is around 10
Gbit/sec). Single mode is the ideal media to use for long interconnects.
Single-mode fiber itself is not significantly more expensive than multimode
fiber or even copper—the added price is in the transmitting components, which
use lasers rather than LEDs. Since the fiber has such a small diameter, it takes
added precision to align the laser in the transmitter with the fiber. See Table 3.3
for specifications on single-mode fibers.
140_SANs_03
8/14/01
1:35 PM
Page 69
SAN Components and Equipment • Chapter 3
Table 3.3 Single-Mode Optical Media Comparison
Media Type
Laser/LED
Type (nm)
Speed
(Mb/sec)
Optimal Distance
(Meters)
9 µm single mode
9 µm single mode
9 µm single mode
1300
1300
1300
100
50
200
2–10,000
2–10,000
2–2,000
Values in table are estimated lengths based on optimal signal strength.
WARNING
Any single-mode or multimode laser can damage your eyes if it is transmitted at 1300 nm. The 1300 nm wavelength is not in the visible spectrum, so you will not see a laser being transmitted like in 850 nm fiber. A
1300 nm laser is dangerous, because it can cause severe retina damage.
Connecting with Connectors
There are many different types of connectors, and no particular connector makes
a difference in performance as long as the connection is clean. Some connectors
are bonded, which means that the transmit and receive fibers are physically
mounted in the same piece of plastic.This is usually acceptable, but for some less
orthodox wiring systems it might be preferable to select connectors that have
loose transmit and receive fibers.This section covers the most well-known types
of connectors.
You should try to minimize the total number of connections and patches
when building your SAN. As discussed earlier, light is reflected by poor connections and patches in the path between devices, so minimizing the number of
patches between devices makes your SAN less susceptible to loss-of-signal errors.
The DB-9 Copper Connector
DB-9 is the standard copper connector, although more organizations are switching
to HSSDC because of its improved reliability and smaller size. DB-9 has the same
appearance as DB-9 serial cabling, so it is important to understand that they are
not the same (Figure 3.3). DB-9 connectors have a metal D-shaped connector rim
69
140_SANs_03
70
8/14/01
1:35 PM
Page 70
Chapter 3 • SAN Components and Equipment
with 9-pin sockets on the female end and either four or eight pins on the male
end, rather than all nine used in serial cabling. Currently, DB-9 terminated cable is
less expensive than HSSDC terminated cable.
Figure 3.3 DB-9 Copper Connector
Copper cabling is available in two types: passive copper and active copper. Passive
DB-9 has four pins (two for transmitting and two for receiving) and, like HSSDC,
is used to terminate shielded twisted pair. In active copper cabling, four pins of a
DB-9 connector are used to transmit power in addition to the two pairs that are
used for transmit and receive. Active copper lines get twice the distance of passive
copper lines. Both active and passive type DB-9 connectors are equally priced.
Again, it is important when purchasing DB-9 cables not to confuse the connector with DB-9 serial cabling.The resistance between the two is not the same
and can severely damage your equipment.
The HSSDC Copper Connector
The HSSDC, shown in Figure 3.4, is starting to replace DB-9 connectors on
some HBAs.The most probable reason is that they are smaller than the DB-9
connectors, so more can fit on a single interface card.The HSSDC connector
uses a single plastic squeeze lock, so it is easy to insert and remove.The HSSDC
connector was specifically designed as a Gigabit copper connector, by improving
density and performance over the DB-9 style connector.
140_SANs_03
8/14/01
1:35 PM
Page 71
SAN Components and Equipment • Chapter 3
Figure 3.4 HSSDC Copper Connector
Looking Forward
The InfiniBand architecture, a widely supported and fast-moving effort,
uses a type of copper connector called HSSDC-2, as well as the same
types of cabling media as Fibre Channel. The InfiniBand protocol is
designed as a replacement for the PCI bus on server systems that need
greater width for I/O. The goal is to avoid the impact on the bus of a
server by directing data to the appropriate channel in the server.
InfiniBand will be able to encapsulate many protocols, including FCP, in
order to transfer Fibre Channel or other ULPs to the appropriate
adapters; however, this should be transparent to the SAN architecture.
With devices already coming to market, the ability to include InfiniBand
servers on your SAN might be a consideration for expansion.
The SC Optical Connector
The SC connector, shown in Figure 3.5, is probably the most widely used optical
connector.The SC connector has been commonly used to replace the ST connector, which at one time was widely used with legacy fiber technologies.The
SC connector is a square plastic block containing a glass housing for the fiber.
The plastic fits snugly in the connector slot on the board or GBIC. SC connectors are used to terminate single or multimode fiber.
SC connectors can be either bonded or unbonded, and come in either single
or duplex quantities. A single-quantity SC patch cord is a piece of single or
multimode fiber terminated at both ends with one SC connector. A duplex
quantity is a pair of fibers, one for transmitting and one for receiving.The plastic
insulation on the fiber is molded together to provide a transmit/receive pair.
There are a total of four SC connectors, two at each end of the fiber.The SC
71
140_SANs_03
72
8/14/01
1:35 PM
Page 72
Chapter 3 • SAN Components and Equipment
connectors on duplex fiber might be bonded together, meaning that the SC connector pairs are made of one piece of (or bonded) plastic. Bonded connectors
have the advantage of reducing your ability to plug the transmit fiber into the
transmit slot on the GBIC or port.
Figure 3.5 Unbonded and Bonded SC Optical Connectors
High-Density Fiber-Optic Connectors
High-density connectors represent the next generation of fiber-optic connectors.
High-density connectors are designed to be small to allow more connections in a
small space, which might be the back of a PCI adapter card or the faceplate of a
hub or switch. As the Fibre Channel protocol develops, you can connect more
nodes reliably, and you will need to have space to make those physical connections.The most popular types of high-density connectors are the LC and MT-RJ
connectors. Neither type uses any new optical technology. In fact, the connectors
use the same multimode and single-mode fiber that SC connectors use.The difference is entirely in the piece of plastic in which the fibers are housed. HSSDC
copper connectors, discussed earlier in this section, are a high-density-type connector.The HSSDC connectors are designed to accomodate more copper ports
on HBAs and switch and hub faceplates.
LC connectors are bonded pairs of miniature connectors.The design of the
plastic pieces looks like a small, elongated version of the SC connector. However,
the LC pair is comparable to the width of a single SC connector.The MT-RJ
uses a single terminator for pairs of fibers.The plastic design is similar to a miniaturized HSSDC without the copper contacts on top of the housing. Instead, the
140_SANs_03
8/14/01
1:35 PM
Page 73
SAN Components and Equipment • Chapter 3
fiber transmits and receives out of two pinholes in the end of the plastic housing.
MT-RJ connectors are also about the width of a single SC connector.
Comparing GBICs to Fixed Media
GBICs are removable transceivers used in all types of Fibre Channel devices,
including switches, hubs, and HBAs. They are used widely in Fibre Channel and
other network technologies. GBICs offer the option of interfacing with almost all
types of connectors. A GBIC fits into a GBIC port on a device. A large percentage
of Fibre Channel devices have a GBIC slot rather than a fixed media type port.
GBICs convert the electrical signal generated from the device into the appropriate
signal for transmission, depending on the type of connection the GBIC was
designed to make. GBICs can convert the device’s electrical signal to a signal that
is appropriate for single-mode fiber, copper (either HSSDC or DB-9), and multimode fiber. GBICs have a variety of connector types (Figure 3.6) and can be used
to make your SAN connection types homogeneous.
Figure 3.6 SC, HSSDC, and DB-9 GBICs
Using a GBIC
Many devices have GBIC slots.Vendors provide them to make devices easier to
connect to a variety of media.The GBIC should slide easily into the GBIC slot.
73
140_SANs_03
74
8/14/01
1:35 PM
Page 74
Chapter 3 • SAN Components and Equipment
GBIC slots generally have a trap door, which flips up on insertion of the GBIC.
The GBIC should be inserted with the single socket pointed in and the connection end facing out. (If you meet with any resistance, do not force the GBIC,
since it is most likely upside down.)
GBICS come in a variety of types, including multimode, single mode,
HSSDC, and DB-9. It is important to remember to connect the right type of
media. For instance, multimode fiber will not work in a single-mode GBIC.You
will most likely need to use GBICs in a switch or hub.
Pros and Cons of Using GBICs
The advantage to using GBICs is the versatility they give devices to work within
a topology.With a range of GBICs, a device can be attached to any SAN. Using
GBICs, however, breaks a cardinal rule of networking, which is to minimize the
number of connections. Although GBICs, when working properly, do not
degrade signal, including a GBIC in the connection introduces another element
that can malfuntion. Although the newer GBICs are highly dependable, they tend
to be used over and over in different slots. Since connectors go through so many
insertions and removals, they tend to break more quickly than a fixed connection
could.When GBICs are used frequently over a long period of time, they become
less dependable. GBICs are also expensive—second only to fiber in being the
most often replaced piece of a SAN.
GBIC Ports on Equipment
Equipment might or might not have a GBIC port.Without a GBIC port, you are
limited to the type of connection on board, whereas if you have a GBIC port,
you have additional options. Switches and hubs almost always have GBIC ports.
HBAs and storage devices often have fixed media.
GBIC ports are becoming more common on devices now.This allows you to
choose your media type based on the location of your device and other device
specifications.With fixed media, however, if the vendor decided that a particular
type of port is appropriate, it might limit your options as far as the distance and
speed at which the device can be connected. Another drawback to a fixed port is
that a failure in that port requires the unit to be replaced.
Serialized Versus Nonserialized
Serial ID GBICs provide serial number, model numbers, and diagnostic data
through embedded Electrically Erasable Programmable Read-Only Memory
140_SANs_03
8/14/01
1:35 PM
Page 75
SAN Components and Equipment • Chapter 3
(EEPROM).This allows for better asset tracking and diagnosis of GBIC
problems.This feature is used by SAN management tools and is the only way to
see if you have a mixture of GBICs in your fabric. Serial ID is a very important
feature if you want to use Brocade Fabric Watch to monitor GBICs. See Chapter
4, “Overview of Brocade SilkWorm Switches and Features,” for further discussion
about Fabric Watch.
Common Problems with GBICs
GBICs are highly reliable devices but, as mentioned earlier, their pluggable functionality causes them to break more frequently than other components. If you are
careful and never force a GBIC, it should last a long time.When trying to diagnose a connection problem with a device, start by making sure your transmit and
receive wires are not crossed before deciding to replace the GBIC.
Another common problem with GBICs is finding appropriate GBICs for the
media type you are using. Use only single-mode GBICs with single-mode
devices, and only multimode GBICs with multimode devices. It is a common
mistake to plug a fiber into an already-inserted GBIC and assume it is the correct
mode for the type of fiber you are connecting. Also, be careful when using
various GBICs between equipment from different vendors. Although GBICs can
be used between devices, vendors often ship GBICs that have been fully qualified
and tested specifically with their equipment. It is usually best to stick with the
GBICs that are shipped with a product or provided by the manufacturer, since
this reduces the possibility of support issues with your equipment. Although
GBICs have their issues, it would be almost impossible to develop a scalable SAN
without them.
Media Interface Adapters
Media Interface Adapters (MIAs) convert copper signals to optical signals by
sitting between a copper port and generating a laser from the copper signal.
MIAs convert DB-9 copper connectors to optical SC connectors and are most
commonly used when a device with a fixed media copper port needs to be connected optically to the rest of the SAN. Since the maximum range of an active
copper line is 30 meters, using MIAs extends your connection range to 500
meters, the maximum distance for multimode fiber. MIAs are most commonly
used in this manner to connect legacy devices with fixed copper media.
You should carefully consider using MIAs. Using an MIA as an adapter adds a
connection, which significantly reduces signal quality.
75
140_SANs_03
76
8/14/01
1:35 PM
Page 76
Chapter 3 • SAN Components and Equipment
Using Hubs
Fibre Channel hubs are used to connect simple FC-AL environments. Hubs were
the original interconnect mechanism used for Fibre Channel, and provide connectivity between nodes in a loop. A hub connects ports, sending frames between individual ports but not routing them to other ports. Simple hubs do this electrically,
and more intelligent hubs might also switch frames through the loop. Switched
hubs do not implement a switched fabric protocol, but they still maintain the
FC-AL environment. As a result, they have most of the same reliability, scalability,
performance, and manageability limits of unmanaged hubs. For this reason,
switched fabric is becoming the dominant SAN technology. However, for low-end
installations, hubs can offer a less-expensive alternative to switches. If you need to
scale your SAN at some point in the future, consider buying an entry-level switch
instead of a hub.
This section briefly describes the different kinds of hubs and their major
features and how hubs are best used in your network.
Simple Electrical Hubs
Simple electrical hubs consist of a simple series of circuits that detect whether a
connection has been plugged into a port on the hub.Think of a hub as being a
“feature-rich” wire. A resilient-loop circuit simply completes a connection to
ensure that a loop is continuous throughout the hub. Simple hubs generally support only copper and do not include any software functionality. Although these
hubs are still available, they are used only in the simplest of configurations due to
their lack of fault tolerance, operational difficulty, shared bandwidth, and difficulty
in maintaining a stable loop when there is more than one initiator.
Managed Hubs
As Fibre Channel has evolved, manufacturers have found that just having simple
hubs does not address problems of stability and manageability in a network.
Managed hubs were designed in response to these issues. Managed hubs, unlike
their simple electrical predecessors, do not just connect wires from port to port.
Instead, they add more sophisticated functionality such as fault detection on
ports, settable port modes, and isolated loop operation. More advanced hubs
enhance performance or usability with frame switching capabilities between
ports, privately routed frames between initiators and targets (rather than having all
nodes pass traffic through the entire loop), and advanced diagnostic capabilities.
140_SANs_03
8/14/01
1:35 PM
Page 77
SAN Components and Equipment • Chapter 3
A typical managed hub operates much like a simple electrical hub, by connecting adjacent ports in a continuous electrical circuit. However, managed hubs
add a fair amount of intelligence to monitoring the port, and might also contain
circuits that can interpret and modify frames received on the port. Instead of
blindly connecting two electrical paths, a managed hub might actually receive the
frames on a port and decide whether to transmit them further down the loop.
Another capability of these hubs is to filter out unwanted frames. For example,
when a marginal connection is sending invalid primitives into the line, a managed
hub will discard this frame. Managed hubs can also interpret a frame and send it
directly to a downstream port—essentially “switching” the packet and avoiding
the need for every node on the loop to see the frame. Managed hubs can also
segment a loop through software (loop zoning), and usually include intelligence
to handle the difficulties of managing Loop Initialization Primitive (LIP) conditions and loop bring up. LIPs are part of a loop initialization process and are
expected in a healthy and normal loop. However, certain conditions inherent to
loops create scenarios where LIPs cause an interruption to I/O or prevent
devices from effectively communicating on the loop.
Advanced managed hubs also avoid the problems of simple electrical hubs by
isolating initiators. Initiators on the loop can be configured to see and communicate only with specific storage devices and can be screened from other initiators.
This prevents some problems that can occur when initiators try to reset or otherwise communicate with each other.
The following is a list of the typical capabilities provided by managed hubs.
Each item is discussed in more detail in the sections that follow:
■
LIP isolation
■
Automatic port bypass
■
Signal retiming
■
Loop zoning
■
Web interface
■
Telnet
■
Port-event logging
■
SNMP support
LIP isolation is the capability of a hub to prevent LIPs from being transmitted
to all nodes in a loop. LIPs used to be the primary source of instability in an
77
140_SANs_03
78
8/14/01
1:35 PM
Page 78
Chapter 3 • SAN Components and Equipment
FC-AL configuration, due to the complexity of the FC-AL protocol.With LIP
isolation, LIPs are isolated from all other parts of the loop, preventing the disruption of traffic and avoiding many LIP protocol-related problems.
Automatic port bypass is the ability of a hub to automatically bypass a port if too
many errors have been detected. Software keeps track of the number and rate of
errors coming from devices, and if the error rate exceeds a certain, user-defined
threshold, it will automatically bypass a port.This helps to prevent the whole loop
from becoming unusable due to a single device having problems.When a device
in a loop experiences partial failure and is not bypassed, it normally will constantly
LIP.This essentially eliminates communication between other devices on a loop—
similar to somebody’s cell phone continually ringing during a meeting.
Signal retiming is the ability of hub hardware to clean up the signal received
from a device. Instead of just electrically passing a signal through the port
(including potential errors or noise on the line), a retimed port will take the signal
and re-encode it on the wire. Any errors will be removed from the signal and
noise removed from the line.
Managed hubs, unlike unmanaged hubs, also provide for manageability functions such as telnet, SNMP, serial ports, and port logging to monitor the ports.
This makes it easier to configure devices, diagnose problems, and view activity in
your FC-AL configuration.
Due to inherent limitations in FC-AL, hubs are being used less frequently in
Fibre Channel installations and are generally found only on low-end, low–portcount installations with only a few initiators. Early problems with instability in
Fibre Channel were due to the difficulties in implementing the FC-AL protocol.
Although managed hubs have reduced the problems, the inherent limitations of
the early technology have driven installations to migrate to fully switched fabric
switches, which are more reliable and provide a higher level of performance and
manageability.
LIP Service: Fibre Channel
LIPs, Problems, and Solutions
With hub technology and FC-AL, one of the hardest parts of maintaining a network is managing the LIP process. Because of the complexity of this process and
various early incompatibilities between equipment, the LIP process often resulted
in the instability of Fibre Channel installations. Note that LIPs were designed as,
and continue to be, a healthy aspect of FC-AL. Unfortunately, LIPs can also be
the cause of loop instability.
140_SANs_03
8/14/01
1:35 PM
Page 79
SAN Components and Equipment • Chapter 3
A typical problem occurs when nodes are added or removed from the loop,
either intentionally by an administrator, when any cable is either plugged or
unplugged, or when a device is powered up or down.When a change to an FCAL loop occurs, the LIP process starts and all of the devices on a loop stop whatever they are doing and renegotiate for addresses in the loop.When a node is
added or removed from a loop, nodes go into the LIP process. Each node passes a
frame through the loop with information used to determine which address to use
to send Fibre Channel frames to it—its device ID.
As you can imagine, this is a problem if traffic is being sent across the Fibre
Channel loop—any data that has been sent must be re-sent, drivers and software
must time out, and all transactions must be retried. Even worse, because of early
hardware incompatibilities and the complexity of the LIP process, sometimes the
process can take minutes for a loop to quiesce, and in rare cases the loop might
never settle and prevent transactions from continuing.
The term LIP storm was coined to describe what happens in these situations,
and an entire generation of managed hubs were designed to minimize or eliminate this problem.Today, with the newest hubs and switches operating in loop
mode, these problems are minimized.
Getting Out of the Loop:
Migrating to Switched Fabric
Because of the problems described in FC-AL environments, many installations
have been migrating out of loop environments into switched fabric. Loops scale
to only 127 devices, while switched fabrics support hundreds or even thousands
of devices. Because Fibre Channel switches inherently support loop devices, and
in fact implement a superset of the Fibre Channel loop protocol, migration is
fairly straightforward. By migrating to a switched fabric environment, the reliability problems, manageability problems, and bandwidth limitations of loops can
be eliminated fairly easily.
Brocade switches provide features called QuickLoop and Fabric Assist that
make it easy to migrate from a loop environment directly to switched fabric.
Note that it might be necessary to purchase a separate license for these features.
Hubs can be entirely replaced by switches and, in fact, some low-end Brocade
switches are positioned to directly replace hubs as a component (for instance, the
SilkWorm 2010 switch). Devices that cannot take advantage of switched fabrics
can still operate in private-loop mode, fabric-aware devices can operate in fabric
mode, and operation and capabilities can be maintained with only a simple
equipment swap.
79
140_SANs_03
80
8/14/01
1:35 PM
Page 80
Chapter 3 • SAN Components and Equipment
QuickLoop operates by setting up a virtual loop through switched ports.
Each of the ports in this case operates as if it were a hub port, but takes advantage
of the capabilities of the Brocade switch, including switching capabilities.
Using Switches and
Fibre Channel Fabrics
A Fibre Channel switch is logically positioned in the center of a SAN and is
connected to hosts, storage, or other switches.The fabric infrastructure can be
viewed as the foundation upon which the rest of the SAN is built.When a frame
arrives from a device, a switch accepts and then routes that frame to the proper
destination device. In fact, using the Brocade cut-through routing approach, a
frame can begin to be forwarded even before it has been completely received. A
fabric switch also contains a great deal of intelligence, providing services for
locating other nodes in a network (the Simple Name Server [SNS]), automatically establishing routes between other switches in the fabric, compartmentalizing
devices into zones (zoning), as well as monitoring and handling errors (basic
Brocade Fabric OS functions and Fabric Watch).We discuss fabric services further
in Chapter 2, “Fibre Channel Basics.”
Brocade switches also provide functionality that allows private loop devices to
participate in a fabric and translate the communication between fabric devices
and older private devices. In fact, the translative mode of operation for a port on
a Brocade switch will automatically allow any private target node (such as a private loop JBOD) to function fully as part of a fabric.This feature is a core piece
of the Fabric OS and does not require a license. Making this work for a private
HBA, on the other hand, requires QuickLoop and/or Fabric Assist options.
Basic Switch Types
Fibre Channel switches are often classified into different categories, depending on
capabilities and features. In many cases, the hardware might be based on the same
underlying architecture or Application-Specific Integrated Circuit (ASIC), but
the software features are variable and priced accordingly to meet the requirements of that class of switch.The exception is highly redundant “core-class”
switches, which tend to be developed on their own fault-tolerant hardware
platforms.This section covers the major categories of switches and explains the
differences between each kind of switch.
140_SANs_03
8/14/01
1:35 PM
Page 81
SAN Components and Equipment • Chapter 3
Entry-Level Switches
Entry-level switches are focused on small workgroups of eight to 16 ports, are
geared toward low cost, and deliver limited scalability and management.They tend
to be used to replace hubs and offer higher bandwidth and reliability. Entry-level
switches are often integrated into complete storage solutions rather than purchased
separately. Entry-level switches offer limited levels for cascading of switch ports.
Brocade entry-level switches can be upgraded with a license to handle more scalability or to add functionality such as zoning or Web management.
Scalable Fabric Switches
Fabric switches provide the ability to cascade switches together to create a larger
fabric. By connecting one or more ports between two switches, all of the ports
connected to either switch see one single image of the network, with any nodes
on the switches available to other nodes in the fabric. Essentially, by connecting
the switches together, you can create one large, virtual switch that also has the
advantage of being distributed—even over large distances.
Fabrics built with fabric switches work as a single fabric, with all ports connected into the network able to view and access any other node on the network
as if it were on the local switch. A unified Name Server and management services
allow you to view and modify fabric information for an entire fabric through
single interfaces.
An important factor in creating a distributed fabric is understanding the
bandwidth availability of the ISLs. It is important to remember that the speed
available between any two ports can be impacted by the lack of available bandwidth on an ISL and that you might need to employ multiple ISLs to maintain
the necessary bandwidth.We discuss this topic further in Chapter 5, “The SAN
Design Process,” and Chapter 7, “Developing a SAN Architecture.”
Core Fabric Switches
Core fabric switches are designed to reside in the middle of a large SAN, interconnecting multiple edge switches to form multihundred-port SANs. Core fabric
switches can also function as standalone or edge switches, of course, but their
robust feature set and internal architecture is designed to allow them to work in
carrier-class environments as well. Other attributes of core fabric switches are the
ability to support protocols other than Fibre Channel, such as InfiniBand, 2
Gbit/sec support, and advanced fabric services like security, trunking, and frame
filtering.
81
140_SANs_03
82
8/14/01
1:35 PM
Page 82
Chapter 3 • SAN Components and Equipment
Core fabric switches generally provide a high port count, from 64 to 128
ports, and employ extensive internal interconnects to route frames at full speed.
These switches are built for scalability and bandwidth, and are designed to route
as many ports as quickly as possible with the least amount of delay to a frame. In
addition, they tend to be blade-based: you can add and remove switch blades in a
chassis to add functionality as needed, to facilitate hot sparing or online repairs,
and to “pay as you grow.”
Some enterprise switches do not support arbitrated loop operation or other
loop devices directly, instead focusing on core switching capability. Brocade highend enterprise switches provide all of the functionality of Brocade mid- and
entry-level switches, including support for loops.
For environments in which availability is most important, and you are willing
to pay a premium for redundancy, highly redundant switches provide fully redundant components throughout the switch, remove single points of failure, and provide extremely high uptime. A premium is paid for highly available backplanes,
power supplies, redundant circuitry, and software to maintain availability.These
types of switches include a great deal of logic and circuitry to deal with hardware
failures within the switch. Beyond redundancy, core fabric switches support nonservice—interrupting software upgrades, virtually eliminating the need to
schedule maintenance windows. An alternate approach that provides a level of
redundancy in the network is deploying a resilient, dual fabric. A resilient, dual
fabric allows you to remove single points of failure and protect against the
unlikely event of an entire fabric going down due to a software or hardware
error, fire, natural disaster, or operator error. For the most highly available networks, you should deploy a dual fabric built with core fabric switches.
Features of Fibre Channel Switches
Fibre Channel switches provide many different features, including support for
GBICs, redundant fans and power supplies, zoning, loop operation, and multiple
interfaces for management. Each of these features adds to the overall operation of
your switched network and understanding the benefits and advantages of each
can help you design a robust and scalable SAN.This section covers the major features of Fibre Channel switches, describes what you need to know about each of
the features, and how to best use these features.The capabilities of Fibre Channel
switches are listed here:
■
Self-configuring ports
■
Loop mode operation
140_SANs_03
8/14/01
1:35 PM
Page 83
SAN Components and Equipment • Chapter 3
■
Switch cascading
■
Auto-sensing speed detection
■
Configurable frame buffers
■
Zoning (physical port- and WWN-based)
■
IP over Fibre Channel broadcasting
■
Telnet
■
Web-based management
■
Simple Network Management Protocol (SNMP)
■
SCSI Enclosure Services (SES)
Zoning
Control over which nodes in a network can view and access each other has
become a necessary part of configuring your SAN. Depending on which Fibre
Channel switch you buy, zoning is implemented in different ways and also might
support different kinds of zoning.
The simplest kind of zoning is port-based zoning, or zoning by a physical
port on the switch. A port zoning entry could be translated something like,
“Only allow devices on switch 1, port 1 to talk to devices on switch 3, port 2.”
WWN-based zoning provides the capability to restrict devices, specified by a port
or node WWN, into zones.This is much more flexible, since it allows nodes anywhere in a network to maintain the zones they are restricted to. However, it does
have its disadvantages. For example, if you replace a device, the WWN might
change, while the port address stays the same.
Zoning is classified into two types: hard and soft zoning. Soft zoning uses only
software to enforce zones—usually through selective information presented to
end nodes through the fabric SNS. Nodes in a zone are informed of each other
only through names services in soft zoning. However, frames are not barred from
being transmitted between nodes that are not in the same zone.This works fairly
well, but does suffer if zones change, if hardware caches Name Server tables, or if
you want to guarantee that no frames (intentional or accidental) are sent to
devices.
Hard zoning uses hardware, examining each frame that comes across the
fabric and ensuring that it is allowed to pass through to a node. Hard zoning
behaves exactly like soft zoning and is usually used in conjunction with it.
83
140_SANs_03
84
8/14/01
1:35 PM
Page 84
Chapter 3 • SAN Components and Equipment
However, no frames, accidental or intentional, can pass through to nodes where
permission is not given.
Newer hardware is starting to extend the features of zoning further into the
network, to the level of storage LUNs. As hardware advances and the ability to
filter traffic beyond the port level all the way down to the LUN level becomes
available, zoning will enable finer granularity control over the specific logical
units on a storage unit that a specific initiator on the Fibre Channel fabric can
access.This will enable better control and allocation of storage through zoning on
a network.
If you are sharing storage on your SAN, if the network is large, or if you want
to closely control access to data and information on your SAN, zoning is a necessary feature of your switch and should be considered a requirement. On the other
hand, if the size of your SAN is limited and devices attached to it are very well
controlled, then zoning might not be as much of a necessity.
Classes of Service
The Fibre Channel protocol supports different classes of service.The class of service determines the level of error control for transfers. For communication
between nodes to be successful, a switch has to support different modes of operation.The important part of support for classes of service is making sure that all of
the equipment that you are running supports the same classes of operation.
Otherwise, they will be unable to communicate with each other.
Most Fibre Channel switches and other hardware devices support Class 3
operation, which is a connectionless conduit without confirmation of transfers
across the SAN—the ideal case for SCSI transfers.This is because the upper-layer
protocol on top of Fibre Channel is already doing the error control. Doing it in
Fibre Channel as well would just add overhead.The majority of components you
can buy for Fibre Channel are Class 3 devices and are fully interoperable. In some
cases, the confirmation of transfers (acknowledgement frame [ACK]) between
nodes is desired for better error detection, in which case, Class 2 would be used.
However, Class 2 is not widely available on all hardware, although most Fibre
Channel switches support it.
Another class of service is Class F, which is used for internal control and
coordination of the fabric. Switches are exclusive users of Class F. Finally, there is
Class 1, which is rarely implemented in switches, but is supported by some older
equipment. See Chapter 2, “Fibre Channel Basics,” for further detail on the
various classes of service defined for Fibre Channel.
140_SANs_03
8/14/01
1:35 PM
Page 85
SAN Components and Equipment • Chapter 3
Fabric Services
Fabric services are the set of internal services available to devices in a SAN. Fabric
services determine the level of manageability and interoperability of your switch
fabric.These services are used by devices when they first attach to your network
and allow different devices to locate others on the network.This section discusses
fabric services relative to switch and SAN implementations.
Name Server support is a part of the Fibre Channel standard for switched
fabrics and provides devices with a directory of other devices on the fabric.There
is one Name Server service for an entire fabric, whether it is a single switch or
dozens of switches. However, that service is distributed across every switch in the
fabric and provides a high degree of resiliency. Any node querying the Name
Server will receive the same answer regardless of location—all of the switches
participating in the Name Server service cooperate and present a unified picture
of the fabric. All switches support this functionality since it is a basic part of
switched fabric operation.
The Management Service is an in-band fabric service that provides basic
management data about the network. Included in this data is topology information: what is connected to where on a switched fabric and basic information
about attached nodes. In addition, the Management Server provides unzoned
access to the Name Server.This is necessary when you have a management station that needs to know about everything in the network, but does not need to
have access to the storage or hosts in a zone.The Management Server is used by
SAN management software that needs additional information about the fabric,
topology (physical layout) of the network, and other management information.
Registered State Change Notification (RSCN) is a service of the fabric that
notifies nodes of changes in state of other attached nodes: for example, if a node
is reset, removed, or otherwise undergoes a significant change in status. Most
switches support RSCN, which is critical for operation of your SAN.This is particularly important for detecting error conditions and informing nodes about
problems.When the state of the node changes, devices using that node are immediately informed and can react properly, rather than trying requests and timing
out due to errors.This feature is required for hosts that add storage “on the fly,”
since the RSCN is the mechanism by which the host finds out about the newly
available storage. RSCN events are generated by devices and by the fabric itself
for any sort of physical change to the topology of the fabric. For example, a
device is added or removed from the SAN, a switch is added to a fabric, or a
device has been internally reset and has dropped off and comes back onto the
85
140_SANs_03
86
8/14/01
1:35 PM
Page 86
Chapter 3 • SAN Components and Equipment
SAN. RSCNs reduce the need for hardware to repeatedly check for changes in
equipment condition, called polling, and thus reduce the amount of nondata
traffic necessary to keep your network up and running.
There are additional services defined in the Fibre Channel standards that are
not necessarily supported by all, or even any, switches. For example, the Time
Server is not yet supported on any switching platform that we know of, but it is
defined as a standard. If you or the management software you buy requires the
additional services, you should ensure that the switch you buy supports or can
enable those services. For example,VERITAS SANPoint Control uses the
Management Server, but many switch vendors do not support that service.
Brocade does support the Management Server.
Redundancy
Because SANs are usually involved in the critical parts of your business, and
because, unlike regular network traffic, data traffic on a SAN must not be lost or
corrupted, the need for equipment protection through redundancy is important.
Redundancy in its most basic capability takes the form of redundant power
supplies and fans. In the field, power supplies and fans are the most likely
components to fail. Fan bearings, which are the most mechanical pieces of any
equipment, receive the most use, and because they are relatively inexpensive, these
components tend to have a shorter life span than an integrated circuit.
Redundant and hot-swappable power supplies help alleviate the problem of
wear and tear on power supplies and the fans that cool them.With a redundant
power supply, if one of the power supplies fails, circuitry can detect it and shut
down the offending supply and issue a warning—either through software or
through an indicator light or buzzer—all while allowing the equipment to continue running. A technician can swap in a new replacement unit for the power
supply in real time, without affecting operation.
Similarly, other components of a switch can also be made redundant,
including back planes, circuit boards, memory, and CPUs—albeit at a much
higher cost.
Buffer Credits per Port
An important aspect of Fibre Channel throughput is the amount of buffer credits
that are available on each switch port.The number of buffer credits available on a
switch port is an important factor, particularly for long-distance applications.
Although optical networks are fast, light still has a definite speed, which is
140_SANs_03
8/14/01
1:35 PM
Page 87
SAN Components and Equipment • Chapter 3
approximately 5 km/microsecond.This is slow enough that over long distances,
the amount of buffers required to keep operations running is very important. If
there are not enough buffers available on the far end of a long run of optical
cable, the hardware will run out of buffers to receive them, throttling the actual
amount of data that can be sent over that cable. By ensuring that you have
enough buffers to support this sort of operation, you can ensure smooth and
maximum throughput across your optical link.
In shorter cable length configurations, buffer credits are less important. Most
Fibre Channel switches are configured with plenty of buffer credits per port
when dealing with distances up to 30 km. However, it is worth knowing what
your switch can support so that you can ensure optimal operation, especially
when you intend to use your switch across long distances. At 1 Gbit/sec, it takes
five 2 KB frame buffers to provide enough buffering to ensure full-bandwidth
performance at 10 km.To ensure full-bandwidth performance at 100 km, you
need approximately 50 buffer credits for each switch port.
Self-Configuring Ports
Fibre Channel has many different modes of operation for ports: loop (FL_Port),
switched fabric (F_Port), and ISLs (E_Port). Even within loop ports there can be
different modes, depending on whether the attached devices are public or private
loop, and if they are initiators or targets. Also, the emerging 2 Gbit/sec and higher
Fibre Channel standards will create even more modes of operation. Self-configuring ports are able to detect what kind of mode the other side of the link is
operating on and automatically configure themselves to support that mode of
operation.This is particularly important in the case of fabric-supporting devices,
which operate much better and with more reliability if they are operated in
fabric mode (also called point-to-point when a device is connected to a switch).
A self-configuring port analyzes the primitives on the wire to properly configure
operation to match the connected Nx_Port hardware.The term Nx_Port is used
to identify either an N_Port (point-to-point connection) or an NL_port (loop
connection) for the connecting device.This also supports dynamic reconfiguration of a network: for example, changing the placement of an ISL should happen
automatically, rather than requiring an administrator to telnet or log in to a Web
interface to control the configuration of a particular port.
Some switch vendors have specific ports that support only certain operations,
requiring that ISLs be connected on only a few specified ports. All Brocade
switches can support self-configuration on all ports. Certain entry-level products
might require an additional software license to enable this support, but the
capability is present in the hardware. Self-configuring ports make it much easier
87
140_SANs_03
88
8/14/01
1:35 PM
Page 88
Chapter 3 • SAN Components and Equipment
to manage your fabric, eliminating the need for dedicating certain ports for
certain functions.
Another feature of switch hardware and software is the ability to manually set
the configuration of these ports. Sometimes, equipment is not able to properly
auto-configure a port. A device that supports switched fabrics might not be recognized as a fabric device, and a port might be configured as a loop device. In order
to ensure that those devices operate in the best mode, you might need switch software to force configuration of a particular port. Being able to manually set the type
of port indicates that there is a conflict between the connecting device and the
switch. If a port is locked to a certain type, it limits the functionality and can cause
potential problems if other devices are plugged into that port. Once a port is
locked, it does not become “plug-and-play” anymore.
Auto-Negotiating Speeds
As Fibre Channel moves from 1 Gbit/sec to 2 Gbit/sec and beyond, support for
auto-negotiation of speeds becomes necessary to support mixed-speed networks.
Auto-negotiation uses communication with devices attached to a switch to
determine if they are running at 1 Gbit/sec or 2 Gbit/sec and automatically
selects the correct speed.
IP over Fibre Channel Broadcasting
The use of IP over Fibre Channel (IPFC) is, for the most part, identical to any
other IP network. Fibre Channel, as a communications medium, does not inherently support broadcasting frames to all nodes on a fabric identically to Ethernet
or other IP networks. Fibre Channel broadcast is a function of switches that will
automatically resend broadcast frames to all attached ports on the Fibre Channel
network, effectively emulating the broadcast properties of Ethernet networks.This
helps to fully support file sharing, such as NFS, bootp, ARP, ping, and other protocols on top of IP that are dependent on broadcast and that are usually not
aware of the behavior of IPFC.
This is a necessary part of fabric operation if you intend to send any IP
frames across your network. Some HBAs do not react well to IP broadcasts, so
you might need to use switch zoning to allow them to coexist with other HBAs
that are running IP.
140_SANs_03
8/14/01
1:35 PM
Page 89
SAN Components and Equipment • Chapter 3
Firmware Upgrade Methods
Although many users will buy a switch directly out of a box and never look at
the firmware installed on it again, sometimes it is necessary to upgrade the
firmware to fix a bug with newly introduced hardware, add a new feature, or
enable management through a third-party software package. Also, if you are
building a large fabric, it might be desirable to have a unified firmware version
throughout that fabric.This will ensure a consistent feature set and the most reliable operation in a large heterogeneous fabric.
Software upgrades can be accomplished in different ways. For fast upgrades of
firmware, look for the capability to download firmware to the switch through
Ethernet.The most basic of these downloads is through the serial port, which
requires an RS232 connection and a PC or other machine that sends the
firmware to a switch through a slow serial link. Brocade switches do not support
the download of firmware through a serial connection and instead use Ethernet
for downloading firmware. However, it is possible to manage all Brocade switches
except the SilkWorm 2800 with a serial connection.
Another consideration for firmware upgrades is how much impact this will
have on your network.The ideal operation is “hot upgrades,” firmware upgrades
that can be done while equipment is running and can be “rolled in” to production. Few pieces of equipment currently support this, but equipment that does
can keep downtime to a minimum. Next is upgrade-on-reboot, where firmware
upgrades can be done, but the new firmware does not take effect until the switch
is booted. Operation can continue until a reboot is triggered.
The worst option is offline upgrading.This is required when a component
must be offline to upgrade, or even worse, when all equipment must be upgraded
at the same time. Many pieces of hardware are eliminating this, but you should
still be aware what kind of work is required when you need to upgrade switch
firmware.
The good news with firmware upgrades is that, in a dual-fabric SAN, you
can upgrade one fabric at a time.This will enable a firmware upgrade to take
place with no disruption to your environment. Using dual fabrics might require
additional software on your host, such as VERITAS DMP, a multipathing HBA
driver such as the TROIKA driver, or multipathing RAID drivers. Since dual
fabrics are always advisable in uptime-sensitive environments, the firmware
upgrade disruption question is moot for real-world applications. See Chapter 7
for more information on SAN availability models.
89
140_SANs_03
90
8/14/01
1:35 PM
Page 90
Chapter 3 • SAN Components and Equipment
Loop Operation: Making Your Switch Act Like a Hub
A convenient feature of almost all Fibre Channel switches is their capability to
act similarly to a Fibre Channel hub. By behaving like a hub, a switch can work
with FC-AL devices that do not support fabric operation. Because the Fibre
Channel standard began with simple FC-AL operation, and much of the storage
hardware and even some of the HBAs available might be only FC-AL and not
switched fabric-compatible, the ability to make a switch act like a hub can ensure
that older equipment will still work in your network.
In general, a set of switch ports or an entire switch might be configured for
loop operation, specifying which ports are running in loop mode. Some low-end
switches are actually forced to operate in loop mode, with the ability to be
license-upgraded to support full fabric.
The capability for loop operation is a must if you are directly attaching
storage devices that support only the older FC-AL protocol to your switch. Refer
to Chapter 4 for further detail regarding how QuickLoop and Fabric Assist
enable private-loop devices to fully participate in the SAN.
FSPF Compliance
Fibre Channel switches from different vendors started out fully compatible with
end nodes in a network. However, until fairly recently they were not able to pass
frames between each other (inter-vendor switch frame compatibility). Several different routing algorithms for inter-switch routing existed. Brocade switches all
used a protocol developed at Brocade called Fabric Shortest Path First (FSPF).
Recent efforts in compatibility have standardized on the FSPF routing protocol
algorithm for routing Fibre Channel frames between switches, and now all vendors are required to support this protocol in order to be Fibre Channel standardscompliant. FSPF, which was originally used by all Brocade switches, forms the
basic protocol for exchanging and routing frames between switches in a Fibre
Channel fabric.
FSPF compliance is most important if you are trying to mix and match Fibre
Channel switch hardware. Because all switches must follow this standard fully to
interoperate, you must make sure all switches in your network support the standard. In addition, because the standard is very new, it is important to make sure
that all pieces of hardware you expect to work together have actually been tested
together. Also check to see what advanced features will be lost when interconnecting switches from other vendors into a Brocade fabric. At this time, no switch
vendor supports the complete feature set that Brocade switches implement, and it
140_SANs_03
8/14/01
1:35 PM
Page 91
SAN Components and Equipment • Chapter 3
is possible that the needed functionality might be lost by introducing other vendors’ switches. Since Brocade implements a superset of most other vendors’ feature sets, it might be more practical to introduce a Brocade switch into another
vendor’s existing fabric than vice versa. Firmware versions and topology are still
very important in these mixed environments, which are still undergoing testing at
the publication of this book.
The trend toward full compatibility in inter-vendor switch compatibility is an
important basis for the future and promises not only to reduce cost, but also
more important, to permit the interchangeability of hardware in your network.
Management Interfaces
Fibre Channel switches support many different ways to manage a switch.These
different interfaces allow you to choose how you want to configure a switch, and
they also take into consideration how you intend to manage your switch, as well
as any other tools you might already have deployed in your network.
Serial Port
The most basic management interface for Fibre Channel switches and other
equipment is the serial port. A standard, RS232-based port is generally available
on Fibre Channel switch equipment that allows command-line interaction with
different configuration options.
Telnet
Telnet is the standard IP networking ability to log in to a piece of equipment
through a telnet interface from any host server attached on Ethernet, or even
in-band through Fibre Channel itself (Figure 3.7).You typically log in to a telnet
interface and execute command-line commands.Telnet has the advantage of
being convenient to run remotely or through a slow connection, and can also be
scripted for automatically configuring switch settings through a nightly script or
for difficult operations.The disadvantage is that command-line interfaces tend to
be difficult to use, especially for complex operations like zoning and viewing lots
of information at once, in which a GUI interface such as WEB TOOLS is more
practical.
SNMP
Simple Network Management Protocol (SNMP) is an IP-based protocol for managing
any kind of network equipment, including Fibre Channel switches. SNMP provides mostly read operation of switch functionality and configuration, as well as
91
140_SANs_03
92
8/14/01
1:35 PM
Page 92
Chapter 3 • SAN Components and Equipment
critical error counters and statistics. Almost all Fibre Channel switches (as well as
hubs, routers, and even some storage arrays) provide SNMP Management
Information Base (MIB) information.This is mostly used in conjunction with
traditional network monitoring applications like HP OpenView, CA Unicenter,
and Tivoli NetView, but also is used by Fibre Channel management software to
provide information from network hardware.
Figure 3.7 Telnet Session with a Brocade Switch
Standards for SNMP Management Information
SNMP defines only the basic protocol that transports management
information. The actual information transported across the network is
defined on top of the SNMP protocol through Management Information
Base (MIB) definitions. It is important to make sure that the equipment
you buy supports the correct MIBs that enable software to interpret and
use the information available from your switch.
The FibreAlliance MIB, now under consideration in the Internet
Engineering Task Force (IETF) standards organization, is supported by
most Fibre Channel network hardware. Defined by the FibreAlliance
organization, which was started by storage manufacturer EMC, the MIB
provides common information for discovery of topology and equipment
capabilities in a SAN. The MIB provides information about how many
Continued
140_SANs_03
8/14/01
1:35 PM
Page 93
SAN Components and Equipment • Chapter 3
ports exist on a piece of equipment; what is connected on each port;
and even detailed information such as error counters and frame counts.
In addition, basic asset tracking information such as manufacturer
strings, model numbers, and other information is presented in this MIB
definition.
The Fibre Channel Management MIB, which is supported by some
earlier switches, is a different MIB that provides much of the same information. This MIB preceded the FibreAlliance MIB and is required by some
software for accessing parameters not exposed by the FibreAlliance MIB.
It is best to check with your software vendor to understand if this information is required for operation.
Most hardware also presents equipment- or vendor-specific MIBs,
defined by the manufacturer. Often times, these MIB definitions are also
made available to customers and can expose things that are not
industry-standard: for example, special features of a switch, information
that is specific to the hardware, or special functions. You should check
with your manufacturer or with software shipped with your hardware
for equipment-specific MIB definitions.
Web-Based Management
Web-based management interfaces provide a graphical,Web-based way of
accessing and modifying switch settings. In general, most Web-based management
tools provide a page that you can access in your browser, view switch status, and
set most switch settings. In many cases, the Web-based GUI can help make complex tasks such as zoning much easier, and also provide a visual indicator of
switch function. However, not all switches export all functionality through a GUI
and they instead might require a telnet or serial port session to access some tasks.
Brocade switches allow practically all management through the Web interface,
which greatly simplifies the management by more casual users.
There are two types of Web-based GUIs available: pure HTML-based and
Java-based Web pages.Web-based applications usually present simple HTML
pages to access all switch functionality, where Java-based Web pages have
embedded Java applets running more like a standard application.There are advantages and disadvantages to each, but you are generally stuck with the type of
interface that ships with your switch.The advantage of pure HTML pages is the
speed of loading a specific page. A Java Virtual Machine (JVM) does not have to
be loaded, which sometimes can take quite a while across a slow link. Moreover,
because of compatibility issues between different Web browsers, you might
93
140_SANs_03
94
8/14/01
1:35 PM
Page 94
Chapter 3 • SAN Components and Equipment
encounter some difficulties if your browser is not exactly the same as the version
for which the switch GUI developers qualified their Java software.The advantage
of a Java-based applet is it offers user-friendly management of complex tasks.
When you select the switches you intend to use in your network, you should
compare what common tasks you would like to do against the Web-based GUI
tools available, and make sure that you purchase the licenses for Web-based management.Web-based tools are very helpful for accomplishing day-to-day tasks. In
addition,Web-based tools make management of your switches easy even from
remote locations or offsite.
Application-Based Management
Some switches on the market also support application-based management. Instead
of an embedded Web server or Java application, a separate, externally run program
manages your switch.These applications are usually based on Java as well, but need
to be installed on a server. Managing your switch through an application sometimes can be faster than running an application from a Web interface, and some
hardware offers identical interfaces between the Web and an application. Brocade
Fabric Manager is an example of an externally based management program.
Application-based management works best when you have a permanent network management station where you can have software installed and used normally. It is more difficult if you need to move from place to place and do not
want to have to reinstall the software on whatever machine you happen to be
using that day. Application-based management hosts are a management single
point of failure.
SCSI Enclosure Services
SCSI Enclosure Services (SES) is a SCSI protocol-based method of obtaining management information. SES support gives some information on the status of SCSI
equipment on the network. If your software supports the SES standard, you can
use this feature of Fibre Channel switches to also monitor the basic health and
well-being of your switch through the same SCSI-based management software.
SES is generally used by operating systems so that they can incorporate certain
management functionality into their environment. End users rarely use it, and its
usage by operating systems is waning due to the advent of sophisticated and
powerful alternatives such as the Management Server.
140_SANs_03
8/14/01
1:35 PM
Page 95
SAN Components and Equipment • Chapter 3
Connecting Your Servers with Host Bus Adapters
HBAs connect hosts to your Fibre Channel network. An HBA translates operating system SCSI commands to the proper Fibre Channel frames and protocols
on the wire when plugging into a bus such as PCI or SBus. Unlike Ethernet network adapters, Fibre Channel adapters are actually much more intelligent and frequently contain embedded processors and embedded firmware to negotiate the
Fibre Channel protocol. Fibre Channel HBAs provide advanced functionality
such as persistent binding and HBA-based LUN masking, which are used in conjunction with switch zoning and storage LUN masking to control and allocate
storage in your SAN.
This section covers the major types of HBAs and details the features available
on most HBAs. In addition, a discussion of specific issues about using HBAs and
how to implement these features in your own SAN will help you understand
how to best use these components in your network.
Connecting Hosts to the Fabric
HBAs operate by plugging into the internal bus of your host machine (for
instance, PCI or SBus). Loaded with device driver software, the HBAs appear to
operating systems as a SCSI adapter. In most cases, an HBA is indistinguishable
from other storage adapters, such as SCSI adapters, to the operating system. HBAs
even emulate the way the legacy SCSI adapters communicate with the operating
system.The HBA will map devices seen on the network to SCSI bus, target, and
LUN addresses associated with a SCSI adapter.
An operating system treats an HBA exactly like it does a SCSI adapter, down
to the exact same SCSI commands and packets.The HBA takes these packets and
translates them into the Fibre Channel protocol, adding network headers and
error handling. It transmits the packets across the network, makes sure of the
response from the storage, and returns the information and status back to the
operating system—all as if it were a SCSI adapter.
Other advanced HBAs also do this for the Internet Protocol (IP) and Virtual
Interface (VI) protocol, providing network and clustering adapters to the
operating system and software.
HBA Types
HBAs range from low-cost, embedded chips to high-end, dual-channel multipathing adapters.The most basic HBAs support only small FC-AL loops with a
few devices and contain minimal buffering memory or intelligence. On the high
95
140_SANs_03
96
8/14/01
1:35 PM
Page 96
Chapter 3 • SAN Components and Equipment
end, adapters might include additional buffer memory for better performance and
throughput, intelligence to handle large fabric deployments, and high-end features such as HBA-based LUN masking and failover capability.
A Plethora of Protocols
Fibre Channel networks, although primarily used for storage using the SCSI FCP
protocol, also can be used for other protocols such as IP for standard networking
and VI for clustering. Different HBAs can support different protocols and at a minimum, support SCSI FCP. It is becoming standard for adapters to support SCSI
FCP and IP, and newer adapters now support the VI protocol as well. If you are
designing a SAN and think you might want to use it for routing IP frames, backup,
or other IP traffic, it is well worth checking to see if your HBA hardware supports
IP or VI, or can support these protocols through a software or firmware upgrade.
The FCP/SCSI Protocol
FCP/SCSI is the primary protocol used to transfer data over the Fibre Channel
network. Fibre Channel Protocol (FCP) encapsulates standard SCSI commands,
which are identical to the old SCSI bus commands. Instead of signals, however,
the Fibre Channel standard transmits the commands as a set of frames containing
the usual command and data phases of the SCSI protocol. In fact, a SCSI application running on top of Fibre Channel is identical to running on the SCSI bus,
with no modification. HBAs take responsibility for translating requests to a SCSI
bus, target, and LUN, and redirecting that to a specific Fibre Channel address and
LUN address. Applications and operating systems written for SCSI can run on
top of Fibre Channel unmodified.
The IP Protocol
The IP protocol, the standard for the Internet, runs on top of Fibre Channel by
following the IPFC protocol. Using the same concepts of IP address and mask, IPcapable adapters generally look and behave identically to Ethernet adapters—only
much faster. By installing the appropriate drivers, you can add a network adapter
and a set of IP addresses, which instead of being transmitted via Ethernet or
another network can be sent by Fibre Channel. Although not typically used to
replace Ethernet, IPFC is useful for managing in-band Fibre Channel devices,
offloading backup traffic, or connecting machines over the same long-distance
links as your storage and taking advantage of the cost savings of not having to run
a different network over a different wire. In installations where server slot space is
at a premium, IPFC can save an additional slot and network infrastructure.
140_SANs_03
8/14/01
1:35 PM
Page 97
SAN Components and Equipment • Chapter 3
The one area to note is that unlike Ethernet, Fibre Channel is not well suited
to broadcasts and multicast operation. Some Fibre Channel networks do not
inherently support broadcast packets and rely on software and switch support to
properly broadcast frames to all nodes on the network. Brocade supports hardware forwarding of broadcast and multicast frames and the software necessary to
support both in a multiswitch fabric.
The VI Protocol
VI is a specification that was developed by Intel for low-latency server clustering.
In standard clustering environments, Ethernet or other IP protocols have been
used to pass data from machine to machine across a network. Unfortunately,
because of the software overhead of IP stacks and the many software layers that
data must pass through to send IP frames, clustering has struggled to reach its full
potential.To solve this problem,VI technology removes the traditional IP networking stack and instead provides a method of directly sending data across a
wire to another computer’s memory. Applications, especially cluster-aware applications like databases, have started to use this protocol for cluster communications.VI over Fibre Channel provides the ability to leverage the Fibre Channel
infrastructure to also pass VI traffic.
Fully clustered databases like Oracle Parallel Server and IBM DB2 both support the VI protocol in their inter-node communications. By configuring these
databases to use VI on a Fibre Channel card, these applications run faster and
with significantly less CPU usage.
Speeds
All Fibre Channel adapters today support the 1 Gbit/sec (100 MB/sec) speeds that
all Fibre Channel equipment supports. As network infrastructure, such as switches,
moves towards the new 2 Gbit/sec standards, so do HBAs and storage.This new
standard provides for double the speed of current adapters, and standards are
firming up that allow for auto-negotiation between the 2 Gbit/sec and 1 Gbit/sec
speeds and protocol differences.The next-generation 10 Gbit/sec standards are
now in development and should allow for 10 times the speed of operation of
current-generation SAN components in the future.With most Fibre Channel
storage, it is rare that you will even approach the 100 MB/sec performance
numbers (200 MB/sec full duplex) that current adapters allow you to reach at
1 Gbit/sec.
97
140_SANs_03
98
8/14/01
1:35 PM
Page 98
Chapter 3 • SAN Components and Equipment
Ports
The number of ports available on a Fibre Channel adapter can range from a
single port to dual-port adapters with the capability to act as two individual
HBAs on a single card. Dual-port adapters add a significant cost reduction by
enabling two separate connections into a single card and can be useful where
there is a need to connect to two separate fabrics from a single system.
One limitation to be aware of with dual-port cards is the overall bandwidth
you can achieve when using these adapters.This is not due to the HBAs, but
because of the computer system buses to which they are attached. Different
architectures such as PCI (33 MHz, 32 bit) cannot handle much more than the
100 MB/sec speeds available on a single Fibre Channel port today.You will probably not be able to get twice the performance with the two ports available on a
single card. Moreover, if the HBA were to fail and it contained several ports, they
would all fail.
Combination Adapters
A recent innovation in Fibre Channel adapters is the appearance of combination
adapters, which combine the functionality of Fibre Channel with other network
interfaces. For example, combination Gigabit Ethernet and Fibre Channel cards
exist on the market.These tend to be used where slot space is at a premium, or
in embedded applications where there is a need to support multiple interfaces in
a small space.
Fabric-Capable Versus Loop Adapters
There are two different classes of HBA capability. Legacy HBAs usually support
only loop operation to connect to a network and do not support connection to
the fabric.These adapters are termed private HBAs.They can generally connect
only to other private FC-AL devices. Some low-end HBAs still support only
loop operation. However, it is possible to upgrade the drivers on these HBAs to
support fabric attachment. Sun and HP HBAs originally were capable of private
mode only and are examples of legacy HBAs.
Fabric-capable HBAs support both loop and fabric and can address thousands
of nodes connected into a switch.They use the fabric Name Server to access different fabric devices. In addition, fabric switches are aware of the different fabric
protocols used to monitor and find other nodes in the network and do not
require special modes in a switch to operate in fabric. In general, it is best to find
140_SANs_03
8/14/01
1:35 PM
Page 99
SAN Components and Equipment • Chapter 3
a fabric-capable adapter for your switched fabric SAN, since they are much more
advanced and able to deal with the complexities of a switched fabric operation.
HBA-Based LUN Masking
HBA-based LUN masking is the capability of an HBA to selectively hide—
“mask”—storage devices on the network from a host. By masking specific LUNs
from a host, you can control which storage a host maps into the operating
system.This is important when you have mixed operating systems on the same
network, since you can prevent corruption of data because of ownership contention. LUN masking also provides a method for dividing storage capacity in
your network.
LUN masking is very important in the context of operating systems such as
Windows NT and Windows 2000.Windows operating systems, which are not
natively Fibre Channel-aware, do not expect storage volumes to ever be shared
with any other hosts. If a storage volume is exposed to more than a single host,
the operating system might not be able to mount the file system located on this
disk. As a response, the operating system might write a signature on a disk it does
not own and will most likely corrupt any data that is on that volume already. By
masking the LUNs to only the volumes a host is permitted to see and own, you
can avoid these problems entirely.
LUN masking is also very important when you are mixing operating systems
in a network. Because the way file systems are written varies for different operating systems, if a LUN is formatted for one operating system, the other operating system will not recognize that it is in use. If LUN masking is not used, the
second operating system could assume that, because it does not recognize an
operating system, it can write its own data on the identical disk—corrupting data
that is already there.
Persistent Binding
Persistent binding, sometimes referred to as LUN mapping, is the mapping of a
Fibre Channel device into an operating system at a specific device location.
Persistent binding is particularly important for applications that use the operating system SCSI address to address a device: for example, a fixed device node in
Solaris or a raw volume used by an Oracle database. In both of these applications,
the ID must be persistent and fixed from reboot to reboot.
In some implementations of HBAs, persistent binding and LUN masking are
the same. Some vendors use persistent binding to enforce LUN masking: only
99
140_SANs_03
100
8/14/01
1:35 PM
Page 100
Chapter 3 • SAN Components and Equipment
devices that have been persistently bound by hand or through software are
allowed access into the system.
Other implementations do not couple persistent binding and LUN masking,
and instead automatically bind devices as they are discovered—as long as LUN
masking allows those new devices to show up to the operating system.This
allows for more flexibility, since devices do not manually have to be configured
for masking settings.
Default LUN Access Permissions
Default LUN access permissions are used by HBA software to determine
whether a disk device should be mounted and accessible to a host operating
system. HBA drivers can usually be configured to always allow access (automatic
mapping of devices to an operating system), or to never allow access (manual
mapping of devices to an operating system).
For example, it is important for very large networks to never allow automatic
access and require manual mapping of a device to the operating system. For
example, you might have 20 LUNs exported from a storage array, but only the
first two LUNs should be accessible to your host. If you set access permissions to
default to allow automatic access, potentially all of those 20 LUNs would be
claimed and probed by the operating system. By setting the default access to
deny, only the intervention of an administrator will allow the operating system
access to the disks (through LUN masking).This keeps hosts from trampling on
the data already written to LUNs in a network.
Typically, for very small networks the HBA drivers are configured to always
allow access to new devices in the network. For large networks, it is a requirement to never allow access unless an administrator specifically grants access.
Upper-Level Protocol Access Permissions
As HBAs have begun to add IP and VI capabilities to their cards, an important
option that is beginning to appear is the control over IP and VI access permissions. Like LUN access permissions, this allows you to control which devices are
allowed to receive IP or VI frames.The major use of this currently is to prevent
IP or VI frames from being sent to hosts or storage that do not understand those
protocols. In some cases, receiving these frames causes errors or software to crash
on these storage devices, even though they should not recognize the frames and
should ignore them.The ability to set these permissions adds the ability to
control this functionality and prevent these types of errors.
140_SANs_03
8/14/01
1:35 PM
Page 101
SAN Components and Equipment • Chapter 3
Dynamic Versus Static Discovery
Fibre Channel is a networking protocol, and unlike a parallel SCSI bus, devices in a
network are not usually powered up and down at the same time. In fact, devices
can be added or removed at any time, and the whole network continues to stay up.
Older parallel SCSI devices used to require a reboot to discover any new storage
devices attached to a host, because of the static nature of the parallel SCSI bus.
Dynamic discovery of devices is the capability of an HBA to discover new
devices on a network without rebooting.This allows the most flexibility, and by
rescanning drives with operating system software a new storage volume can be
added or removed without rebooting the system. Static discovery occurs when
an HBA requires a reboot to discover new devices, which is still the case for
some hardware.
Understanding if your hardware supports dynamic or static discovery is
important. If you want to run a Fibre Channel network, dynamic discovery is
necessary for 24x7 network operation.
Configuration Management Software
The suite of software available in most network adapters has usually been limited
to very basic command-line utilities or a few simple configuration pages attached
to the device driver. However, in the Fibre Channel HBA world, the features and
capabilities of cards are more advanced than a few simple configuration settings.
The advent of sophisticated management software has made it possible to not just
change card settings, but also monitor what a Fibre Channel card sees in the network; monitor the status of connections; identify externally connected nodes; and
run diagnostics. Figure 3.8 shows the TROIKA SAN Command utility, and
Figure 3.9 shows an example of Emulex’s configuration utility.
Different vendors provide varying amounts of configuration management
software, ranging from simple command-line utilities to sophisticated GUI
applications.
HBA API Support
The HBA API is a C-level API supported by Fibre Channel HBA manufacturers
to enable the collection and management of information available from HBAs.
This API is used by SAN management software to collect information such as
model numbers, vendor names, hardware and software version numbers, port
speeds and settings, as well as ports attached to a Fibre Channel HBA. Support
101
140_SANs_03
102
8/14/01
1:35 PM
Page 102
Chapter 3 • SAN Components and Equipment
for the HBA API is widespread and is a requirement for managing your Fibre
Channel HBAs through most management software.
Figure 3.8 HBA Configuration Management Software (TROIKA
SAN Command)
Figure 3.9 Emulex Configuration Software
140_SANs_03
8/14/01
1:35 PM
Page 103
SAN Components and Equipment • Chapter 3
The HBA API consists of two major parts: a shared, common library that is
loaded onto your operating system and accessible to any applications, and a
vendor-specific library that supports the HBA that you are loading onto your
system.The common library allows applications to generically call specific functions from the HBA API and is dynamically linked with applications such as
VERITAS SANPoint Control, SANavigator, and other SAN management software.This library is installed into a common location (such as C:/WINNT/
SYSTEM32 on Windows NT and \usr\lib on UNIX).The vendor-specific
library is provided by the HBA manufacturer and is usually installed in a manufacturer’s install location.
The HBA API has an advantage over the previous technique used for managing HBAs, which tended to be vendor-specific I/O Controls (IOCTLs).The
API, which was developed as part of the efforts of the Storage Networking
Industry Association (SNIA) to increase interoperability and manageability of
Fibre Channel networks, has the advantage of running with any HBA vendor’s
hardware across different operating systems and allowing different vendors’ hardware to be addressed in the same box.
Remote Boot across the SAN
Remote boot is the capability of an operating system to use a Fibre Channel HBA
to access and mount the boot volume for a system. Unlike parallel bus SCSI, volumes in a SAN are not limited to a local bus, and additional logic is necessary to
boot an operating system from the network. A boot binding between a specific
volume in the network—WWN and LUN—is required for remote boot to start.
This choice is usually made through software accessible before boot and startup.
Remote boot allows for an interesting use of the Fibre Channel SAN.
Because boot volumes can be on a network and be available to practically any
device connected to the SAN, they make it possible to dynamically change
which physical hardware a system is booting up from by changing the remote
boot binding. For example, you could have the operating system image of your
Web server stored on the network on a disk, and if the hardware that was running that Web server fails, all you would need to do is reassign the boot volume
to a new server.This makes it possible to easily reallocate functions of your
servers across your network when hardware failure occurs, without requiring
moving, reinstalling, restoring, or any other typically cumbersome and lengthy
processes associated with local storage utilization.This is also used to enable
103
140_SANs_03
104
8/14/01
1:35 PM
Page 104
Chapter 3 • SAN Components and Equipment
advanced disaster tolerance, so that you can boot the image of a host if using different hardware.
The difficulty of remote booting is that when you use network volumes to
run your operating system, if there is a network error or failure in your network,
you will not be able to access anything on the boot operating system. Errors will
not be logged, and in many cases, everything will come to a grinding halt until
the network error is fixed. A simple case of someone unplugging the Fibre
Channel connection on the back of a machine could be catastrophic, versus the
normal situation where an internal disk is unlikely to be removed without
powering down a machine and opening the case.This situation can be remedied
by using a dual-fabric configuration.
Hot-Plug Support
With newer operating systems and hardware, the capability to swap failed equipment while your computer and operating system are still running has begun to
become possible. Peripheral Component Interconnect (PCI) hot-plug systems
allow the removal and insertion of hardware, even while I/O is occurring.This
makes it possible to fix problems without taking down critical systems or
rebooting systems. In redundant configurations, this ensures that operation can
continue even through single hardware failures.
The typical sequence of events in a hot-plug situation is this: an HBA fails or
is otherwise showing signs of problems.Through system error logs, an administrator determines that a certain piece of hardware is at fault, and either indicates
through software or through buttons or levers on the system that he or she
would like to swap this HBA.This signals to driver software and to the hardware
bus to stop I/O to a card and isolate the connections of that card from the
remainder of the system. Usually, a light or other indicator will show that it is
acceptable to remove the offending card, and the administrator will swap it with
an identical piece of hardware. By pressing another button or lever, or through
running software, the administrator tells the operating system that a new piece of
hardware has been swapped in. Finally, that signals drivers and the operating
system to start using that hardware again.
This generally works great. However, there are several Fibre Channel-specific
complications that need to be addressed with PCI hot-plug.These are Fibre
Channel settings, storage LUN masking, and zoning settings, which are usually
tied to a very specific piece of hardware, and which will not understand that the
newly swapped card is identical to the other piece of hardware it replaces.
Fibre Channel settings are the specific settings for how a piece of hardware
should behave on the Fibre Channel network. Because there are different modes
140_SANs_03
8/14/01
1:35 PM
Page 105
SAN Components and Equipment • Chapter 3
of operation, and specific compatibility modes and settings needed to operate
with different Fibre Channel hardware, these settings are important to maintain.
Often, these settings are stored in Nonvolatile RAM (NVRAM), which is a
physical piece of hardware attached to an HBA.When you swap the HBA, these
values are no longer remembered, and instead will use the settings that are
written into the new card.Two things need to be done in this case to properly
set up these Fibre Channel settings. First, if possible, replacement cards and spares
should be configured ahead of time identically to the running system. Different
timeout values, modes of operation, and settings should be logged and set on all
spare hardware so that they do not need to be reset on a hot-swap card. In addition, it is best to force settings in your driver software (typically done through
registry entries or configuration management software in Windows systems and
through .conf files in UNIX), so that the driver forces the setting no matter what
is programmed into the card.
Storage LUN masking is the capability of Fibre Channel storage to enforce a
level of security on which devices are allowed to access specific volumes on the
storage. High-end RAID arrays are typically the only pieces of hardware that
contain this functionality. A piece of storage will contain settings within its software and memory that allow only specific devices (HBAs) access to certain
LUNs.This is usually defined either with a port WWN or node WWN of the
HBA that is installed into your host system. If you swap this HBA, the WWNs
will change, because they are globally unique to that specific HBA.You will have
to reconfigure your storage to accept the new port WWN or node WWN of the
HBA that you have hot-swapped into a system, or no I/O operations will be
allowed to the LUN.To some extent, you can avoid this by knowing in advance
the port and node WWNs of the hot swap and preprogramming your storage to
accept the hot swap’s WWNs. If this is not possible, you will have to ensure that
as part of your hot-plug procedures that the reprogramming of your storage array
is included.
The last Fibre Channel-specific setting that is important in hot-plug situations is switch zoning. Switch zoning operates much like storage LUN masking,
since it also relies on the WWNs programmed on an HBA to perform the
zoning operation (unless you are using port zoning, in which case this does not
apply).When you insert a new HBA, the WWNs programmed into different
zones will no longer match, meaning that any storage that your host previously
had access to will not be available if switch zoning is in effect.There are different
things you can do to minimize the amount of work needed to do a hot-plug
swap. One is to use an alias in your switch to define the host, so that it is easy to
105
140_SANs_03
106
8/14/01
1:35 PM
Page 106
Chapter 3 • SAN Components and Equipment
change the actual port or node WWN of the alias and have all other zone settings
change automatically. Another is to do as in storage LUN masking, and preprogram, if possible, the WWNs of your spares into the proper zones so that they are
automatically included as a part of the zone. Finally, some HBAs allow you to
force a node WWN to apply to an entire set of HBAs in a system and use that
node WWN as the zoning member, which makes it possible to swap in new
HBAs without requiring reconfiguration of your switch zoning. If you use this
method, perform all zoning by using the node WWN of each HBA, rather than
the port WWN.
Connecting Legacy
Devices into Your SAN
When Fibre Channel networks were first put into operation, many storage
devices were not able to natively communicate using the Fibre Channel protocol.
However, the parallel SCSI bus was widely supported and many different kinds of
devices were implemented with parallel SCSI support. Because Fibre Channel
provides the same SCSI protocol over a different kind of medium, devices called
Fibre Channel-to-SCSI routers were developed to translate Fibre Channel frames
to the appropriate parallel SCSI commands.These routers include one or more
Fibre Channel ports on one side, and one or more parallel SCSI bus connections
on the other. Devices on the parallel SCSI bus side are presented to the Fibre
Channel network as any other Fibre Channel native storage devices, as LUNs
available from a Fibre Channel port.
Fibre Channel routers make it possible to use legacy parallel SCSI devices by
simply plugging a box into the network on one side and the legacy parallel bus
on the other side. In particular, tape libraries have relied on Fibre Channel routers
to help enable direct backup of storage on the SAN.
Basic Features of Routers
Fibre Channel routers should more accurately be called bridges, as they bridge
legacy SCSI devices and Fibre Channel, translating Fibre Channel SCSI-FCP
transactions and parallel SCSI bus transactions. A Fibre Channel router plugs into
the Fibre Channel network on one side and a SCSI bus on the other.To the
SCSI bus, a router looks like an initiator such as a host. It issues SCSI commands
such as resets and inquiries, and determines what SCSI devices exist on the SCSI
bus.To the Fibre Channel network, a router looks like any other storage node on
140_SANs_03
8/14/01
1:35 PM
Page 107
SAN Components and Equipment • Chapter 3
the network. Presenting each of the SCSI devices it has found on the SCSI bus as
a LUN connected to a port, the router takes any frames sent to the SCSI devices
and translates them from SCSI-FCP to parallel bus SCSI. Likewise, any responses
received from the SCSI devices are changed to SCSI-FCP, tagged with network
headers, and sent back to the initiating device.
Routers support any type of parallel SCSI device and are used for everything
from legacy RAID arrays and disk storage to non-Fibre Channel-capable tape
drives. Most routers act as Class 3 devices, because they are geared toward the
transport of storage SCSI traffic (which operates best with Class 3).They also tend
to support loop-only operation, rather than full Fibre Channel fabric protocols,
due to the fact that they are target and not initiating devices. In addition, these
routers support additional levels of error recovery on top of normal SCSI error
recovery, including all error recovery in FCP and newer FCP-2 error recovery
procedures.The capabilities of Fibre Channel-to-SCSI routers are as follows:
■
Number of SCSI buses
■
Types of SCSI buses
■
Internal or external SCSI termination
■
Selective LUN presentation
■
Extended copy support
■
SNMP
■
Telnet
■
Ethernet ports
■
Serial ports
Number of SCSI Buses
The most basic routers have one SCSI bus and one Fibre Channel port connected to the Fibre Channel network. More capable routers have more than one
bus and multiple Fibre Channel ports.The advantages of having more than one
bus include available bandwidth and isolation of error conditions on a bus.With
high-bandwidth SCSI devices and the architecture of parallel (shared bus) SCSI,
speeds are limited by the amount of available bandwidth that can be shared on a
SCSI bus. By providing more than a single SCSI bus, routers with multiple SCSI
ports allow you to reduce the contention for resources on those buses for
high-speed devices.
107
140_SANs_03
108
8/14/01
1:35 PM
Page 108
Chapter 3 • SAN Components and Equipment
Types of SCSI Ports,Termination
Another consideration in routers is the types of SCSI ports available.With different kinds of connections available for SCSI, you need to make sure that the
type of SCSI ports available on your router matches the device you are running.
The different types of SCSI ports that might be supported by your device range
from narrow, single-ended SCSI to fast, wide, ultra-wide, ultra2, and Low Voltage
Differential (LVD) devices.
One area to look at is the type of SCSI termination used by your router.
Parallel SCSI termination can be either internal or external, depending on the
manufacturer. In the case of internal termination, flipping a switch can turn on
termination, enabling the device to be the end of a SCSI chain.With external
termination, you either need to buy an external terminator or the device needs
to be in the middle and not at the end of a SCSI chain. Internal termination
makes it easier to manage this aspect of parallel SCSI operation and also means
one less component you need to track.
Selective LUN Presentation
More advanced routers provide the capability to selectively filter which hosts are
allowed to access a specific SCSI target. Like HBA-based LUN masking and
switch zoning, this can be used to help enforce which devices are allowed to
access a specific storage volume.To configure this LUN presentation, management software is used to specify which hosts are allowable for specific LUNs.This
is usually specified with a port WWN of the HBA.
Selective LUN presentation can be used to limit access to a specific SCSI
device on one side of a router to only one Fibre Channel device, such as to allow
only a backup server access to your tape drive. It can also be used to partition
data between hosts, in order to assign different SCSI targets to different hosts.
Extended Copy Support
Extended copy support is the capability of a Fibre Channel router to support the
Extended SCSI Copy command, which is used for server-free backup on your
SAN. In conjunction with special Fibre Channel SAN-aware backup software,
routers help form a part of a solution that can offload the backup traffic from the
hosts on your network to the Fibre Channel router, enabling storage to be
backed up directly to a tape drive connected to the router instead of requiring
the intervention of different hosts in the network.
140_SANs_03
8/14/01
1:35 PM
Page 109
SAN Components and Equipment • Chapter 3
Extended copy appears as a feature of a Fibre Channel router, which interprets SCSI commands received from backup software to fetch and store data from
storage arrays.
Management Interfaces
Like switches, routers also support different management interfaces, including
serial ports, Ethernet ports, SNMP, FTP,Web interfaces, and also emerging standards like Sun’s Jiro management standard. Most Fibre Channel routers also support the FibreAlliance MIB through their SNMP interface, which allows most
management software to derive basic information from the equipment.
Bridging and Routing to
IP Networks and Beyond
As Fibre Channel expands from just a few machines and storage to much larger
networks, interoperability and connectivity are becoming much more important.
Routers and bridges are starting to be used to take Fibre Channel traffic and
send it across regional and wide area networks, as well as for remote backup and
IP traffic.
Fibre Channel to DWDM
Fibre Channel-to-DWDM technology is beginning to be used to help extend
the distances of SAN operation. Fibre Channel-to-DWDM equipment operates
by multiplexing Fibre Channel optical signals onto a higher wavelength fiber.
Typically, a single switch E_Port is connected to DWDM equipment, which multiplexes the Fibre Channel signal to a remote DWDM port. On the other side, a
DWDM multiplexer is connected to a remote SAN where the frame is sent out
through a switch E_Port back into the network.To the Fibre Channel switch, the
existence of a DWDM link is almost transparent—no change occurs to the protocol, information is received at full speed, and there is no indication that a frame
went over a long-distance link. However, because of distance and the speed of
packets of light, switches do need to be configured for more available frame
buffers between the two switch links connected by the DWDM equipment. Fibre
Channel-to-DWDM equipment is used primarily for regional transport, because
of the distance limitation of Metropolitan Area Networks (MANs).
Using DWDM has many advantages for Fibre Channel.The first is extending
the distance limitation of Fibre Channel beyond 10 km. DWDM technology can
109
140_SANs_03
110
8/14/01
1:35 PM
Page 110
Chapter 3 • SAN Components and Equipment
extend Fibre Channel to MAN distances up to 100 km. Second is the ability to
multiplex a large number of Fibre Channel connections to fewer fiber links,
reducing the amount of optical fiber needed between facilities and simplifying
long-distance cabling.
Fibre Channel across IP Networks
Developing standards allow for the encapsulation of Fibre Channel onto an IP
frame (FC_IP) for transport across any IP-capable network. Like Fibre Channel
across DWDM, this is being targeted at extending the distance of SANs across
regional and wide area distances.The FC_IP protocol encapsulates Fibre Channel
frames within IP packets.This works through special equipment that connects
into a Fibre Channel network and encapsulates Fibre Channel frames and
transmits them on IP networks.This allows any IP-capable network—including
Gigabit Ethernet, ATM, and any other technology—to transport Fibre
Channel frames.
Transporting Fibre Channel over IP networks can help extend SANs well
beyond the campus networks and regional networks now in use.With suitable
bandwidth available,WAN distances can be possible for SANs with this emerging
technology.
The major limitation of running FC_IP is the speed and latency of the IP
network.The IP network must be able to handle the amount of bandwidth generated by Fibre Channel, or otherwise risk being bottlenecked in the IP transport. In addition, different vendors are not compatible with each other using
FC_IP, since the standard is still in a draft phase. However, as the standard progresses, expect that vendors will begin to interoperate.
IP over Fibre Channel to Gigabit Ethernet
Fibre Channel to Gigabit Ethernet routers allow for the transport of IP frames
generated on either the Fibre Channel network or Gigabit Ethernet network to
appear transparently on the other network.This can be done through dedicated
hardware, or even through a host computer set up to route IP frames between
different network interfaces. Standard IP protocol services such as Web pages,
FTP, and telnet can be seamlessly run across your Fibre Channel network and
routed across to an IP network like Gigabit Ethernet.
This is now usually done primarily as a way to retrieve management information from network equipment through in-band management. In particular,
with expensive DWDM or remote links being used to connect remote sites,
140_SANs_03
8/14/01
1:35 PM
Page 111
SAN Components and Equipment • Chapter 3
using IPFC and bridging that information to other general-purpose networks
makes it more cost-efficient to manage the remote equipment. However, as IPFC
becomes more prevalent, these bridges and routers can make it possible to merge
traffic between both IP and Fibre Channel networks.
Fibre Channel Storage
Selecting Fibre Channel storage depends on what kinds of applications and, most
importantly, how much data and protection you need. Fibre Channel storage
ranges from individual disk drives that support the Fibre Channel protocol, to
devices with dozens of connections and different kinds of storage connection
ports, such as Fibre Channel, SCSI, and Enterprise System Connection
(ESCON).This section attempts to briefly describe the Fibre Channel aspects of
this storage, but does not cover the very complex task of
evaluating storage systems.
Individual Disk Drives and JBODs
Individual disk drives, although they support the Fibre Channel protocol, are
rarely deployed alone in a SAN. In general, these individual disk drives are added
to JBOD enclosures that hold four or more single disks in a single, loop-ready
configuration.These drives are wired together into a miniature FC-AL loop with
one or two ports to connect to the drives.The first port of a JBOD is generally
wired to one of the dual ports on a disk drive, and the secondary port of the disk
drives is wired on a secondary loop.To an HBA and system on the other side,
there is no difference between a disk drive and a JBOD. In fact, you cannot tell
that individual disks are tied together into a JBOD electronically or through software. Different JBOD systems differ in the number of ports, physical enclosure
features, and individual disk drives used in the systems. Most JBOD systems do
not add any actual Fibre Channel functionality and just physically connect the
internal Fibre Channel disk drives to the Fibre Channel network. Differences in
JBODs include the number of disk drives included in an enclosure; rack-mount
and standalone options; amount of cooling and the power supplies; temperature
sensors and alarms; and the ability to hot-swap and replace faulty components.
Fibre Channel RAID systems can also be connected into a Fibre Channel
network with one or more ports, and range from low-end systems with only
several gigabytes of capacity and little cache to higher end, hundreds of gigabyte
capacity arrays with extensive cache.They provide varying levels of redundancy
111
140_SANs_03
112
8/14/01
1:35 PM
Page 112
Chapter 3 • SAN Components and Equipment
and performance, and generally can be configured into different types of RAID
levels for differing protection. An important Fibre Channel consideration when
you are selecting a RAID system is the ease of configuration of the RAID. For
example, some systems might require specific operating systems and HBAs in order
for configuration software to work.You should make sure that the RAID system
you select supports the other components in your system and has been qualified
with switches in your system.The capability to configure a RAID system from the
serial port or from an Ethernet port can ease management of that system. RAID
systems also are available in different rack-mount and standalone configurations and
provide redundant cooling and power component options.
RAID Levels
RAID arrays generally support different levels of protection and redundancy for your data. By selecting different RAID levels, you can trade off
speed of operation and recovery against the amount of protection for
your data. The following is a brief description of the most common RAID
levels available:
■
RAID 0: Striping Provides very rapid access to data by
striping information across different disks. By distributing
data across different spindles (disks), the data can be
retrieved very rapidly and at a rate greater than an individual
disk drive can support. However, RAID 0 provides no redundancy or protection for data if any disks fail.
■
RAID 1: Mirroring Duplicates data across disks on a one-toone basis. This provides 100 percent protection of data
across disks and instant access to data if one disk fails. A
duplicate copy of all data is simultaneously written to a mirrored disk, which is made available if the primary disk fails.
However, this is fairly expensive since it requires purchasing
twice as much capacity.
■
RAID 3: Striping with Parity Data is striped as in RAID 0,
but with the addition of a parity disk. This provides speed
and fault tolerance.
■
RAID 5: Striping with Distributed Parity Similar to RAID 3,
but parity is instead distributed across all disks, allowing for
better read performance and fault tolerance.
140_SANs_03
8/14/01
1:35 PM
Page 113
SAN Components and Equipment • Chapter 3
High-End Storage Arrays
High-end storage arrays generally support multiple terabytes of data and often
include the capability to support Fibre Channel connections as well as interfaces
such as SCSI, FICON, and other specialized storage interconnects. Built to
support dozens or even hundreds of storage users, the arrays can range from
refrigerator-sized to half a room. In addition to many storage connections, these
devices provide large amounts of memory cache to accelerate disk accesses, as
well as advanced capabilities like LUN masking through selective LUN presentation, snapshot backup volumes, redundant controllers, and failover and replication
capabilities.
Selective LUN Presentation
Selective LUN presentation is the capability of a storage device to filter or mask
which hosts are allowed to see a LUN. For example, storage can be configured to
show LUN A to host X and LUN B to host Y, but not vice versa, making it possible to partition the storage entirely by hosts on the network.This has many
advantages, including the capability to allocate the storage in the box within the
network at a single interface, to guarantee that users do not accidentally mount
an incorrect volume and corrupt data, and to better control how hosts see the
storage in the network.
Selective LUN presentation works through hardware and software that examines frames coming into a storage subsystem from the network. By examining the
frames and comparing the source of those frames with an administratorconfigured list of allowable hosts, the storage array can allow or deny access to
specific LUNs in the array. In addition, arrays can also enforce LUN numbers
that a host sees, even if those are not the actual physical LUN numbers of the
internal volumes.This is typically used for operating systems like Windows NT,
which requires that every storage volume have a LUN 0—which is fine for
simple RAID arrays but impractical on a storage array that might have a hundred
LUNs allocated by dozens of hosts.Through selective LUN presentation, every
host can see a different LUN 0, which is physically different but can be addressed
through the same LUN 0 address.
LUN Export across Multiple Ports
With Fibre Channel, the capability to make data highly available through a network has resulted in software support for highly available transport of data from
113
140_SANs_03
114
8/14/01
1:35 PM
Page 114
Chapter 3 • SAN Components and Equipment
storage.This software can identify volumes that are the same, even if they are seen
at different points of a network.
High-end storage arrays have a feature that allows a single logical LUN to be
exported across multiple Fibre Channel ports on an array. For example, a single
volume for an e-mail database can be exported across two different storage ports,
across two separate fabrics. If a storage port’s hardware fails or a switch fabric is
disrupted, the other port can still be accessed across a different path.This ability
to export LUNs across multiple ports enables software and dynamic multipathing
software and drivers to intelligently access, either on an active-passive or activeactive basis, the same LUN.
Array management software typically must be configured to allow for this
multiple LUN export. Exporting a LUN across multiple interfaces without the
addition of dynamic multipathing or volume management software can result in
data corruption or collisions due to duplicates in the operating system image. In
addition, the arrays need to identify that these different exported LUNs are actually and logically the exact same volume images.This is referred to as Page 83h
information, named after the SCSI mode page that describes the identification of
storage volumes on a storage device such as a RAID.
Snapshot Backup Volumes
Snapshot backup volumes are a special feature of high-end arrays that take a “snapshot” of an operational LUN’s data at a point in time and copy that data to
another volume.This snapshot, which is made instantaneously while traffic is
running and sometimes in coordination with a halt in traffic to a storage LUN,
enables an easy way to back up a very busy storage array.
Because high-end storage arrays are typically highly utilized and attached to
business-critical systems that must be available 24x7, backup is a very challenging
task. Usually, backup has to be done to a static system, or a system that is not
being written to during the entirety of the backup.With these critical systems,
this never happens—and backups still have to be done. In fact, backups are probably even more important for these systems.
Snapshot backup volumes solve this problem by providing a copy of the data
at a point in time, which can be backed up without the problem of data constantly being changed on a volume.These backup volumes are exported to different hosts or backup hardware on the SAN and at predetermined periods will
refresh their information from the “live” data and allow for the next backup.
140_SANs_03
8/14/01
1:35 PM
Page 115
SAN Components and Equipment • Chapter 3
Summary
The keys to robust Fibre Channel deployment are the components that you
use to build your network. By understanding these components, you can better
design a robust, scalable network. Understanding the features and capabilities
of your hardware will help you select and qualify the best equipment for
your needs.
The most basic layer of your SAN is the physical layer, which includes your
media and connector choices.There are a number of choices that are dependant
on the primary purpose of your SAN. In selecting the type of media to use for
your SAN, you have two choices: copper and optical.The distinct advantage of
copper is that it is inexpensive compared to all types of optical fiber.The advantage of optical fiber is that it provides a reliable signal over a longer distance than
copper.There are two types of fiber: single mode and multimode.The difference in
the fibers is the diameter of the fiber. Smaller-diameter single mode can transmit
at a far greater distance.
The cardinal rule for connecting a SAN is to minimize the total number of
connections and patches. SC is the standard optical connector, but high-density
connectors are becoming more popular as more devices are connected to SANs.
Examples of high-density connectors are LC, MT-RJ, and HSSDC. HSSDC and
DB-9 are the copper connectors available. In most SANs, you will need to use
some types of GBICs. GBICs provide an easy way to adapt devices to whatever
connection type you prefer.This lets you customize your SAN based on distance
and speeds required between devices. Hubs serve as the most rudimentary support for Fibre Channel, providing basic FC-AL connectivity to smaller networks.
Simple hubs provide only basic electrical connectivity, shared bandwidth, and no
intelligence. Intelligent hubs provide much better error recovery and management, especially in multi-initiator fabrics. Key features to look at in hubs are the
different management interfaces available, error recovery, and LIP isolation.
Switches form the core of a switched Fibre Channel fabric, and not only switch
frames between nodes but also contain intelligent services to locate and manage
nodes in the fabric. In addition, the capability to work in loop mode also makes
Fibre Channel switches a good replacement for hub deployments. Performance and
functionality such as zoning and the ability to cascade switches together make
switches a key to controlling access in the SAN and scaling your installations.
HBAs connect hosts to the Fibre Channel network. In conjunction with device
drivers, these devices translate SCSI commands and protocols from operating
115
140_SANs_03
116
8/14/01
1:35 PM
Page 116
Chapter 3 • SAN Components and Equipment
systems and send them across the Fibre Channel network. Advanced functionality
such as persistent binding and LUN masking add to your ability to control how
storage is allocated in your network. HBAs are also capable of non-SCSI traffic
such as IP frames and VI clustering protocols, which are enabling new applications
for your SAN.
Fibre Channel-to-SCSI routers provide an important bridge from legacy parallel bus SCSI devices to new Fibre Channel networks. By translating between
Fibre Channel SCSI and older parallel bus SCSI protocols, older non-Fibre
Channel SCSI devices can seamlessly be used in your network. Advanced features
such as selective LUN presentation and support for extended copy can also help
support applications on your SAN.
Companies are starting to extend their SAN infrastructure over long distances,
adding disaster tolerance and remote backup to their data centers.Technologies
such as Fibre Channel-to-DWDM equipment and encapsulation of Fibre Channel
over IP networks are gaining ground as a way to extend SANs to MANs and even
WANs. Bridging IP from Fibre Channel is also helping to drive the interoperability between standard corporate IP networks and specialized data SANs.
Fibre Channel-capable storage forms the core of a SAN and is present as
low-cost JBOD disk drive cabinets, midrange RAID arrays, and high-end storage
subsystems. JBOD arrays present individual Fibre Channel-capable disks drives to
the network. RAID arrays add error tolerance, cache, and management ability to
the network, with high-end storage subsystems providing much higher functionality and capacity to Fibre Channel and other storage interconnects. High-end
features such as selective LUN presentation, multiple export of LUNs, and snapshot backup are some of the features that can help better manage the allocation
of storage on your SAN, add redundancy to your networks, and help to keep
your data available 24x7.
These components together form the SAN. By studying them and understanding their major features and benefits, you can better select components and
design your network.
Solutions Fast Track
Overview of Fibre Channel Equipment
; Understanding the features of your Fibre Channel equipment is key
when building a robust infrastructure.
140_SANs_03
8/14/01
1:35 PM
Page 117
SAN Components and Equipment • Chapter 3
; A Fibre Channel network is comprised of cabling, GBICs, hubs,
switches, HBAs, and routers.
; Fibre Channel shares much of the same terminology as Ethernet
networking, but the functionality of similarly named equipment is not
necessarily identical.
Cabling and GBICs
; Copper cabling is almost always terminated with either an HSSDC or
DB-9 male connector.
; Multimode optical fiber is terminated using a variety of optical
connectors, including SC, LC, and MT-RJ.
; Single-mode fiber is the most expensive media type, but preferable for
long distances.
; Single-mode fiber, because of its small diameter (9 µm), has the highest
transmission speed potential.
; Copper cabling is available in two types: active and passive. Active copper
lines provide twice the distance of passive copper lines.
; The HSSDC connector was specifically designed as a Gigabit copper
connector, improving density and performance over the DB-9 style
connector.
; GBICs are removable transceivers used in all types of Fibre Channel
devices, including switches, hubs, and HBAs.
; GBICs offer the option of interfacing with almost all types
of connectors.
; A Media Interface Adapter (MIA) converts DB-9 copper connectors to
optical SC connectors.
Using Hubs
; Hubs serve as a very basic level for connecting different ports in a
network together.
; Hubs can connect up to 127 devices together in an FC-AL loop.
117
140_SANs_03
118
8/14/01
1:35 PM
Page 118
Chapter 3 • SAN Components and Equipment
; Simple hubs contain no intelligence, just electrical connections.
; Managed hubs provide a level of error tolerance and
management features.
; Managed hubs provide LIP isolation, automatic port bypass, signal
retiming, and management interfaces.
; Fibre Channel LIPs can be a major source of problems in arbitrated loop
configurations.
; To avoid an earlier generation of problems due to loop architecture,
most people are moving to switched fabric devices instead.
Using Switches and Fibre Channel Fabrics
; Switches are classified into three categories: entry-level, scalable fabric,
and core fabric switches.
; Entry-level switches are focused on small workgroups of 8 to 16 ports,
usually are geared toward low cost, and deliver limited scalability and
management. Fabric switches provide the capability to cascade switches
together to create larger fabrics.
; A core fabric switch is designed for interconnecting multiple edge
switches to form multihundred-port SANs.
; HBAs are used to connect servers to the network.They map SCSI
commands in the operating system to Fibre Channel frames on the network. HBAs range from low-end, loop-only devices to high-end, fabric
multipathing adapters.
; Major protocols supported by HBAs are SCSI-FCP for storage, IPFC for
networking, and VIFC for clustering.
; HBAs either support 1 Gbit/sec or 2 Gbit/sec speeds, with current gen-
eration cards supporting 1 Gbit/sec, and emerging cards supporting both.
; HBAs can be found in single one-port configurations or dual-port
adapters for higher density.
; LUN masking enables control of access to devices in the network from
the HBA.
140_SANs_03
8/14/01
1:35 PM
Page 119
SAN Components and Equipment • Chapter 3
; Persistent binding is the mapping of a Fibre Channel device into an
operating system at a specific device location.
; Dynamic discovery is the capability to dynamically add and remove
drives from your system without reboot.
; HBA API support is an important feature that allows management of
your HBA by SAN management software.
; Remote booting is the use of an HBA to boot an operating system
image across the SAN and is used to dynamically change hosts and
enable ease of disaster recovery.
Connecting Legacy Devices into Your SAN
; Α Fibre Channel router, which is also known as a bridge, allows legacy
parallel SCSI devices to attach to your Fibre Channel SAN.
; A Fibre Channel router plugs into Fibre Channel on one side and
a SCSI bus on the other.
; Frames are translated from SCSI-FCP to parallel SCSI bus signals
through routers.
; Routers provide many different features, including different numbers
of SCSI buses and different support for parallel SCSI protocols
and termination.
; Advanced features include selective LUN presentation, extended copy
support, and various management interfaces.
; Selective LUN presentation is the capability of a router to mask the
presence of devices to different hosts in the network and allow for better
security and control over resources.
; Extended copy support (third-party copy) allows software to directly
back up data on the SAN, saving CPU and network traffic.
; Available management interfaces include telnet, SNMP, Ethernet, and
serial ports.
119
140_SANs_03
120
8/14/01
1:35 PM
Page 120
Chapter 3 • SAN Components and Equipment
Bridging and Routing to IP Networks and Beyond
; Fibre Channel-to-DWDM technology multiplexes Fibre Channel signals
onto higher bandwidth fiber for transmission over MAN distances (up
to 100 km).
; Use of DWDM is transparent to Fibre Channel switches, except for
buffer settings.
; It is necessary to increase buffer credit settings to handle the long
distances/delays involved in MANs.
; Fibre Channel can also be transported across IP networks like ATM and
Gigabit Ethernet.
; FC_IP (not to be confused with IPFC) encapsulates Fibre Channel
frames in the IP protocol and can be used for remote backup and
extending SAN distances.
Fibre Channel Storage
; Fibre Channel storage is important as the core of the data storage on
your network.
; Fibre Channel storage ranges from simple JBOD devices to multi-
terabyte storage arrays.
; A JBOD is a cabinet of independent disks, all connected into the Fibre
Channel network in a loop.
; Hosts individually address disks in a JBOD.
; RAID arrays provide additional protection and performance to
your storage.
; Different RAID levels are appropriate for different applications.
; High-end storage arrays add support for multiple terabytes of data.
Other types of connections include parallel SCSI, ESCON, and FICON.
; High-end arrays also generally include a large amount of cache, which is
used to speed up data access.
140_SANs_03
8/14/01
1:35 PM
Page 121
SAN Components and Equipment • Chapter 3
; Selective LUN presentation is the ability of high-end storage to control
access by hosts to data and to ensure data integrity.
; LUN export across multiple ports is used for redundancy and high avail-
ability, but requires dynamic multipathing software or drivers to work.
; Snapshot backup volumes are used to enable backup on live databases
and data images.
Frequently Asked Questions
The following Frequently Asked Questions, answered by the authors of this book,
are designed to both measure your understanding of the concepts presented in
this chapter and to assist you with real-life implementation of these concepts. To
have your questions about this chapter answered by the author, browse to
www.syngress.com/solutions and click on the “Ask the Author” form.
Q: Should I buy a switch that supports GBICs or fixed media?
A: This depends on your application. GBICs offer much more flexibility in the
configuration and media in your network, allowing you to mix copper and
optical media and different copper and optical connectors. However, GBICs
do add some cost to your installation and are another possible point of failure.
Using fixed media generally lowers the cost of your switches and is more reliable. However, a failure of a transceiver in fixed media means a swap of the
entire switch and not just a GBIC.
Q: I notice that many Fibre Channel devices support a common SNMP MIB.
How do I get this to work with my network management software?
A: You can get MIB files from the manufacturer of your hardware, usually from a
distribution disk or through a download from their Web site. Using an MIB
browser in your network management software, you can browse this data.
However, a better way to use this is through Fibre Channel management software that understands how to interpret these MIBs and show (for example) a
visual topology of the network from this data.
121
140_SANs_03
122
8/14/01
1:35 PM
Page 122
Chapter 3 • SAN Components and Equipment
Q: Should I be buying 1 Gbit/sec or 2 Gbit/sec components for my network?
A: You should consider the bandwidth requirements of your application. 1 Gbit/sec
networks are the most proven and mature technology. However, as 2 Gbit/sec
equipment starts to become available, you can expect to see all parts of the network (storage, switches, and HBAs) supporting 2 Gbit/sec standards.The higher
bandwidth will provide significant performance improvements when used on
ISLs, though.
Q: It looks like most Fibre Channel components provide some sort of masking
ability to control access to either ports or LUNs.Which one of these
methods should I use in my network?
A: Even though different components all offer some sort of zoning, LUN
masking, or LUN presentation techniques, you will probably need to use all
of them in one way or another to control your network. For example, switch
zoning is invaluable for isolating specific SAN segments and control, but you
will have to use either HBA-based LUN masking or storage-based LUN
presentation to more finely partition your volumes.
140_SANs_04
8/14/01
3:34 PM
Page 123
Chapter 4
Overview of
Brocade SilkWorm
Switches and
Features
Solutions in this chapter:
■
Selecting the Right Switch
■
Understanding the Brocade Fabric OS
■
Using Optional Brocade Features
■
Future Capabilities in the Brocade
Intelligent Fabric Services Architecture
Summary
Solutions Fast Track
Frequently Asked Questions
123
140_SANs_04
124
8/14/01
3:34 PM
Page 124
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
Introduction
This chapter covers the Brocade switch models and describes their differences
and similarities. It also describes how to select the most effective switch for specific requirements or SAN applications, a critical part of the overall process of
implementing and managing a SAN. A discussion of the Brocade Fabric OS helps
to explain the core aspects of switch functionality, and how that functionality
relates to your own installation. In addition, this chapter gives you an overview of
the features of Fabric OS and how they control the behavior of Brocade
switched-fabric SANs. Finally, a discussion about the future capabilities of SANs
assists you with developing your SAN vision.The information in this chapter
presents an overview of the Brocade product family to provide context for the
other chapters in the book and to enable this book to be a self-contained reference on Brocade SANs. For the latest and most detailed information about
Brocade products, visit the Brocade Web site at www.brocade.com.
Selecting the Right Switch
By choosing among the wide variety of switches available today, you can deploy
them in simple, standalone configurations or by networking them with other
switches to build larger or more resilient fabrics.
Selecting the right switch for your application is extremely important, since
the switch is the building block of the infrastructure for your SAN.The right
SAN infrastructure can significantly improve information management and allow
you to address some of your most challenging business requirements.
As with other types of technology implementations, selecting a switch is a
strategic decision.You must understand your current IT infrastructure and requirements, as well as how you might need the switch to fit into your strategic direction. As a result, you need to understand key factors such as scalability, compatibility,
and interoperability with other hardware resources. After all, your SAN is hardly a
static platform, and you should expect to expand it or integrate it with other components as new technologies emerge.This is how a SAN provides you with investment protection by enabling you to scale and adapt it to meet future requirements.
At a very high level, you should be aware of a few key characteristics when
selecting a switch, including hardware redundancy features, which reflect the level
of reliability and availability you require; port count, or the overall size of your
installation and the number of devices you need to connect; function, that is, the
capabilities you need for certain types of applications or environments; and finally,
cost, in planning the overall budget to achieve your SAN goals.You can use these
140_SANs_04
8/14/01
3:34 PM
Page 125
Overview of Brocade SilkWorm Switches and Features • Chapter 4
characteristics to differentiate the various members of the Brocade SilkWorm
family of switches:
■
Hardware Redundancy The low-cost SilkWorm 2000/2200 entrylevel models have a single fixed power supply and cooling mechanism.
The enterprise-class SilkWorm 2400/2800 products have dual hotswappable power supplies and hot-swappable cooling fans. All except the
lowest-end SilkWorm 2000 series of switches offer pluggable Gigabit
Interface Converters (GBICs) to enable fast replacement of optical
transceivers.
■
Port Count Brocade offers 8- and 16-port switches, a 64-port integrated fabric and a dual 64-port core fabric switch (in the future).You
can use these switches alone or together to form fabrics consisting of
hundreds of ports.
■
Function The Brocade Fabric OS software installed on all Brocade
switches is inherently capable of supporting all key features. However,
software license keys might be required to activate some of these features. Each switch comes bundled with certain software keys, and if you
need features that are not bundled with the product, you will need to
budget for the additional licenses.
■
Cost Brocade has a wide range of switch models, from low-cost entrylevel departmental switches all the way up to robust, enterprise-class
fabric switches, integrated fabrics, and core fabric switches.
One of the first steps in selecting a switch is to decide how many servers and
storage devices you want to connect.There are some general guidelines you can
follow when deciding what type of switch best meets your needs. For networks
of less than eight devices with no growth plans, an 8-port switch will suffice. If
you plan to have more than eight devices at some point in time, you will probably want a 16-port switch. If you have more than 16 devices, you will want a
network built from a number of 16-port switches. If you have over 50 devices,
you will want to either build a network from 16-port switches or consider a
more advanced solution such as the SilkWorm 6400 Integrated Fabric, which
provides a preconfigured network in a convenient package.
If you have business-critical applications that must be continuously available,
you will need to connect them in a resilient manner.You might even want to use
a dual-fabric configuration.We discuss this in more detail in Chapter 5, “The
SAN Design Process,” and Chapter 7, “Developing a SAN Architecture.” Since
125
140_SANs_04
126
8/14/01
3:34 PM
Page 126
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
redundancy lies within the fabric, it might be possible to utilize the less expensive
switches with a single fixed power supply. If one of the switches fails, multipathing software on the host and storage array will route traffic over the redundant fabric until the failed switch is replaced.
For applications that can tolerate occasional downtime for periodic maintenance, a single fabric is less costly and can be built upon switches with redundant,
hot-swappable power supplies and fans—so the fabric can withstand the loss of
one of the components without downtime.We discuss the topics of redundancy
and resilience in detail in Chapter 7.
Most Brocade switches support hot-pluggable GBICs that provide both flexibility—you can choose optical or copper media—and high availability. If a single
GBIC fails, you can replace it without disturbing the other ports on the switch.
However, the lowest priced 8-port entry model supports one GBIC and seven
fixed optical ports.
The Brocade family of fabric switches includes a wide variety of cost options,
ranging from the least expensive 8-port SilkWorm 2010 switch with fixed media
to the highly available 16-port SilkWorm 2800 and the redundantly configured
SilkWorm 6400 Integrated Fabric.The high end of the family is the SilkWorm
12000 Core Fabric Switch, which will provide 128 ports in a single enclosure
when it is released. All Brocade switches are fully interoperable with each other
and can be mixed in fabrics to provide the optimal balance of cost-effectiveness,
expansion capability, and high availability.
Entry-Level Switches
Brocade entry-level switches are designed for environments with smaller storage
growth requirements where cost considerations outweigh the need for high availability.The 8-port SilkWorm 2010, 2040, and 2050 are the same physical 1Uhigh switch with different feature sets enabled (Figure 4.1).The 16-port
SilkWorm 2210, 2240, and 2250 are the same physical 1.5U-high switch with
different features enabled (Figure 4.2).
Figure 4.1 SilkWorm 2000 Series Switch
140_SANs_04
8/14/01
3:34 PM
Page 127
Overview of Brocade SilkWorm Switches and Features • Chapter 4
Figure 4.2 SilkWorm 2200 Series Switch
SilkWorm 2010 (8 Ports) and 2210 (16 Ports)
The SilkWorm 2x10 are arbitrated loop-only switches and are a viable alternative
to a managed hub-based solution.The SilkWorm 2010 and 2210 series deliver
superior performance compared to a hub by providing simultaneous, full-bandwidth data transfers and enhanced availability through fault isolation and intelligent Loop Initialization Primitive (LIP) management.The SilkWorm 2010
features seven fixed optical ports and one pluggable GBIC slot, while the 2210
offers 16 pluggable GBIC slots.The fixed media solution in the SilkWorm 2010
reduces cost by eliminating the need to purchase GBICs for the remaining slots.
A consideration to this configuration is that, if one fixed optical component fails,
you need to replace the entire switch rather than a single component.
The SilkWorm 2x10 are bundled with Brocade Zoning, Brocade WEB TOOLS,
QuickLoop, and the Simple Name Server (single switch) enabled.The 2x10 can
be upgraded to a full-fabric switch with a license key.This capability is key to
investment protection: you might need only a loop-switch now but decide in the
future that you need fabric capabilities as well.With the 2x10, you do not need to
perform a forklift upgrade of your SAN in order to add that functionality.
SilkWorm 2040 (8 Ports) and 2240 (16 Ports)
The SilkWorm 2040 and 2240 are the same physical switches as the SilkWorm
2010 and 2210, respectively, with additional support for entry-level fabrics.You
can use these switches in dual-switch configurations to support fabrics or loop
environments of up to 30 ports. Alternatively, you can have one E_Port connection per switch to connect the switch to a larger Brocade fabric.
The SilkWorm 2040 is bundled with Brocade Zoning, Brocade WEB TOOLS,
and the Distributed Name Server enabled. It can be upgraded to a full-fabric
switch (SilkWorm 2x50) with a license key. However, you must then also purchase Brocade QuickLoop to connect Fibre-Channel Arbitrated Loop
(FC-AL) hosts.
127
140_SANs_04
128
8/14/01
3:34 PM
Page 128
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
SilkWorm 2050 (8 Ports) and 2250 (16 Ports)
The SilkWorm 2050 and 2250 are the same physical switches as the SilkWorm
2010 and 2210, respectively, but they feature a full-fabric configuration that
enables multiple E_Port connections and networked fabrics of multiple switches
(equivalent to what is provided by the SilkWorm 2400 and 2800 switches).This
design enables redundant paths between switches.
In a networked configuration, any device can connect to any other device,
with I/O taking the shortest available path to the target device.The SilkWorm
2x50 is bundled with Brocade Zoning, Brocade WEB TOOLS, and the
Distributed Name Server enabled. However, optional Brocade QuickLoop software is necessary to connect private FC-AL hosts.
While the SilkWorm 2400 and 2800 switches offer more high-availability
features (such as redundant, hot-swappable power and cooling), the highest availability environment requires that dual fabrics be used with connections from each
server and storage to both fabrics. Operator errors, natural disasters, and other
catastrophic events can cause an entire fabric to become inoperable. In that
respect, a single fabric is a single point of failure. Path failover software—such as
EMC PowerPath,VERITAS Dynamic Multipathing software, Compaq
SecurePath, or other solutions—allows traffic to flow over either fabric. In these
environments, the SilkWorm 2050 and 2250 switches can provide cost-effective
solutions when availability of an individual switch is not the highest requirement.
NOTE
By enabling data routing, rerouting, self-healing, and high scalability,
Brocade full-fabric products enable switch-to-switch networking to produce a resilient multiswitch fabric. If you think you might eventually
upgrade to a full fabric, this feature set is highly useful.
Scalable Fabric Switches
The Brocade SilkWorm family of switches is designed to work with other
popular storage hardware and servers to enable a best-of-breed open systems
environment. By integrating with heterogeneous IT infrastructures, Brocade
switches help leverage existing storage investments while providing a strategic
path to manage continued data growth.
140_SANs_04
8/14/01
3:34 PM
Page 129
Overview of Brocade SilkWorm Switches and Features • Chapter 4
The SilkWorm 2400 and 2800 are the most widely deployed members of the
Brocade SilkWorm family. Both provide a fault-tolerant solution with dual power
supplies and redundant cooling fans.To further increase availability, both the
power supplies and cooling fans can be replaced without taking the switch
offline.These switches are bundled with Brocade Fabric OS, including
Distributed Name Server, FSPF routing, automatic discovery, and advanced diagnostic and management functions. Most Brocade switches are sold bundled with
Brocade Zoning and Brocade WEB TOOLS optional software packages.
SilkWorm 2400
The SilkWorm 2400 is an 8-port full-fabric switch (Figure 4.3). It is 1U high
and has two slim power supplies that can be removed and replaced while the
switch is online.These power supplies are the same power supplies used for the
SilkWorm 2800 and therefore are interchangeable with the SilkWorm 2800.The
SilkWorm 2400 supports pluggable optical or copper GBICs on all ports and
offers an Ethernet connection and a serial connection for management. All of the
ports are capable of supporting and automatically detecting fabric connection
(F_Port), loop device connection (FL_Port), or connections to other fabric
switches (E_Port).
Figure 4.3 SilkWorm 2400 Fabric Switch
SilkWorm 2800
The SilkWorm 2800 is a 16-port full-fabric switch (Figure 4.4). It is 2U high and
has dual power supplies (the same power supplies are used for the 2400) that can
be removed and replaced while the switch is online.The SilkWorm 2800 supports pluggable optical or copper GBICs on all ports and offers an Ethernet connection, a built-in two-line LCD display, and a four-key keypad for management.
This management capability enables you to configure a switch without additional
equipment (such as an ASCII terminal). All ports are capable of automatically
129
140_SANs_04
130
8/14/01
3:34 PM
Page 130
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
detecting and supporting either fabric connection (F_Port), loop device connection (FL_Port), or connections to other fabric switches (E_Port). All ports can be
used in any mode.
Figure 4.4 SilkWorm 2800 Fabric Switch
SilkWorm 6400 Integrated Fabric
The SilkWorm 6400 is a 64-port integrated fabric designed for data center environments (Figure 4.5). It harnesses the strong networking capability of Brocade
switches to create an integrated 64-port solution that is typically half the cost of
director-class products. In addition, it enables interconnection of a large number
of hosts and storage devices for enterprise-wide distributed applications.The
SilkWorm 6400 is simple to install, cable, and configure.The high-port-count
solution features a modular design of six SilkWorm 2250 switches in a custom
housing.The six switches are preconfigured and preconnected to form a highly
available fabric with 64 user ports and no single point of failure. Brocade Fabric
Manager software enables consolidated management of the switch modules.
The SilkWorm 6400 is interoperable with other SilkWorm switches, and it
supports private loop Host Bus Adapter (HBA) environments through Brocade
QuickLoop.The switch is bundled with Fabric OS, Brocade Zoning, Brocade
WEB TOOLS, and Brocade Fabric Watch.
The internal topology of the SilkWorm 6400 is designed to provide a costeffective high-availability solution.Two of the switch modules operate as the core
of the fabric and use eight ports to provide dual connections to each of the four
edge switch modules.The remaining eight ports are used for device connections.
Each edge switch module provides two connections to each of the core switch
modules, leaving 12 ports for device connections.The use of dual core switch
modules and dual connections from each edge switch ensures that no single
failure can bring down the entire integrated fabric.
140_SANs_04
8/14/01
3:34 PM
Page 131
Overview of Brocade SilkWorm Switches and Features • Chapter 4
Figure 4.5 SilkWorm 6400 Integrated Fabric
Fabric Manager
By providing a centralized view of the SilkWorm 6400 switch modules, Brocade
Fabric Manager simplifies SAN administration and maintenance of the SilkWorm
6400 Integrated Fabric. A portable, Java-based management application that is
easy to install on a Windows management station, Fabric Manager makes it easy
to view the status of all switch modules, drill down to individual switch modules,
and access Brocade WEB TOOLS.
SilkWorm 12000 Core Fabric Switch
To further support enterprise-level SAN deployment, Brocade has developed the
SilkWorm 12000 Core Fabric Switch, which will provide up to 128 ports of
connectivity in a single enclosure (Figure 4.6).The switch will be the first model
based on a third-generation ASIC that enables auto-sensed link speeds of 1 and 2
Gbit/sec.With its high-performance and high-reliability characteristics, the
SilkWorm 12000 will provide the same capabilities as director-class switches, but
with improved intelligence, scalability, and interoperability.With superior built-in
intelligence, the SilkWorm 12000 will enable a centrally managed core-to-edge
network model based on proven core backbone networking methodology.
The SilkWorm 12000 will also feature a protocol-independent backplane
that supports 2 Gbit/sec Fibre Channel blades on release, and in the future,
10 Gbit/sec Fibre Channel blades.The protocol-independent design will also
support emerging storage protocols such as Small Computer Systems Interface
over IP (iSCSI), Fibre Channel over IP (FC_IP), and InfiniBand. In addition to
131
140_SANs_04
132
8/14/01
3:34 PM
Page 132
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
Fibre Channel, InfiniBand, and IP, the SilkWorm 12000 will support an optional
Application Platform blade that enables the deployment of high-performance
fabric services such as storage virtualization and third-party copy.With the
Application Platform integrated into the switch, higher data rates will be possible
and management between switches and applications will be much easier.
Figure 4.6 SilkWorm 12000 Core Fabric Switch
NOTE
At the time of printing, the SilkWorm 12000 represents an unreleased
product. This section is, therefore, a visionary statement of future SAN
design capabilities.
Understanding the Brocade Fabric OS
The Brocade Fabric OS enables you to easily configure, manage, and maintain a
SAN for your specific needs.The de facto industry standard, Fabric OS simplifies
management for both FC-AL and switched-fabric SANs. Fabric OS allows you
to discover the network of connected storage and host devices and automatically
determines the available data paths through the switches and fabric. In addition,
Fabric OS enables you to customize your fabric via telnet commands or a Webbased Graphical User Interface (GUI). Figure 4.7 shows the high-level functions
provided by Fabric OS.
140_SANs_04
8/14/01
3:34 PM
Page 133
Overview of Brocade SilkWorm Switches and Features • Chapter 4
Figure 4.7 Fabric OS Functions
Optional
Fabric
Software
Base
Services
BROCADE
SES
BROCADE
WEB TOOLS
BROCADE
QUICKLOOP
FABRIC SERVICES
Distributed Simple Name Server
Alias Server (Multicase)
Routing Services, FSPF
Universal Port
EXTENDED
FABRICS
MANAGEMENT SERVICES
Management Server
SNMP Agent
Telnet / Serial (Front Panel)
Web Server
Fabric Watch
REMOTE
SWITCH
Fabric OS
Fibre
Channel
Platform
BROCADE
ZONING
SILKWORM SWITCH FAMILY
Fabric OS Core Functions
Fabric OS provides core functions such as:
■
Automatic discovery of devices Fabric devices log in to the Simple
Name Server (SNS).Translative mode is automatically set to allow fabric
initiators to communicate with private loop targets.
■
Universal port support Fabric OS identifies port types and automatically initializes each connection specific to the attached Fibre Channel
system, whether it is another switch, host, private loop, or fabric-aware
target system.
■
Continuous monitoring of ports for exception conditions Fabric
OS disables data transfer to ports when they fail. Ports are automatically
enabled after the exception condition is corrected.
Fibre Channel Services for Reconfiguration
Fabric OS provides a standard set of Fibre Channel services that provide fault tolerance and automatic reconfiguration when a new switch is introduced to the
fabric.These services include:
■
Management Server Supports in-band discovery of fabric elements
and topology.
■
Simple Name Server (SNS) Incorporates the latest Fibre Channel
standards and registers information about SAN hosts and storage devices.
133
140_SANs_04
134
8/14/01
3:34 PM
Page 134
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
It also provides a Registered State Change Notification (RSCN) when a
device state changes or a new device is introduced.
■
Alias Server Supports the multicast service that broadcasts data to all
members of a group.
Dynamic Routing Services
Fabric OS provides dynamic routing services for high availability and maximum
performance.These routing services include:
■
Dynamic path selection via link-state protocols Uses Fabric
Shortest Path First (FSPF) to select the most efficient route for transferring data in a multiswitch environment.
■
Load sharing to maximize throughput through Inter-Switch
Links (ISLs) Supports high throughput by using multiple ISLs between
switches.
■
Automatic path failover Automatically reconfigures alternate paths
when a link fails. Fabric OS distributes the new configuration fabricwide and reroutes traffic without manual intervention.
■
In-order frame delivery Guarantees that frames arrive in order.
■
Automatic rerouting of frames when a fault occurs Reroutes
traffic to alternative paths in the fabric without interruption of service
or loss of data.
■
Routing support for link costs Enables network managers to manually configure the link costs of individual ISLs to create custom FSPF
function that supports unique network management objectives.
■
Support for high-priority protocol frames (useful for clustering
applications) Ensures that frames identified as priority frames receive
priority routing to minimize latency.
■
Static routing support Allows network managers to configure fixed
routes for some data traffic and ensure resiliency during a link failure.
■
Automatic reconfiguration Automatically reroutes data traffic onto
new ISLs when they are added to the SAN fabric.
140_SANs_04
8/14/01
3:34 PM
Page 135
Overview of Brocade SilkWorm Switches and Features • Chapter 4
Facilities for End-to-End SAN Management
Fabric OS includes an extensive set of facilities for end-to-end SAN management, including:
■
Management Server based on FC-GS-3 Permits in-band access to
fabric discovery.
■
An SNMP agent and a series of comprehensive Management
Information Bases (MIBs)
—Assists with monitoring and configuring the switches.
—Provides an extensive set of trap conditions.
—Immediately alerts administrators about critical exception conditions.
■
In-band (through IP or over a Fibre Channel link) or external
Ethernet interface Gathers SNMP information and provides access to
all the switches in the fabric through a single fabric connection.
■
Syslog daemon interface Directs exception messages to up to six
recipients for comprehensive integration into a host-based management
infrastructure.
■
Switch beaconing Identifies an individual switch among a group of
remotely managed fabric elements.
Brocade Command Line Interface
The command line interface can be an excellent tool for managing your switch.
You can log in to the command line interface through two methods: you can
telnet into the switch or, on some models, you can connect an ASCII console to
the DB9 serial port and log directly in.
Using Optional Brocade Features
Brocade offers a wide variety of optional features designed to simplify the
deployment, management, and administration of SAN fabric environments.These
optional features are designed to help you fully leverage your SAN resources to
ensure a fast Return On Investment (ROI).
135
140_SANs_04
136
8/14/01
3:34 PM
Page 136
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
NOTE
SilkWorm switch features are enabled through a licensing system. When
you purchase additional feature products, you simply enter a license key
into the switch to enable the feature.
Brocade Zoning
Brocade Zoning, now bundled with almost every switch, provides advanced SAN
management capabilities. It enables the separation of a fabric into smaller, isolated
subfabrics to address closed user group requirements. As a result, zoning is an
excellent way to enhance security within a fabric.
With zoning, fabric-connected devices are arranged into logical groups of
devices over a single physical configuration, enabling you to segregate certain
devices from other devices.This can be helpful when you have devices that do
not interoperate well, or when you want to separate a development environment
from a production environment without purchasing additional switches. Devices
in a zone only “see” other devices in that zone and can access only those members. Any device not included in a given zone is not available to the devices in
that zone.
Brocade Zoning is available in both software and hardware formats, and you
can intermix both formats within a fabric. In general, software-enforced zoning
provides more flexibility, while hardware-enforced zoning provides the highest
level of security.
Brocade Zoning involves zone specification, enforcement, and management.
You can use a set of telnet commands (either in-band or out-of-band) to create,
delete, and display zones; to add or remove zone members; and to configure a set
of zones. Further information about Brocade Zoning appears in Chapter 6, “SAN
Applications and Configurations,” and Chapter 9, “SAN Implementation,
Maintenance, and Management.”
Extended Fabrics
Because you might need to connect multiple data centers over a long distance,
Brocade offers Extended Fabrics support. Extended Fabrics reconfigures the
switch to support the rigors of transmitting I/O over long distances in conjunction with technologies such as Dense Wave Division Multiplexing (DWDM).
140_SANs_04
8/14/01
3:34 PM
Page 137
Overview of Brocade SilkWorm Switches and Features • Chapter 4
This feature extends all of the scalability, reliability, and performance benefits of
Fibre Channel SANs beyond the native 10 km distance specified by the Fibre
Channel standard. Moreover, it enables the use of full-performance applications
over extended distances, including disaster recovery, remote backup, extended
storage consolidation, remote mirroring, and tape consolidation.
Extended Fabrics can be especially useful if you want to connect remote
locations—such as a disaster recovery facility—with the high performance and
reliability associated with a Fibre Channel SAN. Using Extended Fabrics, you
can leverage existing high-speed public and private networks to connect your
Fibre Channel SANs over Metropolitan Area Networks (MANs) and Wide Area
Networks (WANs).
Brocade Fabric Access Layer API
As the proliferation of storage networking increases, the need for
storage applications to directly access and control fabric resources has
become a critical requirement. Fabric Access, the Brocade Fabric OS
Access Layer API, provides a flexible way for applications to access a
variety of SAN information. Through Fabric Access, applications can control the fabric for functions such as discovery, access (zoning) management, performance, and switch control.
Fabric Access consists of a host-based library that interfaces the
application to switches in the fabric through an out-of-band TCP/IP connection or an in-band IP-capable HBA. You can develop your own SAN
management applications from the API or take advantage of applications from third-party software developers, such as EMC and VERITAS.
Key benefits of Fabric Access include:
■
Single point of access to the fabric
■
Secure access control
■
Multifabric access
■
Transaction-based management
■
Object-oriented XML interface
■
Multiplatform support
■
Conformity to industry standards
137
140_SANs_04
138
8/14/01
3:34 PM
Page 138
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
Fabric Watch
An optional SAN monitor for Brocade SilkWorm switches running Fabric OS
2.2 or higher, Fabric Watch enables each switch to constantly watch fabric elements for potential faults—and automatically alert you about problems before
they become costly failures. Fabric Watch tracks a variety of SAN fabric elements, events, and counters—including fabric-wide events, ports, GBICs, and
environmental parameters. Using Fabric Watch, you can quickly identify and isolate faults while optimizing fabric-wide performance. In addition, Fabric Watch
integrates easily with standard enterprise systems management tools—sending
traps as opposed to requiring polls to report exceptions. Compared to other types
of management protocols—such as SNMP—Fabric Watch provides a more robust
solution that enables proactive management of your SAN environment.
Understanding Loop Support,
QuickLoop, and Fabric Assist
Brocade switches can replace intelligent hubs by creating virtual loops.This is
done by logically connecting ports on one or two SilkWorm switches, with one
or more private loop devices connected to each of the ports. Each switch port
and the devices attached to it form a “looplet” that can independently transfer
data at 100 MB/sec. Unlike hub-based environments, bandwidth is not shared
across looplets, so full-bandwidth data transfers can occur simultaneously on all
switch ports. Multiple hosts can simultaneously transfer data to different looplets
in parallel. Also unlike hubs, Brocade switches provide superior fault isolation
capabilities, preventing errors on a single device from disrupting the entire SAN.
The Brocade QuickLoop software product provides a cost-effective solution
for using private loop hosts with your switched-fabric-based SAN. QuickLoop is
an alternative to various hub-based solutions, and since you are connecting these
devices to a switch as opposed to a hub, private loop environments should exhibit
significant performance and reliability improvements. As a result, QuickLoop can
help protect your existing technology investment in hub solutions while integrating them into a higher performance environment. If you want to connect
private loop HBAs to your Brocade Fabric, you will need QuickLoop or
QuickLoop with Fabric Assist.
QuickLoop operates as a translator between the legacy loop initiators and the
fabric target devices.When setting up QuickLoop, you specify which ports are
connected to loop initiators and which fabric targets will be available to them.
140_SANs_04
8/14/01
3:34 PM
Page 139
Overview of Brocade SilkWorm Switches and Features • Chapter 4
NOTE
Only private loop initiators (HBAs) require the QuickLoop/Fabric Assist
products. Public loop devices, and even private loop targets (storage) use
the Fabric OS built-in translative mode feature instead.
QuickLoop offers two modes of private loop host attachment: Hub
Emulation and Fabric Assist.With Hub Emulation, QuickLoop actually builds
private loops across a set of switch ports on one or two switches. A combination
of QuickLoop functions and Brocade Zoning, Fabric Assist creates virtual loops
capable of spanning the entire fabric while allowing private loop hosts to function as if they were attached to a physical loop switch.
Brocade Fabric OS assigns a “phantom” fabric address to each private loop
target device, enabling each device to be registered transparently in the fabric.To
migrate a private loop host to the fabric, a QuickLoop port is reconfigured to
operate in Fabric Loop Attach (FLA) mode. Next, a single private loop host is
attached to the port (which cannot connect to any other devices). A Fabric Assist
zone is then configured to include the private loop host and all attached storage
with which it needs to communicate.The storage can be private loop, public
loop, or fabric aware.The private loop host appears to reside on a dedicated
private loop with all of the storage in the Fabric Assist zone.
Brocade WEB TOOLS
To simplify SAN fabric management, Brocade WEB TOOLS is a software utility
that enables you to manage and monitor your fabric through a Web browser
interface and Java plug-in. Using WEB TOOLS, you can view all switches in the
SAN from a single interface from any workstation in your enterprise—even at a
remote location.You can perform administration and configuration tasks for the
entire SAN fabric, fabric switches, and individual ports.The utility presents a
graphical representation for each switch licensed for WEB TOOLS, and you can
manage and manipulate each switch through the GUI.
In addition to showing a graphical representation of each switch, the
WEB TOOLS screen indicates the status of the switch.When a switch has a
warning, you can click on that switch to obtain a detailed view to see power
supply status, GBIC/link status, and activity indicators.This approach enables you
139
140_SANs_04
140
8/14/01
3:34 PM
Page 140
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
to glance at a display and know immediately if there is a problem with the fabric
so you can take corrective action.
Future Capabilities in the Brocade
Intelligent Fabric Services Architecture
To provide a powerful yet flexible framework for addressing critical SAN requirements, Brocade has developed the Intelligent Fabric Services Architecture, which
will provide both the basic switching functions and the advanced services that
improve manageability, availability, security, and scalability (Figure 4.8).This architecture, which will help transform the network into an intelligent SAN fabric,
consists of the following building blocks:
■
The SilkWorm family of fabric switches
■
Advanced Fabric Services
■
Open Fabric Management tools
■
Enterprise-class security products
NOTE
At the time of printing, components of the Brocade Intelligent Fabric
Services Architecture represent unreleased products or product functions.
This section is, therefore, a visionary statement of future SAN design
capabilities.
Beginning with the SilkWorm 12000, the Intelligent Fabric Services
Architecture will enable a wide range of advanced switch fabric services,
including Brocade ISL Trunking, Brocade Frame Filtering, more robust hardwareenforced zoning, more comprehensive performance monitoring, and enhanced
security with Brocade Secure Fabric OS.
Brocade ISL Trunking
Brocade ISL Trunking enables as many as four Fibre Channel links between
switches to be combined to form a single logical ISL with an aggregate speed of
up to 8 Gbit/sec.These high-speed trunks simplify network design, optimize
bandwidth utilization, and ensure that server-to-storage performance remains
balanced under heavy network loads (Figure 4.9).
140_SANs_04
8/14/01
3:34 PM
Page 141
Overview of Brocade SilkWorm Switches and Features • Chapter 4
Figure 4.8 The Brocade Intelligent Fabric Services Architecture
SAN Management and Administration,
Storage Resource Management, Storage Administration, Storage Virtualization,
LAN-Free Backup
Enterprise-Class
Security
Fabric OS
Advanced Fabric
Services
2000
2200
2400
Open Fabric
Management
12000
6400
2800
Intelligent Switching Platform
Fibre Channel
IP
InfiniBand
Multiprotocol Support
Figure 4.9 Brocade ISL Trunking Relieves Congestion and Enables High-Speed
Data Traffic
Optimal bandwidth utilization using load
balancing on up to four 2 Gbit/sec links
Without Trunking
1G
With Trunking
1.5 G
0.5 G
1G
1G
2G
1.5 G
0.5 G
1G
2G
Congestion
2G
1.5 G
0.5 G
1G 2G
2G
1.5 G
0.5 G
1G 2G
ASIC Preserves
In-Order Delivery
ISL Trunking is an optional software product available with all Brocade 2
Gbit/sec Fibre Channel fabric switches.This new technology is ideal for optimizing
141
140_SANs_04
142
8/14/01
3:34 PM
Page 142
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
the performance and simplifying the management of a multiswitch SAN fabric
containing Brocade 2 Gbit/sec switches.When two, three, or four adjacent ISLs are
used to connect two switches, the switches automatically group the ISLs into a
single logical ISL, or a “trunk.”The throughput of the resulting trunk is 4, 6, or 8
Gbit/sec.
Brocade Frame Filtering
Another advanced feature incorporated into Brocade 2 Gbit/sec switch hardware
is the ability to filter Fibre Channel frames to increment counters or to perform
other actions such as blocking the frame itself. Overall, Frame Filtering enables a
variety of new capabilities for monitoring and managing SAN fabrics, including
the ability to:
■
Increase zoning capabilities and security
■
Facilitate the deployment of new SAN management applications that
improve visibility into, and control over, the fabric
■
Enable new fabric services for clustering, virtualization, and shared
file systems
Wire-speed filtering of each frame based on the content of several fields in
both the header and the payload enables fabric zoning based on Logical Unit
Number (LUN), network protocol, or I/O request type.This approach enables
fabric-wide heterogeneous LUN masking managed from a central point.
Brocade Frame Filtering combines the security of hardware zoning with the
cabling flexibility of software zoning.When an administrator moves a cable from
one port to another, the Frame Filtering capabilities can monitor the unique
address of the device, change the zone, and block inappropriate data from communicating with it.This critical zoning improvement helps ensure security,
while minimizing the time and effort required to manage the SAN fabric and its
associated zones.
More Robust Hardware-Enforced Zoning
Hardware-enforced zoning by World-Wide Name (WWN), port ID, or Arbitrated
Loop Physical Address (AL_PA) simplifies administration while providing the
highest level of secure control over data access.This capability provides administrators with much more flexibility in how they partition storage and servers to secure
the overall fabric.
140_SANs_04
8/14/01
3:34 PM
Page 143
Overview of Brocade SilkWorm Switches and Features • Chapter 4
Enhanced End-to-End Performance Analysis
Enhanced end-to-end performance analysis enables more effective tracking of
resource utilization on a fabric-wide basis. Administrators can capture I/O performance levels associated with specific initiator and target device IDs anywhere in
the fabric, independent of fabric topology. In addition to reducing management
cost through more proactive capacity planning, this capability enables reporting at
a level required to demonstrate adherence to service level agreements.
Secure Fabric OS
Within the Intelligent Fabric Services Architecture, Brocade provides Secure
Fabric OS, the most comprehensive SAN security architecture available. Based on
state-of-the-art networking security technologies, Secure Fabric OS addresses
vulnerabilities in the SAN fabric and supports authentication methods at the
following access points:
■
User access to the management interface
■
Management console access to the fabric
■
Server access to the fabric
■
Switch access to an existing fabric
To prevent unauthorized configuration or management changes, Fabric OS
employs policies with multilevel passwords, extensive use of Access Control Lists
(ACLs), and centralization of fabric configuration changes on “trusted” switches.
Fabric OS prevents WWN spoofing—the practice of assuming the WWN of
another server to gain access to storage in its zone—at both the HBA and server
level by locking certain WWNs to certain ports.With Secure Fabric OS, new
switches are assigned digital certificates, enabling an existing fabric to authenticate
any switch that joins the fabric.While Secure Fabric OS prevents unauthorized
access to the fabric from the outside, Brocade Zoning ensures that devices can
access only their authorized storage resources.
143
140_SANs_04
144
8/14/01
3:34 PM
Page 144
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
Summary
Before selecting what switches you should use to build your SAN fabric configuration, you need to consider a wide variety of variables. As with all resource planning, you should identify and prioritize your key requirements—both current
and future—before comparing the available products in the marketplace. Brocade
offers a full range of Fibre Channel switches—from hub alternatives to highlyavailable fabric switches, integrated fabrics, and core fabric switches with 64 ports
of connectivity. Each Brocade SilkWorm switch provides unique characteristics
designed for entry-level SANs, medium-sized data centers, or very large enterprises.
Along with these switches, Brocade provides Fabric OS, a real-time operating
system designed to deliver all the key functions for managing your SAN environments. Fabric OS includes a wide range of basic functions, including Fibre
Channel services and rerouting services. It also features an API that you can use
to write your own SAN management applications (or you can take advantage of
applications developed by third-party software vendors).
To simplify the deployment, management, and ongoing administration of
fabric-related tasks, Brocade offers a comprehensive set of software products.
Products such as Brocade QuickLoop, Zoning, and WEB TOOLS can help you
fully leverage your existing hardware investments and help position you for continued growth. In addition, Brocade provides optional features—such as Brocade
Extended Fabrics and Fabric Watch—designed for specific types of SAN environments or functions. By taking advantage of the full range of these products, you
can significantly increase the overall return on your SAN investment.
Brocade will also provide strategic software functions as part of its Intelligent
Fabric Services Architecture. Functions such as ISL Trunking and Frame Filtering
will provide high-performance and high-reliability characteristics to meet the
most demanding enterprise-level requirements. In addition, enhanced hardware
zoning functions and Secure Fabric OS will greatly improve security within SAN
environments, enabling SANs to grow in a safe, controlled manner.
Solutions Fast Track
Selecting the Right Switch
Identify your requirements for availability, port density, functionality,
and cost.
140_SANs_04
8/14/01
3:34 PM
Page 145
Overview of Brocade SilkWorm Switches and Features • Chapter 4
Decide whether you need an arbitrated loop or full-fabric environment.
Learn which switch functions best satisfy your requirements.
Consider what strategic direction you want to take, and whether your
current switches will scale easily to meet your needs.
Understanding the Brocade Fabric OS
Fabric OS is the operating system for all Brocade SilkWorm switches.
Key functions include auto-discovery, in-order frame delivery, zoning,
and others.
Fabric OS provides the capability to work with other storage
management applications.
Using Optional Brocade Features
You can use Brocade Zoning to isolate devices into separate,
virtual SANs.
Zoning is ideal for multiple customer environments where data security
is critical.
Extended Fabrics enables the benefits of Fibre Channel technology at
distances up to 100 km.
Fabric Watch tracks switch and fabric events to help you optimize
fabric-wide performance and proactively identify problems before they
happen.
QuickLoop integrates private loop-based devices into switched fabric
environments.
QuickLoop helps support legacy devices to protect existing investments
while also providing performance and reliability advantages.
WEB TOOLS is an advanced monitoring tool that sends alerts about
fabric events to help prevent downtime.
You can use a Web browser interface and Java plug-in to monitor
switched-fabric SANs from any workstation in your enterprise.
145
140_SANs_04
146
8/14/01
3:34 PM
Page 146
Chapter 4 • Overview of Brocade SilkWorm Switches and Features
Future Capabilities in the Brocade
Intelligent Fabric Services Architecture
The Brocade Intelligent Fabric Services Architecture includes the
SilkWorm family of fabric switches, advanced fabric services, open fabric
management tools, and enterprise-class security products.
ISL Trunking is an optional product ideal for optimizing πperformance
of Brocade 2 Gbit/sec Fibre Channel fabric switches.
Frame Filtering enables a variety of new capabilities for monitoring and
managing SAN fabrics and enhancing both security and reliability.
Secure Fabric OS is the most comprehensive SAN security architecture
available, addressing vulnerabilities in the SAN fabric and supporting
multiple authentication methods.
Frequently Asked Questions
The following Frequently Asked Questions, answered by the authors of this book,
are designed to both measure your understanding of the concepts presented in
this chapter and to assist you with real-life implementation of these concepts. To
have your questions about this chapter answered by the author, browse to
www.syngress.com/solutions and click on the “Ask the Author” form.
Q: Can I use Brocade switches in an arbitrated loop environment?
A: Yes. Brocade offers switches that are viable alternatives to managed hubs. In
addition, the Brocade QuickLoop product enables the integration of private
loop-based devices into a switched fabric.
Q: What switch is most capable of providing high availability at a low cost?
A: For a relatively inexpensive switch that provides high-availability characteristics, try the SilkWorm 2400 and 2800. Both switches have redundant, hotswappable components at a cost-effective price point.
Q: What is the most reliable way to keep certain hosts from interacting with
other hosts or storage devices?
140_SANs_04
8/14/01
3:34 PM
Page 147
Overview of Brocade SilkWorm Switches and Features • Chapter 4
A: Brocade Zoning enables you to configure distinct zones to restrict
interoperability between various devices.
Q: How can I perform fabric administration and management?
A: Brocade offers several tools for simplifying these types of tasks. Fabric Watch,
Fabric Access, and WEB TOOLS all provide timesaving functions that help
reduce SAN management costs.
Q: What kind of switch is most suited for a very large enterprise SAN?
A: The Brocade SilkWorm 6400 provides 64 ports of connectivity with highavailability characteristics, making it ideal for data center environments and
enterprise SAN implementations.
147
140_SANs_04
8/14/01
3:34 PM
Page 148
140_SANs_05
8/14/01
3:35 PM
Page 149
Chapter 5
The SAN Design
Process
Solutions in this chapter:
■
Looking at the Overall Lifecycle of a SAN
■
Conducting Data Collection
■
Analyzing the Collected Data
Summary
Solutions Fast Track
Frequently Asked Questions
149
140_SANs_05
150
8/14/01
3:35 PM
Page 150
Chapter 5 • The SAN Design Process
Introduction
We intend this book to allow you to effectively design, implement, and maintain
storage networks. Doing so requires an understanding of the processes in each of
the seven phases of a SAN’s lifecycle, and their relationships with each other.
Without taking a moment to review the process from the highest level, it is easy
to get lost in the details of SAN hardware.
In this chapter, we provide that high-level view.We show how the SAN
design process is really an ongoing lifecycle.We take you through the process
from the moment the decision is made to deploy a SAN, through releasing the
SAN to production.Then we explain the extent to which the process should be
repeated when upgrades and architectural changes are needed.We also provide
detail on the first two parts of the lifecycle.
The processes presented here are derived from other areas of Information
Technology (IT) and they are normal parts of any large-scale IT project. For
example, when implementing a SAN, you should interview people who will have
a key interest in the finished product—the same is true when putting in a Local
Area Network (LAN) or Wide Ares Network (WAN). Much of this material
should be second nature to any IT network architect, Database Administrator
(DBA), or senior systems administrator. For the more advanced users to whom
these techniques are well understood in general, this chapter will serve as reference
material showing how these processes are applied to SANs in particular.We have
attempted in this book to provide material that will allow both the beginner and
the expert alike to successfully design a SAN.
It is true that more attention must be paid to SAN design than to most other
networking technologies.This is because SANs typically have more stringent
availability and performance requirements than other networks. A SAN is similar
to a traditional network in its requirements, but is also somewhat like a channel
(for example, a CPU/RAM interconnect mechanism, or a PCI bus). Channels
require very high performance, and are almost assumed to be 100 percent reliable.This is in stark contrast to the traditional Ethernet LAN, where things like
five-nines uptime for all node connections, in-order packet delivery, and tuned
approaches to bandwidth management are rare indeed.
Fortunately, SANs provide the tools necessary to achieve these performance
and availability goals. For example, it is commonplace in a Fibre Channel SAN to
use a dual-fabric approach to SAN architecture.This means having two completely separate networks for data to travel over, and potentially using both networks as active paths.While it is certainly possible to do this sort of thing using
IP/Ethernet networks, it is substantially more difficult, since Fibre Channel was
140_SANs_05
8/14/01
3:35 PM
Page 151
The SAN Design Process • Chapter 5
designed with this in mind, and Ethernet was not.The SAN designer must provide for higher availability and spend some time thinking about performance, but
will know going into the process that these goals are entirely achievable.
We should also note here that the process outlined in this chapter is designed
to make a complex SAN design successful.With less complex designs (that is, the
majority of SAN deployments to date), it is perfectly acceptable to skip over
much of the process. For example, if you are deploying a SAN with only three
servers and two storage arrays, spending much time on architectural analysis is
unnecessary.The complexity is presented here so that users with complex
requirements will have it available to them; users with simpler scenarios can use
their judgment about which bits to incorporate into their design process.
The seven phases of the lifecycle of a SAN at the very highest level can be
broken down into three broad categories: design, implementation, and maintenance.The first of these, designing the SAN, includes the collection and the analysis of data, which defines the requirements of the network.We will go into
detail on these first two phases of the design process in this chapter.These phases
provide a solid launch pad for your journey through the remainder of the SAN’s
lifecycle.
The third and fourth phases of the SAN lifecycle—architecture development
and prototype testing—complete the design process. Implementing the SAN
encompasses the transition phase and the release to production phase, the fifth
and sixth phases of the lifecycle.These phases are discussed in Chapters 6 and 7
of this book. Chapters 8 and 9 cover the troubleshooting, maintenance, and management—the final phases of the lifecycle model.
When you are finished reading this chapter, you should have a solid understanding of the design processes, and have a valuable reference tool to enable project planning on any future SAN deployments.
Looking at the Overall
Lifecycle of a SAN
Any SAN will go through certain phases over the course of its life. Depending
on the size and complexity of the SAN, some phases might take months to complete, and some might be only glanced over. For example, a single-switch SAN
does not require much in the way of network design. However, if the solution
involves hundreds of devices, including storage components from many different
vendors that were not already pretested and determined to be interoperable, it
could require extensive testing or validation.
151
140_SANs_05
152
8/14/01
3:35 PM
Page 152
Chapter 5 • The SAN Design Process
When an existing SAN must undergo a fundamental change, be it at the
architectural level or simply the introduction of a new type of storage array, you
should cycle back through the phases of SAN development.This will ensure that
the critical applications running on the SAN are not unexpectedly disrupted by
changes. However, when the change is fundamental but small (like adding a new
type of storage array) it is possible to take a fast track through this process.
The SAN’s lifecycle, which can be described at a high level as design, implementation, and maintenance, translates directly into action-oriented phases on the
part of the SAN designer: data collection, data analysis, architecture development,
prototype and testing, transition, release to production, and maintenance. See
Figure 5.1 for a flowchart of these phases and their relationships to each other.
Figure 5.1 An Overview of the Lifecycle of a SAN
Upgrade / Architectural Change
Design
Implementation
Data Collection
Transition
Data Analysis
Release to
Production
Architecture
Development
Prototype and
Test
Maintenance
Add / Change /
Remove /
Management /
Troubleshooting
140_SANs_05
8/14/01
3:35 PM
Page 153
The SAN Design Process • Chapter 5
Data Collection
You must define the requirements of the SAN before building it.What business
problem is being solved by the SAN? What are the overall goals of the project?
To determine the requirements, you should interview all affected parties, to find
out what they all hope to achieve (in other words, their goals and objectives),
and develop both a detailed technical requirements document and a timeline for
the project.
Data Analysis
Once you have gathered input from all parties, you must analyze it and put it
into a meaningful format.The first two phases together will allow you to start
with the business goals that are driving the project, and determine at a high level
the necessary technical properties required of the SAN. Once this phase is completed, all business requirements should be translated into technical requirements.
The technical requirements document will be created during the collection
phase, and completed during the analysis phase.You will also have created a
working document for a Return On Investment (ROI) proposition to justify the
expense of the project.
Architecture Development
Now that you have a list of technical requirements, you will develop a SAN
architecture that meets those requirements.This process will involve balancing
many factors. For example, there might be a tradeoff between performance considerations and cost. It might be necessary for you to cycle back to the data collection and analysis phases to gather more input to make compromises with input
from all affected parties.When finished, you will have a detailed architecture of
the SAN that you intend to build. A SAN architecture includes the fabric
topologies of all related fabrics, the storage vendors involved, the SAN-enabled
applications being used, and other considerations that affect the overall SAN solution.This step is the most likely to be skipped over quickly when the SAN
requirements are small.
Prototype and Testing
SANs deal directly with the mission-critical data of today’s enterprises.When
building any mission-critical solution, you must test it before releasing it to production. In this phase, you will build a prototype of the SAN solution and test it
153
140_SANs_05
154
8/14/01
3:35 PM
Page 154
Chapter 5 • The SAN Design Process
to ensure that it will function properly when released.This should be done using
nonproduction systems. It might be necessary to cycle back to the architecture
development phase if problems are found.
Wherever possible, build a test bed identical to the solution you are implementing.This will provide the greatest assurance of success in production.
However, budgetary concerns, limits on time and space, and other factors will
usually prevent this from being practical. Imagine a 200-port SAN. Now imagine
200 hosts and storage arrays plugged into it. Now imagine asking the CFO to
buy another 200 devices to test with, and to provide administrators, space, power,
and cooling for all of it.
Because of this, the test phase will be a balance of conducting your own
testing, and leveraging other organizations’ test results. Finding a document that
says “vendor X already tested or certified this configuration” might be as good or
better than testing it yourself. Even if the components of a solution have been
tested by you and/or others to your satisfaction, you must test all aspects of the
complete system prior to releasing it to production.This is due to the fundamental
nature of a large networked system where interactions, timing, and other factors
can produce different results from devices tested individually.The actual final test
will occur during the release to production phase, but creation of the test plan
should occur in this phase. At the end of this phase, all parties with an interest in
the outcome of the project will approve it, and the transition to production will
begin.
Transition
Now that you have a working prototype, and all interested parties have signed off
on it, you will begin to transition your existing hardware onto the new SAN. If a
SAN is already in place, this phase might be as simple as adding a new node to
the SAN, or changing the Inter-Switch Link (ISL) architecture. If the SAN is
completely new, it might involve a long migration process consisting of moving
one production system at a time. In any case, there might be a need to cycle
between this phase and the release-to-production phase repeatedly. Once a component has completed the transition onto the SAN, release to production can
occur for that component.
Release to Production
Once a component has been transitioned onto the new SAN, it must be tested
again and then approved before becoming a part of the enterprise’s production
environment. Since there might be many components that must be transitioned
140_SANs_05
8/14/01
3:35 PM
Page 155
The SAN Design Process • Chapter 5
and released, it might be necessary to cycle between the transition and release-toproduction phases repeatedly until all components have entered production. After
this phase is complete, the SAN will enter the maintenance phase.
Maintenance
This is the useful life of the SAN. All of the benefits that prompted the SAN
designer to implement the SAN in the first place are found in this phase. It is
therefore desirable to have a SAN spend as much time as possible in this phase,
and as little as possible in the other phases.The goal of this phase is to keep the
SAN running as well as possible for as much of the time as possible, and to
expand its capabilities only according to the original, tested, and approved parameters.This phase includes adding, changing, or removing components, as well as
managing, monitoring, and troubleshooting existing components.
During the maintenance phase, no changes should be made to the SAN that
fall outside of the original blueprint that was established in the previous phases.
Any such change necessitates a repetition of the entire lifecycle. For example, if
the SAN were originally built using vendor X storage arrays, an additional
vendor X array could be added as part of maintenance, but an array from vendor
Y would require thought and testing before its introduction. It might not require
much thought and testing, but it must, in any case, be looked into.
NOTE
Any fundamental change to the SAN requires a repetition of the
entire lifecycle.
In summary, the seven phases of the SAN design lifecycle are:
1. Data Collection
2. Data Analysis
3. Architecture Development
4. Prototype and Test
5. Transition
6. Release to Production
7. Maintenance
155
140_SANs_05
156
8/14/01
3:35 PM
Page 156
Chapter 5 • The SAN Design Process
Conducting Data Collection
The data collection phase of SAN design is the foundation upon which the SAN
will be built. It is vital that the information collected in this phase be both complete and accurate. If the SAN requirements are poorly defined, it is guaranteed
that the resulting SAN will meet business objectives poorly.You should therefore
take your time with this phase.
Some of the information you will collect is generic to any major IT project.
If you already have an established data collection process in your company, integrate the SAN-specific material from this section into that process.
Data collection consists of determining which people you will need to interview, interviewing them, and conducting a physical assessment of existing equipment and facilities.When this process is complete, you will have a technical
requirements document consisting of a list of the business problems that the SAN
will solve, the business requirements for the SAN, characteristics of all devices
that will be attached to it, and detailed information about all relevant facilities.
You will also have a timeline for implementation.
Creating an Interview Plan
Who has a stake in the SAN solution? Well, you could argue that every person
who uses a system attached to the SAN has a stake in it.While true, this is not
useful for creating an interview list, because there would be too many people
involved. Similarly, you could argue that only the person who initiated and
“owns” the project should be consulted. Again, this is not useful, because it leaves
out people who have a strong interest in the project, and might have knowledge
that is critical to its success.
A balanced approach to creating an interview list is critical.You can view the
people on this list as a SAN solution “core team.”Think about having all of these
people together in a room, and trying to solve the SAN solution problem
together.Try to include everyone needed to solve the problem, but nobody else.
Typically, a core team might include:
■
At least one systems administrator
■
At least one storage administrator
■
A network administrator
■
A DBA, if a database server will be involved
140_SANs_05
8/14/01
3:35 PM
Page 157
The SAN Design Process • Chapter 5
■
At least one application specialist associated with each application that
will run on the SAN
■
At least one manager who can act as an overall “owner” of the project
It is probable that you will be one of these people, in addition to being the
SAN designer. Unless you are an external consultant, this is typically the case.
Once you have a list of the desired members of the core team, you must contact them and ask them to take time to help with the project. Ensure that each
team member has allocated the necessary time and that their management appreciates the demands of participating in this team. As the SAN design goal of the
team might require a long-term process, getting this buy-in initially will minimize disruption to the team later. Often in the past, SAN design teams did not
include network administrators, as the focus was on the storage side. Experience
has shown that SANs are networks, and should be coordinated with the traditional IP network groups to ensure that proper networking experience is at hand.
Whenever possible, schedule an interview as a face-to-face, one-on-one
meeting.This format will allow you to communicate the questions and understand
the answers most effectively.You should also have a group meeting with the entire
core team after conducting individual interviews.This will allow you to resolve
any differences before analyzing the data, and review the analysis as a team.
Conducting the Interviews
Now that you know who to interview and have a schedule of when you will
interview them, you need to know what questions to ask, and what format to
put the collected data into.This section contains a suggested set of questions that
you should ask, and some detail on what each question is about. It is followed
with a summary that could be used to create an interview form.
NOTE
Not every person you interview will be able to answer every question.
Between the members of the core team, the expertise necessary to answer
all of these questions should be completely represented. Some members
might provide conflicting answers. You will be in a key position to resolve
these differences, and achieve a compromise. It is vital that all affected
parties agree with the deployment strategy before implementation begins.
157
140_SANs_05
158
8/14/01
3:35 PM
Page 158
Chapter 5 • The SAN Design Process
What Overall Business Problem
Are You Trying to Solve?
A business problem that would initiate a SAN design might be something like:
■
“We need to keep our business running in case of a disaster like an
earthquake or fire.”
■
“Our backups take so long to finish that they are impacting our ability
to process customer orders.”
■
“We need to save money on storage by utilizing free space
more efficiently.”
Chapter 6 discusses some of the more common business problems that SANs
can solve. Brocade maintains a series of documents that detail specific SAN solutions.These documents are known as Brocade SOLUTIONware configuration
guidelines and are available on the Brocade Web site at www.brocade.com/SAN.
NOTE
A SAN might be intended to solve multiple business problems. In this
case, you should separate each business problem into a different set
of questions and answers. You will correlate these during the
analysis phase.
What Are the Business Requirements of the Solution?
Once you know the business problem that you need to solve, it should be easy to
figure out what the business requirements of the solution must be.This is simply
a matter of rephrasing the previous answers, with more specific criteria:
■
“The SAN must allow all functionality of all business-critical servers at
site X to resume within Y minutes at site Z.”
■
“The SAN must allow the following list of servers to complete backups
within X minutes: …”
■
“The SAN must allow the following list of servers access to the
corresponding list of storage arrays: …”
140_SANs_05
8/14/01
3:35 PM
Page 159
The SAN Design Process • Chapter 5
This is useful because it acts as a migratory step toward turning the business
problem into a matching technical solution.
Moving from Business Requirements to Technical Requirements
You should not deploy a SAN simply for the sake of adopting the “hot new
technology.” SANs are hot because they solve important business problems and
allow companies to make more money.This could be fairly direct—for example,
a matter of saving more money on IT than the project cost, since SANs are very
efficient at providing a clear ROI. ROI is often achieved by management efficiencies, resource efficiencies, or better utilization of resources. On the other
hand, it could be indirect—by making IT systems more efficient, thus increasing
users’ productivity.
The first key to a successful SAN deployment is the accurate and complete statement of what business problem(s) you intend for the SAN to solve. Unfortunately,
you cannot turn a business problem into a technical solution without work.
There is no silver bullet to make your backups run faster so that your users will
not have to work on a slow system. However, there are tape libraries that run fast,
and can be shared by many devices.This, when combined with an appropriate
Fibre Channel fabric, and a SAN-enabled backup application, could amount to
the same thing as the silver bullet.
In order to know which hardware and software will solve your business
problem, you have to define in a technical way what you need to accomplish.
This is a necessary intermediate step between the business problem and the purchase of specific technical solutions.
It is fairly straightforward to change a sentence like, “We need to keep our
business running in case of a disaster like an earthquake or fire” into a sentence
like, “The SAN must allow all functionality of all business-critical servers at site
X to resume within Y minutes at site Z.” Once you have done this, you will have
the business requirements of the solution.You know that you have a business
requirements statement when you could phrase it like this, and still have it make
sense: “Our business will run better if we have a SAN that can allow all functionality of all business-critical servers at site X to resume within Y minutes at site
Z.”The components of the business requirements statement are “our business will
run better” (or something to that effect) followed by a reasonably specific statement about what the SAN must do to make that happen.
However, you will still not have the technical requirements detailed.This is not
something that you, the SAN designer, can simply ask in an interview.This is a
large part of what you will bring to the table as the SAN designer once you have
159
140_SANs_05
160
8/14/01
3:35 PM
Page 160
Chapter 5 • The SAN Design Process
gathered the data and then analyzed it in the next phase. A technical requirements
document set should list, in detail:
■
All of the devices that are to be attached to the SAN
■
Their locations
■
The communication patterns between them (random I/O, streaming
access such as video, I/O-intensive database access)
■
Their performance characteristics (reads, writes, max/min/typical
throughputs)
■
What software will run on them relative to the SAN (for example, a
LAN-free backup application, or anything SAN-specific)
■
How all of this is expected to change over time (storage growth,
server growth)
The technical requirement statement would be, “The SAN needed to meet
the business requirements outlined must have the following characteristics: …”
This would be followed by the body of the technical requirements document.
The rest of the questions to ask in the interview process will provide you with
the body of this document.
What Is Known about the Nodes
that Will Attach to the SAN?
You should try to get a list of all information possible about every node attached
to the SAN. For each node, the relevant information can include questions about
each host, storage device, facilities where hosts and storage will be located, and
questions about the SAN itself. Questions about each host could include the
following:
■
What operating system is installed? What patch or service pack level?
■
Are fabric HBA/controller drivers available? Are they well tested?
■
What type of connection is supported (private loop, public loop,
or fabric)?
■
Which applications will run on this host (databases, e-mail, data
replication, file sharing)?
140_SANs_05
8/14/01
3:35 PM
Page 161
The SAN Design Process • Chapter 5
■
How much storage does it require?
■
How will its storage requirements change over time?
■
Physically, what are its dimensions? How heavy is it?
■
Does it rack mount? Does it have a rack kit? Will it set on a shelf?
■
If there is a management console, what type is it? (Is it a traditional keyboard/video/mouse combo [KVM], or is it a serial connection, like a
TTY?) Does it need to be permanently attached? (For example, a Sun
SPARC server could have a keyboard, mouse, and monitor permanently
attached, or it could be managed through a serial port attached to
a modem.)
■
How many HBAs will it have?
■
If it has more than one HBA, what software will be used to provide
failover or performance enhancements of multiple paths?
■
Do these interfaces exist, or do they need to be purchased? (You should
keep track of every piece of equipment that you need to buy for the
project, for budgeting and ROI analysis.)
■
If they exist, what are the make, model, and version information?
■
If not, what kind will be purchased to meet the objective?
■
How many Ethernet interfaces will it have?
■
In what temperature range will it operate?
■
Will it need a telephone line for management?
■
Where will the node be physically located?
These questions could be used to create an interview form for each host,
which might look like the following:
161
140_SANs_05
162
8/14/01
3:35 PM
Page 162
Chapter 5 • The SAN Design Process
OS
HBA Drivers
fabric
PTP
private loop
public loop
New
make
model
version
Existing
make
model
version
HBA Count
DMP / Failover Support
HBA New or Existing
Application List
Initial Storage Requirements
Projected Storage Requirements
Dimensions
Weight
Mounting
rack mount
Console Type
KVM
rack shelf
switched KVM
floor
TTY
table top
terminal server
modem
Console Location
Ethernet Interface List
Operating Temperature
Power Requirements
Voltage
Amperage
Connector Type
Need Telephone Line for Management?
Physical Location
Questions about each storage device could include the following:
■
What are the make, model, and version information?
■
What type of connection is supported (private loop, public loop, fabric,
SCSI, SSA)?
■
How many hosts can this device serve?
■
If it is a multiport device, does it have limits on how many hosts can
access it through each port?
■
Physically, what are its dimensions? How heavy is it?
■
What is its capacity in gigabytes?
140_SANs_05
8/14/01
3:35 PM
Page 163
The SAN Design Process • Chapter 5
■
Does it rack mount? Does it have a rack kit? Will it sit on a shelf?
■
If there is a management console, what type is it? Does it need to be
permanently attached?
■
How many Fibre Channel interfaces will it have?
■
Do these interfaces exist, or do they need to be purchased?
■
If they exist, what are the make and model? If not, what kind will
be purchased?
■
How many Ethernet interfaces will it have?
■
In what temperature range will it operate?
NOTE
Obviously, some of these questions do not relate directly to the SAN
deployment. However, they are generally relevant whenever making a
large architectural change in a data center. For example, it is necessary to
know what temperature a server can operate at in case the server is in a
location where temperature control is an issue. In this case, adding a
large number of switches might increase the room temperature beyond
operating levels. As always, use your judgement about which questions
to include in your interview, and which to skip over.
■
Will it need a telephone line for management?
■
Where will the node be physically located?
■
What is the firmware level?
■
For tape libraries, what is the capacity of each cartridge, number of cartridges the library can hold, number and speed of drives, and number
of transports?
■
SCSI or Fibre Channel interface? What type of SCSI (wide/narrow, differential/single ended)?
163
140_SANs_05
164
8/14/01
3:35 PM
Page 164
Chapter 5 • The SAN Design Process
NOTE
While it is possible to manage an entire fabric through a single Ethernet
connection, this is not the method that Brocade currently recommends.
You should plan on one Ethernet connection per Brocade switch, in addition to planning connections for hosts and other SAN devices. It is also
advisable for the highest availability plan to balance switches across multiple electrical circuits, even if an Uninterruptible Power Supply (UPS)
protects them.
Questions about facilities where hosts and storage will be located could
include the following:
■
Who is responsible for this facility?
■
Are there any existing optical cables, and what type?
■
Is there sufficient electrical power?
■
What about cooling?
■
Is there enough rack space?
■
What is the network infrastructure?
■
Physical access? If the location is on an upper floor, is there a
freight elevator?
Answers to questions about the SAN itself must be considered preliminary.
They will indicate preconceptions that members of the core team have, but
all members should be prepared to be flexible on these preconceptions as the
SAN design process progresses. Questions about the SAN itself could include
the following:
■
Are there any distance considerations? (For example, long cable runs
between floors of a building, campuswide networks, or MAN/WAN
connections.)
■
How many hosts will attach to the SAN?
■
How many storage devices will attach to the SAN?
■
If known at this point, do they require any-to-any connectivity?
Alternately, are there groups of devices that need to communicate only
among themselves?
140_SANs_05
8/14/01
3:35 PM
Page 165
The SAN Design Process • Chapter 5
Which SAN-Enabled Applications
Do You Have in Mind?
Will the SAN use a serverless backup application? How about clustering software? How about volume management? This category of software requires special
attention because of its close ties to the SAN hardware you choose to build the
solution. For example, if you plan to use vendor X serverless backup software,
you must make sure that your backup hardware (tape libraries, Fibre Channel/
SCSI gateways, etc.) is supported.
Which Components of the Solution Already Exist?
Any hardware or software that is already in place and that must be included in
the solution will create points for you to build around.You must find out as
many details as possible about everything in this category.When you are finished
with the interviews, and conduct the physical assessment, you should personally
inspect every piece of hardware.This will prevent surprises later in the process.
Make sure that you find out exactly where all hardware is located, and how to
access it.
You must pay special attention to devices that already exist and already have
Fibre Channel interfaces. Find out which kinds of HBAs are installed in hosts,
and which driver revisions are installed on them. Find out code levels for RAID
arrays and Fibre Channel tape libraries. Find out if upgrades to driver/code levels
are planned or at least allowed.
NOTE
You must know if each device is public loop, private loop, or full fabric.
Some devices might even be SCSI and require additional hardware to
bridge between SCSI and Fibre Channel.
If possible, you should not use private loop drivers on initiators unless the
device does not support fabric drivers or is not easy to upgrade. Private loop
hosts require special licenses, typically Brocade QuickLoop and Zoning. Find out
if the existing devices are configured as full-fabric devices. If not, find out if their
drivers support full fabric, or if they can be upgraded to full fabric.This is not
intended to discourage incorporation of private loop devices into a fabric:
165
140_SANs_05
166
8/14/01
3:35 PM
Page 166
Chapter 5 • The SAN Design Process
QuickLoop and Fabric Assist exist specifically to enable this to occur. However, if
a device can support full fabric, then integration into the SAN will be easier if it
does so.
Which Components Are Already in Production?
Components that are in production require special attention in two areas:
■
Duplicate equipment might be desired for testing.
■
The transition phase is more complex.
It is vital to know as much as possible about production systems that are
going to transition onto the SAN.Therefore, somebody intimately familiar with
and responsible for every such system should be included on the core team.
Which Elements of the Solution
Need to Be Prototyped and Tested?
For relatively simple solutions that involve only components already certified to
work together, it might be that you do not have to do any testing at all. For
example, if you are implementing a SAN-based solution on a Brocade
SOLUTIONware document, you might feel that you need only to do minimal
validation.This is opposed to a solution where no documentation or testing
information exists, which generally requires extensive validation.
For more complex solutions involving a large number of devices that might
be from many different vendors, you might feel that every single element needs
to be tested in combination before release to production can occur.You should
get input on this from every member of the core team. If any team member feels
that you should conduct inhouse testing on a component, you should strongly
consider doing so.
What Equipment Will Be Available for Testing?
Any existing equipment that is not in production, and any equipment that is
going to be purchased specifically for this project might be good material with
which to build a test bed. Existing equipment that is in production is not good to
test with. If existing equipment already in production will be transitioned onto
the SAN, it might be beneficial to budget for a representative sample of duplicate,
nonproduction systems with which to prototype the solution. It is generally a
good idea to have such systems available for testing in any case. It may also be
possible to borrow systems to test with. In any case, it's probably worth asking
your vendors for such loans.
140_SANs_05
8/14/01
3:35 PM
Page 167
The SAN Design Process • Chapter 5
Whether or not test equipment is available, you should research what testing
third-party vendors or third-party organizations have already done. In this way,
you will avoid duplicating their efforts. If you cannot get representative test
equipment for an element that needs to be prototyped, it might be acceptable—
and necessary—to rely entirely upon the work done by others to validate
the solution.
Again, with many solutions, this is a perfectly acceptable way to go. If you do
not feel that inhouse testing is warranted, then you can save time and money by
skipping the prototype and test phase. Just make sure that you have documentation certifying the solution before you make this decision.
How and When Are Backups to Be Done?
You need to get a list of everything that relates to the system’s backups:
■
What backup hardware will be used?
■
What backup software will be used for each host?
■
Which storage arrays will be backed up by which tape libraries?
■
When will these backups occur?
■
How long can they take?
■
How much data needs to be backed up?
■
Will snapshots be used? How do they work?
■
Will split mirrors be used? How do they work?
What Will Be the Traffic Patterns in the Solution?
You should produce a matrix showing every initiator-to-target communication
expected in the SAN.This is necessary to determine performance characteristics,
and to set up zoning on the fabric:
■
Which hosts will use a specific storage array?
■
Which hosts in a cluster will talk directly to each other over the SAN?
■
Which backup devices will be performing serverless backups?
■
Which arrays will they be backing up?
Create a table listing every device on the SAN that can act as an initiator in
one column.This will include every host, every storage virtualization product,
167
140_SANs_05
168
8/14/01
3:35 PM
Page 168
Chapter 5 • The SAN Design Process
and every serverless backup server. It might include storage arrays, if they have
data replication capabilities.Then put a second column next to it with all of the
targets that each initiator will communicate with (Table 5.1).
Table 5.1 Initiator-to-Target Mapping
SAN Traffic Patterns
Initiators
Targets
host1
host2
array3
array1
array2
tape1
array1
array1
array2
array1
array3
array4
array3
host3
host4
tape1
array3
array4
Note that some devices on a SAN can act as both an initiator and a target.
If so, they will appear in both columns. See array3 and array4 in Table 5.1.
This is how you would indicate that array3 and array4 perform data replication
between them.
You will not necessarily be able to build this table by interviewing one person;
it will likely be developed over the course of the interview process, changed as the
implementation takes place, and maintained for the life of the SAN.
What Do We Know about Current
Performance Characteristics?
Any devices that currently exist, and will be transitioned onto the SAN, are candidates for empirical performance testing.
Create a second set of columns next to the traffic pattern columns, as shown
in Table 5.2.You will need entries for peak utilization and sustained utilization.
Obviously, you will only be able to enter current data for initiators that already
exist, and already communicate with the same targets they will talk to after the
SAN is complete.
140_SANs_05
8/14/01
3:35 PM
Page 169
The SAN Design Process • Chapter 5
Table 5.2 Current Traffic
SAN Traffic Patterns
Current Peak
Current Sustain
Initiators
host1
host2
MB/sec
10
MB/sec
5
50
10
host3
host4
tape1
array3
array4
Targets
array3
array1
array2
tape1
array1
array1
array2
array1
array3
array4
array3
In this example, host1 and host3 already exist, and are already talking to
array3 and array1, respectively. All of the other devices are to be added, are not
talking to the same targets that they will be after the SAN is up, or performance
data might simply be unavailable.
If the owner of a system has already done this kind of analysis, you will
simply transfer the numbers to your table. If not, you should work with the
owner to get the performance information, as this might have a substantial
impact on your SAN design.
Gathering Performance Data
On almost any kind of system, some facility exists for measuring performance.
More often than not, there will be multiple options for gathering disk I/O performance information.
For example, on a Windows NT system, you might use the diskmon feature.
You have to install this from the Windows NT Resource Kit. If you do not install
diskmon, standard Windows perfmon will not have a disk monitoring tool.
Alternately, you could install a package like Intel’s Iometer, and use that to generate a simulated load and measure performance.This tool is presently available as
a free download from Intel’s Web site.
Under Sun’s Solaris operating system, performance can be measured using the
iostat utility, the GUI utility perfmeter, or one of a number of third-party utilities like
Extreme SCSI.There are similar tools in every UNIX variant.We are providing
169
140_SANs_05
170
8/14/01
3:35 PM
Page 170
Chapter 5 • The SAN Design Process
examples for Solaris only, since the details of these commands will vary between
every flavor of UNIX, and providing examples for every variant is impractical.
Refer to the man pages for your particular version of UNIX for the exact syntax.
There are also a number of options for generating loads under Solaris, ranging
from the dd command, to—again—a utility like Extreme SCSI.
NOTE
Tools like Iometer, dd, and Extreme SCSI should be used with care. It is
tempting to use them to generate maximum load. A more useful test to
run is to generate a representative load. Try to determine what your
application will actually be doing in terms of read/write ratio, and total
bandwidth consumption, and use these tools to generate that kind of
load on the system.
In cases where performance data cannot be collected empirically—such as
when the system in question does not exist yet—there is still hope. Most hosts
are not capable of generating sustained load at full wire speed.They are generally
going to be limited by other factors.These could include:
■
CPU speed Although Fibre Channel has much lower overhead than
the TCP/IP stack, it still takes a fast processor to get near to full performance on a 1 Gbit/sec Fibre Channel link, simply because the processor
will be busy running whatever task is actually generating the I/O.
While almost all hosts now shipping have sufficiently fast CPUs, you also
need to estimate how much of that CPU resource is taken up by other
tasks the host is performing that do not result in disk I/O (such as running a TCP/IP stack). Moreover, many data centers have older CPU
servers that might not be capable of running at 1 Gbit/sec even without
taking these tasks into consideration.
■
PCI bus speed Fibre Channel full duplex is 200 MB/sec. A 32-bit 33
MHz PCI bus can only sustain about 120 MB/sec. A 64-bit 33 MHz or
32-bit 66 MHz PCI bus can handle about 240 MB/sec, and a 64-bit 66
MHz bus can handle about 480 MB/sec. Even on the higher rate buses,
you must bear in mind that it is a shared bus. If you put two Fibre
Channel HBAs onto a bus that can handle 240 MB/sec, that will be the
total possible full-duplex speed for both HBAs.Therefore, you would on
140_SANs_05
8/14/01
3:35 PM
Page 171
The SAN Design Process • Chapter 5
average get 120 MB/sec out of each interface. For example, this could—
in a balanced read/write environment—mean that you get only 60
MB/sec of read performance out of each card. Also bear in mind that
there may be other cards on the bus taking up some of that bandwidth.
■
HBA speed Although designed to work on a 1 Gbit/sec SAN, many
HBAs cannot achieve or at least cannot sustain full 1 Gbit/sec transfers.
Newer HBAs typically have better performance. Older HBAs might
only be able to achieve 60 MB/sec, regardless of the other possible
issues.
■
RAID controller speed Many RAID controllers cannot sustain 100
MB/sec per interface on all interfaces simultaneously. Some barely
operate at 30 MB/sec per interface, which is more than acceptable for
many applications! Finding out the limits of your RAID array should be
as simple as calling the vendor’s support channel. Of course, you might
also check third-party testing results such as those done by many
industry magazines for an unbiased opinion.
■
RAM quantity and speed If your system is short on RAM, it might
spend a lot of time paging. If it does, performance will be substantially
degraded.
■
Disk seek time If your application does a lot of random I/O, the disk
heads will have to jump all over the platform. Since disk seek time is an
order of magnitude or more slower than a Fibre Channel link, you
might have to allocate substantially less bandwidth for random I/O
applications like a file server than for sequential I/O applications like a
video server or decision support system.
■
Application overhead This ties into the CPU-limit factor. How much
CPU do you have, and how much of it is free for handling I/O?
■
Write speed of tape device Most tape drives cannot come anywhere
near 100 MB/sec. It is usually sufficient to ask a vendor for performance
data in the case of tape drives, although optimistic compression ratios
can inflate the performance numbers they provide.
In addition, if anything is known about the application that is running on the
host, you might be able to make a good guess about how much load it will even
try to place on the disk subsystem. For example, if you know that the host is an
intranet Web server, and that it receives only 500 hits a day, you can safely guess
that its I/O requirements will be minimal.
171
140_SANs_05
172
8/14/01
3:35 PM
Page 172
Chapter 5 • The SAN Design Process
Once you have collected your best empirical or estimated numbers for each
factor, use the lowest common denominator approach to estimate the maximum
bandwidth that the system could need.You can guarantee that the overall system
will not outperform its weakest link.
Also note that on systems with multiple HBAs, I/O load might be distributed
across these HBAs. Achieving active-active distribution across HBAs might
require third-party applications like the VERITAS Dynamic Multipathing software,Troika’s HBA driver, or one of the storage vendor’s dual-path products. If
this is the case, you might estimate that each HBA will usually have a fraction of
the total load. In a dual-fabric, active/active HBA architecture, each HBA normally has 50 percent of the total load. If a system is capable of sustaining 70
MB/sec, then each HBA will sustain 35 MB/sec. Note that this might change
during system maintenance if you shut down one path, and the remaining path
could then take on the full 70 MB/sec, so the design should incorporate the
worst-case scenario. It is usually also good practice to add some padding to the
top of this estimate (perhaps 10 percent) to allow for the unexpected.
NOTE
Unlike physical-disk counter data, logical-disk counter data is not collected by the NT operating system by default. To obtain performance
counter data for logical drives or storage volumes, you must type
diskperf -yv at the command prompt. This will cause the disk performance statistics driver used for collecting disk performance data to
report data for logical drives or storage volumes. By default, the NT operating system uses the diskperf -yd command to obtain only physical
drive data. For more information about using the diskperf command,
type diskperf -? at the command prompt.
What Do We Know about
Future Performance Characteristics?
Performance numbers change over time. Consider a customer database for a catalog retail company. Perhaps you will install the SAN in February, because this is
your slow month of the year, and you can get the necessary downtime.You might
know that the database host will start talking to its storage array(s) at a sustained
rate of 5 MB/sec during the business day, with a peak of only 10 MB/sec.
140_SANs_05
8/14/01
3:35 PM
Page 173
The SAN Design Process • Chapter 5
However, when the Christmas season comes along and your business picks up,
you might move to a 50 MB/sec sustained rate, peaking at 70 MB/sec. Because
of the potential for substantial changes in performance requirements over time, it
is essential to plan for both current and projected performance. Most of this
might be educated guesswork, since many of the systems you are going to deploy
might not yet exist.
Again, you will need to come up with numbers for both sustained traffic and
peak traffic for each communication. Also try to determine what days/times peak
performance will occur.This will be added to your table (Table 5.3).
Table 5.3 Adding Traffic Projections
SAN Traffic
Patterns
SAN Peak
Performance
SAN Sustained
Performance
Initiators
host1
Targets
array3
Initial
10
Expected
10
Initial
5
Expected
5
host2
array1
array2
tape1
array1
0
0
20
50
70
70
20
50
0
0
0
10
50
50
0
20
array1
array2
array1
0
0
0
90
90
20
0
0
0
50
50
0
array3
0
20
0
0
array4
array3
10
5
30
5
5
0
5
0
host3
host4
tape1
array3
array4
Peak Times
Initial
M–F
8a–5p
Expected
same
M–F
8a–5p
+ Sa
10a–4p
Sa
same
5p–9p
Sa
same
9p–11p
Again, you can only enter data for systems about which you can make an
educated guess. If you know about what the peak traffic could be based only on
the limitations of a system, you might not have any way of guessing when this
would occur.You should also enter projected data for systems that you know that
you will add later.
In Table 5.3, host2 and the application it is running might not exist yet, so
every piece of data about that system is pure guesswork. Let us say that host2 is a
Return Merchandise Authorization (RMA) system, and your rapidly growing
173
140_SANs_05
174
8/14/01
3:35 PM
Page 174
Chapter 5 • The SAN Design Process
company has never had an RMA system before.You might not be able to reliably
guess when customers are going to call in with RMA requests most often, or
even how many RMAs you are going to get in a given day.The best you can do
is determine what performance the hardware and software you are installing
could reasonably run at, and design the SAN to support it all the time it could be
in use.While this approach might result in over-engineering your network, this is
better than the alternative. During future design phases, you can alter the SAN
design to adjust or scale back the design accordingly, as well as incorporate other
additions and changes.
For backup devices, peak usage will always correspond with your backup
schedule.This will usually not correspond with peak usage of the rest of the
system.This is particularly useful knowledge when planning an ISL architecture,
because you can often count on having low nonbackup-related utilization of ISLs
during backup windows. An obvious exception to this is a SAN that is used
solely for performing LAN-free backups.
How Much Downtime Is Acceptable to
Production Components During Implementation?
It will likely be necessary to shut down some existing production devices during
implementation, to ensure a safe transition onto the SAN. For example, you
might have to shut down a host to install an HBA. Determine how much downtime is acceptable for each host, and at what times this can occur. Generally, you
should try to schedule more downtime than you think you need to ensure that
any unforeseen issues that arise during the implementation can be handled within
the downtime window.
How Much Downtime Is Acceptable for Routine
Maintenance? How Much Downtime Is Acceptable
for Upgrades and Architectural Changes?
These two questions are intimately related, because—to an end user—there is
really no difference between downtime to a production system for maintenance,
and downtime for an upgrade. Once systems are in production, you will want to
keep them running as much as possible.
Many upgrades can be accomplished with zero downtime by using a doubleor triple-redundant fabric architecture. No matter how well you plan the upgrade
and maintenance processes beforehand, you will need to shut down specific hosts
140_SANs_05
8/14/01
3:35 PM
Page 175
The SAN Design Process • Chapter 5
on occasion. For example, you might want to upgrade an HBA driver, which
would typically require a reboot.
NOTE
Wherever possible, a redundant fabric architecture should be used. This
will ensure the best performance and reliability, and will simplify maintenance tasks. In a redundant fabric architecture, every host has at least
two paths to every storage device it connects to, and these paths traverse two completely unconnected fabrics. While it might appear on the
surface to be more expensive, if hosts are to be dual-attached anyway, it
is actually less expensive to attach them to two separate fabrics than to
use one larger fabric, or a director-class switch. This does not even
include the downtime ROI calculation, which, in high-availability environments, will usually overshadow the entire cost of the SAN. More details
about redundant and resilient fabrics are provided in Chapter 7.
You should therefore determine in advance when you will be able to
schedule downtime for every host and storage array, and for the fabric itself.You
might not have to use every scheduled outage, but having them available to you
when you do need them is essential.
One way to do this is to make a list of applications and services provided by
the hosts on the SAN, and determine an owner for each.Take your list of SAN
devices and map these devices to the applications and services they affect.This will
provide a mapping of application/service owners, who are typically responsible for
scheduling downtime, to devices that typically require downtime. Have each owner
approve the downtime calendar for each device that affects his or her service.
The mapping of owners to devices should be kept up to date as changes in
personnel, applications, and/or SAN infrastructure occur.
When Do You Need Each Piece
of the Solution to Be Complete?
Once you have a table detailing which of the initiators communicate with which
targets, you can begin to create a timeline for the project. Other members of the
core team will tell you something like, “the customer database application must
be online by mid-June.” It is your task to define which SAN components you
175
140_SANs_05
176
8/14/01
3:35 PM
Page 176
Chapter 5 • The SAN Design Process
need to accomplish this, and to develop a timeline for adding these components
that meet their requirements.
Summary List of Questions
This is a high-level list of some of the questions that should appear on a SAN
design interview form:
■
What overall business problem are you trying to solve?
■
What are the business requirements of the solution?
■
What is known about the nodes that will attach to the SAN?
■
Which SAN-enabled application do you have in mind?
■
Which components of the solution already exist?
■
Which components are already in production?
■
Which elements of the solution need to be prototyped and tested?
■
What equipment will be available for testing?
■
How and when are backups to be done?
■
What will the traffic patterns in the solution be?
■
What do we know about current performance characteristics?
■
What do we know about future performance characteristics?
■
How much downtime is acceptable to production components during
implementation?
■
How much downtime is acceptable for routine maintenance?
■
How much downtime is acceptable for upgrades and architectural
changes?
■
When do you need each piece of the solution to be complete?
Conduct a Physical Assessment
You should now have the location of every piece of hardware that currently
exists. In addition, you should know where each piece of hardware in the eventual SAN will be located.
Look at each piece of hardware. Make sure that it does exist, and has all necessary pieces to function.This could include things like power cords, keyboard,
140_SANs_05
8/14/01
3:35 PM
Page 177
The SAN Design Process • Chapter 5
mouse, monitor, Ethernet card, Ethernet cable, HBAs, and Fibre Channel cables.
Note the physical dimensions of the hardware, and its power/cooling requirements. Does it rack mount? Does it have a network interface? How many Fibre
Channel interfaces does it have? How much does it weigh? You should already
have this information from the interview process, but you should verify that the
information you were given is correct.
Go to each location where SAN equipment or nodes will be installed, and
again check to see that your information was correct. Notice how the equipment
will fit into the space available. Notice how the equipment will enter the
building.You should also have a meeting with the person in charge of the facility
to discuss power, cooling, and equipment locations.
Analyzing the Collected Data
Now that you have collected information from all key stakeholders in the project, and verified the accuracy of this information, you will analyze it to determine the characteristics of the required solution.When you have completed this
process, you will have a list of technical requirements, and an ROI analysis to
justify the project.
Processing What You Have Collected
You have a matrix detailing communication between nodes. Attempt to group
the nodes by communication patterns.The purpose of this is to determine the
amount of known locality in the SAN. Locality of reference is a concept prevalent in many areas of computer science, from disk drive construction to LAN
design. Locality is important in SAN design because if you can localize traffic into
specific areas of a SAN, you directly improve the SAN’s performance and reliability.This will allow a more cost-effective SAN design as well, preventing overdesigning the network to handle nonexistent cross traffic. Locality is discussed in
greater detail in Chapter 7.
A SAN with a great deal of known locality might be constructed out of
many separate fabrics, with no ISLs whatsoever. A SAN with little or no known
locality might require a high-performance ISL architecture (Table 5.4).
177
140_SANs_05
178
8/14/01
3:35 PM
Page 178
Chapter 5 • The SAN Design Process
Table 5.4 Initiator–to-Target Mapping for Locality Example
SAN Traffic Patterns
Initiators
Targets
host1
host2
array3
array1
array2
tape1
array1
array1
array2
array1
array3
array4
array3
host3
host4
tape1
array3
array4
In Table 5.4, array3 would be grouped with host1, tape1, and array4. None of
those devices will need to communicate with any of the other devices.They
could be grouped onto a single switch, or even put onto a totally separate fabric.
You might find it helpful to do the grouping in a diagram. For another example,
look at Figure 5.2.
Figure 5.2 SAN Diagram without Grouping
Serv
ge
Stora
ers
SAN
140_SANs_05
8/14/01
3:35 PM
Page 179
The SAN Design Process • Chapter 5
Nothing is known about the communication patterns in this SAN.
Consequently, there is no way to optimize ISLs for performance. After grouping
the initiators with their targets, the SAN diagram could look something like
Figure 5.3. If you look carefully, you will notice that there are only 12 connections into this SAN. If there are fewer connections than there are ports in your
switches, you do not really need to go through the grouping exercise because
localization of traffic will happen automatically. It is only useful if you will be
using ISLs; however, as most systems scale well past the size of the largest switches
available, it will be a frequent exercise. For the purposes of making the examples
more readable, we will just assume that they are all dealing with a subset of the
devices that the SAN will support.
Figure 5.3 SAN Diagram with Simple Grouping
SAN
Group 1
Group 2
Group 3
Group 4
Making a diagram such as this will allow you to see at a glance what the
communication patterns for your SAN are.
This example is simplistic, and in large SANs, there will likely be conflicts.
When you cannot effectively group all of the communication patterns, you
should focus on grouping faster performing devices. For example, if you find that
179
140_SANs_05
180
8/14/01
3:35 PM
Page 180
Chapter 5 • The SAN Design Process
the bulk of traffic will be between host1, array3, and array4, these could be
grouped separately from tape1 and host2 if necessary.This could happen if you
find that there are so many interrelationships that you end up with very many
devices, but very few very large groups.The grouping technique does not help
for performance if you only have one big group. It could also happen if you have
a few devices that are shared by a great many devices, such as a large RAID array
in a storage consolidation solution.
Another way to combat this “group growth” problem is to account for multiple interfaces on storage arrays. Let us say that you have a redundant fabric
architecture.Your RAID array has eight interfaces, and each host will access only
two of them—one interface on each fabric. List each interface on the array separately in your traffic pattern table.Then, you associate servers or groups of servers
with specific interfaces.With the array listed as a single entity, a diagram of the
communication could look something like Figure 5.4.
Figure 5.4 SAN Grouping Diagram with Single-Entity Arrays
Server Group
Server Group
SAN A
SAN B
RAID Array1
Server Group
Server Group
If, however, you separate the interfaces, your diagram could look more like
Figure 5.5.
You can indicate that a device crosses groups but does not need much in the
way of performance by varying the line color, weight, or pattern. Figure 5.6
shows that the tape robot crosses all groups, but does not need much bandwidth.
140_SANs_05
8/14/01
3:35 PM
Page 181
The SAN Design Process • Chapter 5
Figure 5.5 SAN Grouping Diagram with Separated Interfaces
Server Group 1
Different array controllers
attaching to different groups.
SAN A
Server Group 2
Group 1
Group 2
Group 3
Group 4
SAN B
Group 1
RAID Array1
Group 2
Group 3
Group 4
Server Group 3
Server Group 4
Figure 5.6 SAN Grouping Diagram with Tape Robot Addition
Server Group 1
RAID Array1
SAN A
Server Group 2
Group 1
Group 2
Group 3
Group 4
SAN B
Group 1
Group 2
Group 3
Group 4
Server Group 3
Tape1
One Interface Going to
Multiple Groups
Server Group 4
181
140_SANs_05
182
8/14/01
3:35 PM
Page 182
Chapter 5 • The SAN Design Process
If you are able to make relatively small performance groups, your SAN will
benefit greatly from applying the principal of locality. For now, you simply need
to be able to determine the category of architecture you will require: one that has
lots of known locality (has well-defined performance groups), or one that does
not.This will affect how many switch ports you need to allot for ISLs. If traffic is
localized within an area of the SAN, it will obviously not need to make use of
ISLs leaving that area. In this case, you will be able to get superior performance
even with far fewer ISLs, resulting in more ports available for servers and storage.
Establishing Port Requirements
Now you will determine how many switch ports you will need to purchase.
(This is a general estimate for calculating ROI; it might be a bit more or less than
your final estimate.)
Take the ports you found out about during the interview process. Make sure
that you account for all ports on each node. Some RAID arrays have many ports,
and many hosts have at least two HBAs. Add up these ports to get the total
number of exposed ports your SAN will require.You will then divide this by the
number of different fabrics you will be using. If you have dual-redundant fabrics,
you will divide by two. If you have triple-redundant fabrics, divide by three, and
so on.This will give you the number of required exposed ports per fabric.The
number of “overhead” ports you must allocate for ISLs and for unused ports will
depend on several factors:
■
The total number of required ports per fabric.
■
The amount of known locality.
■
Your need to manage all switches as a single entity.
■
The physical layout of your SAN—any MAN/WAN connections, or
intra-building campus connections, or intra-floor building connections—
might dictate use of additional ISLs and less than perfect utilization of the
ports on each switch.
■
Your applications’ expected performance characteristics.
■
The rate of expected growth in port count of the fabric.
■
Your maintenance policies regarding port usages on network devices. For
example, you might require that a certain number of ports be left available
for expansion or troubleshooting during the course of normal operation.
140_SANs_05
8/14/01
3:35 PM
Page 183
The SAN Design Process • Chapter 5
Simple Case
If the number of required exposed ports is less than the number of ports on a
single switch, you will generally need zero ports for ISLs. In this case, you will
require one switch per fabric. However, as larger switches utilize more hardware
internally to connect the higher number of user ports, a decision might need to
be made between using a larger switch versus utilizing a network of smaller ones.
The appropriate decision will depend on performance requirements, budget, and
design factors. In addition, if you have made small performance groups that have
no components in common, you might be able to localize traffic 100 percent,
and require no ISLs.You would have many small, unconnected SAN islands if
you follow this approach. One reason not to use isolated islands is that requirements change. Someday you might need access between islands at a moment’s
notice. A robust architecture can achieve your immediate connectivity requirements, and give you the flexibility to handle change as well.
You will require each fabric to be a network if this is not the case, or if
you wish to design in flexibility to your configuration.You will have to reserve
port count for these. Simple case requirements include the following:
■
Fewer ports required than exist on a single switch, or…
■
Each performance group is well defined and smaller than the number of
ports on a single switch.
■
Future requirements for growth and change are minimal.
Assume that you have two 16-port arrays (32 storage ports total), 10 dualHBA servers (20 ports), and two single-port tape libraries (two ports).Your total
port count is 54. However, assume further that you are using a dual-redundant
SAN architecture.Your port count per fabric is 27.You are building the fabric out
of 16-port switches. It is possible that some ISLs are required.You will need to
determine how many are needed.
Variant A
With a relatively small fabric like this and relatively high locality, you can assume
that you will have about 14 free ports per switch.Two switches with two ISLs
between them will yield 28 ports per fabric.You are using a dual-redundant
architecture, so there will be two fabrics, for a total of four switches.Your
grouping diagram will look like Figure 5.7.
183
140_SANs_05
184
8/14/01
3:35 PM
Page 184
Chapter 5 • The SAN Design Process
Figure 5.7 Determining ISL Requirements for Variant A
RAID Array1
Server Group 1
(5 Hosts)
A1-1
A1-2
4
SAN A
A1-3
4
Group 1
A1-4
4
A2-1
4
5
Group 2
4
5
5
SAN B
4
Group 1
A2-2
4
4
A2-3
A2-4
Group 2
5
Server Group 2
(5 Hosts)
RAID Array2
This grouping would result in an actual implementation resembling Figure 5.8.
Figure 5.8 Variant A Implementation
RAID Array1
Server Group 1
(5 Hosts)
SAN A
A1-1
4
A1-2
5
4
A1-3
A1-4
4
5
4
4
A2-1
A2-2
A2-3
A2-4
RAID Array2
4
5
4
5
4
SAN B
Server Group 2
(5 Hosts)
Variant B
If you decide that you cannot guarantee the localization of traffic for some
reason, grouping will not help. Assuming also that you have a requirement for
high performance between the switches, you would add two ISLs per switch to
140_SANs_05
8/14/01
3:35 PM
Page 185
The SAN Design Process • Chapter 5
the estimate, for a total of about four ISLs per switch.Your architecture might
look Figure 5.9.
Figure 5.9 Adding ISLs for High Performance in Variant B
36 Ports per Fabric
(Balance storage and hosts across the
3 switches for best performance.)
RAID Array1
Server Group 1
(5 Hosts)
SAN A
A1-1
4
A1-2
5
4
A1-3
A1-4
4
5
4
4
A2-1
A2-2
A2-3
A2-4
RAID Array2
4
5
4
5
4
SAN B
Server Group 2
(5 Hosts)
The same technique can be applied to any SAN, no matter how complex. In
fact, the larger the SAN, the greater the benefits will be from grouping traffic.
Moderate Case
If the required exposed port count is about double or triple the per-switch port
count, and some locality is known, you will be able to use very few ISLs. In this
case, estimate two ISLs per switch. Let us say that you need 26 ports, and you are
using 16-port switches.Two ISLs per switch means that you actually get 14 ports
per switch.Two switches will give you 28 ports, so you would budget for two
switches per fabric, or four switches total.
Moderate case requirements include the following:
■
No more than three times as many ports are required than are present
on a single switch.
■
Performance groups are reasonably well defined. Some locality is known.
■
Future requirements for growth and change are minimal.
185
140_SANs_05
186
8/14/01
3:35 PM
Page 186
Chapter 5 • The SAN Design Process
NOTE
The low port count/high locality/low ISL count configurations work well
for either two or three switches. Two switches would be cascaded
together with two ISLs, with 16-port switches yielding 28 ports. Three
switches would be connected in a ring, supporting about 40 devices. If
you are over that limit, a four-switch full mesh can support about 50
devices. The full-mesh architecture does not scale well beyond that point,
and none of these work well if you have performance groups with more
than 13 or 14 members. It is feasible to build ring or partial-mesh
topology fabrics with higher port counts, but it is generally better to use
a core/edge topology for higher port count solutions. These topologies
are explained in detail in Chapter 7.
Complex Case
If you need more ports than one of these configurations will handle, you will
need to allocate about four ISLs per switch.You might use fewer than four ISLs
on some switches, and perhaps nothing but ISLs will be present on other
switches. In the complex case for port count estimates, the intent is to average
the ISL requirements.
Until a detailed architecture is developed, you will have to make general estimates for a few things. If you have any distance requirements, add two ISLs per
switch. If you have very high-performance requirements, and very little known
locality, add two ISLs per switch.
Take the estimated number of ISLs per switch (I) and subtract it from the
number of ports per switch (PS). Divide the total required ports per fabric (P) by
this number and round up.This is the estimated number of switches (S) that you
need to budget for. For estimating complex SAN switch counts, S=P/(PS – I).
For example, if you have a need for 30 ports per fabric (P=30), are using 16port switches (PS=16), and each switch will use about two ISLs (I=2), then the
number of switches you estimate needing per fabric is 30/(16–2).This is 2.14,
which rounds up to 3. If you have a single fabric, this is the number of switches
you should budget for. If you have a dual-fabric SAN, you should budget for six
switches. Complex case requirements include the following:
140_SANs_05
8/14/01
3:35 PM
Page 187
The SAN Design Process • Chapter 5
■
Any number of exposed ports might be required.
■
Performance groups might or might not be defined.
■
Future requirements for growth and change are significant.
Preparing an ROI Analysis
In any business transaction, it is important to understand the economic benefits
or the Return On Investment (ROI) that your company will receive. Preparing an
ROI analysis for your SAN project will show how your company will not only
return the capital investment, but also save additional money as well in time,
management, and other efficiencies.
During the interview process, you made a list of all of the equipment that
you would need to purchase.To begin the ROI analysis of your SAN, determine
which components are specific to the SAN project. For example, if your company will need to buy additional storage arrays whether or not a SAN is used,
these would not be included on the expense side of the analysis. If the SAN is
expected to prevent you from having to buy an array, this cost savings would go
onto the benefit side of the analysis.You should include any hardware you intend
to buy for testing that will not be used elsewhere.
When accounting for staff time spent on the project, make sure that you only
charge the project for time spent beyond what would be spent by not building the
SAN. If you are expected to save staff time in the long run, apply this to the benefit side.Your ROI analysis will be a living document, and will be updated as the
SAN project develops.
The Return On Investment Proposition
Technical justifications for SAN infrastructure deployments can often be made
more credible by adding an ROI analysis for the proposed implementation.
Follow the guide in the following sections to produce an ROI analysis based on
SAN solutions to particular problems.
Step One: Pick a Theme or Scenario
Most implementations have a purpose.That purpose could be a server or storage
consolidation to improve infrastructure usage and gain economies of scale,
ensuring storage and server resources are utilized in the most cost-effective
manner. High-availability clustering can improve the availability of mission-critical
applications, thus ensuring business continuance and the cost saving associated
187
140_SANs_05
188
8/14/01
3:35 PM
Page 188
Chapter 5 • The SAN Design Process
with it. SAN-based backup deployments improve data integrity by performing
backups and restores more efficiently and quickly, again saving in business continuance time and effort.
Step Two: Identify the Affected Infrastructure Components
Most SAN deployments will focus on affected servers. Servers can be grouped
according to the applications they run or the functional areas they support.
Examples of application groupings include Web servers, file and print servers,
messaging servers, database servers, and application servers. Functional support
servers might include financial and personnel systems or engineering applications.
Once the server groups are known, get the characteristics of servers in each
group. For example, if your solution fits into a storage consolidation theme, you
should consider factors such as:
■
Amount of attached disk storage
■
Storage growth rates
■
Storage space reserved for growth (headroom)
■
Availability requirements
■
Server downtime and an associated downtime cost
■
Server hardware and software costs
■
Maintenance costs
■
The administration effort required to keep the servers up and running
Step Three: Identify the SAN-Enabled Benefits
The scenario approach allows you to focus more closely on the benefits. Server
and storage consolidation, for example, will concentrate on benefits accrued from
more efficient use of server and storage resources, improved staff productivity,
lower platform costs, and better use of the infrastructure. Simply take the list of
characteristics you developed in step two, and show how a SAN can provide benefits in those areas. Establishing specific cost savings is one of the two key elements in the ROI process, so be sure to look hard for every area of benefit.
Step Four: Identify the SAN-Related Costs
Determining the costs associated with the scenario involves identifying the new
components specifically required to build and maintain the SAN.These can
140_SANs_05
8/14/01
3:35 PM
Page 189
The SAN Design Process • Chapter 5
include software licenses, switches, Fibre Channel HBAs, optical cables, and any
service costs associated with the deployment. Be careful to include only those
items that relate directly to the SAN implementation.This is the second key element in the ROI process: if you do not correctly estimate expenses, the ROI
might be substantially better or worse than your estimate.
Step Five: Calculate the ROI
There are several standard ROI calculations in common use, such as net present
value (in dollars), internal rate of return (as a percentage), and payback period (in
months). Briefly, these can be defined as:
■
Net Present Value (NPV) A method used in evaluating investments
where the net present value of all cash flows is calculated using a given
discount rate.
■
Internal Rate of Return (IRR) A discount rate at which the present
value of the future cash flows of an investment equal the costs of
the investment.
■
Payback Period The length of time needed to recoup the cost of a
capital investment on a nondiscount basis.
Detailed explanations of these techniques and how to use them can be found
in most accounting textbooks. It is likely that your company has a preferred
method for calculating ROI.You should determine which method this is, and if
there are standard forms for presenting your analysis. Asking your accounting
department might be a good first step.
This approach to calculating ROI allows you to focus on a particular project
or infrastructure-based problem. It allows you to reduce deployment risk by
deploying SANs in phases by scenario. Deploying by scenario will keep investments limited to the solution at hand and create an investment base for future
deployments.The initial investment will improve the ROI on other scenarios by
reducing some of the investment required to deploy them.
189
140_SANs_05
190
8/14/01
3:35 PM
Page 190
Chapter 5 • The SAN Design Process
The Rest of the Process and
the Repetition of the Cycle
Now you have the following documents:
■
Detailed results from the interview process, which define what the SAN
project needs to accomplish.This includes:
■
A technical requirements document
■
A timeline for accomplishing the tasks associated with implementing
the SAN
■
A list of everything that you will need to buy to make the
project work
■
A rough idea of how the SAN will be designed.
■
An ROI analysis to justify continuing with the project.
These will be used and maintained throughout the life of the SAN.The
timeline will be the framework in which all activities in the SAN’s lifecycle will
reside. In later chapters, you enter the architecture development phase and will
use these documents to develop a detailed architecture for your SAN.This will in
turn be used to develop a test plan.These documents will be used in the approval
process for implementation, and will be kept up to date during the maintenance
phase as part of the SAN’s documentation set. If any major changes to the SAN
are needed, the lifecycle will be repeated and another set of documentation will
be produced.
140_SANs_05
8/14/01
3:36 PM
Page 191
The SAN Design Process • Chapter 5
Summary
The SAN design process consists of seven phases, which are cycled through as
needed throughout the life of your SAN. Data collection and analysis together
define the requirements of your SAN.These requirements feed into the architecture development process to produce a SAN design blueprint. After you have a
plan in place for your SAN, you must test certain components to ensure that it is
working the way you thought it would, before you can begin to transition and
release it into production. Once the SAN has entered production, it falls into an
ongoing maintenance phase, and continues in that phase until a change occurs
that causes the cycle to repeat.
The first two phases (data collection and analysis) are critical to the health of
the SAN. Simply put, if the information on which the design is based is incomplete and/or inaccurate, the design will be incorrect.
Data collection consists of a series of interviews, collecting the answers into a
meaningful format (a technical requirements document), and verifying the accuracy of the collected data. It is imperative that all key stakeholders in the SAN
project be included on the interview list.
While listed as a separate phase, data analysis actually coincides with data collection.The objective of the analysis phase is to turn the raw data, which is generally in the form of business requirements, into a more technical format—the
technical requirements document. Some of this occurs “on the fly” during the
interview process. However, certain tasks are done after the interviews are complete. For example, detailed port count and performance requirements are generated “on the fly,” and an ROI proposition is created after the fact. Once the
requirements of the SAN are well defined, the remaining phases can take place.
These phases are covered in subsequent chapters.
Solutions Fast Track
Looking at the Overall Lifecycle of a SAN
The SAN design process is a cycle.
This process consists of seven phases:
1. Data Collection
2. Data Analysis
191
140_SANs_05
192
8/14/01
3:36 PM
Page 192
Chapter 5 • The SAN Design Process
3. Architecture Development
4. Prototype and Test
5. Transition
6. Release to Production
7. Maintenance
Whenever there is a fundamental change to the SAN, the cycle
should repeat.
Conducting Data Collection
Data collection is the foundation on which a SAN is built.
You should interview everybody who has an interest in the project.
During the interview process, create a technical requirements document.
Analyzing the Collected Data
There are several things that you need to get out of data analysis:
— The number of different fabrics that will make up the SAN solution
— The port count and performance characteristics of each fabric
— An estimate of the hardware required to meet these requirements
You might be able to localize traffic for better performance if you can
create well-defined groups.
Prepare an ROI proposition to justify your SAN project.
140_SANs_05
8/14/01
3:36 PM
Page 193
The SAN Design Process • Chapter 5
Frequently Asked Questions
The following Frequently Asked Questions, answered by the authors of this book,
are designed to both measure your understanding of the concepts presented in
this chapter and to assist you with real-life implementation of these concepts. To
have your questions about this chapter answered by the author, browse to
www.syngress.com/solutions and click on the “Ask the Author” form.
Q: Once I have designed my SAN, shouldn’t it be done? I don’t want to have to
keep reinventing the wheel!
A: Yes and no. After a SAN enters production, it is “done” until you want to
change it in a fundamental way. As long as you are happy with leaving your
SAN the way it is, there is no reason why you would have to repeat the
design cycle. Simply adding a new storage array does not require a repetition
of the cycle. Moreover, events that do cause the cycle repeat might cause it to
repeat relatively quickly. For example, if you decide to go through the design
process because you are adding a new type of storage array to the SAN, and
want to validate that doing so won’t break anything, you will be able to take
a fast track through most of the process. After all, adding this device will not
by any stretch of the imagination require that you change your fabric
topology, or affect much of your SAN architecture.
Q: Every end user in my company is a stakeholder in the SAN. Do I need to
interview everybody?
A: No. It is true that everybody who uses a system is a stakeholder in that
system. However, we mean something a little less broad.When we refer to a
stakeholder, we mean somebody whose job revolves around taking care of
one or more of the systems that will attach to the SAN.This can include systems, database, and storage administrators, as well as other technical people. It
can also include people responsible for the data that resides on these systems.
For example, a manager responsible for a call center at a phone-in catalog
company might be a key stakeholder in the SAN, because he or she is
responsible for the data entered into that company’s business system—which
is attached to the SAN.Why is this person a key stakeholder? Because he or
she might have something to say about the availability and performance
requirements of the system.When in doubt, try to include anybody on the
193
140_SANs_05
194
8/14/01
3:36 PM
Page 194
Chapter 5 • The SAN Design Process
team who wants to be there. It is usually better to have more data than you
need, rather than less.
Q: Do I need to wait until data collection is complete before beginning data
analysis?
A: Actually, the data collection and analysis phases are most effective if there is
some degree of overlap. If you have analyzed data from the first interview
when you go into the second, you will be able to better understand the
answers, and might also be able to direct the line of questioning along more
useful lines. Be careful not to develop firm convictions too early on, though.
Always approach SAN design scientifically. Never start an interview with a
firm preconception of the outcome! Collection and analysis are divided into
two phases because some of the analysis naturally occurs after all data collection is complete. For example, you can’t prepare an ROI proposition until
you have a fairly complete picture of what the SAN will need to accomplish,
and some idea of the technical infrastructure that will be involved.
140_SANs_06
8/14/01
3:37 PM
Page 195
Chapter 6
SAN Applications
and Configurations
Solutions in this chapter:
■
Configuring a High-Availability Cluster
■
Using a SAN for Storage Consolidation
■
LAN-Free Backup Configuration
■
SAN Server-Free Backup
■
Making Your Enterprise Disaster Tolerant
Summary
Solutions Fast Track
Frequently Asked Questions
195
140_SANs_06
196
8/14/01
3:37 PM
Page 196
Chapter 6 • SAN Applications and Configurations
Introduction
This chapter covers configurations for some of the most common Storage Area
Network (SAN) applications.The surest route to SAN success is to base your
installations on proven configurations—equipment layouts that have been found
through practice and repeated refinement to be best suited for the desired application.
In this chapter, we review the major SAN configuration types and discuss the
advantages of each.This chapter does not go into specific detail (we do not give
specific vendor names or driver revisions) in order to keep the material generally
useful.We give you enough information to understand and set up your own
cluster, but we do not identify a specific storage or Host Bus Adapter (HBA)
vendor for use in that cluster; thus, the material will be useful to users of any
storage or HBA. If you do desire low-level detail that identifies specific configuration information such as vendor, model, and revision level, Brocade
SOLUTIONware might be helpful.
Using Brocade SOLUTIONware
Brocade provides a number of pretested configurations on its Web site
(www.brocade.com) for administrators who wish to configure their own
SANs, and integrators who want to have a head start on developing
solutions for general deployment. Brocade SOLUTIONware guides can be
used to help define the basic configurations for your SAN and can be
modified to fit your solution. SOLUTIONware guides offer very specific
model and part numbers for configurations similar to the solutions
given in this chapter. The solutions are specific as to the model of
storage, switch, and HBAs, but can be extended to similar models. Thus,
a SOLUTIONware can be used either as a “cookbook” for building a SAN
identical to the one discussed in the paper, or as a “reference solution”
from which other, similar solutions can be derived.
Configuring a High-Availability Cluster
High-availability (HA) clusters are used to support critical business applications.
They provide a redundant, fail-safe installation that can tolerate equipment,
140_SANs_06
8/14/01
3:37 PM
Page 197
SAN Applications and Configurations • Chapter 6
software, and/or network failures, and continue running with as little impact
upon business as possible. HA clusters have been in use for some time now.
However, until the advent of Fibre Channel, they were very limited in size and
reliability.This is because clusters require shared storage, and sharing Small
Computer Systems Interface (SCSI) storage subsystems is difficult and unreliable.
In fact, sharing a SCSI device between more than two initiators is completely
impractical due to SCSI cabling limitations, and SCSI’s poor support for multiple
initiators.Thus, clustering technology has been greatly enhanced by the network
architecture of SANs. SANs provide ease of connectivity, and the ability to interconnect an arbitrarily large number of devices. Because of this, SANs can support
more than just dual failover, and can be easily extended to support many-to-one
failover configurations.
The advantages of an HA cluster fall into three categories: availability, manageability, and scalability. Availability is the capability of a cluster to be tolerant of
hardware, network, or software errors. In short, it is the capability of a system to
“stay up.” Clustering software automatically detects error conditions, and restarts
or transfers applications from one server to another. Little or no downtime will
result from these problems, as the HA software can be configured to automatically act to correct them. Manageability is the set of processes with which an
administrator keeps a system running. HA clusters enhance manageability by
allowing all servers in each cluster to be managed as a group. Moreover, when
software and/or hardware needs to be upgraded, each node in a cluster can be
upgraded separately without taking important applications or the system
offline—a concept known as a rolling upgrade. Scalability is the ability of a system
to grow. Factors that limit scalability might prevent you from adding servers to a
data center. For example, you might not have enough rack space, power, network
connections, or budget to add another server. Cluster-aware applications can take
advantage of the distributed nature of the cluster to distribute processing, dynamically balancing load between servers.Thus, an HA cluster can provide better utilization of the server resources you already have, saving the data center and
budgetary resources for scalability in other areas. Moreover, adding servers to the
cluster can be easier with HA software. A new server can be added to the cluster
while other servers are still online.Then, applications can be transferred to the
new server to distribute the load evenly, in much the same way that a rolling
upgrade is accomplished.
The next section covers the configurations of an HA application or database
server, and Microsoft Cluster Server (MSCS) on Windows NT/2000.
197
140_SANs_06
198
8/14/01
3:37 PM
Page 198
Chapter 6 • SAN Applications and Configurations
Typical HA Application or Database Server
A typical HA application or database server consists of several
components:
■
Two or more redundantly configured servers
■
One or more Fibre Channel SANs to enable the sharing of storage
■
At least one fault-tolerant/redundant storage volume
■
An interconnect for cluster messaging (which might also be Fibre Channel)
■
A software mechanism for providing failover operation
Through clustering software, the application server continually communicates
with the clustered spare using network heartbeats to indicate to the other machines
that everything is operating correctly.This heartbeat is typically carried over a dedicated network for clustering traffic. In case of a problem (for example, a software
crash on the operational server or a hardware component failure), a heartbeat link
will indicate to the other server that something has failed or is otherwise inoperable. If that heartbeat is lost, the spare server takes over the function provided by the
application service. Depending on the clustering software, either the entire server or
only specific services on the server can be failed over or failed back.
NOTE
HA clusters can use either an active/active or an active/passive model of
operation. In the active/passive model, the passive server does not provide
any service until a failure condition causes it to assume control of the
cluster and become the active server. Thus, the passive, or standby server
is not utilized at all in normal operation. The active/active model, by contrast, allows each server to provide service—in other words, to be utilized—even during normal operation. The downside to active/active
clustering is that when a failure does occur, performance will be
impacted, since there will suddenly be fewer resources providing the
same services.
HA clusters are also used to enable rolling upgrades—the upgrading of one of
the machines in a cluster to new software or hardware. In a rolling upgrade,
resources are manually transferred to a standby server from the operational server,
and the operational server is taken offline for maintenance. Hardware or software is
140_SANs_06
8/14/01
3:37 PM
Page 199
SAN Applications and Configurations • Chapter 6
added or upgraded, tested, and then the operational server is brought back online
and resources are transferred back to the server.Then, the standby server is similarly
upgraded, without end users encountering a substantial interruption in service.
This enables systems to be upgraded and maintained without affecting critical
business operations.
NOTE
Almost all HA servers in use today require a small interruption in service
during failover. This is the time it takes the standby server to decide that
the primary has actually failed, and then to start up the applications that
that server had been running. The technology required to provide a true
zero-downtime failover is understood. Some HA databases even implement zero downtime failover; however, this functionality is not in general
use, due to its complexity and the need to have applications specifically
written to take advantage of it.
Redundant components are used throughout a system to provide high availability. Eliminating all single points of failure is important to ensure that the
failure of a single component does not bring down the entire cluster. Redundant
HBAs are used to provide two paths to the cluster’s shared storage from each
host. Redundant, noncoupled fabrics provide separate network paths to the
storage.We discuss redundant fabrics further in Chapters 5 and 7. In addition to
redundancy, fault-tolerant equipment designed with dual power supplies, internally redundant controllers, and circuitry is preferable in these environments.
To eliminate single points of failure, a dual-fabric SAN architecture is used.
These fabrics are separated intentionally to prevent loss of service due to operator
error, the need for a rolling upgrade, or major software or hardware problems on
one of the fabrics. By using separate redundant paths, physical cabling problems
are minimized, and disruption of the network from operator error can be isolated
to just one segment.
Dual HBAs are used in the servers to connect to each fabric. Multipathing
software, which is normally provided by your RAID vendor, is used to detect
I/O errors and redirect traffic to the other HBA, thus avoiding unnecessary
cluster node failover.Third-party software such as VERITAS’ Dynamic MultiPathing (DMP) product can also be used to provide this functionality. Finally,
some HBA vendors (such as TROIKA) provide multipathing software support
built into their HBA drivers.
199
140_SANs_06
200
8/14/01
3:37 PM
Page 200
Chapter 6 • SAN Applications and Configurations
NOTE
The overriding design principal in an HA environment is to keep all failures at as low a level as possible. You want to minimize the chance of
the HA software (the highest level of protection) having to actually perform a failover simply because a fan or power supply (the lowest level)
fails. This is particularly critical in large environments, where it is statistically predictable that even components with high Mean Time Between
Failures (MTBF) will fail frequently, simply because there are so many of
them. Therefore, you should not assume that the presence of an HA
environment eliminates the need for fault-tolerant components.
Critical to any HA application server is the use of at least one HA storage
device, supporting either active/passive or active/active storage controllers. In the
active/passive case, a single Logical Unit Number (LUN) is exported to two or
more separate Fibre Channel ports. One port is used at a time, with either an
automatic or manual failover causing the alternate port to become active. Only
one port can be active at any given time. For active/active devices, this limitation
does not apply. A LUN or target is exported to any number of paths.Traffic can
flow across these multiple paths from any node to this LUN.
Microsoft Cluster Server
Microsoft Cluster Server is the most common way that Windows administrators
add HA capabilities to their critical IT systems. Figure 6.1 shows a typical MSCS
configuration.The basics of how a generic MSCS configuration works and
descriptions of the critical parts are explained further later in this section.
An MSCS cluster consists of two (Windows NT,Windows 2000 Advanced
Sever) or four (Windows 2000 Data Center) server nodes connected via redundant
HBAs to a dual-ported storage subsystem via a redundant/resilient architecture
SAN (see Chapter 7,“Developing a SAN Architecture,” for further explanation of a
redundant/resilient architecture).The most common MSCS setup in use today is
the two-node cluster.The two servers are configured as Active and Standby nodes
in the cluster.This employs the active/passive model of HA.The Active node owns
the cluster LUN(s), but the Standby has the right to take ownership when required.
Having ownership rights does not imply sharing. MSCS uses a share-nothing
140_SANs_06
8/14/01
3:37 PM
Page 201
SAN Applications and Configurations • Chapter 6
architecture, which means that only one server can use a LUN at a time.The management software provided by the storage vendor must allow a LUN to be accessed
by two or more hosts for use in a cluster environment.
Figure 6.1 Microsoft Cluster Server Configuration
Dual-Controller
Storage Array
C1
C2
DualFabric
SAN
Architecture
Fabric A
HBA 1
Fabric B
HBA 2
HBA 1
HBA 2
Heartbeat
Active Cluster
Server
Ethernet LAN
Standby Cluster
Server
201
140_SANs_06
202
8/14/01
3:37 PM
Page 202
Chapter 6 • SAN Applications and Configurations
The clustering software uses one or more heartbeat networks to ensure that
the Standby server knows the status of the Active server at all times.These heartbeats can be sent over a dedicated (private) IP network (either Ethernet or IP
over Fibre Channel), a public Local Area Network (LAN), or by using a shared
disk volume. Generally, at least one dedicated connection is used to ensure that
the heartbeat is not interrupted by outages to the public network.
Absence of the heartbeat detected by the Standby server triggers the node
failover function.The Standby node then takes ownership of the cluster LUN
and continues serving the application’s clients. Once the primary node server has
been fixed, a manual failback operation will restore the original configuration.
Manual failback is recommended to avoid a ping-pong effect that can result from
an automatic failback setting.The software can also be manually moved over to
enable server maintenance or rolling upgrades.
A dual fabric and dual HBAs are typically used in HA clusters to ensure that
there are fully redundant hardware components in the system.The recommended
storage is a dual-controller storage array to allow for redundant connections to
the data.
Microsoft Windows 2000 Data Center is a four-node cluster configuration
that has been certified using fabric switches. Unlike MSCS, Data Center absolutely
requires a SAN, since SCSI storage cannot support more than two initiators.
Microsoft WHQL Certification
Microsoft Windows Hardware Quality Lab (WHQL) provides lists and references to hardware that has passed certification by Microsoft for certain applications. WHQL certification is often useful in determining if
hardware and drivers for that hardware have been certified for working
with Microsoft operating system software and advanced OS capabilities
like MSCS. Special certifications are given to hardware (storage, HBAs,
and network components) for running with MSCS, which ensures that
Microsoft has tested all of the hardware for that application and certified that it works. Brocade switches are cluster-certified through Original
Equipment Manufacturers (OEMs) in bundle configurations along with
the OEM storage and qualified HBAs. It is also possible to build custom
HA solutions with Brocade switches, in addition to buying pre-integrated bundles.
Microsoft provides an online database of WHQL-qualified hardware, which can be searched by keyword or category of equipment
(www.microsoft.com/hcl).
140_SANs_06
8/14/01
3:37 PM
Page 203
SAN Applications and Configurations • Chapter 6
Using a SAN for Storage Consolidation
One of the major uses of SAN technology has been for storage consolidation.
The availability of a storage network has enabled data center administrators to
centralize where their storage resources are located, making better use of precious
storage dollars through storage sharing and pooling. By having storage available in
a central pool, usage of that data space is more efficient and the data is easily
backed up, managed, and accessed.This section describes the major methods of
using a SAN for storage consolidation, reviews several of the techniques used to
accomplish this goal, and provides sample configurations.
Before the use of storage networks, storage was dedicated to a specific host.
This resulted in often underutilized or poor distribution of storage capacity. For
example, you might have 100 GB available on one server that was being used for
user’s home directories, but only 1 GB available on another server that was running a business-critical e-commerce database.With dedicated storage, it was
impossible to reallocate those disks from the less critical, and less resourceconstrained, system to the more important one.You would have to buy additional
storage for the critical database server, even though you already owned 100 GB
of unused storage! With the use of a SAN, reallocation could be as simple as reassigning some of the disks associated with less important user files to the businesscritical database in a few minutes without any recabling or downtime.
With dedicated storage, whether externally attached or internal to a server,
adding storage capacity typically requires the full reconfiguration of a system:
shutting down the host, connecting the new storage, and restarting the system.
External arrays might have the capability to add disks “on the fly,” but what if
you need to add a new cabinet? There are always fundamental limits to directattach storage that do not apply to storage attached via a SAN.
With Fibre Channel SANs, making storage available to hosts in a network is
less complex.You connect any Fibre Channel storage device to your switch, and
the device is immediately available to any of the hosts that are connected to the
fabric. Minimal configuration of the fabric is required for those storage resources
to show up on the SAN, and minimal effort is required for that storage to
become available to hosts in the network.You simply connect the storage to the
network, and all of those resources become immediately available. Usually,
all that is required is to configure the new device into the fabric’s zoning table,
if necessary, and to import those newly available volumes into the operating
system. On Windows 2000 and Solaris, this does not require a reboot, and can be
accomplished entirely without downtime.
203
140_SANs_06
204
8/14/01
3:37 PM
Page 204
Chapter 6 • SAN Applications and Configurations
NOTE
Some storage devices require specialized drivers, which might have different procedures for adding capacity. You should check with your storage
vendor to find out how to add access to new storage arrays “on the fly.”
Storage consolidation is supported in almost any configuration of the fabric.
The topology of a Fibre Channel network for storage consolidation and sharing
really is the simplest case for SAN layout.You can use a “SAN islands” approach,
with an interface on each multiport storage array connected to each island.You
can also construct one large fabric.The route you take will depend on your performance and management goals.These topics are covered in greater detail in
Chapters 5 and 7.
With Fibre Channel, devices are dynamically added or removed at any time
from the network.This is an advantage in terms of flexibility and control, but also
a potential difficulty due to the way operating systems handle dynamic volumes.
For example, some operating systems have been built to assume that they own all
storage they are connected to, and will even try to overwrite data on any volume
that they come across.This is particularly the case for Windows NT, which will
write an operating system signature on anything that it discovers, even if it has
previously been claimed by another system in the network.These problems have
led to a number of hardware- and software-based approaches, like Brocade
Zoning, for controlling which systems on your network have access to the
devices you have added.To truly add storage “on the fly,” one or more layers of
functionality might be required. For example, you might want to be able to
dynamically resize logical volumes. Additional software and configuration might
be required in order to facilitate this. For example,VERITAS Volume Manager
and VERITAS File System provide this functionality.
Operating systems currently available generally do not have native support for
Fibre Channel volumes. Instead of directly representing Fibre Channel devices to
an operating system, Fibre Channel HBA drivers abstract storage devices in a
Fibre Channel network, and present them to the operating system as SCSI targets.This leads to an area where configuration is important.You need to understand how this mapping occurs, so that you can translate between the devices as
they appear in the operating system, and the reality of where they are located in
the SAN. Figure 6.2 depicts a simple Fibre Channel SAN.
140_SANs_06
8/14/01
3:37 PM
Page 205
SAN Applications and Configurations • Chapter 6
Figure 6.2 Simple Fibre Channel SAN for Storage Consolidation
Server
Server
Server
Server
Fibre Channel Storage
Shared Access
The access management techniques currently in use fall into one of three categories: switch zoning, LUN masking, and software control. In any SAN where
storage is expected to be shared between different hosts, it will be necessary to
utilize at least one of these methods to control which device has access to which
205
140_SANs_06
206
8/14/01
3:37 PM
Page 206
Chapter 6 • SAN Applications and Configurations
storage volumes. Examples of the different kinds of storage partitioning techniques follow. Zoning is also discussed from an implementation perspective in
Chapter 9, “SAN Implementation, Maintenance, and Management.”
Shared Storage Using a Web Farm
A common use of HA clustering is for making Web farms and their data available
at all times. Unlike most HA scenarios,Web farms do not necessarily require specialized HA software to be used as a group.This is because read-only file systems
can be used to provide shared access to storage, and front-end IP load balancing
switches provide the rest of the solution. In this use of Fibre Channel SANs, a
read-only, centralized storage array is used to support a large number of Web
servers.This approach helps to enormously reduce the costs of acquiring and
managing storage.
The traditional approach to designing a Web farm involves buying large
amounts of local storage for all the servers.With a SAN, far less storage can be
used, and it can be managed in a centralized way. Multiple Web servers are connected on a single Fibre Channel network, with shared access to the same pieces
of storage. Static Web data is kept on this storage, which has been mounted readonly on all of the Web servers. Because of the high speeds of Fibre Channel
versus the slower IP network connections to the Internet, many systems can
access the same storage with no impact on performance.
In a typical configuration for Web farm storage sharing, as shown in Figure 6.3,
a large farm of Web servers would be connected to the Internet through an IP
load balancer (Layer 4 switch).This allows traffic to be distributed to the least
busy server, while making all servers in the farm appear as one logical entity to
the clients on the Internet. All of the hosts would have access to the same volumes on a read-only basis. One “content master” host would need to have readwrite access. A shared file system is required to enable this configuration, unless
some care is taken in the design.
This technique drastically reduces the cost of a Web farm, because you gain a
great amount of efficiency in the use of your storage resources. Instead of dozens
of replicated environments with the same Web data, a single large storage array is
used to support Web requests.This results in significant savings in data storage,
and simplifies management of the data. Less manpower is required to manage
fewer storage devices, use of floor space is minimized, maintenance contract costs
are reduced, and electricity and cooling costs go down.
140_SANs_06
8/14/01
3:37 PM
Page 207
SAN Applications and Configurations • Chapter 6
Figure 6.3 Web Farm Using a SAN
Internet
Web Load Balancer
Ethernet Backbone
Web Server
Web Server
Fibre Channel Storage
Web Server
Web Server
207
140_SANs_06
208
8/14/01
3:37 PM
Page 208
Chapter 6 • SAN Applications and Configurations
Storage Partitioning Using Switch Zoning
Brocade Zoning, the use of the fabric to partition storage and servers into different accessible areas, can be used to partition storage into different pieces. For
example, each disk within a Just A Bunch Of Disks (JBOD) could be assigned to
a different host. By zoning different pieces of storage with different servers,
sharing of the network can be used to allocate storage among hosts. Zoning
within the fabric is particularly important in larger configurations, as it not only
acts to provide partitions to control disk access, but also provides “broadcast containers” similar to IP Virtual LANs (VLANs).Thus, Brocade Zoning acts to
increase the scalability and reliability of fabrics.
To set up storage partitioning, administrators typically map out on paper what
kind of storage distribution they would like across their SAN. Specific storage
targets are assigned to specific servers and workstations.The administrator then
uses a Graphical User Iinterface (GUI) (such as Brocade WEB TOOLS) or a
Command Line Interface (CLI), (such as the Brocade telnet CLI), to create individual zones for all of the servers and workstations.
If your SAN contains multi-LUN devices such as RAID storage arrays, current fabric zoning cannot partition individual LUNs to different hosts, although
hardware offering this capability is on the horizon. Using the software supplied
with such a RAID or using HBA-based LUN masking might be required in
these cases.These should not be seen as a substitute for fabric zoning as they do
not provide an equivalent security model, do not allow for centralized management, and do not act as Registered State Change Notification (RSCN) containers. Instead, these techniques are supplemental to fabric zoning.
Switch Zoning Configuration for Departmental SANs
When an administrator wants to take advantage of storage consolidation and
enhanced manageability, he or she might want to collect several departments onto
one large storage network. In the example in Figure 6.4, three departments
(Engineering, Finance, and Marketing) have been consolidated onto a single
SAN. Each department has a dedicated storage array for its operations and a set
of hosts. Hosts and storage throughout the company can be connected into a
single Fibre Channel storage network. By using switch zoning, engineering hosts
have access to only the engineering storage array, finance hosts have access to
only the finance array, and marketing hosts have access to only marketing data on
the marketing storage array. Using the Brocade Zoning tools, the SAN administrator would create three zones: Zone1, Zone2, and Zone3. Engineering Host A,
140_SANs_06
8/14/01
3:37 PM
Page 209
SAN Applications and Configurations • Chapter 6
Engineering Host B, and Engineering Storage Array would be added to Zone1,
as shown in Figure 6.4. Finance Server would be added to Zone2 with Finance
Storage Array. Marketing Host and Marketing Storage Array would make up
Zone3. All of these zones would be added to a single zone configuration, which
would be set as the active configuration for the fabric.The advantage of this partitioning includes the capability to install a single Fibre Channel-connected
infrastructure in a building or campus that can support any desktop or server
connection to storage, the ability to centrally manage access to storage by an
administrator through the switches, and the ability to centrally back up all storage
through the network.
Figure 6.4 A Departmental SAN Partitioning Using Switch Zoning
Engineering
Server
Engineering
Server
Marketing
Server
Finance Server
Zone2
Zone3
Zone1
Engineering
Storage
Finance
Storage
Marketing
Storage
209
140_SANs_06
210
8/14/01
3:37 PM
Page 210
Chapter 6 • SAN Applications and Configurations
An administrator would set this up by first defining which departments had
storage that could benefit from central connectivity. A Fibre Channel SAN would
be wired throughout a building or campus, and storage for each department
brought online within specific zones. Individual servers and hosts would be
added, one at a time, to the appropriate zones to which they belong by identifying their World-Wide Names (WWNs) or port addresses and assigning them
into a zone set.
Storage Partitioning Using
Storage LUN Masking
Storage partitioning is also accomplished using LUN masking on the storage.
Storage arrays from all major vendors have added this feature to control host
access to storage volumes.The storage administrator determines which hosts talk
to which storage volumes. Usually, this is done by specifying the port or node
WWNs of the HBAs (and thus, hosts) that are connected in the network, and
which physical LUNs they are allowed to access. If a host that is not granted
access attempts to access a volume, the storage array will prevent this and reject
any commands to that device from the alien host. Hosts that are not allowed
access to a LUN will simply not get access. Rogue hosts on a network, human
error, and operating system vagaries will not jeopardize the integrity of your data.
Storage LUN masking controls access where the data is being stored. However, in
an environment where many storage arrays exist, this decentralized management
model might be time consuming to manage. Storage array manufacturers might
charge a substantial extra fee for the capability and software to manipulate storage
LUN masking.You should make sure that you understand up front whether your
array provides this capability, and if so, at what cost. Finally, if you use more than
one manufacturer’s array in your SAN, you need to ensure that you have expertise in each LUN management application.
Storage Partitioning Using HBA LUN Masking
The other end of your storage network is the HBA.This is another feasible point
for controlling which hosts can or cannot access certain LUNs. HBAs also offer
LUN masking functionality for this reason. An HBA can mask what devices the
operating system accesses.This control is typically exercised through a console
application, registry settings, text files, or third-party software. Host-based LUN
masking produces much the same effect as storage LUN masking. It is typically
140_SANs_06
8/14/01
3:37 PM
Page 211
SAN Applications and Configurations • Chapter 6
less expensive, although it requires the active participation of every HBA on your
network.The security model is substantially weaker, since one host—either intentionally, through operator error, or through software malfunctions—can compromise data integrity on your SAN. If even one host on your network does not
have the LUN masking set up correctly, it could mean corrupted data. A rogue
host could also gain access to your data without permission.This should not
imply that HBA LUN masking should not be used, but rather that it should be
used in conjunction with fabric zoning for maximum effectiveness.This type of
multilayer security is common in traditional IP networks: any security consultant
will tell you that the correct place to provide security is not just at the host or in
the network, but everywhere.
Some HBAs allow you to change LUN masking “on the fly,” meaning that
changes you make to masking are reflected immediately on the network. Some
software might require a reboot of the system for the changes to take effect. In
addition, LUN access changes need to be propagated across a network to every
host that is accessing common storage—not a trivial feat.The principal advantages of HBA-based LUN masking are cost and accessibility.When HBA-based
LUN masking is available for an HBA, generally the LUN masking is included as
a standard part of the cost of purchasing the hardware.
Partitioning with Software
To tackle the problems of allocating volumes across devices, software companies
have come up with a number of solutions to help you control which systems in
your network have access to which devices.These solutions usually exist as
drivers that are layered on top of file systems. Like HBA LUN masking drivers,
this software must be loaded on every system in order to work.The existence of
hosts that are accessing storage in the same zone as these machines but that are
not running the software is almost guaranteed to result in data corruption.These
software applications have one thing in common: they act as a filter for other
drivers that control which LUNs are seen by a host, and selectively allow or disallow access to these devices depending on administrator or user requests.
Some software packages actually allow multiple hosts to have read-write access
to the same device at the same time.This is a shared volume/shared file system
approach. Other software packages merely allow convenient and dynamic reallocation of resources between hosts, with only one host having control at a time.
Data-sharing applications are loaded onto servers in a network, and generally
all machines in a network are required to have cooperating servers, or must be
zoned into special areas for volume sharing. Some software applications also
211
140_SANs_06
212
8/14/01
3:37 PM
Page 212
Chapter 6 • SAN Applications and Configurations
require the installation of a metadata server to coordinate access to volumes.
Software applications like VERITAS Volume Manager,Tivoli SANergy, and HP
LUN Manager allow you to manipulate which hosts in the network are allowed
to see which volumes in a network.Through metadata passed between devices,
information about which systems are using which volumes is exchanged with the
drivers on all of the systems.When there is a request to share or unshare a
volume, the software tells the hosts that are no longer allowed access to unmount
a volume. It then tells the hosts that now have access to go ahead and mount and
access the volume. All of this can happen automatically, without additional
user intervention.
These techniques are suited for very dynamic environments.With all of the
packages available today, sharing of data can be changed “on the fly,” from minute
to minute if necessary. A common use of this has been for rendering farms at
Hollywood computer graphics firms, animation houses, and special effects companies.Volumes are shared to individual animators to store work in progress.The
high speeds of the Fibre Channel network are used to transfer the many gigabytes
of data generated by those animators.Volumes are shared between workstations
and groups, and can be reallocated based on changing workload requirements.
Shared file systems also can be used to facilitate the creation of Web farms.
The disadvantage of using software is primarily one of security. All of the
software available today requires that all of the hosts running in the SAN zone be
loaded with the software. Accidentally attaching hosts that are not running the
software could cause massive data corruption.The software relies on cooperating
hosts to be loaded properly to control individual access to volumes. Because of
this, these software packages are best used in tightly controlled environments, and
in conjunction with fabric zoning.
LAN-Free Backup Configuration
Traditional backup systems used SCSI direct-attached tape storage as a method to
back up business-critical data accessed by application servers.This meant that
each application server had its own tape storage, which backed up the data stored
on locally attached disks. Server RAM, I/O bus, and CPU resources were used to
drive the backup process.To combat management problems associated with coordinating the growing number of local tape drives and libraries, inefficient use
of secondary storage resources, and ineffective use of personnel, companies
implemented LAN-based backup using a server-client model.
140_SANs_06
8/14/01
3:37 PM
Page 213
SAN Applications and Configurations • Chapter 6
A central backup server would be installed on the LAN.The application
servers and workstations were configured as clients of this backup server.The
central backup server would accept requests from backup agents running on its
clients, and transfer the data through the LAN to the locally attached tape
resources it managed.This method provided a centralized, easy-to-manage backup
scheme and allowed greater efficiency by sharing tape resources over the network. Unfortunately, the LAN-based solution has several shortcomings.
Backup jobs require a large amount of block data movement, which in this
scenario is carried across the LAN.With so much data being generated every day
and backup windows extended into normal working hours, LAN connections
become swamped with backup jobs. End users complain that they cannot access
network resources, and systems administrators see network performance dwindle.
Administrators can attempt to minimize the effect of the backups on the operation of the LAN by running backups after business hours. However, the amount
of data being backed up continues to increase at nearly exponential rates.With
24x7 operations, the LAN-based solution has turned out to be unfeasible for
most enterprise environments.
LAN-free backups using storage networks have helped solve these problems.
Because Fibre Channel is a high-bandwidth channel, and has been designed from
inception as a separate network for bulk data movement, the bandwidth problems
that appear in running backups over LANs disappear.This separate network, in
addition to offloading traffic from the LAN, also performs its operations with
less CPU overhead than the LAN approach requires.This is due to the fact that
Fibre Channel connections do not need to go through the server’s TCP/IP stack,
and because certain levels of error checking are accomplished in Fibre Channel
hardware.
It is necessary in many solutions to have a host to manage the shared storage
and to store the backup database, which is used to locate and recover data.VERITAS NetBackup is an example of backup software that facilitates LAN-free
backup.
SAN Server-Free Backup
Server-free backup is the use of the SAN to remove backup traffic from the current Ethernet or other IP network without requiring a separate dedicated server.
A SAN-based server-free backup is therefore also LAN-free. Because Fibre
Channel storage networks are now used for data sharing or consolidation, it is
natural to design and implement a server-free fabric-wide backup scheme.This
213
140_SANs_06
214
8/14/01
3:37 PM
Page 214
Chapter 6 • SAN Applications and Configurations
type of backup implementation is in contrast to legacy LAN-based approaches,
where each server reads the data and using IP, sends it on the corporate LAN to
another server with locally attached hardware or LAN-free schemes, where
backup traffic is isolated to a separate Fibre Channel-based network. In contrast
to one host backing up to another, whether it be on a LAN or SAN, a data
mover, at the request of a host, reads from the disk and writes directly to SANbased shared tape resources, without the requesting host ever having to be
directly in the data path.
What can be a data mover? Just about anything. Old servers gathering dust
can be recycled and turned into data movers. Native “smart” Fibre Channelattached tape drives might have embedded data mover functionality. Fibre
Channel-to-SCSI bridges and routers, which allow legacy SCSI device attachment to the SAN, almost universally have this feature.Typically, data movers can
be either Network Data Management Protocol (NDMP)-based or use the
Extended Copy command, sometimes called third-party copy.
NDMP is an open standard protocol for enterprise-wide backup of heterogeneous storage. NDMP clients and servers pass metadata about the backup job
status as well as the data itself.Traditionally, NDMP was used in the networkattached storage model.
Extended Copy is a SCSI protocol command that allows a remote block-level
copy to occur.The reason this method is called third-party copy is that the host
actually requesting the copy command does not send it directly to the devices in
question. Instead, it sends the request to a third-party device, which then sends the
command to the appropriate targets. In general, Fibre Channel-to-SCSI routers
and native Fibre Channel tape drives use Extended Copy, while NDMP uses
legacy hosts for data movement from disk to tape.
Why do server-free backups? Typically, backup used to be done through a
centralized console or backup server.This backup server would communicate
with backup agents on all of the hosts or servers in the corporate Ethernet network at a convenient hour, and request certain files to be sent from storage to
tape.This works well for installations with a few file servers and small data sets.
Enterprise companies are now finding that all other IP traffic comes to a halt
when backups are occurring, and server CPU cycles are being saturated from all
of the IP traffic. Moreover, with the immense data growth occurring throughout
the industry, the amount of time a backup takes has extended from a few hours
each night to sometimes an entire day. In extreme cases, the times required to
back up an enterprise data set, even incrementally, have gone beyond the
Ethernet network’s capability to transport that data within a day.To make matters
140_SANs_06
8/14/01
3:37 PM
Page 215
SAN Applications and Configurations • Chapter 6
worse, the move toward 24x7, always-on Web-based business models, has made no
time of the day available for loading the corporate network. Continuous customer access and nonstop transaction processing have become critical for
business survival.
Today, the best solution to these dilemmas is server-free backup. Existing or
new Fibre Channel storage networks can take full advantage of server-free
backup technology. For most new SAN installations using LAN-free only
backups, the hardware functionality exists by default.What problems are solved in
the data center? Server-free backup is done directly by each data mover writing
to tape directly, without the need for agents or IP traffic, or any consumption of
CPU time on other servers. A master backup server coordinates tape sharing
between media servers. Other features such as snapshot copy, which exist as hardware- and software-bundled features for enterprise RAIDs, in addition to a
backup software application option, enhance server-free backup implementations
for database servers.The idea is that instead of the data being read from disk
drives, into the memory of servers, and sent through an IP network to the
backup server, the data is block-copied from disk to tape directly. Figure 6.5
shows an example of this, where three different hosts share a single tape drive
across the SAN.
SAN-Based Third-Party Copy Data Movers
Third-party copy backup systems are very similar to the LAN-free systems discussed previously. However, with this technique, specialized pieces of hardware
and software called “data movers” are used to back up critical data from storage
arrays in the SAN without the need for a dedicated server or server(s) to handle
the data copying and movement.
All data movers support the SCSI Extended Copy command.The third-party
copy hardware actually moves the data from the disk to tape.The backup software
controls this operation without the need for the servers in the network to get
involved in the actual movement of data. Agents are not required to run on the
server, and critical servers are not occupied backing up data.
Third-party copy hardware, such as some Crossroads or Chaparral Fibre
Channel/SCSI bridges, works in conjunction with third-party copy-enabled
backup application software (from VERITAS, Legato, Computer Associates, and
others).This backup software operates identically to normal backup software in
terms of user interface and operation. Overall, this technique increases performance, greatly reduces the time needed for backup windows, and eliminates the
task of backup job processing from the CPU of the server.
215
140_SANs_06
216
8/14/01
3:37 PM
Page 216
Chapter 6 • SAN Applications and Configurations
Figure 6.5 Using Storage Networks for Server-Free Backup
Host A
Fibre Channel Storage
Host B
Fibre Channel Storage
Host C
Fibre Channel Tape
A typical third-party copy configuration with currently available hardware typically uses a Fibre Channel-to-SCSI router as a bridge to legacy SCSI tape drives,
and combines that functionality with third-party copy support (Figure 6.6).
Making Your Enterprise Disaster Tolerant
As computer systems become increasingly central to business operations, the
integrity and availability of those systems has become one of the most important
charters of IT organizations.The loss of access to computer systems and data today
even for a few minutes can mean millions of dollars of lost revenue, and damage
to reputation. Because of the criticality of those systems and data, service-level
agreements and in some cases government regulations require that systems must be
available to provide business continuance in the face of major disasters.
140_SANs_06
8/14/01
3:37 PM
Page 217
SAN Applications and Configurations • Chapter 6
Figure 6.6 Third-Party Data Mover Configuration
Host A
Host C
Host B
Command
to move
data
er
ov
aM
t
Da
Fibre Channel/SCSI Bridge
Fibre Channel Storage
Fibre Channel Storage
Actual block
data
movement
Fibre Channel Tape
Enterprises that are implementing a SAN are finding that the ability to
mirror and operate their devices spread across large distances helps to provide disaster tolerance to their critical installations.The presence of a SAN helps enhance
the ability of a company to protect and recover data in the case of a disaster, and
provides the tools to enable an administrator to design a disaster-tolerant system.
Fibre Channel technology can be coupled with technologies like Dense Wave
Division Multiplexing (DWDM) for Metropolitan Area Networking (MAN), and
tunneling through existing high-speed Wide Area Networks (WANs).Thus, it is
217
140_SANs_06
218
8/14/01
3:37 PM
Page 218
Chapter 6 • SAN Applications and Configurations
now possible to separate key data sites across great distances, while still allowing
them to share disk subsystems and backup devices.The following sections summarize the creation of a geographically separated, but fully connected SAN.They
describe how Brocade switches are uniquely suited to handle these needs with
features known as Remote Switch and Extended Fabrics.
Data Replication and Remote Backup
Data replication is used to make enterprises disaster tolerant across long distances,
particularly in the case where bandwidth is not readily available or is very expensive. Data replication is the technique of taking a snapshot of your operational
storage images at a specific point in time, and moving across a network to a geographically separate storage facility.This data is then moved, on schedule, across a
potentially slower network across large distances, and replicated at the backup
facility. Data replication is typically done every day or at most every few hours,
and it can be done across a city or across the globe. Because there is a delay
between updates, data can sometimes be sent over a slow link and reassembled on
the other side.
Data replication can be done directly across Fibre Channel, enabling very fast
replication and minimal mismatch between a replica and live data.This technique
can also be used across existing IP and other network infrastructures to transport
the data across large distances. For example, it is possible to tunnel Fibre Channel
connections over an Asynchronous Transfer Mode (ATM) network using the
Brocade Remote Switch product and an appropriate Fibre Channel-to-ATM
gateway.This is an optionally licensed product available for all Brocade SilkWorm
2000 and higher switches.
Remote backup is the use of a long-distance link to enable backup to a
remote site. Normal backup techniques are used, with the difference being that
backup tapes and media are stored far away from the servers being backed up.
This helps ensure the safety of that data in the case of a geographically limited
disaster. Like remote replication, the task can be done via a Fibre Channel-toATM or Fibre Channel-to-IP gateway. Figure 6.7 shows a typical remote backup
configuration utilizing existing WAN infrastructure.
140_SANs_06
8/14/01
3:37 PM
Page 219
SAN Applications and Configurations • Chapter 6
Figure 6.7 Remote Backup over WANs
Primary Site
Server
ATM
Recovery Site
Fibre Channel/ATM
Fibre Channel Storage
Fibre Channel/ATM
Fibre Channel Tape
Metropolitan Area Network Solutions
A recent innovation in optical technology, DWDM hardware has enabled the
transport of native Fibre Channel over greater distances up to 100 km. A DWDM
allows for real-time, full-speed transport of Fibre Channel to match the very high
bandwidth requirements of real-time, mission-critical applications.
219
140_SANs_06
220
8/14/01
3:37 PM
Page 220
Chapter 6 • SAN Applications and Configurations
This approach is used for creating disaster-tolerant solutions, by enabling
remote mirroring of operations across large distances. Because these solutions can
transmit full-speed Fibre Channel frames, two separate data centers can share the
same data, and can create remote mirrors of data.This remotely mirrored data
allows for a hot-standby system that can take over the operations of a failed
system at a moment’s notice, with all data intact and no need for data recovery. In
fact, DWDMs can be used in conjunction with HA software to allow this failover
to occur both automatically and quickly.
Brocade switches compensate for the signal delays that happen when transmitting frames over long distances with the use of extended amounts of
buffering (buffer-to-buffer credits) available on Inter-Switch Links (ISLs).This
delay is caused by the speed with which light travels through the glass in the
fiber-optic cable. By configuring two switches to use extended buffer credits on
the long-distance E_Ports, Brocade switches can handle this delay without
losing bandwidth.
A Brocade switch can be configured to handle extended distance fabrics by
installing the Extended Fabrics software license, and then setting the long-distance fabric settings to “1” in the switch configure command. Individual switch
E_Ports can either be set to handle distances of 50 km (mode 1) or 100 km
(mode 2, 60). See the Brocade Extended Fabrics documentation for details
on configuration.
One of the most typical uses of Fibre Channel SANs over MANs is sharing
data for disaster tolerance. Banks, brokerages, and other businesses in Manhattan
are some of the biggest users of this technology.These organizations require realtime backup of data to a remote site. A combination of disaster tolerance requirements and the cost of real estate in Manhattan has resulted in a large number of
organizations establishing disaster recovery sites in New Jersey for a secondary
operations center.
Figure 6.8 shows a typical disaster tolerance configuration used for a MAN
topology.Two parts of the same SAN exist on either side of a MAN (DWDM),
operating just as if they were not geographically separated. Data is mirrored
between both sides of the SAN, and failover software can be used to provide high
availability with backup servers on either side of the link.
140_SANs_06
8/14/01
3:37 PM
Page 221
SAN Applications and Configurations • Chapter 6
Figure 6.8 Metropolitan Area Network, Bridging Fibre Channel over DWDM
Site B
Site A
Server
Server
DWDM
DWDM
Data Mirror
Fibre Channel Storage
Fibre Channel Storage
221
140_SANs_06
222
8/14/01
3:37 PM
Page 222
Chapter 6 • SAN Applications and Configurations
Summary
In this chapter, we discussed the most common overall applications of SANs, and
some sample configurations for those applications.
HA clusters are used with Fibre Channel networks to support mission-critical
business applications utilizing redundant SAN components.These can include
database clusters like Oracle Parallel Database Server, and specialized failover
packages such as Microsoft Cluster Server.
Storage consolidation is another typical application of Fibre Channel networks. SANs have made it possible for data center administrators to centralize
their storage resources, making better use of the storage they have, and leveraging
their budget for storage through more efficient allocation and capacity planning.
With some attention paid to managing the allocation of storage through fabric
zoning, LUN masking, or storage sharing software, the use of a SAN for storage
consolidation can pay off handsomely in better use of IT budgets, enhanced manageability of data, and more reliable operation of your data center.
The use of a SAN for LAN-free backup, server-free backup, and newer techniques like third-party copy, offers solutions to backing up the vastly growing
amounts of storage in your enterprise. Because of the bandwidth and efficiency
of Fibre Channel, SANs are fundamentally better suited for backup than LANs
are. Server-free backup configurations offer even more efficient backup of storage
resources, requiring more advanced software capable of directly backing up data
without the intervention of servers. Finally, third-party copy provides even more
efficient use of the network without requiring even a backup server.
With the advent of DWDM equipment that can transmit high-speed Fibre
Channel data across MANs, enterprises can add disaster tolerance to their data
centers. By enabling remote mirroring and replication of data, clustering across
long geographical distances enhances the ability of the enterprise to keep critical
systems up and running in the most extreme conditions.
Solutions Fast Track
Configuring a High-Availability Cluster
HA clusters are used for redundant, fail-safe installations of mission-crit-
ical business applications.
Clustering provides availability, manageability, and scalability.
140_SANs_06
8/14/01
3:37 PM
Page 223
SAN Applications and Configurations • Chapter 6
Availability is the capability of a cluster to tolerate hardware, network, or
software errors.
The most common use of clustering is two servers configured to share
storage through Fibre Channel.
Redundant HBAs and switches should be used to provide fault tolerance.
The use of dynamic multipathing software, drivers, or HBAs can provide
higher levels of availability to your cluster.
Using a SAN for Storage Consolidation
Storage consolidation enables administrators to centralize
storage resources.
Consolidation provides more efficient use of storage, enhances
manageability, and improves accessibility.
Almost any layout of a storage network can be used for
storage consolidation.
Consolidation requires attention paid to how operating systems treat
shared volumes.
In order to properly partition data in a consolidation environment, you
need to use fabric zoning, LUN masking on storage or the host, or software to control permissions.
It is generally best to use fabric zoning even when also using another
access control product to achieve a more effective security model, and to
provide a “broadcast container,” which can increase the scalability and
reliability of a SAN.
An example of a typical storage consolidation setup is a shared SAN
used to provide data storage for a Web farm, where many servers read
the same disks to present data.
Storage LUN masking is used to ensure that only specific hosts are
allowed access to specific logical units of a storage array.The advantage
of storage LUN masking is that the storage guarantees which host is
allowed access to any volume.
223
140_SANs_06
224
8/14/01
3:37 PM
Page 224
Chapter 6 • SAN Applications and Configurations
HBA LUN masking is also used to limit what storage a host can see, and
requires that every host in the network participate in the same
masking scheme.
Software partitioning provides another type of control over LUN
presentation, but it generally requires upper-level software and demands
that every host in the network be loaded with that software.
Switch zoning, available in Brocade switches, provides a convenient way
to allocate storage to hosts, and to consolidate different departments into
a single company network.
Switch zoning does not currently support control at the LUN level, only
at the port and WWN levels. Upcoming products will add this capability.
For now, other access control techniques might need to be used in addition to switch zoning to provide access control at the LUN level.
Storage LUN masking provides another way to control access to
volumes in a shared SAN.
High-end storage arrays provide the capability to specify the port or
node WWN of a host HBA, and specify which volumes in the array will
respond to requests.
By using storage LUN masking, you can ensure that only hosts with
permission can read or write from a specified volume.
Storage LUN masking requires the participation of the storage only to
enforce permissions.
HBAs provide access control to volumes through LUN masking.
LUN masking controls which volumes an operating system can see
through a particular HBA.
HBA LUN masking requires the participation of all of the hosts in the
network to avoid contention for storage resources.
LAN-Free Backup Configuration
Traditional backup systems used SCSI direct-attached tape storage.The
LAN-based client-server backup model, although an improvement,
cannot account for ever-increasing amounts of data through the LAN
connection. LAN-free backups using storage networks solve LAN-based
problems by offloading traffic from the LAN and increasing bandwidth.
140_SANs_06
8/14/01
3:37 PM
Page 225
SAN Applications and Configurations • Chapter 6
SAN Server-Free Backup
Server-free backup is the use of a SAN to remove backup traffic
from a LAN.
Backup is done directly on the SAN for each device, rather than each
host being involved in data transfer.
Third-party copy provides an even more efficient way to transfer data to
tape, freeing a backup server from needing to directly access disks and
copy data to tape.
Making Your Enterprise Disaster Tolerant
Fibre Channel SANs are ideal for mirroring and accessing data across
large distances.
It is now possible to separate critical systems many miles apart.
Brocade switches provide extended credits on ISLs to enable high
performance and reliable long-distance operation.
225
140_SANs_06
226
8/14/01
3:37 PM
Page 226
Chapter 6 • SAN Applications and Configurations
Frequently Asked Questions
The following Frequently Asked Questions, answered by the authors of this book,
are designed to both measure your understanding of the concepts presented in
this chapter and to assist you with real-life implementation of these concepts. To
have your questions about this chapter answered by the author, browse to
www.syngress.com/solutions and click on the “Ask the Author” form.
Q: My configuration does not look exactly like any of these. Is this a problem?
A: These examples represent typical configurations for applications. Many reallife configurations might be more complex.
Q: Where can I get more information on configurations?
A: Interoperability programs through most Fibre Channel manufacturers provide
example configurations, along with more detailed version numbers and specific model information for tested configurations. Brocade has an extensive
set of SAN solutions called SOLUTIONware Guides available on their Web
site at www.brocade.com/SAN.
Q: I am trying to control access to storage, and do not know what type of control I need: zoning, LUN masking, or software? What should I do?
A: The kind of control over your storage depends entirely on your application.
Analyzing how dynamic your environment is will determine whether you
can just use zoning or software. In most cases, you might actually use a combination of these techniques to achieve what you need.
Q: I would like to cluster my databases for better performance.What databases
can I use?
A: Most major databases now support fabric switch-based clustering, including
Oracle Parallel Server, IBM DB2 Parallel Edition, and Microsoft SQL Server.
Q: I would like to have my Exchange Mail Server highly available.What
should I do?
A: Brocade has developed HA solutions for the Exchange Server that can be
used in setting up your desired SAN configuration. For more information,
visit the Brocde Web site: www.brocade.com/SAN.
140_SANs_07
8/14/01
3:38 PM
Page 227
Chapter 7
Developing a SAN
Architecture
Solutions in this chapter:
■
Identifying Fabric Topologies and
SAN Architectures
■
Working with the Core/Edge Topology
■
Determining Levels of Availability
■
Configuring Traffic Patterns
■
Evaluating Performance Considerations
Summary
Solutions Fast Track
Frequently Asked Questions
227
140_SANs_07
228
8/14/01
3:38 PM
Page 228
Chapter 7 • Developing a SAN Architecture
Introduction
In Chapter 5, “The SAN Design Process,” you performed the requirements
analysis to determine what your SAN needs to accomplish. In Chapter 6, “SAN
Applications and Configurations,” you explored some of the solutions that could
be built on your SAN. At this point in the book, you should know the following
information about your SAN:
■
How many ports you need for hosts
■
How many ports you need for storage
■
What the traffic patterns will be
■
The network’s performance requirements
■
Where all of the equipment will be located
■
What, if any, MAN/WAN or campus distances will be involved
■
What type of solution you are building (such as, storage consolidation)
■
How all of this will likely change over time
In this chapter, you will take this information and determine the fabric
topology or topologies that best suit your needs as part of your overall SAN
architecture.We discuss the different categories of fabric topologies that you
could apply and which topology is most appropriate in any given case. Furthermore, we describe how you could use multiple fabrics to form a highly reliable
and scalable SAN architecture.We delve into detail on one particular topology,
the core/edge fabric, also commonly known as a star topology network.There are
subtle differences between “normal” star networks and a core/edge SAN that
require using the new term. However, for the most part, if you think of a star
network, you will not be far off base.
NOTE
Do not view the focus on the star topology as an indictment of the other
approaches you could take. We chose to highlight this approach because
it is simple. One of the strongest features of Brocade Fabric OS is the
robust implementation of Fabric Shortest Path First (FSPF), which allows
arbitrarily complex networks to be possible. However, the simplicity principle of “Occam’s razor” tells us to use simpler solutions whenever we
can, and the core/edge SAN is simple indeed. Thus, we felt it to be an
appropriate design to highlight.
140_SANs_07
8/14/01
3:38 PM
Page 229
Developing a SAN Architecture • Chapter 7
Methods for providing the best redundancy in all topologies and a discussion
of performance follow. Finally, we talk about how to deal with SANs that must
span long distances.
Identifying Fabric Topologies
and SAN Architectures
Before embarking on a discussion of SAN architectures, we will review a set of
working definitions for those SAN components particular to these architectures.
The terminology for describing fabric topologies, SAN architectures, and their
components is still evolving. Much of the current terminology is derived from
other networking technologies, like Ethernet, frame relay, or even high-performance computing node interconnects in supercomputers. Since there are occasionally multiple terminologies that might apply to the same entity, the terms we
use in this book are the ones that we consider the most useful in describing any
given SAN. For example, the term ring topology fabric is quite self-explanatory, and
is therefore useful in describing “a fabric in which the Inter-Switch Links (ISLs)
form a logical ring.”
■
Fabric A fabric consists of one or more interconnected Fibre Channel
switches.The term fabric can refer to the physical switches or to a set of
global software components such as the routing tables, zoning configurations, and Name Servers.
■
SAN A SAN can consist of one or more related fabrics and connected
edge devices. It is possible to build a SAN using various networking
technologies. However, all SANs discussed in this book are Fibre
Channel fabric-based. Several emerging technologies might complement
Fibre Channel in the future, so it is important to make the distinction.
For now, though, the vast majority of SANs in production—and all
SANs based on Brocade technology—use Fibre Channel.
■
Fabric Topology Fabric topology is the arrangement of the switches
that form a fabric.This term is used in the context of ISL interconnection and does not relate to the way in which nodes are connected to the
fabric. Moving a storage device from one port to another does not
change the fabric topology.
■
Resilient Core/Edge Fabric Topology Resilient core/edge fabric
topology is when two or more switches act as a core to interconnect
229
3:38 PM
Page 230
Chapter 7 • Developing a SAN Architecture
multiple edge switches. Nodes are attached to these edge switches.We
discuss this topology in greater detail later in the chapter.The convention for describing a simple core/edge fabric involves stating the number
of edge switches, the number of core switches, and the number of ISLs
used to interconnect each edge switch to each core switch. (For now, all
switches are assumed to be 16 ports.) It is written as 16ex4cx1i, and is
read as “A simple core/edge fabric consisting of sixteen edge switches,
each of which is connected to four core switches by one ISL.” A shorter
reading could be “A sixteen edge by four core by one ISL core/edge
fabric.” Figure 7.1 illustrates this nomenclature.
Figure 7.1 Core/Edge Fabric Nomenclature
16e x 4c x 1i
4c
x
by
1i
x
4 co
L
re
by
1 IS
16 e
dge
230
8/14/01
16 e
140_SANs_07
■
Node A node is any device—usually either a host or storage device—
that attaches to a fabric.The terms node and edge device can be used
interchangeably.
■
Node Count Node count is the number of nodes attached to a fabric.
Each node might take up one or more ports in one or more fabrics.
■
Fabric Port Count Fabric port count is the number of ports available
for connection by nodes in a fabric.The term port count used alone refers
to a fabric port count.
■
SAN Port Count SAN port count is the number of ports available for
connection by nodes in the entire SAN.This is the sum of the port
counts in all fabrics that make up the SAN.
■
SAN Architecture SAN architecture is the overall design or structure
of a storage network solution.This includes one or more related fabrics,
each of which has a topology. It might also include other networks over
which the SAN is bridged or tunneled—such as a Metropolitan Area
Network (MAN). More broadly, SAN architecture can include software
140_SANs_07
8/14/01
3:38 PM
Page 231
Developing a SAN Architecture • Chapter 7
components—such as path failover or data backup software—and the
nodes that are attached to the fabric(s).
■
Hop Count Hop count can be defined in several ways. For the purpose of evaluating SAN designs, the hop count is identical to the
number of ISLs that a frame must traverse to reach its destination. If
traffic is localized to a switch, the hop count is zero. If traffic has to cross
one ISL, the hop count is one.
■
Latency Each hop takes a small amount of time.This time is referred to
as the latency of the link. It is a very small amount of time (2 microseconds maximum across a switch, 1 microsecond typical), and will
influence performance only when the hop is a long-distance link (as in a
SAN/MAN or SAN/WAN).This number is usually so small when
compared to disk access times that it normally can be treated as inconsequential and eliminated as a factor.
■
Over-Subscription Whenever more nodes could potentially contend
for the use of a resource—such as an ISL—than that resource could
simultaneously support, that resource is said to be over-subscribed. Oversubscription can be a desirable attribute in a fabric topology and is
common in most networks as a cost/benefit trade-off, as long as it does
not produce unacceptable levels of congestion. Although a design might
have over-subscription, it does not necessarily mean that it will have
congestion. Most nodes cannot sustain full Fibre Channel speeds, typically running at 50 to 80 percent of the maximum 100 MB/sec. For
performance-limiting congestion to occur, several devices must not only
all operate at their peak at the exact same time, but must also sustain
simultaneous peak operation. As most traffic is bursty, as well as relatively
random, the chance of significant congestion is usually reduced to an
insignificantly small amount.The exception to this rule is traffic such as
video streaming.This kind of application produces a long, constant
stream of data. If designing for this type of traffic, you must ensure
adequate bandwidth for these streaming sources or localize the traffic
onto a switch.
■
ISL Over-Subscription Ratio The over-subscription ratio for an ISL
is the number of different ports that could contend for the use of its
bandwidth.This should be calculated for an edge switch in a core/edge
SAN by making a ratio of the number of free ports (non-ISL) on that
231
140_SANs_07
232
8/14/01
3:38 PM
Page 232
Chapter 7 • Developing a SAN Architecture
switch to the number of ISLs. For example, in the 16ex4cx1i SAN, each
edge switch has four ISLs (one ISL going to each of four cores). Since
each edge switch has 16 ports, there are 12 left. 12:4 reduces to 3:1, so
the over-subscription ratio is 3:1 for this SAN. Keep in mind that this
just indicates the number of potential devices that could contend for
access to an ISL.This does not necessarily mean that they actually are
contending for access.
NOTE
In discussing over-subscription ratios, assume that all ports operate at
the same speed: for example, 1 Gbit/sec. In a network where some ports
are different speeds (some are 1 Gbit/sec, while others are 2 Gbit/sec),
the process of calculating over-subscription can be fairly complex.
The worst-case scenario of meaningful over-subscription for an ISL
on a 16-port edge switch is 15:1.This ratio means that 15 devices could
be contending for the use of one ISL.This is not a property of Brocade
switches. It is a mathematical property of “networks built with 16-port
switches where all ports operate at the same speed.” One could argue
that more than 15 nodes outside a switch could contend for access to it.
However, this is not a meaningful definition of ISL over-subscription,
since the nodes would be subject to performance limitations of node
over-subscription. If two hosts are trying to access one storage port, it
does not matter how well the network is built—the over-subscribed
storage port will be the limiting factor. Over-subscription of a storage
port is a completely different performance metric. See the definitions for
fan-in and fan-out.
As switches get larger, over-subscription will continue to be a design
factor. Networks and node counts will continue to grow faster than the
ability to build larger switches. (In fact, the Fibre Channel addressing
standards themselves limit the maximum size of a switch to 256 ports,
while a fabric could be significantly larger than that.) Networking will
still be required in SANs and will have the potential for congestion
regardless of the vendor or the switch size.
140_SANs_07
8/14/01
3:38 PM
Page 233
Developing a SAN Architecture • Chapter 7
■
Congestion Congestion is the realization of potential oversubscription. A congested link is one on which multiple devices are
actually contending for bandwidth.These devices will be throttled down
to consume the total bandwidth, assuming that all the devices are
peaking and congesting the link at the same time.
A frequent point of confusion is the difference between congestion
and blocking. Blocking means that the data actually does not get to its
destination, whereas congestion means that the data might simply have
to wait a bit.
As an analogy, consider the checkout line in a supermarket.When
there are more customers in the store than there are cashiers, the
checkout line is over-subscribed. However, if only one customer wants
to check out at a time, the presence of other customers in the store is
irrelevant to him or her. If more customers need to check out than there
are cashiers, the checkout line is congested.This will have an impact on
how quickly each checkout can occur. In a congested but nonblocking
supermarket, each customer might have to wait in line a bit, but will
eventually get to go through the checkout. If the supermarket took a
blocking approach, excess customers would be turned away, and would
not be allowed to wait in line. Some networks block; Brocade Fibre
Channel networks do not.
■
Fabric Shortest Path First (FSPF) The FSPF protocol was developed by Brocade and subsequently adopted by the Fibre Channel
standards community for allowing switches to discover the fabric
topology, and route frames correctly. (As an interesting side note,
Brocade proposed FSPF as a standard to ANSI one and a half years
before its adoption. A number of competing protocols were proposed by
other companies in the interim, but the T11 standards committee
selected only FSPF.) FSPF also provides for load sharing between equalcost links.This capability is the key to eliminating congestion from a
fabric. Fabric topologies that have many equal-cost links—such as the
resilient core/edge topology—benefit greatly from FSPF load sharing
features. Fabrics with many ISLs, but few equal-cost paths—like the full
mesh—do not use FSPF as efficiently.
■
Single Point of Failure (SPOF) A single point of failure in a SAN is
any component—either hardware or software—that could bring down
an entire SAN solution.This could be a switch or an ISL in a nonresilient topology, or a host with no clustering software installed. In an
233
140_SANs_07
234
8/14/01
3:38 PM
Page 234
Chapter 7 • Developing a SAN Architecture
environment where uptime is critical, even distributed fabric services—
such as the Name Server—can be viewed as single points of failure.
This is why dual fabrics are always recommended in high-availability
applications.
■
Fan-out This is the ratio of host ports to a single storage port. It is the
view of the SAN from the storage port’s perspective: “How many different hosts will be trying to access me via this port?” If you have many
hosts accessing a single storage port, you can be assured that all of them
together will not use more than the 1 Gbit/sec the port has available.
Thus, if you have a fan-out of 10:1 (10 hosts sharing access to one
storage port), it is possible that on average each host will get only onetenth of the available bandwidth. Using high fan-out or ISL over-subscription ratios can be a perfectly acceptable way to build a SAN, as long
as the performance characteristics of the applications involved are well
understood.
■
Fan-in This is the ratio of storage ports to a single host port. It is the
view of the SAN from the host port’s perspective:“How many different
storage devices will I be trying to access from this HBA port?” Fan-in
and fan-out are both useful for planning the aggregate bandwidth needed
in the ISL matrix. If, for example, you have a fan-in of 3:1 (three storage
ports for each host port), and your ISLs are 3:1 over-subscribed (three
devices could potentially contend for access to one ISL), proper device
placement will allow this SAN to run with an actual congestion ratio of
1:1.This is because you can guarantee that certain devices will simply
never try to talk to each other.Thus, the SAN will not adversely affect
performance, despite the ISLs being over-subscribed.
There are several categories of information for which we need to define
terms.These categories cover such topics as how the switches are interconnected
(the topology), what strategy is used for optimizing traffic patterns (locality or
tiering), and what the distance requirements are (MAN/WAN). In order for distance to affect topology, the distances involved do not have to be long, they just
have to affect your ability to cable components together. Even having SAN components located in different rooms or floors of the same building might affect the
SAN topology you choose.You will need to know how many fabrics are
involved, and whether they all have the same topology.The last two items are part
of the SAN architecture, not the fabric topology.
140_SANs_07
8/14/01
3:38 PM
Page 235
Developing a SAN Architecture • Chapter 7
A SAN that attaches to a long-distance network such as a MAN or WAN is
described as being “over” that technology. A SAN tunneled over a WAN is
referred to as “SAN over WAN” or “SAN/WAN.” If a specific WAN technology
is known to be in use, then you can refer to technologies in the same way. For
example, if Fibre Channel is used on the SAN, and ATM is used on the WAN,
you can refer to this as “FC over ATM” or “FC/ATM.”
The following sections provide terms and definitions for the pieces that are
not quite so easily defined. In particular, we provide terms for fabric topologies
and SAN architectural strategies. Each of the topologies that we cover includes a
description of the topology, a picture of a representative network built using that
topology, and a summary list of its properties. All examples and properties assume
that the networks are built using 16-port switches, unless otherwise noted. Size
limits are based on current production-level support. Production level implies that
Brocade has tested fabrics to these size limits, and/or has customers who have
successfully done so.This does not mean that all vendors will support the maximum size listed, but rather that somebody does, and that there are no fundamental
technical reasons why fabrics of that size cannot be deployed in production
environments.
Also note that sizes are based on the most symmetrical designs. It is usually
possible to increase the size of a fabric by “hanging another switch off the edge.”
This means that you would take a “pure” design like a core/edge fabric, and plug
another switch or two into it anywhere you like to increase the port count.
Useful Topologies
The topologies for which we provide definitions are not the only topologies possible.This list does not represent all of the standard topologies that can be applied
to fabrics. Instead, we have attempted to compile a list of topologies that we have
found to be the most useful in practical, real-world SANs. For example, we
include the core/edge topology, but not the tree topology.This is because we
have not seen any instance where the tree is the best way to solve a design
problem, not because there is anything that would prevent you from using it.
You can also combine these topologies to form complex networks, provided
that you do not exceed the supported numbers of switches or exceed the Fibre
Channel standards defined hop count limit. (There can be no more than seven
hops between an initiator and a target.) These complex networks are desirable
when building SAN/MAN or SAN/WAN solutions, but should be avoided to
keep the network as simple as possible.
235
140_SANs_07
236
8/14/01
3:38 PM
Page 236
Chapter 7 • Developing a SAN Architecture
Scalability
One of the properties we discuss for each topology is its ability to scale well.
There are two metrics for measuring scalability: the size that the topology can
scale to (in terms of port count and switch count), and the ease of performing
this process. A number of scalability factors are involved in determining these
metrics, including the fundamental “geometric” properties of the design, and the
way that the channel-like fabric services perform in it. Certain topologies, such as
the resilient core/edge topology, are well suited to taking a “start small and grow
large” approach without needing much effort to add switches. However, some
topologies, like the partial mesh, might scale to a large size, but require extensive
recabling and possibly even downtime to do so.There is nothing fundamentally
wrong with the latter categories of design, but they might not be well suited for
dynamic environments that have high uptime requirements.
Cascade Topology
A cascaded fabric, illustrated in Figure 7.2, is like a bus network: it is a line of
switches with one connection between each switch and the switch downstream
of it.The switches on the ends are not connected.
Figure 7.2 A Cascaded Fabric
Cascaded fabrics are very inexpensive, easy to deploy, and easy to expand.
However, they have the lowest reliability and limited scalability.They are most
appropriate in situations where most if not all traffic can be localized onto individual switches, and the ISLs are used primarily for management traffic.
There are cascade variations that use more than one ISL between switches.
This will eliminate ISLs as a single point of failure and greatly increase the
140_SANs_07
8/14/01
3:38 PM
Page 237
Developing a SAN Architecture • Chapter 7
reliability of the solution. However, this also greatly increases the cost of the
solution, and each switch can still be a single point of failure.Table 7.1 charts the
properties of a cascade topology.
Table 7.1 Properties of a Cascade Topology
Limit of scalability (edge port count) 114 ports / 8 switches
Properties
Ease of scalability
Performance
Ease of deployment
Reliability
Cost (per edge port)
Ratings
3
1
3
1
3
Ratings indicate how well the topology meets the ideal requirements of that
property (1 = Not well, 2 = Moderately well, 3 = Very well).
Ring Topology
A ring is like a cascaded fabric, but with the ends connected (Figure 7.3).
Figure 7.3 A Ring Topology
The ring has superior reliability to the cascade because traffic can route
around an ISL failure or a switch failure. It does cost more than a cascade, but not
significantly more.The ring is usually preferable to the cascade for this reason.
Rings still do not perform well, or scale well.The ISLs are still very much
over-subscribed and will perform acceptably only when significant localization of
237
140_SANs_07
238
8/14/01
3:38 PM
Page 238
Chapter 7 • Developing a SAN Architecture
traffic is possible. Of course, you can improve performance by using more than
one ISL per switch interconnect, but—just as in a cascade—this substantially
reduces the scalability and increases the cost of the fabric.To scale the fabric, you
must disconnect at least one ISL, which might be disruptive to a production SAN.
This design is fine if you plan for your SAN to start small and stay small. It
can also be used when implementing SAN over MAN or WAN, where the
topology of the MAN/WAN might dictate the topology of the Fibre Channel
network. (Rings are common MAN/WAN topologies.) Finally, it is a good
choice when the ISLs are mostly used for management—when performance over
the ISLs is not a concern—but reliability of the ISL structure is still required, and
cost is a driving factor. For example, this design might be well suited for an
enterprise backup SAN with many backup libraries dedicated to server
groups.(The ISLs do not carry much in the way of volume of traffic, but what
they do carry is important traffic.) If your SAN performance requirements are low
(as in many Windows NT environments), the ring architecture might also work
for you.Table 7.2 charts the properties of a ring topology.
Table 7.2 Properties of a Ring Topology
Limit of scalability (edge port count) 112 ports / 8 switches
Properties
Ratings
Ease of scalability
Performance
Ease of deployment
Reliability
Cost (per edge port)
2
1
3
3
3
Ratings indicate how well the topology meets the ideal requirements of that
property (1 = Not well, 2 = Moderately well, 3 = Very well).
Mesh Topologies
Technically, almost any topology can be described as some sort of mesh. Since
this is not a very useful definition—and above all else, a definition must be
useful—we will discuss and provide working definitions for two meshes: the full
mesh and the partial mesh.
140_SANs_07
8/14/01
3:38 PM
Page 239
Developing a SAN Architecture • Chapter 7
Full-Mesh Topology
In a full-mesh topology (Figure 7.4), every switch is connected directly to every
other switch. Using 16-port switches, the largest useful full mesh consists of eight
switches, each of which has nine available ports.This gives you a total of 72 available ports. Adding more than eight switches will actually reduce the number of
available ports. Figure 7.5 shows the maximum size of a full mesh.
Figure 7.4 A Full-Mesh Topology
Figure 7.5 Maximum Size of a Full-Mesh Topology
NOTE
Scaling a full mesh can require unplugging edge devices. If you have a
4-switch full mesh (52 edge ports) and you use all the ports with edge
devices, you will need to unplug one device from each switch in the
mesh in order to add another switch. Make sure that you plan for this by
either leaving some ports free on all switches, or making sure that there
are some devices that can withstand downtime on each switch. Because
of this, full meshes do not have a high rating for ease of scalability.
239
140_SANs_07
240
8/14/01
3:38 PM
Page 240
Chapter 7 • Developing a SAN Architecture
Full meshes are best used when you know that your fabric will not grow
beyond four or five switches, since the cost of the ISLs becomes prohibitive after
that. Also, networks that use six or more switches are very good candidates for a
core/edge design, which will cost less, perform better, and scale much better than
a full mesh.The good thing about performance on a full mesh is that there will
never be more than one hop between switches unless there is a failure. However,
since the ability to support load sharing is more of a performance enhancement
than is keeping hop count low, this is not a good choice for most performancecritical applications.Table 7.3 charts the properties of a full-mesh topology.
NOTE
Special cases:
A 2-switch full mesh is identical to a 2-switch cascade.
A 3-switch full mesh is identical to a 3-switch ring.
Table 7.3 Properties of a Full-Mesh Topology
Limit of scalability (edge port count) 72 ports / 8 switches
Properties
Ratings
Ease of scalability
Performance
Ease of deployment
Reliability
Cost (per edge port)
1
2
3
3
1
Ratings indicate how well the topology meets the ideal requirements of that
property (1 = Not well, 2 = Moderately well, 3 = Very well).
Partial-Mesh Topology
The common definition for a partial mesh (Figure 7.6) is broad enough to
encompass almost all SANs that are not full meshes. A partial mesh is defined by
Brocade as follows: “A partial mesh is similar to a full mesh, but with some of the
ISLs removed. In most cases, this will be done in a structured pattern (for
example, each switch will directly connect to its neighbor, and to every other
140_SANs_07
8/14/01
3:38 PM
Page 241
Developing a SAN Architecture • Chapter 7
switch across from it).”While this definition is not in general use outside of
Brocade, it describes a desirable variant on the full mesh.
Figure 7.6 A Partial-Mesh Topology
The network in Figure 7.6 might be useful if you knew that traffic would
never flow between the two left-hand switches.The network still is fully resilient
to failure, but you are not paying a price premium for an ISL that will not be
used. Partial meshes also scale farther than full meshes. Figure 7.7 shows a partial
mesh that has 176 free ports. (Remember that the largest full mesh has 72 ports.)
Each switch is connected to its neighbor.Two switches are skipped before the
next connection.The worst-case hop count between switches in the event of an
ISL failure is three hops.
Figure 7.7 Maximum Size of a Partial-Mesh Topology
241
140_SANs_07
242
8/14/01
3:38 PM
Page 242
Chapter 7 • Developing a SAN Architecture
While these networks can be scaled to produce a large number of edge ports,
they still have performance characteristics that are less than ideal. None of the
networks listed thus far will benefit much from FSPF load-sharing capabilities, for
example. Since bandwidth is more frequently a concern than hop count (Fibre
Channel latency being effectively a second-order derivative in most real-world
performance calculations), the ability of a topology to load share across ISLs is
key to its performance.
Moreover, partial meshes can be difficult to scale without downtime.The
procedure for moving from the full-mesh fabric in Figure 7.5 to the partial-mesh
fabric in Figure 7.7 would require not only adding new switches and potentially
disconnecting nodes, but actually disconnecting ISLs that were already in place.
The same is true for scaling between many partial-mesh designs.This would be
disruptive to many production SANs, especially if redundant fabrics were not
used.Therefore, meshes—either full or partial—are recommended only for networks that will change infrequently.They might also be used as a static component of a network. For example, a full mesh could be used in an environment
where the “SAN islands” architecture was employed, or as the core of a complex
core/edge design (which we discuss later).Table 7.4 charts the properties of a
partial-mesh topology.
Table 7.4 Properties of a Partial-Mesh Topology
Limit of scalability (edge port count)
176+ ports / 16+ switches
Properties
Ratings
Ease of scalability
Performance
Ease of deployment
Reliability
Cost (per edge port)
1
1
2
3
2 to 3
Ratings indicate how well the topology meets the ideal requirements of that
property (1 = Not well, 2 = Moderately well, 3 = Very well).
Core/Edge or Star Topologies
In a resilient core/edge fabric (Figure 7.8), two or more switches will reside in
the center of the fabric (the core) and interconnect a number of other switches
(the edge). Switches that reside in the middle of the fabric are referred to as core
140_SANs_07
8/14/01
3:38 PM
Page 243
Developing a SAN Architecture • Chapter 7
switches.The switches that are interconnected by the core switches are referred
to as edge switches. Devices such as hosts and storage are attached to free ports
on the edge switches.These ports are referred to as edge ports. Free ports (if any)
on the core switches should be reserved for use as ISLs, in order to avoid limiting
the fabric’s growth potential.
Figure 7.8 A Core/Edge or Star Topology
Edge
Core
We will focus on the core/edge fabric as being a solution for scalable fabrics
for a number of reasons.The core/edge topology is:
■
Easy to grow without downtime (“pay as you grow”).
■
Easy to transition to future large core fabric switches, and good at
providing investment protection.
■
Economical, with a good cost-to-performance ratio.
■
Simple and easy to understand.
■
Well tested and reliable.
■
Proven in traditional data networks such as Ethernet.
■
Capable of exhibiting stellar performance, with full utilization of FSPF
load sharing and redundancy features.
■
Conducive to performance analysis. Performance in a core/edge SAN is
deterministic, and you can easily determine how much bandwidth any
given switch has to “get to” any other switch.
■
Scalable to hundreds of ports now, and thousands in the future.
■
Able to solve most design problems, and a good choice when design
requirements are not well known.
243
140_SANs_07
244
8/14/01
3:38 PM
Page 244
Chapter 7 • Developing a SAN Architecture
Again, we must stress that the simple core/edge SAN is by no means the only
way to build scalable networks with Brocade switches.There are limitless ways in
which Brocade switches could be interconnected.We will focus on this particular
topology in the hopes that it will be beneficial to the majority of designers, and
we will leave the limitless variations on this and other topologies as exercises for
the reader.
Simple Resilient Core/Edge Topology
A simple resilient core/edge fabric has two or more core elements, each of which
consists of a single switch. All of the core/edge fabrics depicted thus far are
simple.Two core elements are recommended to maintain a high level of resilience
and avoid a single point of failure (Figure 7.9).Table 7.5 charts the properties of a
simple resilient core/edge topology.
Figure 7.9 A Simple Resilient Core/Edge Topology
Two core elements
(
and
)
each of which is a
single switch
Table 7.5 Properties of a Simple Resilient Core/Edge Topology
Limit of scalability (edge port count)
224 ports / 20 switches
Properties
Ratings
Ease of scalability
Performance
Ease of deployment
Reliability
Cost (per edge port)
3
3
3
3
2
Ratings indicate how well the topology meets the ideal requirements of that
property (1 = Not well, 2 = Moderately well, 3 = Very well).
Complex Core Resilient Core/Edge Topology
A complex core resilient core/edge SAN (Figure 7.10) has two or more core
elements, each of which consists of multiple switches.These designs are more
140_SANs_07
8/14/01
3:38 PM
Page 245
Developing a SAN Architecture • Chapter 7
complex, more expensive to build and maintain, and are generally not necessary.
However, in certain cases, you might want to expand beyond 16 edge switches,
and large-port-count core switches might not be available. In this case, you could
build a larger fabric out of complex core elements.Table 7.6 charts the properties
of a complex core/edge topology.
Figure 7.10 A Complex Core Resilient Core/Edge Topology
Table 7.6 Properties of a Complex Core Resilient Core/Edge Topology
Limit of scalability (edge port count)
300+ ports / 28+ switches
Properties
Ratings
Ease of scalability
Performance
Ease of deployment
Reliability
Cost (per edge port)
2
2
1
3
1
Ratings indicate how well the topology meets the ideal requirements of that
property (1 = Not well, 2 = Moderately well, 3 = Very well).
Composite Resilient Core/Edge Topology
A composite resilient core/edge SAN has two or more cores (Figure 7.11). Each
core consists of two or more single-switch elements. It could be viewed as two
245
140_SANs_07
246
8/14/01
3:38 PM
Page 246
Chapter 7 • Developing a SAN Architecture
stars “glued together at the edge.”This is useful when using a tiered approach to
performance optimization (we discuss tiered SANs later).
Figure 7.11 A Composite Resilient Core/Edge Topology
Two cores (
and
) each of
which has two elements, each of
which is a single switch
Topologies at a Glance
Table 7.7 on the following page provides a compilation of all of the characteristics of topologies listed thus far, so that you can compare them easily.
Complex Topologies
It is possible to build arbitrarily complex architectures by connecting switches
together in a seemingly random way. For example, a ring could be connected to
the edge of a core/edge fabric, which could also have a cascade and a smaller
core/edge fabric hanging off it.What would you call this SAN other than “complex?”While this approach might be desirable in certain cases, it is usually better
to use a more structured approach to network design.This will ensure flexibility
and the maintainability of the network.
Even without deviating from the “simple” core/edge network—or, indeed,
from a simple full mesh, or ring—it is possible to provide for variations in performance requirements. Figure 7.21 later in this chapter is an example of an asymmetrical core/edge network, which is still geometrically “simple.”
Working with the Core/Edge Topology
Building one or more core/edge fabrics is, at the time of this writing, the best
way to implement a general-purpose scalable SAN. Again, more experienced
designers should feel free to implement whatever topology they see fit; we are
simply advocating the core/edge approach because it can solve most problems
easily, which is, after all, what most users want.This section provides you with
some tips on how to use your core/edge fabric most effectively.
1
3
1
3
247
3
3
3
1
2
3
1
3
2 to 3
1
2
1
Partial Mesh
Ratings
176+ ports /
16+ switches
3
2
3
3
3
1
2
1
2
Complex
Core/Edge
Ratings
Simple
Core/Edge
Ratings
3
300+ ports /
28+ switches
224 ports /
20 switches
Ratings indicate how well the topology meets the ideal requirements of that property
(1 = Not well, 2 = Moderately well, 3 = Very well).
1
3
2
3
Ease of
scalability
Performance
Ease of
deployment
Reliability
Cost (per
edge port)
Full Mesh
Ratings
72 ports /
8 switches
3:38 PM
Ring
Ratings
Cascade
Ratings
Topology
Properties
112 ports /
8 switches
114 ports /
8 switches
8/14/01
Limit of
scalability
(edge port
count)
Table 7.7 Topology Properties Comparison
140_SANs_07
Page 247
140_SANs_07
248
8/14/01
3:38 PM
Page 248
Chapter 7 • Developing a SAN Architecture
Scaling without Downtime
You can scale a resilient core/edge network without any downtime.This requires
a small amount of planning and attention to detail, but it is well worth the effort.
Here are two examples of how you can scale these networks easily, by adding an
edge switch and upgrading the core.
The procedures listed here are designed to be applicable in a large, real-world
production environment, where a fine degree of control over the SAN is desired.
Consequently, they are much more complex than the plug-and-play approach,
which would be used in a less-demanding environment. For example, it is possible to add an edge switch by simply plugging it in, turning it on, and allowing
the Brocade Fabric OS’s plug-and-play features to do the configuration for you.
In larger production environments, however, it is usually desirable to control the
process manually, to ensure that the new switch not only enters the fabric, but
also does so in accordance with your site-specific administration policies. For
example, Fabric OS can automatically pick a domain ID for you. Most large sites
have a structured approach to domain ID assignment, and in these cases you
would want to assign the domain ID manually before adding the new switch to
the fabric.The following manual procedures are not necessary, but are recommended in these more tightly controlled environments.We leave it to you to
decide what level of manual control you wish to take in this process, and how
much control you want to give to the Fabric OS.
NOTE
It is best to use dual fabrics to achieve the highest possible uptime.
The use of resilient fabrics, like the mesh, ring, or core/edge, is part of a
fully redundant—or High-Availability (HA)—SAN solution, but not the
whole of it.
Adding an Edge Switch
You might want to implement only part of the edge of your core/edge fabric at
the beginning, and add the rest of the edge switches later. In other words, you
can take the “pay-as-you-grow” route.This is easy to do without downtime, as
long as you do not use the free ports in the core switches for edge devices, but
rather reserve them for scalability.
140_SANs_07
8/14/01
3:38 PM
Page 249
Developing a SAN Architecture • Chapter 7
Let us say that your architecture (as shown in Figure 7.12, Step C) is a 10switch resilient core/edge fabric. Let us say further that you have only implemented half of the edge (Figure 7.12, Step A), and then decide that you want to
add one more edge switch (Figure 7.12, Step B).
Figure 7.12 Adding an Edge Switch
Step A
4ex2cx2i
Step B
5ex2cx2i
Step C
8ex2cx2i
Step One: Setting Up the New Switch
Set the new switch up by itself in the location where it will attach to the SAN
(bolt it to the rack), but do not attach it to the SAN at this point.
NOTE
We discuss the process of adding a switch to the fabric in greater detail
in Chapter 9, “SAN Implementation, Maintenance, and Management.”
Power on the switch, assign it a host name and an IP address, and then use
either the telnet or Brocade WEB TOOLS interface to configure its domain ID,
switch name, and any special variations of fabric parameters—such as ErrorDetect Time Out Value (E_D_TOV)—that you are using at your site. Ensure that
no zoning configuration is in effect.
Step Two: Connecting the New Switch
Issue the portDisable command on the new switch for each port that you
intend to use as an ISL. For example, if you are using ports 0 through 3 as the
ISL ports, you would execute the following commands:
newswitch:admin> portDisable 0
newswitch:admin> portDisable 1
newswitch:admin> portDisable 2
newswitch:admin> portDisable 3
249
140_SANs_07
250
8/14/01
3:38 PM
Page 250
Chapter 7 • Developing a SAN Architecture
Attach cables to these ports and route them to the appropriate core switches.
For example, ports 0 and 1 could be connected to one core switch, and 2 and 3
could be connected to the other core switch.
Re-enable the first port (port 0). At this point, the new switch should join
the fabric. After the fabric has reconfigured, you can issue the fabricShow command to see if it has indeed joined the fabric:
newswitch:admin> portEnable 0
newswitch:admin> fabricShow
Once you have verified that the switch joined the fabric successfully, reenable the remaining ports.While not necessary, this procedure will help ensure
minimal disruption to your fabric.
Upgrading the Core
As technology progresses, inevitably there will be a “bigger, better, faster” product
available that you might wish to purchase. It is quite likely that you will want this
new product to reside in the core of your fabric, as this position will probably
benefit the most from the features that caused you to buy the product in the first
place.This approach is common in all areas of networking: when you build a
LAN in a rapidly growing company, you might use several entry-level Ethernet
switches as your backbone.When the environment grows, you might replace
them with workgroup switches, and then perhaps a bladed chassis, but the entrylevel switches that you used at the start will still be included in the network; they
will simply not form the core of the network anymore.
For this example, we will say that you have a fully populated 18-switch
resilient core/edge fabric (16ex2cx1i), as shown in Figure 7.13.
Figure 7.13 A Fully Populated 18-Switch Resilient Core/Edge Fabric
140_SANs_07
8/14/01
3:38 PM
Page 251
Developing a SAN Architecture • Chapter 7
Since the core switches in this design are fully populated, you cannot add any
more edge switches. However, you can change the core to a larger switch without
downtime.This will allow you to add more edge switches and will even give you
the first two new edge switches “for free” in the bargain!
Step One: Adding the New Core Switches
As in the previous example, you should put the new core switches in their correct physical location, but not attach them to the fabric at this point.You should
configure their IP addresses, domain IDs, and so on, and then clear the zoning
configuration as before. However, you do not need to disable all of the ports that
will be used for ISLs. Use the switchDisable command to disable all ports on
that switch. Again, this is not necessary, and you can allow Fabric OS to do the
work for you if your site does not require such fine control.
Telnet into the first of the two existing core switches that you want to replace.
Issue the switchDisable command:
oldswitch:admin> switchDisable
Verify that all traffic is going over the remaining core switch. After you are
sure that the SAN is functioning correctly on its one remaining core, you can
power off the disabled core. Remove the ISL cables from the switch and the rack
cable management structure.The switch should now be completely separated
from the SAN (Figure 7.14). Cable the first new core switch—the one that you
have configured and disabled—to one of the edge switches.
Figure 7.14 Removing the First Core Switch
Step Two: Configuring the New Core Switches
Issue the switchEnable command. As before, the new switch should join the
fabric.You might see a fabric reconfigure message on the telnet session. After the
fabric has reconfigured, you can issue the fabricShow command:
251
140_SANs_07
252
8/14/01
3:38 PM
Page 252
Chapter 7 • Developing a SAN Architecture
newswitch:admin> switchEnable
newswitch:admin> fabricShow
One at a time, cable in the remaining edge switches.When you are done, the
SAN should look like Figure 7.15.
Figure 7.15 Adding the New Core Switch
Repeat the procedure with the other new core switch, and you will have the
SAN shown in Figure 7.16.
Figure 7.16 Newly Installed Core Switches
You also have two extra switches (the former core switches that are sitting off
to the side disconnected). Follow the procedure for adding an edge switch to
bring these back into the fabric (Figure 7.17).This set of procedures allows you
to future-proof your SAN, because it gives you an architecture that has a
migration path to future technologies already built into it.
140_SANs_07
8/14/01
3:38 PM
Page 253
Developing a SAN Architecture • Chapter 7
Figure 7.17 Redeploying the Former Core Switches
Install core
fabric switches
Redeploy to the edge
Target Designs for a Core/Edge SAN
If you have a rough idea of your port count and performance requirements, but do not want to spend much time designing an optimized
SAN architecture, you can pick one of the following four designs and
“be done with it.” You do not have to build the entire design at once.
You can build toward the target design, and “pay as you grow.”
Design One (the 10-switch core/edge), Design Two (the 20-switch
core/edge), and Design Three (the 18-switch core/edge) are “pure”
designs. They are simple, symmetrical, and easy to understand and
deploy. You can “start small and grow large“ with them. Design Four
(the 14-switch core/edge) is a combination of Designs One and Three.
The approach used in Design Four is less symmetrical, but might be
desirable if parts of your SAN have different performance requirements,
or if you are expanding an existing SAN.
Design One
The 10-switch core/edge design, shown as Design One in Figure 7.18, is
ideal for SANs that require high performance, have little known locality,
and will not immediately grow beyond 96 edge devices.
Identification
8 edge by 2 core by 2 ISL (8ex2cx2i)
Edge ports
ISL over-subscription
Switch count
96
3:1
10
Continued
253
140_SANs_07
254
8/14/01
3:38 PM
Page 254
Chapter 7 • Developing a SAN Architecture
Figure 7.18 Design One (10-Switch Core/Edge)
Design Two
Design Two, the 20-switch core/edge design shown in Figure 7.19, is
very similar to Design One. It uses twice as many switches, produces
twice as many ports, and performs exactly the same way. This design is
appropriate for SANs that require high performance, have little known
locality, and will not immediately grow beyond 192 edge devices.
Identification
16 edge by 4 core by 1 ISL (16ex4cx1i)
Edge ports
ISL over-subscription
Switch count
192
3:1
20
Figure 7.19 Design Two (20-Switch Core/Edge)
Continued
140_SANs_07
8/14/01
3:38 PM
Page 255
Developing a SAN Architecture • Chapter 7
Design Three
Design Three, the 18-switch core/edge design shown in Figure 7.20,
produces more ports, and takes fewer switches to implement than the
previous design. It is substantially more cost-effective. The tradeoff is
that the performance is lower, so it is appropriate if high performance is
not critical, or if some locality is known.
Identification
16 edge by 2 core by 1 ISL (16ex2cx1i)
Edge ports
ISL over-subscription
Switch count
224
7:1
18
Figure 7.20 Design Three (18-Switch Core/Edge)
Design Four
Design Four, the 14-switch core/edge design shown in Figure 7.21,
balances the higher performance of Design One with the lower cost and
higher port count of Design Three. This approach is appropriate if you
have some devices that require high performance, and some that either
do not require that degree of performance or have known locality. Many
variants are possible that use this approach, and it is possible to yield
anywhere between 96 ports and 224 ports.
Identification
12 edge by 2 core by [1 or 2] ISL
(12ex2cx[1/2]i)
Edge ports
ISL over-subscription
Switch count
160
3:1/7:1
14
Continued
255
140_SANs_07
256
8/14/01
3:38 PM
Page 256
Chapter 7 • Developing a SAN Architecture
Figure 7.21 Design Four (14-Switch Core/Edge)
Like the SilkWorm 6400
It is also useful to take this approach when expanding a SilkWorm
6400 Integrated Fabric. The internal ISL topology of the SilkWorm 6400
is like an incomplete Design One SAN. You can complete the SAN exactly
like Design One by attaching four switches to the SilkWorm 6400’s outer
modules (modules 1 and 6), attaching eight switches to get Design Four,
or attaching some switches like Design One and some like Design Four.
These four designs will solve most SAN design problems and will all
migrate easily to incorporate high-port-count core fabric switches in the
future.
Determining Levels of Availability
Resiliency is the capability of a fabric topology to withstand failures.This is equivalent to the fault-tolerant level of availability seen in many high-uptime RAID
arrays or enterprise-class servers.The core/edge, mesh, and ring topologies provide at least two internal fabric routes and are considered resilient because each
topology can withstand a switch or ISL failure while the remaining switches
remain operational.Thus, the fabric can “heal” an “injury” without any operator
intervention.This self-healing capability is enabled by the Brocade-authored
FSPF protocol. (FSPF is now the standard Fibre Channel routing protocol.Visit
www.t11.com for details on this and other Fibre Channel standards.)
Redundancy is the duplication of components up to and including the entire
fabric to prevent the failure of the SAN solution. For example, an airplane
hydraulic system is resilient to failures. However, most jumbo jets also have
redundant hydraulic systems so that the jet will not crash even if the resiliency
fails to keep the system up. Redundancy is equivalent to the HA model popular
140_SANs_07
8/14/01
3:38 PM
Page 257
Developing a SAN Architecture • Chapter 7
with critical server farms using products like VERITAS Cluster Server or
Microsoft Cluster Server (MSCS). In a redundant SAN architecture, you must
have at least two completely separate fabrics—just as an HA server solution
requires at least two completely separate servers.This type of design not only
accounts for software and hardware failures, but also for the human errors that
can be the more common cause of system downtime. As the fabrics are independent, an incorrect operator-initiated change on one fabric (such as disabling an
entire switch instead of just a port) would not affect the redundant SAN. Since
the other SAN has a separate connection to the devices that have been disconnected from the first fabric, the services performed by those devices can continue.
Because of this, even a larger (director-class) single switch with highly available
characteristics is still a single point of failure, and should be used with caution in
HA environments.
There are four primary categories of availability in SAN architecture. In order
of increasing availability, they are:
■
Single fabric, nonresilient All switches are connected to form a
single fabric, which contains at least one single point of failure.The
cascade topology is an example of this category of SAN.
■
Single fabric, resilient All switches are connected to form a single
fabric, but there is no single point of failure that could cause the fabric
to segment.The ring, mesh, and core/edge topologies are examples of
single, resilient fabrics.
■
Dual fabric, nonresilient Half of the switches are connected to form
one fabric, and the other half form an identical, separate fabric.Within
each fabric, at least one single point of failure exists.This design can be
used in combination with dual-attached hosts and storage devices to
keep a solution running even if one fabric fails.
■
Dual fabric, resilient Half of the switches are connected to form one
fabric, and the other half form an identical, separate fabric. Neither
fabric has a single point of failure that could cause the fabric to segment.
This design can be used in combination with dual-attached hosts and
storage devices to keep an application running even if one entire fabric
fails due to operator error, catastrophe, or quality issues.This is generally
the best design approach for HA requirements. Another key benefit of a
dual-SAN design is the capability to take half of the dual fabric offline
for upgrades or maintenance without affecting production operations on
the remaining fabric.
257
140_SANs_07
258
8/14/01
3:38 PM
Page 258
Chapter 7 • Developing a SAN Architecture
Figure 7.22 depicts the failure of a switch in a cascade topology. Switches A
and B are unable to communicate with the remaining switches when the switch
marked with the “X” fails, resulting in segmentation of the fabric into three
smaller fabrics.
Figure 7.22 Switch Failure in a Nonredundant Cascade Topology
B
A
C
Segmentation
D
E
This fabric is neither resilient nor redundant.The uptime of this fabric solution could be improved by duplicating the fabric, as in Figure 7.23. In this
example, even though one of the fabrics segmented, all devices are attached to
another SAN that is still working.
A switch failure in a ring, mesh, or core/edge topology SAN does not cause a
loss of communication with the remaining switches, as shown in Figure 7.24, for
the core/edge topology. If one core switch fails, the alternative core switch can
still communicate with all edge switches.This will allow the SAN to continue to
operate.The failover to alternate paths is performed by FSPF and is completely
transparent to the users of the fabric.These topologies are therefore considered
resilient.
In order to ensure maximum uptime, you should use both resilience and
redundancy. Figure 7.25 shows a resilient and redundant SAN solution.
Redundant fabrics are desirable because they provide the most effective protection against hardware and software failures, and protect against user error. For
example, if a SAN administrator were to telnet into SAN A and accidentally
damage the zoning configuration—making Fabric A unusable—SAN B would
not be affected.
140_SANs_07
8/14/01
3:38 PM
Page 259
Developing a SAN Architecture • Chapter 7
Figure 7.23 Switch Failure in a Redundant Cascade Topology
Devices
B
A
C
D
E
Figure 7.24 Switch Failure in a Resilient Ring, Mesh, or Core/Edge Topology
Failed Core Switch
Alternate Path
259
140_SANs_07
260
8/14/01
3:38 PM
Page 260
Chapter 7 • Developing a SAN Architecture
Figure 7.25 A Resilient and Redundant SAN Solution
Hosts
SAN A
SAN B
Storage
NOTE
Whenever you have a mission-critical application that requires the
highest uptime possible, you should use resilient/redundant fabrics to
build the SAN infrastructure, in conjunction with other HA software
and components.
140_SANs_07
8/14/01
3:38 PM
Page 261
Developing a SAN Architecture • Chapter 7
Configuring Traffic Patterns
There are a number of approaches possible for optimizing traffic patterns within
a fabric, or an overall SAN.This section discusses two approaches: the tiered
approach and the localized approach.
Leveraging Tiers
In many SANs today, traffic predictably flows between hosts and storage devices,
not between hosts and other hosts, or storage devices and other storage devices.
If your SAN is used primarily to support hosts accessing storage, you can optimize traffic flow in your SAN by following the tiered approach described in
this section.
NOTE
The trend in leveraging tiers might change as IP over Fibre Channel,
third-party copy, peer-to-peer copy, and cluster technologies (VI) are
more widely adopted. These technologies require host-to-host or
storage-to-storage data flow. However, these new technologies will
represent a relatively small fraction of the traffic in a SAN, so even if
these technologies are widely adopted, the tiered approach will still
be valid.
For example, in Figure 7.26, you have a 4-switch full mesh with hosts
attached to two of the switches and storage attached to the others. Since we can
assume that the only traffic patterns we will see with any regularity are between
hosts and storage, we can change the ISL topology as seen in Figure 7.27.
261
262
8/14/01
3:38 PM
Page 262
Chapter 7 • Developing a SAN Architecture
Figure 7.26 Analyzing Traffic Flow in a Full Mesh
Hosts
Traffic
140_SANs_07
No Traffic
Storage
Removing the unused ISLs increases the number of available ports without
affecting resiliency.This improves the scalability and the cost/benefit ratio of the
SAN, with very little effort.
8/14/01
3:38 PM
Page 263
Developing a SAN Architecture • Chapter 7
Figure 7.27 Optimizing Traffic Flow by Using a Two-Tier Approach
Hosts
Traffic
140_SANs_07
Storage
In the example in Figure 7.28, the switches on the right are the host tier
switches and the switches on the left comprise the storage tier.This is a tiered SAN
because of the way it is being used, not because of its topology.The initial
topology was a full mesh, and the modified topology is a ring. It can also be
viewed as a two-tier SAN from a performance standpoint.
263
140_SANs_07
264
8/14/01
3:38 PM
Page 264
Chapter 7 • Developing a SAN Architecture
Figure 7.28 Storage Tier and Host Tier in a Two-Tier SAN
Hosts
Host Tier
Storage Tier
Storage
NOTE
Performance-related terms such as tiered SAN or localized traffic do not
refer to a topology. However, the use of these performance techniques
might influence the choice of topology for a SAN.
140_SANs_07
8/14/01
3:38 PM
Page 265
Developing a SAN Architecture • Chapter 7
The benefit of tiered SANs is that they do not need ISLs or bandwidth
optimization between switches on the same tier. Figure 7.29 is an example of a
three-tier SAN. In addition to a host and storage tier, it also has a core tier.This
SAN is a variation on the core/edge architecture that exploits the performance
characteristics of tiered SANs while maintaining the advantages of the core/edge
SAN. It is actually a composite of two 18-switch resilient core/edge SAN
components.Tiered SANs are effective for managing your SAN.When you need
to add hosts, you also add a host switch.When you need to add storage, you add
a storage switch if ports are not available.This organization makes it easier to
manage your SAN and simplifies your job as a SAN administrator.
Figure 7.29 A Three-Tier SAN
Hosts
Hosts
Core Tier
Storage
265
266
8/14/01
3:38 PM
Page 266
Chapter 7 • Developing a SAN Architecture
Exploiting Locality
Locality is another performance optimization strategy. It is the opposite of a
tiered approach and will always outperform the tiered SAN. However, it also
takes more planning.
You can attain the best performance in any network design by understanding
the traffic patterns the network will transport. If these patterns are well understood, it might be possible to localize traffic by putting ports that need to communicate with each other close together.This concept is known as locality and is
used not just in SAN design, but also throughout computer science.
The amount of known locality in a SAN will combine with application performance requirements to determine the number of ISLs required to achieve the
SAN design objectives.This in turn will affect the number of edge ports and the
cost of the SAN infrastructure.
In Figure 7.30, the servers attached to Switch A are using storage also
attached to Switch A.The servers attached to Switch B are using storage also
attached to Switch B. No data should cross the ISL connecting the two switches.
In the SAN shown, traffic has been 100 percent localized.
Figure 7.30 Localizing Traffic 100 Percent
Traffic
Flow
Switch A
Switch B
ISL
140_SANs_07
Traffic
Flow
140_SANs_07
8/14/01
3:38 PM
Page 267
Developing a SAN Architecture • Chapter 7
Sometimes it is impossible to determine anything at all about locality during
the SAN design phase. In the case of Figure 7.31, locality is at zero percent.The
single ISL in Figure 7.31 is over-subscribed, because many more devices could be
trying to use it than it can simultaneously support. Over-subscription is not
necessarily a bad thing and does not always affect SAN performance. Only
congestion—the realization of the potential of over-subscription—can affect
performance.
Figure 7.31 Localizing Traffic Zero Percent
Traffic
Flow
Traffic
Flow
Switch A
Switch B
Traffic
Flow
Frequently, locality knowledge will be neither zero percent nor 100 percent.
For example, it might be possible to localize all traffic between a group of hosts
and a RAID array, but the hosts might be sharing access to a tape device that
needs to be located on a different switch (Figure 7.32).
While smaller SANs will not benefit as greatly from the use of locality as large
SANs will, all SANs will benefit somewhat.The use of locality will reduce cost,
increase scalability, increase reliability, and improve performance. It is always worth
making some effort to use locality in a SAN design. However, for low-bandwidth
applications, the management benefits of organizing your edge devices in a tiered
fashion are significant, and zero-percent locality can be quite acceptable.
267
268
8/14/01
3:38 PM
Page 268
Chapter 7 • Developing a SAN Architecture
Figure 7.32 Using Locality in a SAN Design
90% of traffic
10% of traffic
140_SANs_07
Using Any-to-Any Connectivity
A well-designed SAN should employ locality to minimize congestion as much as
possible. However, congestion usually occurs only in unique traffic conditions,
such as the following:
■
The SAN application is extremely bandwidth-intensive. For
example, certain video applications use a large I/O size to the order of
64 KB or larger, and typically consume 80 MB/sec to 100 MB/sec of
bandwidth. More common are low-bandwidth applications such as
Online Transaction Processing (OLTP) or e-commerce, where the
typical I/O size is approximately 2 KB to 8 KB, and bandwidth consumption peaks at 16 MB/sec.
■
The majority of I/O is streaming, as opposed to bursty at peak
utilization. In a SAN environment, it is unlikely that all devices will
concurrently require peak utilization, since most SAN traffic is bursty
and in this respect similar to LAN traffic.
140_SANs_07
8/14/01
3:38 PM
Page 269
Developing a SAN Architecture • Chapter 7
■
The majority of SAN devices support 100 MB/sec or 200
MB/sec throughput. However, most current storage and server
devices are incapable of sustaining 100 MB/sec or 200 MB/sec
throughput. For example, some of the fastest tape drives can deliver a
maximum native throughput of only 14 MB/sec. In many cases, network
congestion is only theoretically possible. Some newer SAN deployments
might end up with a larger mix of higher throughput devices than
existing SANs that have evolved gradually over time. Even when the
hardware is capable of maximizing the HBA’s theoretical bandwidth, it is
rare to have an application that needs to do so on a sustained basis. Due
to the typical capabilities of the SAN edge devices and normal traffic
patterns, congestion rarely occurs in most SANs.
If you cannot determine locality or host/storage locations in advance, or simply
feel that your SAN does not have much potential for congestion and you do not
need to use locality, you can plan for any-to-any connectivity.This could also be
necessary if you are using a clustering application or a data replication application
that requires hosts to talk to other hosts and storage to talk to other storage.
The core/edge topology is the preferred way to build any-to-any connectivity SANs, since it is symmetrical and deterministic performance characteristics
are well suited for it. It might be desirable to use a “thicker” ISL structure for an
any-to-any SAN.This provides a lower degree of over-subscription. In a case
where traffic patterns are totally unknown, high levels of over-subscription are
likely to turn into congestion.The core/edge designs that are best suited for this
are the 8ex2cx2i and 16ex4cx1i networks. Each of these SANs provides for a 3:1
over-subscription ratio, which is usually perfectly acceptable.
Evaluating Performance Considerations
Building a fabric with very low ISL over-subscription can eliminate performance
constraints within the fabric. However, this might not be desirable. If the devices
attached to the fabric have performance limits greater than the limits imposed by
the ISL topology of the fabric, the extra ISLs used will add nothing to the solution but cost.This section describes that principle, and will help you evaluate
how much performance you need to build into your fabric.
269
140_SANs_07
270
8/14/01
3:38 PM
Page 270
Chapter 7 • Developing a SAN Architecture
When Is Over-Subscription Bad?
Since over-subscription is only a potential for link contention, it is never a
problem in and of itself. In fact, over-subscription is the normal state in almost all
networks in use today. For example, over-subscription is deliberately designed
into the Internet as a way of reducing cost.
Congestion, which is the realization of the potential of over-subscription, is
usually only a problem when the congestion is sustained. If you have a link that
becomes congested for a total of five minutes per day and is underutilized the
rest of the time, there is not really a justification for adding another link. If,
however, the link is congested half of the day, you should re-evaluate the SAN’s
performance optimization strategy.The crossover point is usually this: If the
congestion is significantly affecting your application’s performance, you need to
eliminate or at least reduce it. Fabric OS provides visibility into the fabric’s
performance by providing accurate performance metrics for ports and switches,
as well as methods for setting thresholds for proactive notification of overutilization. Refer to the Fabric Watch manual for information on thresholds and
the Fabric OS manual on general performance measurements.
Considerations Outside the Fabric
The fabric itself can affect performance by being congested or having many
long-distance (for example, 50 to 100 km) and relatively latency-heavy links.
Again, we must stress that “latency-heavy” is relative. Even such a long-distance
link is typically at least one order of magnitude faster than the storage devices
that it serves. Moreover, the higher degree of latency on these links is caused by
the speed with which light travels through the glass fiber-optic cable—since there
is as yet no way to make a signal travel faster than light, this latency must be considered acceptable.These considerations can create bottlenecks in a SAN, which
will limit the performance of applications using it. However, much more frequently, devices outside the fabric create the bottleneck. Some things that do so
include:
■
CPU speed
■
PCI bus speed
■
Resource sharing
■
Application I/O profile
■
File system block access profile
140_SANs_07
8/14/01
3:38 PM
Page 271
Developing a SAN Architecture • Chapter 7
See Chapter 5, “The SAN Design Process,” for a detailed discussion of these
items. In short, if you know that you have performance limits outside the fabric,
there is no reason to design the fabric itself to avoid over-subscription. If the
devices attached to the fabric cannot or do not generate 1 Gbit/sec of sustained
traffic (they very rarely do), it is not necessary for the SAN to support that.The
built-in over-subscription will never become congestion in this case. It will
simply save you money, since you will not have to use as many ports for ISLs. Of
course, if you find subsequently that there is a need for more bandwidth, it is a
simple matter of connecting more ports to achieve additional bandwidth.
271
140_SANs_07
272
8/14/01
3:38 PM
Page 272
Chapter 7 • Developing a SAN Architecture
Summary
A fabric consists of one or more interconnected Fibre Channel switches. A SAN
includes one or more related fabrics and everything attached to them. Some
fabric topologies are better suited for general purpose use—such as the core/edge
topology—while other topologies—such as the cascade, ring, full mesh, and partial mesh—might be useful in more limited or special-case deployments.
Core/edge fabrics have many useful traits.They can scale to accommodate a
large number of ports, and do so easily. A resilient core/edge fabric can be scaled
“on the fly.” Switches can be added or replaced in the fabric without downtime.
This includes the core switches, as long as the core is resilient. Core/edge fabrics
also handle varying degrees of locality well.This topology is the most frequently
recommended by Brocade and is the best general-purpose choice.There are four
availability models in SAN architecture. In order of increasing availability, they are:
1. Single fabric, nonresilient
2. Single fabric, resilient
3. Dual fabric, nonresilient
4. Dual fabric, resilient
The last of the models—dual fabric, resilient—is always recommended for
applications where high uptime is a strong requirement.This kind of SAN consists of two usually identical, completely unconnected fabrics, neither of which
contains a single point of failure. It is also possible to build SANs with more than
two fabrics for even greater availability. (For example, the “triple fabric, resilient”
topology consists of three fabrics.)
While a tiered approach to SAN architecture can simplify management and
storage resource planning, the most effective approach to performance tuning a
SAN is to localize traffic within areas of the SAN. Applying the principle of
locality takes a certain amount of work on the front end of a project and adds to
the complexity of managing the growth of a fabric. However, when properly
applied, locality greatly enhances performance and scalability.
Using locality is one way to ensure that the ISLs in a fabric will not become
over-subscribed. Over-subscription within a fabric is not necessarily a bad thing. A
well-designed fabric can actually benefit from deliberate use of over-subscription,
which can drive down cost, improve manageability, and decrease network complexity. Over-subscription affects performance only when it becomes congestion.
140_SANs_07
8/14/01
3:38 PM
Page 273
Developing a SAN Architecture • Chapter 7
You should look for performance limitations outside the fabric—such as host
and storage limitations, or application I/O profiles—before building a low-oversubscription fabric.This will produce a topology that will provide the most
appropriate performance characteristics for your environment. However, if you
are unable to do this kind of in-depth analysis, there are a number of core/edge
topologies presented in this chapter from which you can choose a general purpose approach to SAN design.These topologies are well tested, reliable, scalable,
and perform well with most I/O profiles.
Solutions Fast Track
Identifying Fabric Topologies and SAN Architectures
A fabric consists of one or more interconnected Fibre Channel switches.
A SAN includes one or more related fabrics and everything attached
to them.
In a resilient core/edge fabric topology, two or more switches act as a
core to interconnect multiple edge switches.This is the best “generaluse” topology available, especially when combined with the dual-fabric
approach to SAN architecture.
In order to select the right topology, you must first decide the require-
ments for your SAN architecture.This includes redundancy and
scalability in addition to port count.
In general, the cascade, ring, full mesh, and partial mesh are best used
in architectures where the individual fabrics that comprise the SAN
will not change much.This could be true in a static, low-growth
environment, or in a “SAN islands” design.
The resilient core/edge topology is the best choice for general use and
for situations where SAN requirements are either unknown or might
change frequently.
The resilient core/edge topology can be combined with dual fabrics to
achieve maximum performance, reliability, and scalability.
273
140_SANs_07
274
8/14/01
3:38 PM
Page 274
Chapter 7 • Developing a SAN Architecture
Working with the Core/Edge Topology
The core/edge topology offers a number of key advantages over other
topologies. Core/edge fabrics are:
—Easy to scale without downtime.
—Capable of scaling to a large number of ports.
—Flexible in terms of their cost-to-performance ratios. (You can add
switches to the core with a clear knowledge of how doing so will
affect both cost and performance.)
—Easy to understand, manage, and performance-tune.
—Well-tested and reliable.
Several core/edge fabrics can be used as “cookie-cutter fabrics” when
design information is incomplete or might change frequently.
Determining Levels of Availability
There are four levels of availability that a SAN architecture might
employ.The dual-fabric, resilient approach is the most reliable and the
most frequently recommended.
In most cases, this approach is not more expensive to implement than
the other three approaches, and it might be less expensive in some cases.
This approach allows for the failure of anything up to and including an
entire fabric without application downtime.
Configuring Traffic Patterns
Tiered fabrics allow simplified management and storage resource plan-
ning, but are the worst-case scenario from the standpoint of locality.
Locality is the most effective approach to performance tuning in a SAN,
but it is frequently unattainable.
You should view locality as a “moving target,” since network complexity
increases over time. However, it is worth getting as much locality as is
practical into a SAN, since all SANs benefit in several ways from this
technique.
140_SANs_07
8/14/01
3:38 PM
Page 275
Developing a SAN Architecture • Chapter 7
Evaluating Performance Considerations
Over-subscription is never a bad thing in and of itself. It is only when
over-subscription becomes congestion that problems might arise.
Latency is almost never a driving consideration in real-world SAN
performance, since fabric latency is at least one order of magnitude
lower than typical disk subsystem latency. Exceptions to this rule include
clustering software and some highly performance-sensitive applications.
In almost all cases, considerations outside the fabric will limit perfor-
mance, such as CPU speed of hosts or the I/O profile of an application.
Frequently Asked Questions
The following Frequently Asked Questions, answered by the authors of this book,
are designed to both measure your understanding of the concepts presented in
this chapter and to assist you with real-life implementation of these concepts. To
have your questions about this chapter answered by the author, browse to
www.syngress.com/solutions and click on the “Ask the Author” form.
Q: What is the difference between a fabric and a SAN? I have usually heard these
terms used interchangeably.
A: A SAN is a storage area network.This could be comprised of any underlying
technology, not just Fibre Channel fabrics.While certain traditional networking technologies are not fundamentally well suited to SAN construction
(for example, Gigabit Ethernet), a number of emerging technologies can also
be used to build enterprise-class SANs.Thus, a SAN is a fairly general term
and does not limit itself to one specific approach. A fabric, on the other hand,
is very specific. It is a set of interconnected Fibre Channel switches.The
terms are frequently used interchangeably because Fibre Channel fabrics are,
at this point, pretty much the only available production-level technology for
SANs. Right now, nearly all SANs are Fibre Channel fabrics.
Q: All of this sounds very complicated. I just want to build the thing! Is there a
fabric that I can just build?
275
140_SANs_07
276
8/14/01
3:38 PM
Page 276
Chapter 7 • Developing a SAN Architecture
A: Sure.You can build a dual-fabric, resilient core/edge SAN.You can pick one
of the designs from the core/edge target design sidebar earlier in this chapter,
and you will have a design that will probably do what you need. It might not
be the least expensive way to solve your design problem, but it is certainly the
approach that requires the least planning.We presented the more advanced
design material because many users want to take more control of their SANs;
not because you must apply all of it in order to get the SAN you need.
Q: I have heard that you should always try to minimize the number of hops
between hosts and their storage. Is this always true?
A: The best performance is always obtained by localizing traffic within a switch.
This is a zero-hop scenario. Once you go outside of a single switch and cross
an ISL, you can get to your destination in one hop. In most cases, getting the
best performance requires an additional hop.Two or more hop equal-cost
path networks make better use of FSPF load-sharing capabilities. Since bandwidth usually makes much more of a difference to performance than latency,
the added bandwidth from FSPF load sharing more than compensates for the
extra hop. From a practical standpoint, when using a core/edge design, this
means that you should always put the hosts and storage devices around the
edge, rather than locating some devices directly on the core.
140_SANs_08
8/14/01
5:26 PM
Page 277
Chapter 8
SAN
Troubleshooting
Solutions in this chapter:
■
The Troubleshooting Approach: The SAN
Is a Virtual Cable
■
Troubleshooting the Fabric
■
Troubleshooting Devices that
Cannot Be Seen
■
Troubleshooting Marginal Links
■
Troubleshooting I/O Pauses
; Summary
; Solutions Fast Track
; Frequently Asked Questions
277
140_SANs_08
278
8/14/01
5:26 PM
Page 278
Chapter 8 • SAN Troubleshooting
Introduction
A SAN is a complex system that can consist of multiple switches, hosts, storage
devices, routers, and hubs. A SAN can also be as simple as a single switch with
attached storage and hosts. A breakdown of the individual components yields a
range of subcomponents, from simple subcomponents, such as cables, to complex
subcomponents, such as switches. At a macro level, the fabric itself is considered a
component that might require troubleshooting. Switches are logically positioned
in the middle of the network between hosts and storage, and have visibility to
both storage and hosts.This visibility into both sides of the storage network
enables you to use switches to determine the cause of any malfunction in the
SAN.This chapter presents a structured process for identifying marginal or faulty
SAN components by helping you figure out where to start and then to methodically home in on the problem. Specific areas of focus include troubleshooting the
following symptoms and SAN components:
■
Fabric
■
“Missing” devices
■
Marginal links
■
Input/Output (I/O) interruptions
The context of your problem influences how to interpret the data output by
the variety of commands available in Fabric OS. For example, focus on the port
state information for switchShow output when you are troubleshooting a port
issue, and the switch status information from the same command when investigating
a fabric issue. We will cover the details of how to troubleshoot using Fabric OS
commands such as switchShow, errShow, portStatsShow, and other commands. Understanding host behavior and interpreting host information is also an
important part of the troubleshooting process we discuss in this chapter.
The Troubleshooting Approach:
The SAN Is a Virtual Cable
When first approaching troubleshooting, think of the SAN as a virtual cable.
Storage traditionally involved connecting a Small Computer Systems Interface
(SCSI) disk via a SCSI cable to a host; with this scenario, you focus on four
components: the storage, the Host Bus Adapter (HBA), the host’s OS, and the
cable/terminator.Troubleshooting a SAN is more challenging, but still has many
140_SANs_08
8/14/01
5:26 PM
Page 279
SAN Troubleshooting • Chapter 8
things in common with the traditional storage troubleshooting process.To the
operating system, the SAN provides a link to a disk, just as a traditional SCSI
connection would.
You can apply the same “tried-and-true” process of elimination used to
troubleshoot a direct-attach SCSI issue or Ethernet network issue to SAN
troubleshooting. At a macro level, if you consider the SAN a virtual cable, the
issue can reside in three possible areas: the host, the “cable,” or the storage.
Troubleshooting can work like a binary search when you start investigating these
areas. Start in the middle and determine whether you are “above” or “below” the
problem, and then keep dividing the suspect path until you resolve the problem.
When troubleshooting with a simple single-switch configuration, a single
host, and a single storage device, you need to focus on the HBA, the Gigabit
Interface Converter (GBIC), the host’s OS, the cable, the switch, and the storage.
Brocade fabrics run a single-image distributed operating system known as Fabric
OS. Fabric OS delivers functionality such as Name Server, Registered State
Change Notification (RSCN), Zoning, and security.These functions are part of
the SAN and are also variables in the troubleshooting equation. A large SAN can
consist of dozens of switches and is capable of growing to thousands of ports.
Knowing where in the SAN to initiate troubleshooting can be daunting.The
next section uses a typical SAN troubleshooting scenario—a host unable to “see”
its disks—to illustrate the method of resolving the problem by treating the SAN
as a virtual cable and working with a process of elimination.
A Typical Scenario: “I Cannot See My Disks”
We provide the scenario described in this section to introduce the troubleshooting process and to establish a framework with which you are familiar.
Some terms, commands, and concepts may seem foreign.This is okay.We address
everything discussed in this section in greater detail later in the chapter.
When a host cannot see its disks, one thing to check is whether that device is
logically connected to the switch by reviewing the output from the switchShow
command. If the device is not logically connected (that is, it does not show up as
an Nx_Port), you need to focus on the port initialization. Notice that port 15 in
Figure 8.1 indicates a logically connected device, as this port is connected as an
F_Port. Port 14 is an example of an unsuccessful device connection, as the device
connected to port 14 is connected as a G_Port. A G_Port indicates an incomplete connection to the fabric. Initially knowing that the missing device is not
logically connected eliminates the host and everything on that side of the data
279
140_SANs_08
280
8/14/01
5:26 PM
Page 280
Chapter 8 • SAN Troubleshooting
path from the suspect list, as depicted in Figure 8.2.This includes all aspects of
the host’s OS, the HBA driver settings and binaries, the HBA Basic Input Output
System (BIOS) settings, the HBA GBIC, the cable going from the switch to the
host, the GBIC on the switch side of that cable, and all switch settings related to
the host.That is quite a lot for one command! If the missing device is logically
connected to the switch, you need to check to see if the device is present in the
Simple Name Server (SNS).
Figure 8.1 Example of a Successful and Unsuccessful Device Connection
core2:admin> switchshow
switchName:
core2
switchType:
2.4
switchState:
Online
switchRole:
Subordinate
switchDomain:
5
switchId:
fffc05
switchWwn:
10:00:00:60:69:10:9b:5b
switchBeacon:
OFF
port 0: sw
Online
E-Port
10:00:00:60:69:11:f9:f7 "edge1"
(upstream)
port
1: sw
Online
E-Port
10:00:00:60:69:10:9b:52 "edge2"
port
2: sw
Online
E-Port
10:00:00:60:69:11:f9:f7 "edge1"
port
3: sw
Online
E-Port
10:00:00:60:69:10:9b:52 "edge2"
port
4: sw
Online
E-Port
10:00:00:60:69:12:f9:8c "edge3"
port
5: sw
Online
E-Port
10:00:00:60:69:12:f9:8c "edge3"
port
6: —
No_Module
port
7: —
No_Module
port
8: —
No_Module
port
9: —
No_Module
port 10: —
No_Module
E-Port
10:00:00:60:69:12:f9:8c "edge3"
port 11: id
Online
port 12: —
No_Module
port 13: —
No_Module
port 14: cu
Online
G-Port //incomplete fabric connection
port 15: id
Online
F-Port
50:06:04:82:bc:01:9a:0c
140_SANs_08
8/14/01
5:26 PM
Page 281
SAN Troubleshooting • Chapter 8
Figure 8.2 The SAN Virtual Cable
OK
Host
Storage
Problem
Virtual SAN Cable
The SNS is a directory service provided by the fabric. Initiators query the
Name Server much in the same way you would query a telephone directory
looking for a particular person or service. If a device is not in the Name Server, it
is essentially invisible to other devices in the fabric.When a device connects to
the fabric, that device will register itself with the Name Server.This is similar to
the situation in which you change neighborhoods and have your name listed in
the new telephone directory.When an initiator, which is normally an HBA,
enters the fabric, it queries the Name Server to identify all accessible devices and
obtain the addresses of these devices, just like you might scan your telephone
directory for a name. Some targets also will query the Name Server.Then the
initiator starts the process of establishing a connection with those devices for
which the Name Server provides addresses.
Check the Name Server for the presence of your missing device by issuing
the nsShow command on the switch to which the device is attached (see the
sample output in Figure 8.3).This will list all of the nodes connected to that
switch, allowing you to determine if a particular node is accessible on the network. An alternate method is to check the Name Server list in the WEB TOOLS
Graphical User Interface (GUI) on any switch in the fabric, as it contains a consolidated list of all devices in the fabric. Note that we started the process in the
middle of the virtual SAN cable, which is the fabric.This is the process we
described earlier as being like a binary search algorithm.You start in the middle
half of the data path, figure out if you are “above” the problem or “below”it and
keep dividing the suspect path in half until you identify the problem.
281
140_SANs_08
282
8/14/01
5:26 PM
Page 282
Chapter 8 • SAN Troubleshooting
Figure 8.3 nsShow Sample Output
ore2:admin> nsshow
The Local Name Server has 9 entries {
Type Pid
*N
COS
021a00;
PortName
NodeName
TTL(sec)
2,3;20:00:00:e0:69:f0:07:c6;10:00:00:e0:69:f0:07:c6; 895
Fabric Port Name: 20:0a:00:60:69:10:8d:fd
NL
051edc;
3;21:00:00:20:37:d9:77:96;20:00:00:20:37:d9:77:96; na
FC4s: FCP [SEAGATE ST318304FC
0005]
Fabric Port Name: 20:0e:00:60:69:10:9b:5b
NL
051ee0;
3;21:00:00:20:37:d9:73:0f;20:00:00:20:37:d9:73:0f; na
FC4s: FCP [SEAGATE ST318304FC
0005]
Fabric Port Name: 20:0e:00:60:69:10:9b:5b
NL
051ee1;
3;21:00:00:20:37:d9:76:b3;20:00:00:20:37:d9:76:b3; na
FC4s: FCP [SEAGATE ST318304FC
0005]
Fabric Port Name: 20:0e:00:60:69:10:9b:5b
NL
051ee2;
3;21:00:00:20:37:d9:77:5a;20:00:00:20:37:d9:77:5a; na
FC4s: FCP [SEAGATE ST318304FC
0005]
Fabric Port Name: 20:0e:00:60:69:10:9b:5b
NL
051ee4;
3;21:00:00:20:37:d9:74:d7;20:00:00:20:37:d9:74:d7; na
FC4s: FCP [SEAGATE ST318304FC
0005]
Fabric Port Name: 20:0e:00:60:69:10:9b:5b
NL
051ee8;
3;21:00:00:20:37:d9:6f:eb;20:00:00:20:37:d9:6f:eb; na
FC4s: FCP [SEAGATE ST318304FC
0005]
Fabric Port Name: 20:0e:00:60:69:10:9b:5b
NL
051eef;
3;21:00:00:20:37:d9:77:45;20:00:00:20:37:d9:77:45; na
FC4s: FCP [SEAGATE ST318304FC
0005]
Fabric Port Name: 20:0e:00:60:69:10:9b:5b
N
051f00;
2,3;50:06:04:82:bc:01:9a:0c;50:06:04:82:bc:01:9a:0c; na
FC4s: FCP [EMC
SYMMETRIX
5267]
Fabric Port Name: 20:0f:00:60:69:10:9b:5b
}
At this point, if the device is not present in the Name Server, you have
narrowed your search along the virtual SAN cable to the Name Server interface
140_SANs_08
8/14/01
5:26 PM
Page 283
SAN Troubleshooting • Chapter 8
between the storage.The missing device process defined in this section is summarized in flowchart form in Figure 8.4. Note that Figure 8.4 is an excerpt from the
complete missing-device troubleshooting process, which is shown in Figure 8.25.
Remember that we will go deeper into this missing-device troubleshooting
process and flowchart later in the chapter.
Figure 8.4 Flowchart Excerpt of Troubleshooting a Missing Device
(See Figure 8.25 for the Complete Flowchart.)
Storage
device not
visible to
host
Is the storage
device
present in
switchShow?
No
Issue between
storage device
and switch. Not a
host issue
No
Issue between
storage device
and switch. Not a
host issue
Yes
Is storage
device visible
in name server?
Where to Start and What Data to Gather
As stated in the previous section, SAN troubleshooting should begin in the
center of the SAN and proceed outward. Once you know where to start troubleshooting, the next question is how to proceed. Start the troubleshooting process by gathering a preliminary set of data, and then analyze this data to identify
where the problem resides: the host, the fabric, or the storage.Then gather additional data from the appropriate area and home in on the cause of the problem.
A plethora of data is available from the switches, hosts, and storage. Knowing
what data to look at and when to look at it is fundamental to the SAN
troubleshooting process.
283
140_SANs_08
284
8/14/01
5:26 PM
Page 284
Chapter 8 • SAN Troubleshooting
Take a Snapshot: Describe the
Problem and Gather Information
Start with a general description of the problem and identify as much supporting
detail as possible. At the very least, this should include a statement about what the
“bad” behavior is, and a statement about what you are doing or have done to
expose this behavior. Note that this is not the same as describing what you have
done that causes the behavior.You might be doing something correctly, like plugging in a disk array and adding it to a zone, yet it might affect something else in
the fabric if there is an underlying problem that is exposed whenever a zone
change occurs.
For example, an HBA responding incorrectly to an RSCN could fail when
the new zone configuration is enabled. An RSCN is a fabric service for which an
edge device optionally registers.When a device registers for an RSCN, it is
asking the fabric to send that device a notice anytime something in the fabric
changes. For example, when a new device is added to the fabric, any devices that
registered for RSCNs will receive a notice.The registered device receiving the
RSCN then checks the Name Server to see what has changed and takes appropriate action. For example, if the registered device is a host and a new disk drive
is added to the SAN, the host might create the necessary device operating system
structures so the new device is accessible to the user.
This information will help you with the problem resolution, and might be
necessary if you need to contact Brocade or any Brocade-authorized support
channel. Some examples of a general problem description include:
■
When I enable a switch zoning configuration with cfgEnable, storage
devices are no longer accessible to the host.
■
There are frequent pauses in I/O when I copy large files between arrays.
■
My edge device sometimes connects as an N_Port, and other times it
connects as a Node Loop (NL)_Port when I power it up.
■
The fabric segments and the following error message is logged (provide
error message in your description). It does this under normal operation, even
when I do not touch any device on the SAN.
Include the answers to the following questions with your problem description:
■
Can you recreate the problem on demand? If so, how? (Go into detail.)
■
Is the problem intermittent? If so, how frequently does it occur?
140_SANs_08
8/14/01
5:26 PM
Page 285
SAN Troubleshooting • Chapter 8
■
Has anything at all changed recently on the fabric? If so, what? (Provide
a complete list.)
■
Is the problem localized or fabric-wide? For example, is the problem
happening with other devices in the fabric, or just locally with a single
device attached to the switch?
■
Is this an initial install and the device was never working, or was the
device working and now it has stopped working?
Other information to record:
■
If there are any error messages, include them with the problem
description.
■
Firmware and driver versions for the affected HBA and storage devices.
■
Firmware and operating system versions for affected hosts and all
fabric switches.
■
External switch information, such as LED state.
■
External HBA and port information, such as LED state.
■
A diagram of the SAN configuration.
■
If long-distance links are present, include information about the length
and quality of the lines, and the mechanism being used to achieve the
distance (for example, “The line is 10 km long, and we are using Long
Wavelength [LWL] GBICs,” or “It is 80 km long, and we are using a
Dense Wave Division Multiplexor [DWDM] and the Extended Fabrics
product”).
Finally, gather supportShow information from the switches.The
supportShow command is a switch command used to gather information about
the switch and the fabric; it can provide valuable clues about what is happening
in your switch network. It is like a macro in that it executes a long list of switch
commands, which Brocade identifies as important for the troubleshooting process. Note that the commands that supportShow executes vary between Fabric
OS releases.The v2.4.1 supportShow command executes the following switch
commands:
■
version
■
uptime
■
tempShow
285
140_SANs_08
286
8/14/01
5:26 PM
Page 286
Chapter 8 • SAN Troubleshooting
■
psShow
■
licenseShow
■
diagShow
■
errDump
■
switchShow
■
portFlagsShow
■
portErrShow
■
mqShow
■
portSemShow
■
portShow
■
portRegShow
■
portRouteShow
■
fabricShow
■
topologyShow
■
qlShow
■
faShow
■
portCfgLport
■
nsShow
■
nsAllShow
■
cfgShow
■
configShow
■
faultShow
■
traceShow
■
portLogDump
One benefit of supportShow is that you do not have to repeatedly retrieve
various types of data, since most of the data you need is available from
supportShow in one place. As this command rapidly streams in a telnet window,
capture mode should be turned on prior to executing the command so that it
can be captured to a text file for later review.
140_SANs_08
8/14/01
5:26 PM
Page 287
SAN Troubleshooting • Chapter 8
NOTE
It is important to execute the supportShow command at the time the
problem is occurring, rather than waiting until the fabric is functioning
normally.
Due to the large volume of data created by supportShow, you might choose
to gather the supportShow data once and then selectively issue a subset of its
commands as part of your troubleshooting process.
Troubleshooting Tools
Many tools are available to the SAN troubleshooter. Many of these tools are
switch commands. Other tools involve viewing the switch LEDs, host information such as Solaris’ /var/adm/messages file, Fibre Channel analyzers, and
diagnostics available on many storage arrays. Rarely is it possible to use a single
tool to successfully troubleshoot a problem. It is more common to use several
tools to attain a successful resolution of a problem.
Using the Switch LEDs
A significant amount of information can be gathered just by looking at the
switch LEDs. At a rudimentary level, it is possible to identify that a device
has faulted or is not yet online by looking for a “fast yellow.” If the switch is
located in another room, you can get a visual real-time LED status using the
WEB TOOLS interface. Fast flickering green lights are a sign of a healthy SAN.
By physically observing the switches that comprise a SAN, it is possible to detect
patterns and identify a marginal or faulty component. For example, if you have a
situation in which you are trying to identify a device that is repeatedly toggling
online and offline, you can use the switch LEDs.
While observing a functional fabric, you can easily identify a potentially disruptive device by scanning for a port that goes offline (no LED light), sends light
(steady yellow), comes online (steady green), and then cycles through the same
steps—blank, yellow, green.You also want to look for correlations or patterns,
such as one device going offline followed by a group of devices going offline and
back online again.This situation is common in QuickLoop configurations when
the first device going offline is sending a Loop Initialization Primative (LIP),
which then causes the other devices to LIP.
287
140_SANs_08
288
8/14/01
5:26 PM
Page 288
Chapter 8 • SAN Troubleshooting
How to Identify a Healthy SAN Using the LEDs
A settled and healthy fabric should have solid green or fast flickering
green lights. A solid green light indicates an active link, while a fast flickering green light indicates I/O activity.
How to Identify a SAN Problem Using the LEDs
A yellow light or blinking yellow light indicates a problem with your
SAN. An LED that transitions from yellow to green, however, is not a
problem. A powered-off edge device, or edge device that is not yet
online, might cause the switch LEDs to blink yellow.
Another helpful use for the LEDs is for fabric “bring up.”When bringing up
a fabric, one sign to look for that indicates a fabric has reached convergence are
steady green lights.When the fabric is coming up, the Inter-Switch Links (ISLs)
go through initialization, which appear to the observer as flickering green and
yellow lights prior to the fabric fully converging. Once the fabric is converged,
the lights go to a steady green.Then, as I/O in the fabric begins, you will see
flickering green lights on the ISL ports and the edge device ports.
A slowly flashing switch power LED indicates that the switch failed the
Power-On Self-Test (POST) and is not able to come online. Refer to the associated switch manual for the location of the power LED.Table 8.1 lists the port
LEDs and their definitions (you can also find this table in the Brocade SilkWorm
2800 Hardware Reference Manual).
Table 8.1 Front Panel LED Port Indicators
Ports
LED Definition
No light showing
No light or signal carrier (no module, no cable) for media
interface
Receiving light or signal carrier, but not yet online
Steady yellow
Continued
140_SANs_08
8/14/01
5:26 PM
Page 289
SAN Troubleshooting • Chapter 8
Table 8.1 Front Panel LED Port Indicators
Ports
LED Definition
Slow yellow
(flashes two
seconds)
Fast yellow
(flashes a half
second)
Steady green
Slow green
(flashes two
seconds)
Fast green
(flashes a half
second)
Flickering green
Disabled (result of diagnostics, switchDisable, or
portDisable command)
Error, fault with port
Online (connected with external device over cable)
Online, but segmented (loopback cable or incompatible
fabric parameters)
Internal loopback (diagnostic)
Online and frames flowing through port
Switch Diagnostics
A robust set of switch diagnostics is available so you can validate the operational
level of a SilkWorm switch. Several of these diagnostics, such as
portLoopbackTest, are also helpful in the troubleshooting process. For example,
if you suspect a bad GBIC or switch port, you can use portLoopbackTest to
confirm your suspicion. Using portLoopbackTest for troubleshooting is discussed in the section “Troubleshooting Marginal Links” later in the chapter.The
supportShow diagnostic command in particular, discussed in detail later in this
chapter, is very helpful to the troubleshooting process.The Brocade Fabric OS
manuals provide detailed description regarding the usage of diagnostic commands.To see what diagnostic commands are available online, issue the command
diagHelp at the switch prompt.The following list of diagnostic commands is
available in the V2.4.1 Fabric OS:
■
ramTest System DRAM diagnostic
■
portRegTest Port register diagnostic
■
centralMemoryTest Central memory diagnostic
■
cmiTest CMI bus connection diagnostic
■
camTest QuickLoop CAM diagnostic
289
140_SANs_08
290
8/14/01
5:26 PM
Page 290
Chapter 8 • SAN Troubleshooting
■
portLoopbackTest Port internal loopback diagnostic
■
sramRetentionTest SRAM Data Retention diagnostic
■
cmemRetentionTest Central Mem Data Retention diagnostic
■
crossPortTest Cross-connected port diagnostic
■
spinSilk Cross-connected line-speed exerciser
■
diagClearError Clear diag error on specified port
■
diagDisablePost Disable Power-On-Self-Test
■
diagEnablePost Enable Power-On-Self-Test
■
setGbicMode Enable tests only on ports with GBICs
■
setSplbMode Enable 0=Dual, 1=Single port LB mode
■
supportShow Print version, error, portLog, etc.
■
diagShow Print diagnostic status information
■
parityCheck Dram Parity 0=Disabled, 1=Enable
■
spinFab ISL link diagnostic
■
loopPortTest L_Port cable loopback diagnostic
Helpful Commands
With dozens of switch commands at your disposal, it can be difficult to determine which command to use in a given situation. An annotated list of helpful
commands follows in this section, with additional commands highlighted as they
relate to specific issues discussed in following sections.This list of commands is a
starting point for gathering data and initiating your troubleshooting process.
While the information generated by these commands is also available in
supportShow, you will want to use individual commands as you advance
through the troubleshooting process. SupportShow creates a significant amount
of data and is helpful when you want to perform the original snapshot of the
configuration and environment (to report a problem to your switch supplier), or
you are not sure what data to capture.
140_SANs_08
8/14/01
5:26 PM
Page 291
SAN Troubleshooting • Chapter 8
NOTE
Although the switch commands are shown with various capitalization as
originally coded in Fabric OS, the commands are no longer case-sensitive
and can be entered with all lowercase if desired.
Entering the command help at the switch prompt generates a list of commands available to the user as shown in Figure 8.5. Entering the command help
<command> generates a help page (similar to UNIX man pages) for that specific command. Many commands differ by the extension show or dump (for
example, errShow and errDump). The difference is that show commands
require you to type a return between entries, while the dump commands stream
data to the screen without any pauses. Dump commands are used when you have
a facility for logging command output to a file. It might be necessary to execute
commands on more than one switch in the fabric, especially if the location of the
problem is unclear.
NOTE
As of Fabric OS 2.4.1, there is no time synchronization among the
switches, which can make troubleshooting a challenge if the clocks
between the switches are skewed. Before you begin troubleshooting
your fabric, you should make a note of any time skew so that you can
compensate for it when reading command outputs. You should also
make an effort to keep switch clocks set correctly during normal
operation to avoid this problem.
Figure 8.5 Use the help Command to See What Commands Are Available or
Type the help Command for Help About a Specific Command
dev172:admin> help
agtcfgSet
Set SNMP agent configuration
agtcfgShow
Print SNMP agent configuration
agtcfgDefault
Reset SNMP agent to factory default
.
Continued
291
140_SANs_08
292
8/14/01
5:26 PM
Page 292
Chapter 8 • SAN Troubleshooting
Figure 8.5 Continued
.
.
qlHelp
Print quick loop help info
routeHelp
Print routing help info
trackChangesHelp
Print Track Changes help info
zoneHelp
Print zoning help info
dev172:admin> help errShow
NAME
errShow - display the error log
SYNOPSIS
errShow
AVAILABILITY
all users
DESCRIPTION
This command displays the error log, prompting the user to type
return between each log entry. It is identical to errDump, except
.
.
.
SEE ALSO
errDump, uptime
The errShow Command
The errShow command provides a listing of up to 64 logged errors and is
helpful for identifying where a problem might reside. It sends messages to the
console and to the error log. Note that the error log is cleared after a reboot or
power cycle; if you want to maintain error logs that persist after reboots or power
140_SANs_08
8/14/01
5:26 PM
Page 293
SAN Troubleshooting • Chapter 8
cycles, consider using the syslog facilities of the switch to log errors to persistent
storage. See syslogdIpAdd, syslogdIpRemove, and syslogdIpShow for further
detail on how to set up persistent logging.
When examining errShow data, which can be quite wordy, look for trends
or patterns. For example, look for an excessive number of errors associated with a
specific port. In addition, watch for high error-count values, which indicate a
repeated error that has been logged many times. Logging error counts limits
errors that occur multiple times from consuming the space provided for the error
log. It is important to note that with every error, a severity level is associated. A
warning (error level 3) is just that—a warning. An error (error level 2) or critical
(error level 1) message is more severe and requires further attention.
An excerpt from the errShow help entry is provided in Figure 8.6. Please
refer to the help page or the Fabric OS manual for details on interpreting Diag
Err#, as the list of codes is lengthy. A Diag Err# usually indicates a problem
with hardware, so contact your switch supplier for further assistance.
In addition to software errors, errShow logs environmental issues, such as
over-temperature conditions, and equipment issues such as fan failures or power
supply failures. A detailed list of error messages, descriptions, probable causes, and
actions is maintained in the Fabric OS Reference Manual Version 2.4
(Publication Number 53-0001569-01).
Figure 8.6 Excerpt from the errShow help Entry
Each entry in the log has the same format:
Error Number
——————
taskId (taskName): Time Stamp (count)
Error Type, Error Level, Error Message
Diag Err#
Error Number
Starting from one. If there are more error than
the size of the log, only the most recent errors
are shown.
Task Id & Name
The ID and name of the task recording the error.
Continued
293
140_SANs_08
294
8/14/01
5:26 PM
Page 294
Chapter 8 • SAN Troubleshooting
Figure 8.6 Continued
Time Stamp
The date and time of the first occurrence of
the error.
Error Count
For errors that occur multiple times, the repeat
count is shown in parenthesis. The maximum count
is 999.
Error Type
An uppercase string showing the firmware module
and error type. The switch manual contains a
detailed explanation of each error type.
Error Level
Error Message
0
panic (the switch reboots)
1
critical
2
error
3
warning
4
information
5
debug
Additional information about the error.
Figure 8.7 is an example of an errShow message. The fabric is segmented,
meaning that the switch that generated this message is logically disconnected
from the SAN, and any devices in the SAN that are not directly connected to
this switch are inaccessible to this switch. Moreover, any devices located on this
switch are unable to access other devices in the fabric. The error level is a
warning (3). The task ID (0x10e2b7f0) can be cross-referenced by issuing the
telnet command “i” to obtain additional information on the task in question.
The Task Name is self-explanatory, and interpreting it is somewhat intuitive. For
example, tTransmit is the transmit task. The Task Name can be helpful in identifying the nature of the problem. Finally, the error message indicates that there is a
discrepancy between the zone information contained on this switch and the zone
information contained in the rest of the fabric. When the switch tried to join
the fabric with this conflicting information, the join request was denied; hence,
the segmented fabric. The message even identifies the zone that is causing the
140_SANs_08
8/14/01
5:26 PM
Page 295
SAN Troubleshooting • Chapter 8
conflict; in this case, it is the “red” zone. This zone should be checked and compared to the rest of the fabric, and if the zone information is different, either correct or delete it.
Figure 8.7 errShow Example
Task Id
Task Name
Error 04
-------0x10f6f4f0 (tTransmit): May 15 09:38:57 (6)
Error FABRIC -SEGMENTED, 3, port 0, zone conflict: content mismatch: red
Error Type
Warning
Port
Error Message
Level
The portErrShow Command
The portErrShow command is an effective command for troubleshooting
marginal ports. This command provides an error summary for all ports associated
with the switch and provides a status of all ports from a link integrity perspective.
The key to interpreting the statistics is looking for a very high number of errors
relative to the frames transmitted and frames received. For example if 2,000,000
frames have been received and only three Cyclic Redundancy Check (CRC)
errors have been logged, the CRC errors relative to the frames received is a very
low ratio and the associated port is not suspected as being marginal. On the
other hand, if 2,000,000 frames have been received and 10,000 CRC errors have
been logged, the CRC errors relative to the frames received is a high ratio and
the associated port should be examined further. A rough guideline is to look for
errors in excess of 0.5 percent of the total number of frames transferred.
Another important trend to watch is a steadily increasing number of errors.
You can track increasing errors by sampling every five or ten seconds and monitoring the delta between the samples. Simple Network Management Protocol
(SNMP) polling can be used to facilitate this. Also, the optionally licensed Fabric
295
140_SANs_08
296
8/14/01
5:26 PM
Page 296
Chapter 8 • SAN Troubleshooting
Watch product can be used to note changes in error rates over time and send out
an SNMP trap or error log entry. Streaming errors is a high-order indicator and
requires close monitoring—even if the error rate is less than one percent.While
the error count relative to frames transmitted or received might be low, a steadily
increasing number of errors indicates a marginal port.
The portErrShow statistics shown in Figure 8.8 were gathered from a
switch that had a marginal NL_Port (HBA), connected to port 6. It turned out
that the Gigabit Link Module (or GLM, a connector similar to a GBIC) was
failing and causing a degraded signal. Note how high the enc_in and CRC
errors are!
Figure 8.8 portErrShow Example
frames
tx
enc
rx
in
crc
err
too
too
bad
shrt long
enc disc link loss loss frjt fbsy
eof
out
c3 fail sync
sig
——————————————————————————————————
port 0: 2.9g 1.7g
0
12
0
0
0
0
0
0
2
1
0
0
port 1: 305m 3.0g
0
0
0
0
0
0
0
0
1
0
0
0
port 2: 1.2g 892m
0
0
0
0
0
0
0
0
556
27
0
0
port 3: 1.1m
25m
0
0
0
0
0
82
0
4
9
4
0
0
port 4:
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1.4k
1.4k
0
0
port 6: 668m 4.0g 6.0m 66m
0
0
236
51m
0
87
54
11
0
0
0
port 5: 9.5m 4.0g
The error statistics shown in boldface are the primary statistics on which to
focus.The following listing explains relevant statistics and associated definitions:
■
enc_in Received data: the number of 8b/10b encoding errors that have
occurred inside frame boundaries.This counter is generally a zero value,
although occasional errors might occur on a normal link and give a
nonzero result. (Minimum compliance with the link-bit error rate specification on a link continuously receiving frames would cause approximately one error every 20 minutes.) Reinitialization or reboots of the
associated Nx_Port can also cause these errors, resulting in a low-count
error count.
■
crc_err Received frames: the number of CRC errors detected. A CRC
error indicates that the contents of a frame are no longer valid.
140_SANs_08
8/14/01
5:26 PM
Page 297
SAN Troubleshooting • Chapter 8
Reinitialization or reboots of the associated Nx_Port can also cause
these errors, resulting in a low count.
■
too_long Received frames: the number of frames that were longer than
the maximum Fibre Channel frame size (such as a header with more
than a 2112-byte payload).
■
bad_eof The number of frames received with a badly formed
end-of-frame.
■
enc_out Receive link: the number of 8b/10b encoding errors recorded
outside frame boundaries.This number might become nonzero during
link initialization, but it indicates a problem if it increments faster than
the allowed link-bit error rate (approximately once every 20 minutes).
■
er_disc_c3 Receive link: the number of Class 3 frames discarded. Class
3 frames can be discarded due to timeouts or invalid or unreachable destinations.This quantity could increment at times during normal operation, but might be used for diagnosing problems in some situations.
NOTE
Steadily increasing errors between samples is a very strong sign that the
associated port is not functioning properly.
Marginal link troubleshooting and related troubleshooting commands are discussed in more detail in the “Troubleshooting Marginal Links” section later in
this chapter.
The switchShow Command
The switchShow command is another powerful command that has many uses
for the troubleshooting process. An excerpt from the switchShow help entry is
provided here. It is helpful for troubleshooting fabric as well as edge device connectivity issues.This command is likely to be one of the first commands you will
execute as part of your troubleshooting process.The nature of the problem will
dictate what switchShow data to focus on and how to interpret this data. As
shown in Table 8.2 and Figure 8.9, switchShow data is loosely organized into
three categories.
297
140_SANs_08
298
8/14/01
5:26 PM
Page 298
Chapter 8 • SAN Troubleshooting
Table 8.2 How switchShow Data Relates to the SAN Functional Areas
Fabric-Related
Edge Device-Related
Miscellaneous
switchState
switchRole
switchDomain
port state
port state
switchId
switchBeacon
switchType
switchName
Figure 8.9 switchShow Definitions
This switchShow command displays switch and port status information.
Some information varies with the switch model, e.g. number of
ports, and Domain ID values. The lines of the display show:
switchName
The switch's symbolic name.
switchType
The switch's model and revision numbers.
switchState
The switch's state: Online, Offline, Testing, Faulty.
switchRole
The switch's role: Principal, Subordinate, Disabled.
switchDomain
The switch's Domain ID: 0-31 or 1-239.
switchId
The switch's embedded port D_ID.
switchWwn
The switch's Worldwide Name.
switchBeacon
The switch's beaconing state (either ON or OFF).
The switch summary is followed by one line per port:
port number
The port number: 0-7 or 0-15.
module type
The port module type (GBIC or other):
— - no module present
sw - shortwave laser
lw - longwave laser
cu - copper
id - serial ID
port state
The port's state:
Continued
140_SANs_08
8/14/01
5:26 PM
Page 299
SAN Troubleshooting • Chapter 8
Figure 8.9 Continued
No_Card
- no interface card present
No_Module - no module (GBIC or other) present
No_Light
- the module is not receiving light
No_Sync
- receiving light but out of sync
In_Sync
- receiving light and in sync
Laser_Flt - module is signaling a laser fault
comment
Port_Flt
- port marked faulty
Diag_Flt
- port failed diagnostics
Lock_Ref
- locking to the reference signal
Testing
- running diagnostics
Online
- the port is up and running
The comment field may be blank, or may show:
Disabled
- the port is disabled
Bypassed
- the port is bypassed (loop only)
Loopback
- the port is in loopback mode
E_Port
- fabric port, shows WWN of attached
switch
F_Port
- pt-pt port, shows WWN of attached
N_Port
G_Port
- pt-pt but not yet E_Port or F_Port
L_Port
- loop port, shows number of
NL_Ports
if a port is configured as a long-distance port,
the long distance level is shown in the format of
"Lx", x being the long-distance level number.
See portCfgLongDistance for the level description.
When troubleshooting issues involve the fabric services or a switch’s ability to
participate in the fabric, the important parts of switchShow data to focus on are
switchState, switchRole, and switchDomain.
Port state is applicable from a fabric perspective for observing the state of
expansion ports (E_Ports). E_Ports associated with ISLs are the ports used to
299
140_SANs_08
300
8/14/01
5:26 PM
Page 300
Chapter 8 • SAN Troubleshooting
connect multiple switches together forming a fabric. Port state is also useful for
troubleshooting connectivity problems with end devices (F_Ports and FL_Ports).
In a running fabric, the switchState should always be online. If not, access to
and from the switch is not possible. It is possible that the switch may be in a transitory state as it comes online from a power cycle or reboot, so check again to
make sure this is not the case. It is also possible that the switch has been manually
disabled using the switchDisable command.
A switch can be operating as a principal, subordinate, or disabled, which is
indicated by the switchRole variable.There is only one principal switch in the
fabric, and if the principal fails, another switch will assume this role.The principal
switch facilitates the bring up of the fabric and assignment of domain IDs. A
switch domain ID is an address that defines the switch in a fabric. Domain IDs
are automatically assigned as part of the fabric initialization process by the principal switch. It is possible to manually assign a domain ID as well. SilkWorm
1000 series switches use the domains 0–31, and SilkWorm 2000 series switches
and beyond use the domains 1–239. If a switch is not a principal, it operates in a
subordinate switch role. If the switch role indicates disabled, access to and from
the switch is not possible and it is likely that someone disabled the switch by
typing switchDisable, or the switch was unable to obtain a domain ID.When a
switch is disabled, a comment of “unconfirmed” accompanies the domain ID
(Figure 8.10). Normally, a switch will be in disabled state after issuing the command switchDisable.The “unconfirmed” attribute could also be caused by a
problem with the fabric, which causes a switch to be unable to confirm its
domain ID even though the switch is enabled.When the switch is disabled, the
LEDs will blink yellow every two seconds and the port state will indicate disabled.
Figure 8.10 Switch Disabled and Unconfirmed Domain
core1:admin> switchshow
switchName:
core1
switchType:
2.4
switchState:
Offline
switchRole:
Disabled
switchDomain:
1 (unconfirmed)
switchId:
fffc01
switchWwn:
10:00:00:60:69:10:8d:fd
switchBeacon:
OFF
Continued
140_SANs_08
8/14/01
5:26 PM
Page 301
SAN Troubleshooting • Chapter 8
Figure 8.10 Continued
port
0: sw
Laser_Flt
Disabled
port
1: sw
In_Sync
Disabled
port
2: sw
In_Sync
Disabled
port
3: sw
In_Sync
Disabled
port
4: sw
In_Sync
Disabled
port
5: sw
In_Sync
Disabled
port
6: —
No_Module
Disabled
port
7: —
No_Module
Disabled
port
8: —
No_Module
Disabled
port
9: —
No_Module
Disabled
port 10: —
No_Module
Disabled
port 11: —
No_Module
Disabled
port 12: —
No_Module
Disabled
port 13: —
No_Module
Disabled
port 14: —
No_Module
Disabled
port 15: —
No_Module
Disabled
The SilkWorm 1000 series of switches uses the domain IDs 0–31, and the
SilkWorm 2000 series and beyond switches use the domain IDs 1–239. Normally,
a domain ID is automatically assigned when a switch joins the fabric; however,
there are circumstances that can result in domain ID conflicts.This can happen
when connecting two online switches that have already been assigned the same
domain ID.When two switches in a fabric have the same domain ID, the fabric
segments along an ISL that allows domain IDs to be unique in each segment.
The port state information generated by switchShow is pertinent to fabricrelated issues if an ISL port is affected. One issue that relates to ISLs involves the
port’s inability to fully initialize.While the port is online, it remains in a generic
port (G_Port) state since it could not initialize as an E_Port. Another issue that
affects ISLs occurs when the link is unable to initialize, resulting in the port not
coming online at all.This could be caused by a marginal link, an offline switch
connected to the other end of the ISL, or a fabric initialization issue. In either circumstance, it is incumbent upon the SAN administrator to establish that the port
is an ISL port or an edge device that is not connected, as there is no way to tell
the type of device connected until after the port initializes. Execute the commands
301
140_SANs_08
302
8/14/01
5:26 PM
Page 302
Chapter 8 • SAN Troubleshooting
portDisable and portEnable, providing the offending port number as an argument to try to reinitialize the port.
The Switch Name is assigned by the user and does not have to be unique in
the fabric. However, uniquely naming each switch can make your SAN administration easier.With some Fabric OS versions,WEB TOOLS might not function
properly if the Switch Name does not match the switch’s actual host name.You
assign a Switch Name with the switchName command.
The switchId value is the switch’s 24-bit Destination ID (D_ID) address in
the fabric.This is the Fibre Channel address that another switch would use to
send the frame to the switch itself, rather than to a device connected to the
switch.This value might appear in portLog data—for example, when the switch
probes an edge device for Name Server information.
Using the switchBeacon switch command, you can have the switch flash a
back-and-forth pattern (from left to right, and right to left) in yellow to identify
the switch.This is helpful if you are doing maintenance and need to identify a
switch that is positioned in a rack with many other switches. Finally, the
switchType information indicates the switch model and revision in the form
model.revision, as shown in Table 8.3.
Table 8.3 switchType Values and Associated Architecture
SwitchType Value
Switch Model
1
2
3
4
5
1000 series
2800
2400
20x0
22x0
Information in the port state section includes the port state, the type of
media, the World-Wide Name (WWN) of the connected device, and the
switch name if the attached device is a switch, private, phantom, and upstream or
downstream information.
The port state will typically be online or offline; however, as shown in Figure
8.4, a laser fault is also indicated when encountered.The type of interface media is
shown as well, indicating the type of GBIC used. SW is for shortwave GBICs, LW
is for long wavelength GBICs (for longer distances), and ID is for serial ID GBICs.
Serial ID GBICs are smart GBICs with serial number and state information.
140_SANs_08
8/14/01
5:26 PM
Page 303
SAN Troubleshooting • Chapter 8
A private device is normally a loop device that does not perform a Fabric
Login (FLOGI) and uses an 8-bit address. A phantom address is a 24-bit translated
address for an 8-bit device. A phantom is created for the embedded port so that
services and other devices within the SAN can communicate with the devices on
a private loop.The switch recognizes only device addresses of 24 bits in length.
Therefore, services on the switch that need to communicate with the private
devices need to have a 24-bit proxy for their 8-bit addresses. Each device that
wants to communicate with devices on a private loop needs to be “represented”
on the loop directly.This is done by creating a phantom device for each host that
wants to communicate with devices on the private loop.This phantom is acting
on behalf of each of the devices that wish to communicate to devices on the loop.
The terms upstream and downstream designate that particular switch’s position
in reference to the principal switch in the fabric.These paths are used in the process for assigning switch domain IDs. In Figure 8.11, notice that switch core1 is
the principal switch, and all “stream” designators are downstream. For switch
edge1, the path to the principal switch is upstream through port 2.There is also a
downstream path from switch edge1.This path is used by switch core2 to access
switch core1; hence, port 3 is designated as a downstream port.The principal
switch has no upstream ports.
The port state section of the switchShow output is very helpful in identifying edge-device connection issues. These issues can involve a range of problems, from missing devices to devices initializing with the wrong topology (for
example, a loop-configured device initializing as point-to-point topology). The
explanation of port states and associated comments is fairly straightforward.
When in doubt, check to see that the port is online, assuming a device is
attached, and that the topology is correct (F_Port or L_Port). If neither of these
values is present, you will need to do further analysis.
The nsShow Command
An excerpt from the nsShow help entry is provided in Figure 8.12. The most
important thing about nsShow output is whether the device in which you are
interested appears in the command output. If a device does not appear in the
Name Server, other devices will not be able to access it. There are some instances
where initiators bypass the Name Server and directly communicate with a device
by using an earlier obtained address or doing a table scan of addresses. This
behavior is considered suspect, as it is bypasses a standard methodology. Note that
hard zoning prevents such activities from occurring, ensuring that all devices
behave appropriately within the SAN.
303
140_SANs_08
304
8/14/01
5:26 PM
Page 304
Chapter 8 • SAN Troubleshooting
Figure 8.11 Upstream and Downstream Paths in Reference to
switchShow Output
switchName:
core1
switchType:
2.4
switchState:
Online
switchRole:
Principal
switchDomain:
1
switchId:
fffc01
switchWwn:
10:00:00:60:69:10:8d:fd
switchBeacon:
OFF
port 0: sw Laser_Flt
port 1: sw Online
E-Port 10:00:00:60:69:10:9b:52
port 2: sw Online
E-Port 10:00:00:60:69:11:f9:f7
port 3: sw Online
E-Port 10:00:00:60:69:10:9b:52
port 4: sw Online
E-Port 10:00:00:60:69:12:f9:8c
port 5: sw Online
E-Port 10:00:00:60:69:12:f9:8c
"edge2" (downstream)
"edge1" (downstream)
"edge2"
"edge3" (downstream)
"edge3"
switchName:
edge1
switchType:
2.4
switchState:
Online
switchRole:
Subordinate
switchDomain:
2
switchId:
fffc02
switchWwn:
10:00:00:60:69:11:f9:f7
switchBeacon:
OFF
port 0: sw No_Light
port 1: sw Online
E-Port 10:00:00:60:69:10:9b:5b "core2"
port 2: sw Online
E-Port 10:00:00:60:69:10:8d:fd "core1" (upstream)
port 3: sw Online
E-Port 10:00:00:60:69:10:9b:5b "core2" (downstream)
port 4: — No_Module
Principal
core1
edge1
core2
edge2
edge3
NOTE
If the device is not in the Name Server, it is most likely invisible to the
rest of the fabric and therefore inaccessible.
140_SANs_08
8/14/01
5:26 PM
Page 305
SAN Troubleshooting • Chapter 8
Figure 8.12 nsShow help Page
NAME
nsShow - display local Name Server information
SYNOPSIS
nsShow
AVAILABILITY
all users
DESCRIPTION
This command displays local Name Server information, which
includes information about devices connected to this switch,
and cached information about devices connected to other
switches in the fabric.
The message "There is no entry in the Local Name Server" is displayed
if there is no information in this switch, but there still may be
devices connected to other switches in the fabric. The command
nsAllShow shows information from all switches.
Each line of output shows:
*
an asterisk indicates a cached entry from another switch.
Type
U for unknown, N for N_Port, NL for NL_Port.
Pid
The 24-bit Fibre Channel address.
COS
A list of classes of service supported by the device.
PortName
The device's port Worldwide Name.
NodeName
The device's node Worldwide Name.
TTL
The time-to-live (in seconds) for cached entries, or
'na' (not-applicable) if the entry is local.
There may be additional lines if the device has registered any of
the following information (the switch automatically registers
SCSI inquiry data for FCP target devices): FC4s supported,
Continued
305
140_SANs_08
306
8/14/01
5:26 PM
Page 306
Chapter 8 • SAN Troubleshooting
Figure 8.12 Continued
(node) IP address, IPA, port and node symbolic names, fabric
port name, hard address and/or port IP address.
Often, the returned SCSI inquiry data is meaningful and indicates telling
information such as the vendor, model, and the firmware revision level of the
attached device, as shown in Figure 8.13. For HBAs, SCSI inquiry data occasionally is not returned and the Name Server entry is a bit sparse, so it is harder to
identify the device. Some vendors are starting to allow administrators to manually
populate this field to allow the textual information to be site-specific, such as
node names or locations.
Figure 8.13 The nsShow Output Explained
The Seagate disks support
Class 3 service and the
EMC supports Classes 2 & 3
Type Pid
An HBA—not much info
COS
PortName
NodeName
TTL(sec)
N
0a1000;
2,3;20:00:00:e0:69:40:13:19;10:00:00:e0:69:40:13:19;
na
NL
0a19cb;
3;21:00:00:20:37:26:b0:6c;20:00:00:20:37:26:b0:6c;
na
FC4s: FCP [SEAGATE ST39102FCSUN9.0G0D29]
NL
0a19cc;
3;21:00:00:20:37:26:84:22;20:00:00:20:37:26:84:22;
na
FC4s: FCP [SEAGATE ST39102FCSUN9.0G0D]
N
0a1b21;
2,3;50:06:04:84:35:46:b5:4d;50:06:04:84:35:46:b5:4d;
FC4s: FCP [EMC
N
0a1c21;
SYMMETRIX
2,3;50:06:04:84:3a:3b:1f:4d;50:06:04:84:3a:3b:1f:4d;
FC4s: FCP [EMC
SYMMETRIX
na
5265]
na
5265]
An EMC storage device with 5265 firmware
FCP = SCSI over Fibre Channel
It can be confusing understanding the difference between a device node
WWN and a port WWN. A device has only one node WWN and can potentially
have one or more port WWN(s).This way, it is possible to uniquely identify
140_SANs_08
8/14/01
5:26 PM
Page 307
SAN Troubleshooting • Chapter 8
multiple paths or interfaces to the same device. For example, today’s Just a Bunch
of Disks (JBOD) systems usually have two ports (A and B), and each port has an
associated port WWN.This enables two paths to connect to the same disk. How
do you know it is the same disk? The node WWN is the same for each path,
with each path having a unique port WWN. In Figure 8.14, if the entry for Port
ID (PID) 0a19cb is connected on both ports A and B, the node WWN stays the
same (20:00:00:20:37:26:b0: 6c), the A port would have a WWN of
21:00:00:20:37:26:b0: 6c, and the B port would have a WWN of
22:00:00:20:37:26:b0: 6c.
Figure 8.14 The Difference between Port WWN and Node WWN
A
Port WWN =
21:00:00:20:37:26:b0:6c
B
Node WWN = 20:00:00:20:37:26:b0:6c
Port WWN =
22:00:00:20:37:26:b0:6c
The use of node WWNs and port WWNs is not always strictly followed, and
the Fibre Channel specifications are not clear on their usage. A node WWN
sometimes is used to represent an entire system and all ports (Port WWNs) associated with that system.
The Name Server also provides information about a device’s PID. Knowing
how to decode a PID is helpful in translating a device’s SAN logical address into
a SAN physical location. If you know a device’s PID, you know the physical port
that device is attached to, the domain ID of the switch that device is attached to,
and whether that device is an N_Port or an NL_Port. Figure 8.15 explains this
decoding process further.
The topologyShow Command
The topologyShow command displays the fabric topology, as seen by the local
switch. topologyShow output consists of a list of all domains that are part of the
fabric, and for each of those domains, all the possible paths to reach these
domains from the local switch. In addition, topologyShow displays the total
number of switches in the fabric, and the domain ID of the local switch. It is also
helpful to issue the switchShow command to identify directly connected
switches. Look for E_Ports and the name of the switch located at the other end
of the E_Port to create a SAN topology. Perform a switchShow for every switch
307
140_SANs_08
308
8/14/01
5:26 PM
Page 308
Chapter 8 • SAN Troubleshooting
in the fabric. First, write down the name of the switch on which the command is
issued. For each E_Port on that switch, write down the name of the switch to
which the E_Port connects.Then draw a line between the switch on which the
command is being run and the switch that shows up on the other end of the
E_Port.The data in Figure 8.16 indicates that switch edge3 is directly connected
to switches core1 and core2.To identify direct-connect switches in the
topologyShow output, look for domain entries with a hop count of one.To
obtain additional information on the switches in the fabric, such as their IP
address, use the fabricShow command.
Figure 8.15 How to Interpret the Port Addressing
Port Addressing
0x XX 1Y ZZ where:
■
XX is a value between 0x1 to 0xef inclusive and indicates the domain id of the
switch to which the device is attached
■
The “1” will always be there in 2000 series switches
■
Y is the port number (0-F hex) that the device is attached to
ZZ is the AL_PA for a loop device or 00 for an F_Port
■
An example: 021500
XX=02 Domain_ID of the switch
Y=5 Port #
ZZ=00 If 00, then F_Port.
IF non-zero, then ALPA of the device on the FL_Port
SAN Profile
It is recommended that you create a profile of your fabric when it is functioning
normally so that you always have a baseline to compare the current state of your
SAN.You will want to create a profile before making any changes to the SAN,
such as firmware upgrades or additions or deletions of switches or edge devices.
This information can be captured from a logging facility within telnet and stored
as a text file.
140_SANs_08
8/14/01
5:26 PM
Page 309
SAN Troubleshooting • Chapter 8
Figure 8.16 Use topologyShow to Determine the Number of Online Switches
in the SAN
5 domains in the fabric; Local Domain ID: 3
Domain
Metric
Hops
Out Port
In Ports
Flags
Name
-------------------------------------------------------------------1
1000
1
0
0x00000002
D
"core1"
2
0x00000008
D
2
2000
2
0
0x00000000
D
"edge1"
2
0x00000000
D
3
0x00000000
D
1
0x00000000
D
4
2000
2
0
0x00000000
D
"edge3"
2
0x00000000
D
3
0x00000000
D
1
0x00000000
D
5
1000
1
3
0x00000001
D
"core2"
1
0x00000004
D
core1
edge1
core2
edge2
edge3
When you finish your maintenance or suspect a problem, take a new profile
and compare the baseline profile to your current profile. Any discrepancies
require further investigation. For troubleshooting purposes, a profile should consist of the following information extracted from a healthy SAN:
■
The number of domains in the fabric, which can be obtained from
topologyShow outputs.
■
The overall topology of the fabric, again from topologyShow and
switchShow outputs.
309
140_SANs_08
310
8/14/01
5:26 PM
Page 310
Chapter 8 • SAN Troubleshooting
■
The number of noncached Name Server entries for each switch in the
fabric, which can be obtained by issuing the command nsShow.
■
The total number of Name Server entries, which can be determined by
issuing the command nsAllShow.
You can also obtain this data by issuing the command supportShow for every
switch and then pulling the required data out of log. Another option is to automate the acquisition of data and then parse out the necessary fields. Figure 8.17
and Table 8.4 are examples of the necessary data collection and what a SAN
profile looks like.The data to collect is bolded in Figure 8.17 as well.
Figure 8.17 Data to Collect When Establishing a SAN Profile
BigSAN102:admin> nsShow
The Local Name Server has 2 entries {
Type Pid
N
COS
PortName
NodeName
TTL(sec)
661600;
3;50:00:60:e8:02:76:b9:04;50:00:60:e8:02:76:b9:04; na
FC4s: FCP [HITACHI OPEN-9
0112]
Fabric Port Name: 20:06:00:60:69:10:67:c4
N
661b00;
3;50:00:60:e8:02:76:b9:00;50:00:60:e8:02:76:b9:00; na
FC4s: FCP [HITACHI OPEN-9
0112]
Fabric Port Name: 20:0b:00:60:69:10:67:c4
}
BigSAN102:admin> nsAllShow
16 Nx_Ports in the Fabric {
641300 661600 661b00 6a1100 6b1000 6b1101 6b1600 6d1100
6d1200 6d1300 7215e1 761d01 761e00 771d00 771f00 781e00
}
BigSAN102:admin> topologyShow
26 domains in the fabric; Local Domain ID: 102
Output truncated. Make sure you capture all domain Ids in the fabric.
140_SANs_08
8/14/01
5:26 PM
Page 311
SAN Troubleshooting • Chapter 8
Table 8.4 Formatted SAN Profile
Switch
Local NS Entries
BigSAN100
BigSAN101
BigSAN102
BigSAN103
BigSAN104
BigSAN105
BigSAN106
BigSAN107
BigSAN108
BigSAN109
BigSAN110
BigSAN111
BigSAN112
BigSAN113
BigSAN114
BigSAN115
BigSAN116
BigSAN117
BigSAN118
BigSAN119
BigSAN120
BigSAN121
BigSAN122
BigSAN123
BigSAN124
BigSAN125
Total Nodes
Total Switches
0
0
2
4
1
1
4
4
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
16
26
311
140_SANs_08
312
8/14/01
5:26 PM
Page 312
Chapter 8 • SAN Troubleshooting
What Data Can a Host Provide?
A host can provide a significant amount of data to aid the SAN troubleshooting
process.Think again of the SAN as a virtual cable. A working virtual SAN cable
means that edge devices that are expected to communicate with each other are
successfully connected as N_Port or NL_Port (verify this with switchShow),
and that the devices are present in the Name Server (verify this with nsShow).
Assuming that zoning is properly configured, these edge devices should be able to
communicate with each other, just as if they are directly connected to each other
with a cable.
A host can indicate if devices are visible to that host. In a Windows environment, do this by running Disk Administrator; in a UNIX environment, do this by
issuing the format command. Many tools from HBA vendors are GUI-based and
allow for real-time, live viewing of connection status to storage devices. Some
examples of these tools are TROIKA’s SAN Command and JNI’s EZ Fibre. If the
devices do not show up at the host when these commands are issued, the next
step is to see why these devices are not visible to the host.The key to this involves
reviewing the host log files. For Solaris, the message log file is normally located in
the file /var/adm/messages.You can watch the SAN HBA events in real time,
by doing a tail –f /var/adm/messages. For a Microsoft environment, you can
use the Event Viewer to see the HBA-related activity.You might need to change
the verbosity levels or set the HBAs to debug mode to see detailed data in the
message logs. An example of a log from a Solaris host is provided in Figure 8.18 to
familiarize you with the data and how you might use it to assist in the troubleshooting process.
In Figure 8.18, the HBA recognizes and has visibility to seven JBOD disks. At
this point, you can conclude that the SAN virtual cable is working fine and that
the host has visibility to the devices at a SAN level.The next step is to see if the
devices are visible to the operating system. Based on the next set of error messages,
the indications are that the devices are not visible to the operating system and that
the HBA should be investigated further.The problem illustrated in Figure 8.18
occurred because the host HBA drivers were not configured to bind with any targets; hence, the disks were not presented to the operating system.To resolve this
issue, it is necessary to follow the HBA directions for binding SAN targets.
313
The devices return
The link comes
back up
The devices go
away
—an HBA issue ?
Now there looks
to be an issue
with the bindings
May 16 21:30:45 sun1 jnic: [ID 594302 kern.notice] jnic0: Target15 Lun0: Initialization failed: No fibre channel bindings provided.
May 16 21:30:45 sun1 jnic: [ID 473743 kern.notice] jnic0: Target14 Lun0: Initialization failed: No fibre channel bindings provided.
May 16 21:30:45 sun1 jnic: [ID 353184 kern.notice] jnic0: Target13 Lun0: Initialization failed: No fibre channel bindings provided.
May 16 21:30:45 sun1 jnic: [ID 232625 kern.notice] jnic0: Target12 Lun0: Initialization failed: No fibre channel bindings provided.
May 16 20:08:35 sun1 jnic: [ID 549483 kern.notice] jnic0: Port 0214EF (WWN 2000002037D97745:2100002037D97745) available.
May 16 20:08:35 sun1 jnic: [ID 121592 kern.notice] jnic0: Port 0214E8 (WWN 2000002037D96FEB:2100002037D96FEB) available.
May 16 20:08:35 sun1 jnic: [ID 900463 kern.notice] jnic0: Port 0214E4 (WWN 2000002037D974D7:2100002037D974D7) available.
May 16 20:08:35 sun1 jnic: [ID 172876 kern.notice] jnic0: Port 0214E2 (WWN 2000002037D9775A:2100002037D9775A) available.
May 16 20:08:35 sun1 jnic: [ID 887895 kern.notice] jnic0: Port 0214E1 (WWN 2000002037D976B3:2100002037D976B3) available.
May 16 20:08:35 sun1 jnic: [ID 861423 kern.notice] jnic0: Port 0214E0 (WWN 2000002037D9730F:2100002037D9730F) available.
May 16 20:08:35 sun1 jnic: [ID 475247 kern.notice] jnic0: Port 0214DC (WWN 2000002037D97796:2100002037D97796) available.
May 16 20:08:35 sun1 jnic: [ID 184835 kern.notice] jnic0: Link Up
May 16 20:08:29 sun1 jnic: [ID 846798 kern.notice] jnic0: Port 0214EF (WWN 2000002037D97745:2100002037D97745) removed.
May 16 20:08:29 sun1 jnic: [ID 944579 kern.notice] jnic0: Port 0214E8 (WWN 2000002037D96FEB:2100002037D96FEB) removed.
May 16 20:08:29 sun1 jnic: [ID 682426 kern.notice] jnic0: Port 0214E4 (WWN 2000002037D974D7:2100002037D974D7) removed.
May 16 20:08:29 sun1 jnic: [ID 355286 kern.notice] jnic0: Port 0214E2 (WWN 2000002037D9775A:2100002037D9775A) removed.
May 16 20:08:29 sun1 jnic: [ID 515504 kern.notice] jnic0: Port 0214E1 (WWN 2000002037D976B3:2100002037D976B3) removed.
May 16 20:08:29 sun1 jnic: [ID 832114 kern.notice] jnic0: Port 0214E0 (WWN 2000002037D9730F:2100002037D9730F) removed.
May 16 20:08:29 sun1 jnic: [ID 957663 kern.notice] jnic0: Port 0214DC (WWN 2000002037D97796:2100002037D97796) removed.
May 16 20:08:29 sun1 jnic: [ID 229332 kern.notice] jnic0: Link Down
5:26 PM
May 16 20:08:29 sun1 jnic: [ID 619166 kern.notice] jnic0: Loss of sync detected
The link was reset
8/14/01
Solaris 2.8 host
JNI HBA
Figure 8.18 Solaris Host SAN-Related Messages
140_SANs_08
Page 313
140_SANs_08
314
8/14/01
5:26 PM
Page 314
Chapter 8 • SAN Troubleshooting
When to Use portLog and Other Advanced Tools
The portLog debugging tool is a low-level tool for debugging the SAN.The
portLog facilities are available in two forms: portLogDump and portLogShow.
The help page for portLogShow is reasonably detailed and helpful for decoding
portLog entries.To effectively understand portLog data, you will need a solid
background in Fibre Channel fundamentals.Training on decoding portLog data is
available from Brocade (www.brocade.com/education_services).
An annotated example of a portLog entry is shown in Figure 8.19 to provide
some insight into how a portLog entry is decoded.You will most likely
encounter portLog data when entering the supportShow command, which calls
portLogDump, or if you are requested to obtain this data by Brocade support.
Figure 8.19 portLog Entry Example
Port 6
D_ID = name server
ELS -> PLOGI
21:01:30.216 tReceive
Rx3
6
116
22fffffc,00011600,00f2ffff,03000000
Task
Timestamp
Class 3
frame
received
Frame
payload
size
S_ID =
Domain 1
Port 6
For Fibre Channel developers and people who are intimately involved with
SANs, a programmer’s guide is available (Fabric Programming Guide Revision 2.1
Publication number 53-0001561-01Rev. A 4/11/00).The guide is available from the
Brocade Web site (www.brocade.com); however, a login and password are required.
Instructions for obtaining a login and password are posted on the Web site.
Another low-level debugging tool is a Fibre Channel analyzer. Companies
such as Finisar (www.finisar.com) and Xyratex (www.xyratex.com) manufacture
Fibre Channel analyzers. An analyzer is typically used in a development environment and rarely to debug production environments. Analyzers can generate a
tremendous amount of data. An analyzer is usually inserted into the SAN
between the switch and an edge device, or between two switches. Normally, a
140_SANs_08
8/14/01
5:26 PM
Page 315
SAN Troubleshooting • Chapter 8
detailed analysis and troubleshooting effort is required to identify where to insert
the analyzer into the SAN and what data the analyzer should look for. Again, an
extensive background in Fibre Channel is necessary to effectively use an analyzer.
In-Depth Troubleshooting with Fibre Channel Analyzers
Although configuring SANs is getting easier with each new generation
of equipment, it is often useful to have the appropriate tools for configuring and testing your SAN. As in standard Ethernet-based networks,
and even in local parallel SCSI bus installations, network sniffers and bus
analyzers are very handy tools to really understand what is going on.
Fibre Channel cable testers can be purchased for nominal amounts, link
activity analyzers for several hundred dollars, and full-blown protocol
analyzers for several thousand dollars.
Fibre Channel cable testers provide simple connectivity tests for a
cable; in the case of copper cables, they test for connectivity between
two ends of a cable. Similar optical tools are available for checking the
amount of light that is transmitted through an optical cable, and they
provide convenient diagnostic capabilities for cable integrity.
An affordable alternative to full-blown protocol analyzers is a link
activity analyzer. Link activity analyzers attach to Fibre Channel cables
and analyze basic activity on the link. Basic functionality includes LEDs
to indicate when traffic is being sent and received, as well as information such as MB/sec counters, online or offline information, error lights
for CRC errors, and optical signal quality indicators. These types of link
activity analyzers are ideal for isolating specific problem areas in a SAN,
and identifying questionable links or devices.
Finally, for the most information about what is happening on a
SAN, protocol analyzers are the best tools available. These tools will
record every bit of information that comes across a wire, and through
user software can play back activity, show errors, highlight questionable
transactions, and more. Ranging from simple two-channel analyzers
embedded in a PC to multichannel testers that can test all of the ports
of a Fibre Channel switch in a single box, these analyzers are invaluable
if you really want to know what is going wrong with your network.
These tools can be invaluable for debugging problems directly at
the source, and are often bundled with training and classes to help you
learn the basic protocol and debugging techniques. For many problems
Continued
315
140_SANs_08
316
8/14/01
5:26 PM
Page 316
Chapter 8 • SAN Troubleshooting
you encounter in a development environment, a protocol analyzer is the
only tool that will help you really see what is going on. However, in production environments it is unnecessary to invest in a full analyzer for
day-to-day operation.
Troubleshooting the Fabric
A problem with the fabric is a pervasive issue that often affects more than one
device.When a fabric issue is experienced in a resilient SAN, it might have no
impact on SAN functionality since the SAN redundancy compensates for the
marginal situation.Table 8.5 provides a high-level review of problematic fabric
symptoms and associated possible causes. Fabric issues are normally associated
with heterogeneous storage and server environments in which all devices have
not been tested as a system.
Table 8.5 Symptoms Indicative of a Fabric Problem
Symptom
Possible Causes
Multiple edge devices
are inaccessible from
multiple hosts
■
■
■
■
■
■
■
■
■
Incompletely initialized
ISLs: ISL port initializes
as a G_Port or does not
come online
■
■
Fabric segmentation (zone conflict, mismatched
fabric parameters)
Switch failure
Edge device timeout or communication conflict
when accessing the Name Server (FFFFFC) or
Fabric F_Port (FFFFFE)
Unconfirmed domain
Message Queue (MQ) issues
Hosts and/or storage attempted to access the
fabric prior to fabric convergence
Domain ID conflict
Port configuration conflict
No fabric license installed
Marginal link
Fabric initialization error
The remainder of this section identifies what tools to use and data to analyze
when a fabric issue is suspected. Symptoms are explained in further detail and
specific issue traits are identified.Where possible, workarounds or corrective
actions are specified.
140_SANs_08
8/14/01
5:26 PM
Page 317
SAN Troubleshooting • Chapter 8
What to Look for in a Malfunctioning Fabric
If a switch is unable to join the fabric, all devices on that switch become inaccessible to the fabric and possibly to each other.When edge devices time out or are
unable to properly communicate with fabric services, communication between
numerous edge devices is interrupted and some devices become inaccessible.
NOTE
When initially identifying a fabric issue, look for a large number of edge
devices to be behaving marginally or not communicating at all. See if
you can identify a pattern. Is the outage random throughout the fabric,
or can you correlate the outage to a particular switch? Does the outage
correlate to one particular host type or storage device?
Host Behavior
Hosts that are involved with a fabric problem exhibit a variety of symptoms, one
of which is that some or all edge devices become inaccessible.You can verify this
situation for UNIX hosts using the command format to see if any devices have
disappeared. For Microsoft Windows 2000, start up the Disk Management utility
and check if any devices have disappeared.The Solaris /var/adm/messages file
and the Microsoft Event Viewer might provide further insight into the issue. ISL
initialization issues normally are invisible to the host, as the fabric will reroute
around failed ISLs and ensure connectivity—unless the ISL failure results in the
SAN becoming segmented, in which case edge devices will become inaccessible
to the host. Another possible symptom on the server is reduced performance of
the application. In the event of an ISL failure, the fabric will reroute the traffic as
mentioned.When this happens, typically, the traffic will have to share ISLs with
more devices than normal, possibly resulting in reduced performance due to congestion on the ISLs. Utilities supplied by your HBA vendor can also be helpful in
identifying host SAN status.
SAN Profile
If you suspect a SAN issue, create a new SAN profile and compare your baseline
SAN profile to your newly created SAN profile. Any unexplained discrepancies
317
140_SANs_08
318
8/14/01
5:26 PM
Page 318
Chapter 8 • SAN Troubleshooting
require further investigation—whether one or more switches have dropped out,
or if there are several missing Name Server entries.
Switch LEDs
If you can observe the SAN switches while the problem is occurring, try to
detect an LED pattern. Focus on the ISL ports first. Any yellow lights (blinking
or steady) indicate that manual intervention is required. At this point, log in to
the switch with yellow lights and issue the command supportShow to extract
debugging information for further analysis. If the switch is disabled (all ports
blinking a slow yellow), issuing a switchEnable command will bring the switch
back into the SAN. If a port is yellow (blinking or steady), you can bring the
device back online by issuing the command portDisable and then a
portEnable on the yellow port. Issue the command switchShow to verify the
port state or a disabled switch.
The errShow Command
Start the troubleshooting process by reviewing errShow data for every switch in
the fabric. Fabric segmentation and Message Queue (MQ) errors are indicative of
an error that will cause the switch and its connected devices to become inaccessible
to the fabric. Fabric segmentation is also caused by zone conflicts, incompatible
fabric parameters, or domain conflict. Review the errShow as a starting point.
The switchShow Command
When investigating fabric issues, you need to look at switchShow for port state
information and for fabric-related information. Issue the switchShow command
on every switch in the fabric. Examine the port state section of the switchShow
data for incompletely initialized E_Ports, which will show up as G_Ports or as
ports that are not online. If the port does not reinitialize itself, then manually
reinitialize the ISL by executing the commands portDisable and portEnable,
providing the offending port number as an argument.
A fabric issue that has less impact involves incomplete ISL initialization. If ISL
initialization issues occur, it is usually during fabric bring up. ISL initialization
issues can also occur during a fabric reconfiguration, which is triggered when an
ISL is added or removed or when a switch is added or removed. If the SAN is
designed to be resilient, an incomplete ISL initialization minimally impacts the
fabric, since there are multiple ISLs connecting the switches and edge devices are
still able to communicate with each other. On the other hand, if the SAN is not
140_SANs_08
8/14/01
5:26 PM
Page 319
SAN Troubleshooting • Chapter 8
resilient, an ISL initialization problem may result in a segmentation of the SAN
and many devices may lose communications with the SAN.
Resilient topologies deliver at least two internal fabric routes and are considered more resilient because each topology is capable of sustaining a switch or ISL
failure while the remaining switches and fabric remain operational.This selfhealing capability is enabled by Fabric Shortest Path First (FSPF) and is depicted
in Figure 8.20.
Figure 8.20 In a Resilient SAN, an ISL Failure Does Not Affect Communication
A
B
failed ISL
A
B
C
Figure 8.20 also depicts the failure of an ISL in a cascade topology, which is
the SAN located on the left. Note that switches A and B are unable to communicate with the remaining switches when the ISL marked with the “X” fails.
However, a similar switch failure in a resilient topology SAN (located on the
right) does not sever communications between the remaining switches. If the ISL
fails, it is still possible for switch A to communicate with switch C, using several
paths, such as the path highlighted in Figure 8.20. In a resilient topology, an ISL
failure might go unnoticed unless some type of monitoring is used (such as
Fabric Watch, a separately licensed product available from Brocade). Additionally,
with the loss of an ISL, there may also be performance degradation due to a loss
of overall available bandwidth.
When reviewing the fabric-related information of switchShow, search for a
switch that is disabled or has an unconfirmed domain. An unconfirmed domain
indicates that the switch was unable to communicate with the principal switch in
the fabric to obtain a domain ID.To resolve either situation, issue the command
switchDisable followed by switchEnable to enable the switch to join the fabric.
319
140_SANs_08
320
8/14/01
5:26 PM
Page 320
Chapter 8 • SAN Troubleshooting
The topologyShow Command
The topologyShow information is straightforward.You have to issue the
topologyShow command on only one switch, unless that switch happens to be
disabled or segmented. If this is the case, the topologyShow data will indicate
the number of switches in your fabric as one, and you need to pick another
switch to obtain the topologyShow information. The number of domains
should equal the number of switches in the SAN.You can reference your SAN
profile to establish the expected number of switches in the SAN. If there is an
unexplained discrepancy, you most likely have a failed, segmented, or disabled
switch.You can use switchShow data to identify a disabled or segmented switch.
If a switch that is supposed to be part of the fabric does not show up in the
topologyShow output (the previous SAN profile helps here), the administrator
should identify the switch, log in to it, and try first a portDisable-portEnable
sequence on any of the ports that should be an E_Port. If this does not work, try
a switchDisable-switchEnable sequence.
The nsShow and nsAllShow Commands
Issue the command nsAllShow on any switch in the fabric to obtain the total
number of edge devices registered with the Name Server. Note that issuing the
nsAllShow command on a switch that is segmented or disabled will return
Name Server data for only the switch and not the entire fabric. If there is an
unexplained discrepancy between this number and the number of Name Server
entries recorded in your SAN profile, you will need to identify which switches
are associated with the missing Name Server entries. First, check to see if there
are a number of missing devices; if so, then it is likely that one of the switches has
segmented or is offline.This should have been seen in the prior step. If you are
unsure of what devices are missing, issue the command nsShow on each switch
in the SAN and compare the number of Name Server entries to your SAN profile. Next, attempt to correlate the missing Name Server entries. Are the missing
entries all associated with any particular switch or edge device? Once you rule
out a segmented or disabled switch, determine if the port associated with the
missing devices is online. If the port is not online, bring the port online by executing the commands portDisable and portEnable, supplying the questionable
port number as an argument to these commands.This should refresh the Name
Server with the missing port edge devices. If the missing Name Server device
port comes online, and it still does not register with the Name Server, then it
indicates that there is either a timeout or a conflict in communication between
140_SANs_08
8/14/01
5:26 PM
Page 321
SAN Troubleshooting • Chapter 8
the Name Server and the edge device in question. It is now time to work with
your switch supplier and edge-device supplier to resolve this complex problem.
Now that You Suspect a
SAN Issue: Digging Deeper
Now that you suspect a SAN issue, you will need to investigate further to identify the root cause.The use and context of each command follows, relative to
troubleshooting a SAN issue.Where possible, workarounds or corrective actions
are identified. Several commands must be run on each switch; this is something
that can be automated.The details for doing so are presented later in the book in
Chapter 9.
Timeout of Edge Devices during Fabric Bring Up
If the problem occurs after a SAN bring up or during reconfiguration, it is possible that the edge devices came online before the SAN is ready. If this is the
case, you will see flickering green and possibly flickering yellow lights on the ISL
ports as the SAN converges while the edge ports remain steady green.You will
also see messages on the switch console as edge devices attempt to FLOGI and
Port-to-Port Login (PLOGI). Normally this is acceptable; however, if the SAN
requires an extended period of time for bring up, devices might time out. Be
careful to differentiate between an edge device that successfully retries PLOGIs
and FLOGIs while the fabric converges, and do not interpret these retries as failures.When the fabric is completely up, most devices that time out will try again;
however, if they do not, a timeout failure is to be expected.
If you suspect a PLOGI/FLOGI timeout failure during fabric convergence,
you can confirm your suspicions by reviewing the host logs.You can determine
the SAN state, by issuing a topologyShow command and verifying that the correct number of domains are in the fabric. If the edge devices are not tolerant of
the time it takes the SAN to converge, they might time out their FLOGI or not
successfully interact with the Name Server. In either case, that device will be
inaccessible to the fabric. If you suspect this is happening with your SAN, investigate the edge device logs to conclusively determine that timeouts are occurring,
the type of timeout, and how long these timeouts last. If timeouts are occurring,
one resolution is to increase the timeout values in the fabric (Resource
Allocation Time Out Value [R_A_TOV] or Error-Detect Time Out Value
[E_D_TOV]) or with the edge devices.There might be other timeout values on
the edge device that might help prevent this issue; however changing timeout
321
140_SANs_08
322
8/14/01
5:26 PM
Page 322
Chapter 8 • SAN Troubleshooting
values is a complex procedure and it is suggested that you work with your switch
supplier and edge device supplier at that point.
Port Configuration Conflict
or Missing Fabric License
If your switch is not configured with a fabric license, it cannot join the fabric.
The port state section of the switchShow will indicate that the E_Ports are
unknown.When you issue the command licenseShow, you should see a fabric
license. If the switchShow data indicates unknown E_Ports and you do not have
a fabric license installed, you will not be able to join that switch into a fabric
until you acquire a fabric license from your switch supplier.The SilkWorm 2010
and 2100 switches are entry-level switches and are not configured with a fabric
license, but can be upgraded with a simple license key.These switches are designed
for switched loop connectivity using Brocade QuickLoop.They can have a single
E_Port for connecting another QuickLoop switch; however, if additional ISLs are
connected, they will not come online.The SilkWorm 2240 and 2250 are entry
fabric switches designed for small SANs or for the edge of a larger SAN.They
can also only support a single E_Port unless you upgrade them to a full fabric
license. Figure 8.21 provides an example of a properly installed fabric license.
Figure 8.21 Example of a Properly Installed Fabric License
core1:admin> licenseShow
SRzy9Sz9zeTS0zAG:
Web license
bbSz9eQb9zccT0AQ:
Zoning license
RdzdSRcSyzSe0eTn:
QuickLoop license
cSczRScd9RdTd0SY:
Fabric license
<------A Fabric license is properly installed
It is possible to prevent switches from connecting into a fabric by disabling
E_Port functionality.You might want to do this for security purposes to prevent
unauthorized switches from joining the SAN. If your E_Ports are unknowingly
disabled, it will not be possible to join the switch into a fabric.To verify the status
of the switch E_Ports, issue the command portCfgEport, as shown in Figure 8.22.
140_SANs_08
8/14/01
5:26 PM
Page 323
SAN Troubleshooting • Chapter 8
Note that switch port 0 E_Port capability is disabled and that you cannot use this
port as an E_Port.To disable or enable E_Port support for a port, use the
portcfgEport command.You might want to do this for security purposes, since
turning off an E_Port prevents someone from attaching a switch to your fabric
without first obtaining your approval.
Figure 8.22 E_Port Configured as Disabled Example
core1:admin> portcfgEport
Ports:
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
----------------------------------------------------------NO
-
-
-
-
-
-
-
-
-
-
-
-
-
Segmented Fabrics
A fabric can segment for a variety of reasons, including zone conflicts, incompatible
fabric parameters, and domain ID conflicts.This section helps you identify whether you
have fabric segmentation, and what type of fabric segmentation you are experiencing. A fabric might segment when you add a new switch to the fabric or upon
fabric reconfiguration or bring up.The segmented fabric error message will occur
on any switch to which the new switch is trying to connect.The new switch that
is trying to join the fabric will show the E_Ports as unknown output from the
switchShow command. If the fabric segments during a reconfiguration or bring
up, you will have to search for a switch with unknown E_Ports, which can be
determined by examining the switchShow output.You can also compare your
current SAN profile to your baseline SAN profile to identify the missing switch.
Zoning Conflict
A zone conflict and fabric segmentation can occur when introducing a single- or
multiple-switch fabric into an existing fabric. As these conflicts may affect the
connected online devices, the switches segment and await human intervention to
determine the proper resolution.There is no way to identify the correct configuration without first investigating the nature of the conflict. If there are conflicts, it
may be easier to clear the configuration on the conflicted switch and then have
that switch absorb the zone information when it becomes part of the fabric.
Typically, there are three conditions that will create a zone conflict:
■
Multiple zoning configurations enabled Enabling zoning on both
fabrics when they are connected will create a zone conflict. Only one
323
140_SANs_08
324
8/14/01
5:26 PM
Page 324
Chapter 8 • SAN Troubleshooting
zone configuration can be enabled in a single fabric at a time. An
example of this is if the Day configuration is enabled on one switch and
the Night configuration is enabled on the other.The administrator will
have to decide which one is appropriate and disable the other.
■
Zone definition type conflict This occurs when introducing a
single- or multiple-switch fabric into an existing fabric that has zoning
definitions already defined, but the definition type (in other words, alias,
zone) is in conflict. An example of this would be a definition of Red as a
zone defining one fabric, and Red as an alias definition on another
fabric.This is a definition conflict and will segment the fabric.
■
Zone definition content conflict This occurs when introducing a
single- or multiple-switch fabric into an existing fabric that has zoning
definitions (in other words, alias, zone) already defined, but the content
is in conflict.This is where the definition name and type match, but the
content is different. An example of this would be a Red zone defined on
both fabrics. On the first fabric, the Red zone was defined with domain
5, port 4, and the second fabric has the Red zone defined with domain
7, port 3. Both have a zone definition of Red, but the content is in conflict and will cause the fabric to segment. Again it will require that the
administrator determine which Red zone is correct and either update
the incorrect one or delete it. Once the fabrics merge, the proper Red
zone will be propagated to all the switches in the fabric.
The workaround for this situation involves correcting the conflicts or clearing
the zoning information on either the fabric or new switch, depending on which
zoning configuration you consider to be correct and want to keep.You can clear
a zoning configuration by issuing a cfgClear <configuration you want to
delete> command followed by a cfgDisable <active configuration you
want to delete> command.You should first save a copy of the zone configuration by issuing cfgShow and saving the output to a file (in case you mistakenly
delete the wrong configurations).The configUpload command is also useful for
this operation. Figure 8.23 shows a zone conflict error message.
Figure 8.23 Zone Conflict Error Message
0x10addf10 (tZone): May 15 09:37:01 (12)
Error FABRIC-SEGMENTED, 3, port 4, zone conflict
140_SANs_08
8/14/01
5:26 PM
Page 325
SAN Troubleshooting • Chapter 8
Incompatible Fabric Parameters
Certain system configuration settings are changed by issuing the command configure.The fabric parameter system configuration settings must be the same for
every switch in the fabric.The fabric will segment if there is a difference between
the parameters that exist in the fabric and the parameters on a switch that is
trying to join the fabric.The following parameters must be consistent with the
switch that is joining the fabric and the fabric:
BB credit: (1..27) [16]
R_A_TOV: (4000..120000) [10000]
E_D_TOV: (1000..5000) [2000]
Data field size: (256..2112) [2112]
Sequence Level Switching: (0..1) [0]
Disable Device Probing: (0..1) [0]
Suppress Class F Traffic: (0..1) [0]
SYNC IO mode: (0..1) [0]
VC Encoded Address Mode: (0..1) [0]
Core Switch PID Format: (0..1) [0]
Per-frame Route Priority: (0..1) [0]
Long Distance Fabric: (0..1) [0]
NOTE
The range of values and defaults for Fabric OS 2.4.1a are shown in the
list of parameters in this section. Fabric parameters are subject to
change, and you should consult the documentation of the Fabric OS version you intend to use.
Figure 8.24 shows an example of an incompatible fabric parameters error message occurring on switch edge1, and the resulting switchShow data from that
switch. It is necessary to track down the switch connected on ports 0 and 2 of
switch edge1 and compare the fabric parameters from that switch to those of
edge1. Once you identify the discrepancy, use the configure command to change
the discrepant fabric parameters of the joining switch to those of switch edge1.
325
140_SANs_08
326
8/14/01
5:26 PM
Page 326
Chapter 8 • SAN Troubleshooting
Figure 8.24 Incompatible Fabric Parameters
edge1:admin> switchShow
switchName:
edge1
switchType:
2.4
switchState:
Online
switchRole:
Subordinate
switchDomain:
2
switchId:
fffc02
switchWwn:
10:00:00:60:69:11:f9:f7
switchBeacon:
OFF
port 0: sw Online
E-Port (unknown)
port 1: sw Online
E-Port 10:00:00:60:69:10:9b:5b "core2" (upstream)
port 2: sw Online
E-Port (unknown)
port 3: sw Online
E-Port 10:00:00:60:69:10:9b:5b "core2"
port 4: -- No_Module
port 5: -- No_Module
port 6: cu Online
L-Port 7 public
port 7: -- No_Module
port 8: -- No_Module
port 9: -- No_Module
port 10: -- No_Module
port 11: -- No_Module
port 12: -- No_Module
port 13: -- No_Module
port 14: -- No_Module
port 15: -- No_Module
Error 01
-------0x10f6f4d0 (tTransmit): May 17 18:10:38 (8)
Error FABRIC-SEGMENTED, 3, port 0, incompatible flow control parameters
Edge1
0 1 2 3 4 5 6 7 8 9 10 1 12 12 14 15
?
Switch with incompatible
fabric parameters
Domain ID Conflict
A domain ID conflict can occur if you join a switch that is in the online state
into a fabric, and the joining switch domain ID conflicts with the domain ID of
140_SANs_08
8/14/01
5:26 PM
Page 327
SAN Troubleshooting • Chapter 8
a switch in the fabric. Normally, domain IDs are automatically assigned; however,
once a switch is online, the domain ID cannot change, as it would change the
port addressing and potentially disrupt critical I/O.The resolution for this
problem involves performing a switchDisable followed by a switchEnable on
the joining switch.This will enable the joining switch to obtain a new domain
ID as part of the process of coming online.The fabric principal switch will allocate the next available domain ID to the new switch during this process.
NOTE
Changing domain IDs can have an impact on port zoning entries. Be sure
to check to see if any port zoning entries exist for devices on a switch
before changing its domain ID, and update any affected zones to reflect
the change.
Message Queue Errors
An MQ error is a message queue error.You can identify an MQ error message by
looking for the two letters M and Q in the error message. MQ errors can result
in edge devices dropping from the Name Server or preventing a switch from
joining the fabric. MQ errors are rare and difficult to troubleshoot, and it is suggested that you resolve them by working with your switch supplier.When you
encounter MQ errors, execute the supportShow command to capture debug
information about the switch. A switch reboot will likely clear any associated
problems.Then forward the supportShow data to your switch supplier for
further investigation.
Troubleshooting Devices
that Cannot Be Seen
A host that is unable to access a SAN device is a more common SAN issue that
can arise. Again, consider the virtual SAN cable analogy to start the troubleshooting
process.We want to determine whether the SAN is the cause of the problem or
whether it is an edge device issue.To do this you need to work your way along the
virtual SAN cable to the edge device(s) that cannot be seen. Figure 8.25 depicts a
flowchart that outlines the process for troubleshooting a missing device.
327
Follow fabric
troubleshooting
procedure
328
Problem Identified
Yes
Is it a zoning
issue ?
Yes
Is storage device
visible in name
server ?
Yes
No
No
No
Timeout or Name
Server conflict Escalate
Problem Identified
Yes
Node
configuration
issue ?
Problem Identified
Yes
Is there a port
configuration
conflict?
No
No
Name Server
conflict - Escalate
Follow marginal
link
troubleshooting
procedure
5:26 PM
Yes
No
Is the storage
device
present in
switchShow?
8/14/01
Is it a fabric
issue ?
Storage Device
not visible to
host
Figure 8.25 Troubleshooting Devices that Cannot Be Seen
140_SANs_08
Page 328
140_SANs_08
8/14/01
5:26 PM
Page 329
SAN Troubleshooting • Chapter 8
What to Look for in the Fabric
The first step is to determine whether the missing device problem is a fabric issue.
A quick way to determine this is to establish if the problem is localized to just a
single missing device or multiple missing devices.You also want to ensure all
switches are online in the fabric.You can quickly check your fabric status by issuing
the command topologyShow to verify that the correct number of domains exist
in your fabric.You can verify that the missing device is a localized issue by entering
the command nsAllShow to establish the total number of devices in the fabric. If
you suspect a fabric issue, since multiple devices are missing, follow the fabric
troubleshooting process. If you suspect a missing device issue, since only one or
two devices are unaccessible, move on to the next section,“Are the Host and
Storage Visible via switchShow on Their Respective Switches?”
Are the Host and Storage Visible via
switchShow on Their Respective Switches?
Use the command switchShow on the switch to which the subject host is connected.Verify that the host port and the storage port are online. If both the
storage and the host port are online, move on to the next section, as the virtual
SAN cable is logically connected to both the storage and the host. If the port is
not online, your host or storage might be malfunctioning, you might have a link
initialization issue, or you might have a marginal link. If the edge port is not
online or is a G_Port, this is analogous to having a disconnected cable. A host
malfunction is a very broad term and can include problems such as incorrect or
improperly installed HBA drivers, HBA parameters, or a faulty HBA. A storage
malfunction can include an incorrect or improperly configured storage interface
or a faulty storage interface.
NOTE
A quick method of identifying the cause of a missing device is to visibly
inspect your switch LEDs. Any steady or flashing yellow lights indicate
that a port is not online and manual intervention is required.
Brocade SilkWorm switches by default automatically configure the appropriate
port topology based on the connecting port topology, which is either N_Port or
329
140_SANs_08
330
8/14/01
5:26 PM
Page 330
Chapter 8 • SAN Troubleshooting
NL_Port, or in the case of a switch, an E_Port.This functionality is invaluable for
SAN management, because it alleviates the SAN administrator from managing and
maintaining the configuration for potentially thousands of ports. In some situations,
it is necessary to configure a port for a particular topology by using one or more of
the commands portCfgEport, portcfgFAport, or portcfgLport to lock the
port into a certain state.This may help with an issue where the edge device supports multiple port topologies and does not initialize in the mode that is desired.
A switch or port might also be configured for QuickLoop. First, check to see
that the switch or port in question is configured correctly for the intended purpose. For example, if the attaching edge device is configured as an NL_Port and
the switch port is configured as an F_Port, there is a conflict and that edge device
might initialize as a G_Port. Initializing as a G_Port is just as bad as not initializing at all, as the associated device is essentially inaccessible.The G_Port, or
generic port, is a transitional state defined in the standards as a device transitions
to an F_Port or an E_Port. If the port connecting to your edge device is not
intended to be a QuickLoop port, you will need to reconfigure that port, or the
edge device might not initialize properly. If there is any conflict, resolve the conflict with the switch, by reconfiguring the port, or with the edge device and
move onto the next section, “Do the Devices Show Up in the Name Server?” If
the devices support both loop and fabric modes, utilize the fabric setting to get
the best performance and fault isolation.
See Figure 8.26 for the usage and examples of various port configuration
commands. Switch core1 is configured for QuickLoop, as evidenced by the
enabled entries in the QuickLoop mode column. Switch core1 port 8 is configured as a loop port, and no ports are configured as Fabric Assist (FA) ports.You
can also use the command qlShow to determine if the switch is configured for
QuickLoop. If the switch is in QuickLoop mode and no QuickLoop is required,
you can issue a qlDisable command to disable QuickLoop for the entire switch.
If QuickLoop is required, but is not needed for the port in question, use the
qlPortDisable <port #> command for the port that needs to be changed.
Figure 8.26 Port Configuration Examples
core1:admin> qlportshowall
PortNum QuickLoop Mode
Port State
0
Enabled
fabric
E PORT
1
Enabled
fabric
E PORT
Continued
140_SANs_08
8/14/01
5:26 PM
Page 331
SAN Troubleshooting • Chapter 8
Figure 8.26 Continued
2
Enabled
fabric
E PORT
3
Enabled
fabric
E PORT
4
Enabled
fabric
E PORT
5
Enabled
fabric
E PORT
6
Enabled
offline
7
Enabled
offline
8
Enabled
fabric
9
Enabled
offline
10
Enabled
fabric
11
Enabled
offline
12
Enabled
offline
13
Enabled
offline
14
Enabled
offline
15
Enabled
offline
core1:admin> portcfgLport
Ports:
0
1
2
3
4
5
6
7 8
9
10
11 12
13
14
15
-------------------------------------------------------Lock
-
-
-
-
-
-
-
- YES
-
-
-
-
-
-
-
Private -
-
-
-
-
-
-
- -
-
-
-
-
-
-
-
6
7 8
9
10
11 12
13
14
15
core1:admin> portcfgFAport
Ports:
0
1
2
3
4
5
---------------------------------------------------------
-
-
-
-
-
-
- -
-
-
-
-
-
-
-
If the port is not online or initializes as a G_Port, attempt to reinitialize the
port by executing the commands portDisable and portEnable, supplying the
port number in question as an argument to these commands. If this process works,
monitor the situation carefully. If the host port consistently does not come online
or comes up as a G_Port repeatedly, you might have a marginal link issue, a faulty
HBA, HBA driver, or some type of configuration conflict between the host and
the switch. At this point, you need to follow the process of troubleshooting a
331
140_SANs_08
332
8/14/01
5:26 PM
Page 332
Chapter 8 • SAN Troubleshooting
marginal link. If the link is not marginal, contact your switch supplier and HBA
supplier to assist with further troubleshooting.
Follow a similar process for the storage port. If the storage port is not online
or is a G_Port, this is analogous to a disconnected cable at the storage end.
Attempt to reinitialize the port by issuing a portDisable/portEnable. Next,
rule out a marginal link, faulty storage equipment, and configuration conflict
between the storage and the switch. If you are still unable to establish the root
cause, work with your switch supplier and your storage supplier to assist with further troubleshooting.
Do the Devices Show Up in the Name Server?
At this point, you have verified that the host and storage are logically connected
to the virtual SAN cable, and it is now necessary to confirm that the two edge
ports are able to communicate. Use nsShow on the switch to which the storage
is connected and the switch to which the host is connected to verify that these
edge devices are registered with the Name Server. If you intend to verify that an
Emulex HBA located on switch core1 port 8 is registered with the Name Server,
the data in Figure 8.27 would confirm this.
Figure 8.27 nsShow Example—Verifying that an Emulex HBA Is Registered
with the Name Server
core1:admin> nsShow
The Local Name Server has 2 entries {
Type Pid
N
COS
PortName
NodeName
TTL(sec)
011800;
2,3;10:00:00:00:c9:21:5f:a7;20:00:00:00:c9:21:5f:a7; na
NodeSymb: [35] "Emulex LP8000 FV3.02
DV5-4.52A7 "
Fabric Port Name: 20:08:00:60:69:10:8d:fd
N
011a00;
2,3;20:00:00:e0:69:f0:07:c6;10:00:00:e0:69:f0:07:c6; na
Fabric Port Name: 20:0a:00:60:69:10:8d:fd
}
If the devices in question are registered with the Name Server, it is possible
that you are experiencing a zoning mismatch or a host/storage issue. If one or
both devices are not registered with the Name Server, it is possible that there is a
140_SANs_08
8/14/01
5:26 PM
Page 333
SAN Troubleshooting • Chapter 8
timeout or communication issue between the edge device(s) and the Name
Server. Check with the edge device documentation to determine if there is a
timeout setting or parameter that may help. If this does not work, contact the
support organization for the product that appears to be timing out.
Rule Out Zoning Issues
It is easy to rule out a zoning mismatch if zoning is not enabled. Check to see if
zoning is enabled by issuing the cfgShow command. If the output states that no
configuration is in effect, zoning is not enabled. If zoning is enabled, it is possible
that the two edge devices are unable to communicate with each other due to
zoning conflicts.To confirm whether this is the case, review the active zoning
configuration.You can do this by again issuing the command cfgShow, as shown
in Figure 8.28. In this example, host1 can access disk1, and host2 can access
disk2, but host1 cannot access host2 or disk2, and host2 cannot access host1 or
disk1. Confirm that the specific edge devices that need to communicate with
each other are in the same zone. If they are not, and zoning is active, you need to
update your zoning configuration before the edge devices in question are able to
communicate with each other. For example, if host1 needs to get access to disk2,
it is necessary to update the zoning configuration to enable this access. Once the
zone changes are made via the command line or WEB TOOLS-based GUI, the
devices should be able to access one another; however, some operating systems
might require that you run a disk utility such as format or disk administrator.
It is also possible that some operating systems might require a reboot to allow
discovery of the new devices.
Figure 8.28 Zoning Example
core1:admin> cfgshow
Defined configuration:
cfg:
colors
red; yellow
zone:
red
host1; disk1
zone:
yellow
host2; disk2
alias: disk1
0,0
alias: disk2
0,1
alias: host1
1,14
alias: host2
1,15
Effective configuration:
Continued
333
140_SANs_08
334
8/14/01
5:26 PM
Page 334
Chapter 8 • SAN Troubleshooting
Figure 8.28 Continued
cfg:
colors
zone:
red
1,14
0,0
zone:
yellow
1,15
0,1
NOTE
If zoning is active, any devices that are not explicitly defined in a zone
together are not able to communicate with each other.
At this point, if you establish that there is no switch zoning mismatch, then
you have established that the SAN virtual cable is working and that it is likely a
host or storage issue. One possible host or storage issue that could be causing the
“missing” devices is a mismatch with the HBA or storage-based zoning; be sure
to check this first when troubleshooting the edge devices.
NOTE
Incorrect or incomplete zoning is one of the most common causes of
SAN communication problems. Checking for this is analogous to
checking to see if a “malfunctioning” computer monitor is plugged in.
Edge Device Not in the Name Server
Reaching this point implies that you have verified that the edge devices in question are connected to the switch, and that one or more of the edge devices are
not registered in the Name Server. Attempt to reinitialize the edge device(s) with
the Name Server by executing the commands portDisable and portEnable,
supplying the port number(s) in question as an argument to these commands. If,
after you do this, the devices successfully register with the Name Server, you have
resolved the problem. However, pay attention to this issue because if the problem
recurs, it indicates a complex problem that is best resolved by working with your
140_SANs_08
8/14/01
5:26 PM
Page 335
SAN Troubleshooting • Chapter 8
switch and edge device suppliers.You should also seek this type of assistance if
after issuing a portDisable/portEnable, the devices do not register with the
Name Server.This fact indicates a complex issue such as a communication conflict or timeout condition. Although edge devices should reconnect to the fabric
and register when the port is disabled, some older devices might time out and no
longer retry logging in. If this happens, you might need to reboot the device to
get that device to reset and log into the fabric and Name Server.
Troubleshooting Marginal Links
A marginal switch port is defined as a switch port that is either receiving a marginal
incoming signal, or the switch receiver is not functioning properly. A marginal
Nx_Port transmit can be caused by an Nx_Port failing optical component (GBIC
or GLM) or a cable issue. A failing Fx_Port receiver can be caused by a failing
switch optical component or a failing switch port, as depicted in Figure 8.29.
Figure 8.29 Marginal Port Elements
Potential Faults
A marginal Fx_Port (switch port)
is termed a marginal Fx_Port receive
A marginal cable or Nx_Port GBIC is
termed a marginal Nx_Port transmit
A point-to-point N_Port or loop
(NL_Port connection)
Corresponding switch is either
F_Port or FL_Port.
Marginal Point-to-Point/Fabric Device Links
The impact of a marginal port can be significant. For example, a large storage
device such as an HP XP512, an IBM Enterprise Storage Server, or EMC
Symmetrix port might be accessed by potentially dozens of hosts.The marginal
335
140_SANs_08
336
8/14/01
5:26 PM
Page 336
Chapter 8 • SAN Troubleshooting
behavior of this storage device has the potential to impact all devices that access
this storage port. Imagine that you are a part of a geographically distributed team
of six workers.The primary communication for this team is via telephone.
Assume that your telephone is functioning marginally (similar to a poor cellular
connection). Anyone who wants to call you will not be able to communicate
effectively with you. Conversely, anyone who you call will also be unable to
communicate effectively with you. If you are a team leader for this group, the
impact of your marginal telephone capabilities is significant, since many people
utilize you as a resource. Note that the others in the group are free to communicate with each other without experiencing any impact from your telephone
problems.The story can have a happy ending if you gain access to two telephones, and realizing the marginal nature of one telephone line, switch to the
working telephone. Note that many SANs are constructed in a similar fashion to
Figure 8.30, with dual paths between hosts and storage, and a single failure does
not result in an I/O failure. In applications where availability is key, dual- or even
triple-redundant fabrics are always recommended.
Figure 8.30 Dual-Fabric SAN Design
Server
Server
Server
Server
Fabric A
Fabric B
Tape drive
Tape drive
Data
Data
Data
Data
140_SANs_08
8/14/01
5:26 PM
Page 337
SAN Troubleshooting • Chapter 8
Marginal Loop Connections
While a marginal point-to-point link affects only devices that access the pointto-point device, the ramifications of a malfunctioning loop-connected device can
impact all devices in that loop. Extending the geographically distributed team
analogy further, imagine that the only way the team communicates is via teleconference.Whenever the team needs to communicate, everyone dials in to a conference call. Unfortunately, the teleconference is disrupted by your marginal
telephone link.What makes things even worse is that communication between
any other team members is impossible or very difficult. For example, it is very
difficult for one member to speak with another on the teleconference because
your marginal telephone continually creates static on the teleconference.
Brocade QuickLoop and Fabric Assist are unique Fibre Channel topologies
that combine aspects of arbitrated loop and fabric topologies.They are composed
of multiple private arbitrated loops (looplets) interconnected by a fabric. It can be
best described as a Private Loop Fabric Attach, as compared to Private Loop
Direct Attached (PLDA) or Fabric Loop Attachment (FLA).The FL_Port of each
looplet is hidden from the NL_Ports. QuickLoop is a logical PLDA that complies
with the FC-AL standard. Although NL_Port devices are attached to different
arbitrated loops interconnected by a fabric, the fabric and the physical device
locations are transparent. QuickLoop enables switches to be used in place of hubs
in environments where all attached devices are private devices. Fabric Assist mode
allows the configuration of a virtual private loop in which a private host can see
and access public or private targets anywhere on the fabric. Such a private loop is
called QuickLoop Fabric Assist mode zone. Fabric Assist mode enables private
hosts to access public or private targets anywhere on the fabric, provided they are
configured in the same Fabric Assist zone. A public target accessed by a private
host remains public, with full fabric functionality.
The nature of loops is such that the behavior of an unhealthy device on the
loop can adversely impact the behavior of the remaining devices on the loop. For
example, a marginal GBIC could degrade the signal to the point where the connected NL_Port (host or storage) device is no longer able to effectively communicate.This in turn causes the loop to reset.When a loop resets, so do the
individual hosts or storage devices connected to that loop. Under normal circumstances, a loop reset does not cause any harm. However, if a device is constantly
resetting, I/O flow can become severely restricted or halted.
Loop Initialization Primitives (LIPs) are part of a healthy loop and are used
for a variety of purposes—most commonly to signal other devices on the loop
337
140_SANs_08
338
8/14/01
5:26 PM
Page 338
Chapter 8 • SAN Troubleshooting
that a new device has been added, or that an existing device has left the loop.
When a loop or NL_Port resets, LIPs are generated. However, an excessive
number of LIPs will make a loop unstable.
The Fibre Channel standards community is making great strides in further
enhancing the functionality of loops. However, loops are starting to become a
legacy issue. It is important to note that Fibre Channel and SilkWorm switches
also support point-to-point topologies, which are not subject to the same disruptive behaviors that loops are.When a public device accesses a private device
(known as translative mode), the LIP is not propagated to that public device, nor
is that public device subject to disruption.
Nx_Port (Host/Storage) Behavior
with a Marginal Port in the Loop
When a marginal device disrupts the loop, a variety of symptoms can be present.
Performance for devices connected to the QuickLoop or devices accessing a
common device can be described as slow. Host logs (that is, /var/adm/messages,
eventlog, or syslog) might indicate that I/O is timing out or that the interface is
being reset.The switch LEDs should be green or a blinking green light. Green
lights mixed with yellow lights or flashing yellow lights indicate that the ports are
resetting themselves. Devices on the affected loop might FLOGI and/or PLOGI
repeatedly onto the fabric as part of a reset process initiated by the HBA.This
would show up on the console or telnet management session for the switch to
which the affected device was attached. N_Port devices are less susceptible to disruption for reasons stated earlier.
Marginal GBIC/Cable
You can use the er_enc_out statistic to identify a marginal GBIC. Active devices
(such as disks) normally clean up an encoding error as these errors are encountered, and mark the frame as having bad CRC. Any er_enc_out errors are
encoding errors outside a frame, and do not generate a CRC error. If a high
count (for example, several thousand) or incrementing counts of er_enc_out
errors are experienced on a particular port, this indicates that the signal is
marginal between the connected device’s transmit port and the switch’s receive
port. Because this situation is being recorded as encoding errors, the implication
is that there is no active device cleaning up the errors between the switch receive
and the connected device transmit.The diagnosis: marginal GBIC or cable on the
connected device.
140_SANs_08
8/14/01
5:26 PM
Page 339
SAN Troubleshooting • Chapter 8
Connected Device
Note that LIPs are normal in a healthy loop. An imbalance where the Lip_in
count is larger than the Lip_out count indicates that the associated connected
device is the originator of LIPs in the loop. A device that generates a large
number of LIPs might be malfunctioning.The switch will propagate LIPs in
accordance with the Fibre Channel specification. Propagated LIPs are recorded as
Lip_out.
Fault Isolation
Once a marginal port is identified, it is necessary to identify where the fault
resides. Figure 8.31 depicts a suggested fault isolation process. Fault isolation on a
loop is very difficult, which is one of the reasons why loops had limited success.
How the Switch Can Help: Fabric
Watch and QuickLoop Zoning
By virtue of being positioned between storage and host, the switch is a natural
resource for gathering statistics and troubleshooting. As shown earlier, the switch
can help mitigate the issues that arise when a marginal device disrupts a loop or
other N_Port devices.
Brocade Fabric Watch allows each switch to continuously monitor fabric elements for irregular conditions. Fabric Watch can assist in rapidly identifying and
escalating potential problems.This proactive management improves the overall
availability of the SAN. Specific to troubleshooting marginal links, Fabric Watch
can detect such failing port symptoms as excessive CRC errors and proactively
send an SNMP alert. It is also possible to telnet into the switch and quickly analyze statistics to identify the marginal port.
To minimize the impact of a marginal device in a loop, you can utilize
QuickLoop zoning or Fabric Assist to compartmentalize various host/storage
pairs. QuickLoop zoning or Fabric Assist prevents LIPs from propagating between
QuickLoop zones. In some respects, QuickLoop zoning turns one loop into multiple virtual loops. In Figure 8.32, a LIP generated by Host A in zone qlZone1
due to a marginal port does not propagate to qlZone2 or qlZone3.Without
QuickLoop zoning, a marginal port has the potential to limit or halt I/O for all
devices connected to the switch!
339
140_SANs_08
340
8/14/01
5:26 PM
Page 340
Chapter 8 • SAN Troubleshooting
Figure 8.31 Marginal Link Fault Isolation
START
Move suspected
marginal port cable
to another port on
the switch
Do the errors
stop or
symptoms go
away ?
No
Cable or
Nx_port
issue
Yes
Try a new cable
Switch port
or switch
GBIC is
marginal
Do the errors
stop or
symptoms go
away ?
Replace GBIC on
marginal port
Yes
BAD Cable
Run portLoopBack
test on marginal port
Does
portLoopback
test fail
Yes
BAD port
Replace
mother board
No
Replaced
GBIC is BAD
No
Follow Nx_port (i.e.
HBA, storage
interface)
troubleshooting
procedures
140_SANs_08
8/14/01
5:26 PM
Page 341
SAN Troubleshooting • Chapter 8
Figure 8.32 QuickLoop Zoning Example
qlZone2
qlZone1
c0
c1
Host A
b8
Host C
ba
e0
0,0
0,1
0,2
0,3
0,4
0,5
e1
e2
e3
Host B
b9
qlZone3
Overview of SilkWorm Port Error Statistics
Additional SilkWorm port statistics can be obtained by executing the following
telnet commands:
■
portShow <port #>
■
portStatsShow <port #>
Use portStatsShow for error statistics (such as CRC, encoding, bad End of
Frame [EOF], etc.), and use portShow for link-level and LIP statistics (such as
link failure, loss of sync, loss of signal, etc.).The portShow command offers similar statistics to portStatsShow. However, the statistics gathered by portShow
341
140_SANs_08
342
8/14/01
5:26 PM
Page 342
Chapter 8 • SAN Troubleshooting
are updated in software whenever a port interrupt is received, while the statistics
for portStatsShow are updated in hardware registers as they occur.The significance in this difference is that many errors, such as CRC errors, could occur
between interrupts.The hardware counters (portStatsShow) will capture these
between interrupt errors, while the software counters (portShow) might not.
Another difference between the two commands is that portShow provides LIP
statistics and link statistics (link failure, loss of signal, loss of sync), while
portStatsShow does not. A partial listing of relevant portShow statistics follows:
■
Lip_in Number of LIPs transmitted from the connected device to the
switch port. Does not apply to F_Port.
■
Lip_out Number of LIPs transmitted from the switch port to the
connected device. Does not apply to F_Port.
■
Lip_rx Type of LIP (F7, F8) last received by the switch from the
connected device. Does not apply to F_Port.
Troubleshooting I/O Pauses
I/O pauses happen, and both the SAN and edge device can and should tolerate
such events.The term I/O pause is somewhat generic. An I/O pause can be as
harsh as the powering down of a host or storage device while I/O is in transit,
which will cause I/O to cease. Alternatively, it can be as lightweight as a portlevel RSCN, which might be a problem for only the most latency-sensitive of
applications. Most HBAs currently pause I/O during RSCN processing; however,
updated drivers are expected to minimize this effect. Relative to the SAN, fabric
events can also cause a pause in I/O. A fabric event can be broken down into a
change, such as a switch reboot, and the resultant activity to respond to that
change. In the case of a switch reboot, not only are the devices connected to that
switch affected, but also devices connected to the fabric—even if the fabric is
resilient.This is because the fabric needs to reroute, which takes less than a
second, and because all devices connected to the SAN that have registered for
state change notification must process a global RSCN. Edge devices such as
HBAs and storage devices should be tolerant of such pauses in I/O. It is possible
to adjust the settings for these devices to accommodate longer or shorter delays
in I/O when a SAN event occurs. RSCNs are normal and key to SAN operation.
Several applications are very sensitive to latency and/or RSCNs, such as
video-on-demand and applications that are evolving into the SAN model, such as
140_SANs_08
8/14/01
5:26 PM
Page 343
SAN Troubleshooting • Chapter 8
tape backup. High latencies and large numbers of RSCNs can adversely affect
these applications. Storage vendors, switch vendors, application vendors, and HBA
vendors are working with the standards bodies (T11) as well as modifying their
product implementations to handle these types of exceptions.Table 8.6 lists
common events that cause fabric rerouting and/or fabric RSCNs.
Table 8.6 Fabric Events and Their Impact
Event
SwitchDisable Disabling a switch in the
fabric will require the fabric to reconfigure
and a new set of data path routes to be
established for the resulting downsized
fabric.
SwitchEnable The corresponding mode
to the disable. A new switch added to
the fabric will result in new route
calculations to allow for the added ports.
E_Port connection/disconnection Adding
or removing an ISL will cause a fabric RSCN.
A zone update, which occurs when you
execute a cfgEnable or cfgDisable
command.
Adding/removing a switch to/from
the fabric.
Generate
Global RSCN?
Will Result in
Reroute?
Yes
Yes
Yes
Sometimes
Yes
Sometimes
Yes
No
Yes
Sometimes
Troubleshooting fabric events and their adverse impact on applications and
the SAN is a complex process. If you suspect that a fabric event is adversely
affecting your SAN, work with your switch supplier for resolution.
343
140_SANs_08
344
8/14/01
5:26 PM
Page 344
Chapter 8 • SAN Troubleshooting
Summary
It can be helpful to think of the SAN as a virtual cable when it comes to troubleshooting, approaching the problem by breaking components down to a host,
the SAN virtual cable, and the storage.To the operating system, the SAN provides
a link to a disk, just as a traditional SCSI connection would.Troubleshooting a
SAN is more challenging, but still has many things in common with the traditional storage troubleshooting process. Switches are logically positioned in the
middle of the network between hosts and storage, and have visibility to both
storage and hosts.This visibility into both sides of the storage network enables
you to use switches to determine the cause of any malfunction in the SAN.
SAN troubleshooting should begin in the center of the SAN and proceed
outward. Once you know where to start troubleshooting, the next question is how
to proceed. Start the troubleshooting process by gathering a preliminary set of
data, and then analyze this data to identify where the problem resides: the host,
the fabric, or the storage. Next, gather additional data from the appropriate area
and focus in on the cause of the problem. A plethora of data is available from the
switches, hosts, and storage.
Many tools are available to the SAN troubleshooter. Several of these tools are
switch commands. Other tools involve viewing the switch LEDs, host information, Fibre Channel analyzers, and diagnostics available on many storage arrays. It
is rarely possible to use a single tool to successfully troubleshoot a problem. It is
more common is to use several tools in concert.
A fabric problem is a pervasive issue that can often affect more than one
device.When a fabric issue is experienced in a resilient SAN, it might have no
impact on SAN functionality, because the SAN redundancy compensates for the
marginal situation. However these “soft” errors can cause degradation in the performance of the enterprise application and thus require immediate attention.
Fabric issues are normally associated with large fabrics, which are defined as fabrics consisting of 10 or more switches and 100 or more edge devices.
A host that is unable to access a SAN device is a more common issue.This
type of issue is classified as a missing device. Use of the commands switchShow
and nsShow can quickly reveal the cause of the missing device. Missing device
issues are normally limited to a few devices. If more devices are involved, it is
likely a fabric issue.
The impact of a marginal port can be significant. For example, a large storage
device might be accessed by potentially dozens of hosts.The marginal behavior of
this storage device then has the potential to impact all devices that access this
140_SANs_08
8/14/01
5:26 PM
Page 345
SAN Troubleshooting • Chapter 8
storage port. A marginal link involves the connection between the switch and the
edge device. Isolating the exact cause of a marginal link involves analyzing and
testing many of the components that make up the link: switch port, switch
GBIC, cable, edge device GBIC, and the edge device.
I/O pauses do happen, and both the SAN and edge device can and should
tolerate such events.The term I/O pause is somewhat generic. An I/O pause can
be as severe as the powering down of a host or storage device while I/O is in
transit, which will cause I/O to cease. Alternatively, it can be as lightweight as a
port-level RSCN, which might be a problem for only the most latency-sensitive
applications. Relative to the SAN, fabric events can also cause a pause in I/O.
Calibrating your edge devices to handle I/O pauses and troubleshooting I/O
pauses is a complex process.
Solutions Fast Track
The Troubleshooting Approach:
The SAN Is a Virtual Cable
; Use the SAN’s visibility to both storage and hosts to start your trouble-
shooting process.
; The switchShow, nsShow, nsAllShow, errShow, and topologyShow
commands are extremely informational and invaluable to the troubleshooting process.
; The UNIX format command or HBA vendor-supplied utilities are also
helpful in troubleshooting.
; When you start the troubleshooting process, determine if the issue is
fabric related or device related. A fabric-related issue impacts many
devices, and a device issue impacts only a few devices.
Troubleshooting the Fabric
; A fabric issue impacts many devices. A logical switch outage, such as
segmentation or physical switch outage, can cause many devices to drop
out of the fabric. Problems with ISL initialization are also considered
fabric issues.
345
140_SANs_08
346
8/14/01
5:26 PM
Page 346
Chapter 8 • SAN Troubleshooting
; The quickest way to narrow your search of a fabric problem is to com-
pare your baseline SAN profile to your current SAN profile and investigate discrepancies.
; A SAN profile includes the number of devices per switch (nsShow),
number of devices in the fabric (nsAllShow), and number of switches
in the fabric (topologyShow).The errShow and switchShow commands are also helpful in tracking down fabric issues.
; Some fabric issues are caused by a mismatch in fabric service timeout
variables and the edge device timeout settings. Careful analysis of both
the fabric and the edge devices is necessary to resolve this complex issue.
Troubleshooting Devices that Cannot Be Seen
; The first thing to check is that the missing device is logically connected
to the SAN as indicated by switchShow output.
; Next, check to see that the device is present in the Name Server, using
the command nsShow. If the device is not in the Name Server, it is
invisible to the other devices in the fabric.
; Other common causes of missing devices are zone conflicts or
marginal links.
Troubleshooting Marginal Links
; Use portErrShow to establish if there are a relatively high number of
errors, such as CRC errors. Look for a steadily increasing number of
errors to confirm a marginal link.
; A marginal link can impact multiple devices. For example, a shared
storage device with a marginal link can cause communication problems
with all devices that access that shared storage.
; A marginal link can be caused by any of the components that make up
the link: switch port, switch GBIC, cable, edge device GBIC, and the
edge device.
140_SANs_08
8/14/01
5:26 PM
Page 347
SAN Troubleshooting • Chapter 8
Troubleshooting I/O Pauses
; I/O pauses happen, and both the SAN and edge device can and should
tolerate such events.
; An I/O pause can be as harsh as the powering down of a host or storage
device while I/O is in transit, which will cause I/O to cease.
Alternatively, it might be as lightweight as a port-level RSCN, which
might be a problem for only the most latency-sensitive applications.
Relative to the SAN, fabric events can also cause a pause in I/O.
; Several applications, such as video-on-demand and applications that are
evolving into the SAN model, such as tape backup, are very sensitive to
latency and/or RSCNs. High latencies and large numbers of RSCNs
can adversely affect these applications.
; Storage vendors, switch vendors, application vendors, and HBA vendors
are working with the standards bodies (T11) as well as modifying their
product implementations to handle these types of exceptions.
Frequently Asked Questions
The following Frequently Asked Questions, answered by the authors of this book,
are designed to both measure your understanding of the concepts presented in
this chapter and to assist you with real-life implementation of these concepts. To
have your questions about this chapter answered by the author, browse to
www.syngress.com/solutions and click on the “Ask the Author” form.
Q: When I activate a zone change (cfgEnable), I notice a pause in I/O and
several of my hosts log warnings.What causes this?
A: When you issue a zone change, an RSCN is delivered to any host in the
fabric that registers to receive an RSCN.The pause you notice is the initiator
responding to the RSCN, which involves the initiator querying the Name
Server and resolving any changes to the fabric.
Q: If I exhaust my troubleshooting options and cannot resolve an issue after
reading this chapter, what should my next step be?
347
140_SANs_08
348
8/14/01
5:26 PM
Page 348
Chapter 8 • SAN Troubleshooting
A: Contact your switch supplier and request support. Provide the information
outlined earlier in this chapter. Of special importance is the supportShow,
which is ideally captured while the problem is happening.
Q: How can I tell if my fabric is segmented?
A: Normally, a segmented fabric will generate an error message on the switch
that segments.You can view errors by issuing the command errShow.
Q: How come my device inconsistently connects to the switch as either an
N_Port or an NL_Port ?
A: It is likely that there is a bug in the port initialization of either the edge
device or the switch. A short-term solution is to configure a port for a specific topology. For example, configure a port as an FL_Port by using the
command portcfgLport. Longer term, you should resolve this behavior by
escalating the problem to your switch supplier and your edge device supplier.
Q: What is a quick way to reinitialize to clear a fault or re-enable a link?
A: The commands portDisable and portEnable will cause a port to reinitialize
and potentially clear a fault. Doing so will cause the edge device to register
with the Name Server.
140_SANs_09
8/14/01
3:40 PM
Page 349
Chapter 9
SAN Implementation,
Maintenance, and
Management
Solutions in this chapter:
■
Installation Considerations
■
Automating Switch
Administration Activities
■
Brocade Zoning Considerations
■
Validating Your Fabric
■
SAN Maintenance
Summary
Solutions Fast Track
Frequently Asked Questions
349
140_SANs_09
350
8/14/01
3:40 PM
Page 350
Chapter 9 • SAN Implementation, Maintenance, and Management
Introduction
Once you have completed your SAN design, you can then focus on implementation, management, and maintenance.To arrive at a design requires a significant
data-gathering effort during which you establish the requirements that shape
your SAN: application drivers, availability, scalability, manageability, and performance.With these decisions made, you can then create a SAN architecture that
meets your needs.The process of deploying a SAN is iterative as you build, test,
and refine your original design. Once you have a SAN design, the next step is to
implement, maintain, and manage the SAN. SAN implementation is the process
of taking your design from paper to physical setup. Implementation is an ongoing
activity that is very visible during the middle stages of your SAN’s lifecycle. In
transitioning to a management and maintenance mode, you will periodically
implement changes such as SAN expansion, fabric upgrades, and node movement. SAN management and maintenance activities are also reactive, such as
replacing a failed switch or Gigabit Interface Converter (GBIC) optical module.
This chapter is organized similarly to how you would set up and run a SAN.
First, we discuss topics that require thought prior to implementation, such as
zoning, cabling, and installation decisions.Then we present topics such as how to
validate your fabric prior to transitioning to production. Finally, once you have
your fabric up and running, we discuss topics like managing your SAN with
automation and maintenance topics such as adding devices to your fabric and
fabric upgrades.
This chapter provides unique tips and ideas as you deploy and manage your
SAN, with a focus on practical techniques and tools that require minimal dependencies.You can directly apply the processes in this chapter to your SAN today.
There are a wide variety of SAN management packages that offer varying degrees
of functionality for SAN management, maintenance, and implementation, such as
VERITAS (SANPoint Control), Computer Associates (Unicenter TNG), BMC
(PATROL), Sun (HighGround), Hewlett-Packard (OpenView Storage Area
Manager), SANavigator (SANavigator), Prisa (Visual SAN), Micromuse (Netcool),
and IBM Tivoli Storage Network Manager (TSNM).
There are many choices for developing your own SAN management and maintenance infrastructure using a mix of commercial packages and your own scripts
and processes.Throughout this chapter, we provide examples of how to automate
certain SAN management activities.The scripts discussed are freely downloadable
and available for your use from the book’s Web site, found at www.syngress.com/
solutions.The level at which you use these scripts and the process outlined in this
140_SANs_09
8/14/01
3:40 PM
Page 351
SAN Implementation, Maintenance, and Management • Chapter 9
chapter depends on your technical skill level, what SAN management software you
deploy, and your company’s information technology practices.
Installation Considerations
Several decisions and considerations regarding your SAN solution are necessary
prior to installation. Upfront planning and review result in significant time savings.This section identifies the areas of SAN installation that require planning,
and the upfront decisions that you need to make. For example, it might be difficult to install or maintain an Ethernet connection for a remote SAN. In-band
management via Internet Protocol over Fibre Channel (IPFC) is an option that
addresses this issue.When you install your SAN, you want to be sure that the
switches are running the appropriate level of Brocade Fabric OS, which is not
always the latest release! You need to know which version of Fabric OS to use
prior to installation. Other installation considerations include setting switch
parameters and verifying that you have the necessary licenses to operate your
SAN in accordance with the design requirements.
How to Cable Your SAN for Ease of Operation
Installation time is when you should plan your cabling and implement a cable
layout scheme that is manageable, flexible, and maintainable. An effective cable
management scheme should not only enable ease of maintenance, but also be
aesthetically pleasing.While aesthetics might not seem like a necessary design
principal, it turns out that cable plans that “look nice” also usually turn out to be
the ones that are easier to manage.
The Inter-Switch Link (ISL) cabling plans in this chapter are optimized to
facilitate clean cable management, and this should be easily achievable as long as
normal cable management practices are followed. In particular, the cables used for
ISLs should be carefully labeled and bundled so that they cannot be mistaken for
host or storage cables. A well-managed ISL cable layout is shown in Figure 9.1.
While the layout depicted in Figure 9.2 might look fairly clean, it does have
some potential problems. If a switch in the middle of this set were to fail, it
would be difficult to replace it without shutting down the other three.This is
because the ISLs from the top switches run in front of the lower switches.The
configuration in Figure 9.1 clearly does not have this problem. Another problem
with the configuration in Figure 9.2 is the fact that the switches are stacked on a
shelf rather than being rack-mounted. Even if the ISLs were cleaned up, it would
still be difficult to remove a switch from the bottom of the stack.
351
140_SANs_09
352
8/14/01
3:40 PM
Page 352
Chapter 9 • SAN Implementation, Maintenance, and Management
Figure 9.1 An ISL Cable Layout That Is Easy to Maintain
Figure 9.2 An ISL Cable Layout That Is Difficult to Maintain
Figure 9.3 shows much the same configuration of switches shown in Figure
9.2, with a recommended cable layout scheme that is easy to maintain.The
Figure 9.3 switches are also racked.
NOTE
Ensure that ISLs run in front of only the switches to which they are connected. This will allow the switches to be removed without downtime for
the fabric.
140_SANs_09
8/14/01
3:40 PM
Page 353
SAN Implementation, Maintenance, and Management • Chapter 9
Figure 9.3 The Switches From Figure 9.2 Are Recabled to Enable
Ease of Maintenance
It is not always possible to use cables that are “cut to length” for the ISLs. If
you are using cables that are excessively long, it is desirable to take up the slack in
some manner, as shown in Figure 9.4. Ideally, do this away from the switch to
avoid clutter at the switch itself. However, be sure that you do not exceed the
bend radius specification of the optical cable.
Figure 9.4 Take Up Slack to Avoid Clutter
Figure 9.5 depicts a high-performance, 32-port configuration that uses six
switches.The switches are mounted with cable management above, below, and on
both sides. Management can also be used in between switches, if needed. Only four
of those switches have any ports available for user wiring.The other two are used
exclusively for ISLs. It is desirable to have all available user ports in one
contiguous block to ease cabling of edge devices and simplify troubleshooting
and monitoring. Figure 9.5 shows the ports available to the user (edge ports).
353
140_SANs_09
354
8/14/01
3:40 PM
Page 354
Chapter 9 • SAN Implementation, Maintenance, and Management
Figure 9.5 Six Switches Racked for Edge Wiring and ISL Wiring
User wiring goes
this way
ISL wiring goes
this way
6' x 19" Rack
Clearly, the ISL wiring can be bundled to the top, bottom, and right side of the
rack and kept completely separate from the user wiring that would run to the left.
The ISLs within this group should all be formed using 1 meter cables, if
you are using 1.5 U switches (such as the SilkWorm 2250), or 2 meter cables, if
you are using 2 U switches (such as the SilkWorm 2800).The length of the ISLs to
the other groups of switches will vary greatly depending on rack configuration and
should therefore be measured beforehand. Note that these cable lengths apply only
to the group depicted. For different SANs, different cable lengths might be
required.
The ISLs used to interconnect the switches in these configurations are
assumed to be semipermanent. It is useful to have these semipermanent ISLs
colored differently from the host/storage/other ISL cables. Most multimode
Fibre Channel cables are orange. Using gray, black, blue, or some other different
color for the ISLs should help to differentiate between ISL cables and edge
device cables.
Racking Considerations
If you are employing a dual-fabric SAN architecture, it is important that the
duality be employed throughout the SAN implementation as shown in the lefthand configuration of Figure 9.6. Deploying two fabrics that are part of a SAN
solution within the same rack makes that rack a single point of failure.The odds
140_SANs_09
8/14/01
3:40 PM
Page 355
SAN Implementation, Maintenance, and Management • Chapter 9
of a rack falling over of its own accord are low. However, it is possible to picture
a contract cable management worker on a ladder falling off and hitting a rack, or
a leak spraying water into a rack.The concept of dual fabrics is to avoid a single
point of failure. For high-availability fabrics, ensure that you have separate power
circuits available, as shown in the right-hand configuration of Figure 9.6. For dual
power supply switches, use separate circuits for the left and right power supplies.
If you are using single power supply switches, it is still important to use separate
circuits if your SAN is configured for high availability.This means connecting half
of the switches in your fabric to one circuit in such a way that if these switches
are powered off, the other half, which are connected to a different circuit, can still
comprise a working fabric.
Figure 9.6 Racking and Powering for High Availability
Separate Circuits
Rack A,
Circuit A
Rack B,
Circuit B
Hosts
SAN A
SAN B
Storage
355
140_SANs_09
356
8/14/01
3:40 PM
Page 356
Chapter 9 • SAN Implementation, Maintenance, and Management
In-Band or Out-of-Band Management?
For some situations, it is not possible or practical to dedicate an Ethernet connection for each switch. It is possible to manage Brocade switches via direct Ethernet
connections or via IPFC.When using Ethernet connections, it is only necessary to
configure the switch IP information and attach an Ethernet cable to each switch.
When using IPFC, you can use a single Ethernet connection to bridge to the other
switches via IPFC.To configure IPFC, it is necessary to configure the switches and
in some cases also to configure the Host Bus Adapters (HBAs) to run IPFC.There
are advantages and disadvantages to an Ethernet bridge to IPFC approach, as presented in Table 9.1.The flexibility does exist to do in-band management with a
single Ethernet connection. If you can do out-of-band management with Ethernet
connections to each switch, you will need to allocate an IP address, Ethernet port,
and cable for each switch.There is also the option of doing full IPFC-based management, with an IPFC-capable HBA connected to a switch that can talk to all
other switches in the fabric via IPFC.
Table 9.1 Advantages and Disadvantages of Using IPFC to Manage Your SAN
Advantages
Disadvantages
Fewer or no Ethernet
connections
Single point of management failure—only one
Ethernet path or IPFC path to fabric. If a switch
goes down anywhere in the fabric or gets
“switch disabled,” all management capabilities
stop at that point—no Brocade WEB TOOLS and
no telnet support.
This is no different than management via the
Ethernet ports. In that case, you would have the
management station and all the switches connected to an Ethernet switch or hub. If the
Ethernet cable goes bad, you cannot manage
any switch: single point of failure.
Static IP addresses—no Dynamic Host
Configuration Protocol (DHCP) support.
No easy gateways exist for routing IPFC like you
have on Ethernet unless you piece one together
using routed devices on a UNIX box or use some
kind of routing software for Windows NT.
Fewer Ethernet hubs/power
Remote management
140_SANs_09
8/14/01
3:40 PM
Page 357
SAN Implementation, Maintenance, and Management • Chapter 9
IPFC In-Band Guidelines
Figure 9.7 depicts an IPFC in-band configuration.The management station,
where the browser runs, does not need to have a Fibre Channel interface, or be
IPFC-capable: it only needs an Ethernet connection. Only one of the switches in
the fabric needs an Ethernet connection, which must be in the same subnet as
the management station. However, this is not strictly necessary.You can configure
the default gateway on the management switch and also add an appropriate static
route on the management station and all routers between it and the management
switch. This is a bit complex, and probably not worthwhile in many cases. In
addition, with this configuration correctly implemented, it is also possible to
telnet into every switch in the fabric from the management station.
Figure 9.7 A Five-Switch IPFC In-Band Setup
Management Station
IP: 192.168.164.109
Subnet: 255.255.255.0
GW:192.168.164.1
Same Ethernet IP can be used on all in-band switches.
Note: Ethernet IP cannot be [0.0.0.0] or None.
SW1
SW2
SW3
SW4
SW5
Management Switch
IP: 192.168.164.28
FC_IP: 172.17.50.1
GW: 192.168.164.1
(Gateway Switch)
IP: 192.0.0.1
FC_IP: 172.17.50.2
GW: 172.17.50.1
IP: 192.0.0.1
FC_IP: 172.17.50.3
GW: 172.17.50.1
IP: 192.0.0.1
FC_IP: 172.17.50.4
GW: 172.17.50.1
IP: 192.0.0.1
FC_IP: 172.17.50.5
GW: 172.17.50.1
Gateway on in-band switches must point to the first switch's FC_IP
address. All switches' FC_IP address must be in the same subnet. A
Static Route must be entered from the management
station pointing to the FC_IP Subnet.
357
140_SANs_09
358
8/14/01
3:40 PM
Page 358
Chapter 9 • SAN Implementation, Maintenance, and Management
A guide for setting up a five-switch SAN for in-band management follows.
You can adapt this guide to fit your SAN environment. A summary of the configuration is listed in Table 9.2.This summary highlights the relationship between
the IPFC and the Ethernet IP addresses:
■
All switches must have their Fibre Channel IP addresses on the
same subnet.
■
The management station and switch Ethernet port must have their IP
addresses on the same subnet.
■
The management station must have a static route to the IPFC subnet, or
the default gateway pointing to the Fibre Channel IP address of the switch
connected to Ethernet. Either solution will work (for example, on a
Solaris machine, route add IPFC mask IPFC MASK IPADDR metric 1).
■
The in-band managed switches not connected to Ethernet must have their
default gateway set to the IPFC address of the switch that is connected to
Ethernet. In Figure 9.7, switches 2, 3, 4, and 5 have their default gateway
set to [172.17.50.1].These switches must have their Ethernet IP addresses
set to an address that is different from the Ethernet IP subnet specified on
Switch 1 (SW1).The Ethernet IP address cannot be [0.0.0.0] or None.
The Ethernet IP address can be the same as illustrated in Figure 9.7, as
long as the switches are not connected to the IP network.
■
The gateway address on Switch 1 (the gateway switch) should be set to
the default gateway on the network. However, this is not required if
Switch 1 and the management station are on the same subnet.
Setting Switch Parameters
Before the switches are cabled together, certain parameters should be set.These
include the IP information and the switch name, which should be the same as the
host name that maps to the switch’s IP address. Set the IP address and switch name
of each switch to an appropriate and unique ID.The gateway and subnet mask
might also need to be set. See your network administrator for appropriate values.
If possible, have a contiguous block of addresses reserved for all Brocade
switches. It might also be beneficial to keep the last octet of these addresses
below 239. One popular way to administer Fibre Channel domain IDs is to have
them match the last octet of the IP address. For example, switch 192.168.62.100
would get domain ID 100. Since the highest valid domain ID is 239, this scheme
works only if the last octet of the IP address is 239 or lower.
172.17.50.1
172.17.50.2
172.17.50.3
172.17.50.4
172.17.50.5
192.168.164.28
192.0.0.1
192.0.0.1
192.0.0.1
192.0.0.1
Management
Switch 1
Switch 2
Switch 3
Switch 4
Switch 5
255.255.255.0
192.168.164.109
IPFC Address
Management
Station
Subnet Mask
Ethernet IP Address
172.17.50.1
172.17.50.1
172.17.50.1
172.17.50.1
Static Route:
route add
172.17.50.0
mask
255.255.255.0
192.168.164.28
metric 1
Gateway Switch
3:40 PM
192.168.164.1
192.168.164.1
Default Gateway Notes
8/14/01
Node
Table 9.2 Five-Switch IPFC Configuration Detail
140_SANs_09
Page 359
359
140_SANs_09
360
8/14/01
3:40 PM
Page 360
Chapter 9 • SAN Implementation, Maintenance, and Management
Switch Naming Tips
Having a well thought-out switch-naming convention enables easy identification of physical switches if a problem arises. Use a switch-naming
convention that scales across the organization, keeping in mind that the
SAN might start small but can be extended enterprise-wide over time. If
you have to change a switch name, it is very easy to do—just execute the
command switchName. Changing a switch-naming convention is more
difficult, as you will most likely have to change all the switch names in
the SAN affected by the naming convention change. For example, if you
evolved your SAN from a four-switch mesh to an eight-switch core/edge
topology, you might want to rename your switches with either the term
core or edge embedded in the name to reflect the role of the switch.
Consider using the following items when making up the switch name
field:
■
Incorporate an ID for the site or building where the switch is
located.
■
Add a component to identify the floor or room where the
switch is located.
■
Use the switch topology function (such as core or edge).
■
Add a component that shows to which organization or
project the switch belongs.
■
Include the rack ID in the name to further detail switch
location.
■
Embed the switch type into the switch name (such as the
SilkWorm 2800, 2250, or 2400).
■
If redundant fabrics are being used, select an ID for
complementary fabrics.
Example: CORE1_A_B6_230_R5 = core Switch 1, fabric A, building
6, room 230, rack 5
Note that switch names can be up to 19 characters long, must
begin with a letter or digit, and must consist of letters, digits, and
underscore characters. Spaces are not allowed.
140_SANs_09
8/14/01
3:40 PM
Page 361
SAN Implementation, Maintenance, and Management • Chapter 9
To set these parameters, execute the following steps:
1. If the switch has a serial port, connect to it with a serial cable and log in
as the administrator. If the switch has a control panel instead of a serial
port, use the buttons on the panel according to the Brocade Fabric OS
documentation to set the IP address, netmask, and gateway, and then
telnet in and perform the rest of the configuration as documented here.
2. To set the switchName parameter, use the switchName command:
switch:admin> switchName "switch1"
Updating flash ...
switch1:admin>
3. Type ipAddrSet. A menu will appear. Answer the questions appropriately. Note that step 3 is not necessary if you enter the IP address via the
front panel:
switch1:admin> ipAddrSet
Ethernet IP Address []: 192.168.163.110
Ethernet Subnetmask [255.255.255.0]: 255.255.255.0
Fibre Channel IP Address [none]:
Fibre Channel Subnetmask [none]:
Gateway Address []: 192.168.163.1
switch1:admin>
4. Connect the switch to the Ethernet and ping the address to verify that it
has been set correctly.
What Fabric OS Version Should I Use?
Deciding which version of Fabric OS to use can be a challenging process, especially if your SAN consists of multiple vendor edge devices or switches.The most
recent version of Fabric OS might not always be the best version to use. In some
cases, you might experience conflicting Fabric OS requirements, with multiple
vendors each specifying a different version of Fabric OS. Many switch suppliers
extensively test their SAN products with Brocade switches in varying configurations.To support their products and Brocade switches, they require that you run a
specific version of Fabric OS. One suggestion is to work with your switch supplier
and your SAN vendors to identify if there is an intersection of supported Fabric
361
140_SANs_09
362
8/14/01
3:40 PM
Page 362
Chapter 9 • SAN Implementation, Maintenance, and Management
OS versions. For example, your switch supplier might support Fabric OS versions
v2.2.2 and v2.4.1. If your storage vendor and HBA vendor support v2.2.2, your
choice would be to install Fabric OS v2.2.2. In some cases, there might not exist
an intersection of support requirements, at which point you might want to use the
version of Fabric OS recommended by your switch supplier or negotiate a support agreement with your SAN vendors. Another determining factor for running
a version of Fabric OS is availability of features or support.
There currently exist two Fabric OS trees, the v1.x tree for the SilkWorm
1000 series of switches and the v2.x tree for the SilkWorm 2000 series of
switches.The naming convention for v2.x Fabric OS is formatted as
dM.m.fp_t...t, with each variable replaced by the information specified in Table
9.3.The Fabric OS version used for examples in this book is v2.4.1.c.The software major version is 2, the minor version is 4, the maintenance version is 1, and
the patch version is c. Many features and enhancements have been added since
Fabric OS v2.0.Table 9.4 lists summaries of these feature and enhancement additions.The information in Table 9.4 can help you determine which Fabric OS is
right for you, should you have the option to choose.The key is to establish
which version of Fabric OS to run as part of the installation process.
Table 9.3 How to Decode a Fabric OS Version
Variable
Meaning
d
Deployment
indicator
M
m
f
Format
Definition
Lowercase Indicates the deployment target for the
letter
release. Does not indicate any functional changes. Normally the letter “v”.
Software
Number
Indicates a release that incorporates
major
significant functional changes to the
version
software, as compared to releases with
a lower software major version.
Generally follows architectural changes
in the core operating system or
hardware.
Software
Number
Indicates a release that incorporates
minor
added functionality within a major
version
software version.
Software
Number
Indicates a maintenance release for a
maintenance
minor software version. Usually indiversion
cates a release of bug fixes only.
(Brocade attempts to prevent functional
changes from occurring in software
Continued
140_SANs_09
8/14/01
3:40 PM
Page 363
SAN Implementation, Maintenance, and Management • Chapter 9
Table 9.3 Continued
Variable
Meaning
p
Software
patch
version
t…t
Special type
(nonproduction
release)
Format
Definition
maintenance versions.) Any functional
changes that do occur are clearly
documented.
Letter
Indicates a release that incorporates a
patch within a minor software maintenance version; otherwise, functionally
identical to the maintenance version.
Each patch incorporates all preceding
patches for the same maintenance
version; for example, v2.0.2c would
incorporate the patches implemented
in both v2.0.2a and v2.0.2b.
Letter(s),
Indicates a special nonproduction build
possibly
(“N” in the following definitions is the
followed
iteration of the build):
by number ■ An Alpha release is “..._alphaN”
(abbreviated to “aN” in bug lists).
For example, “_alpha3 or “a3”.
■ A Beta release is “..._betaN” (abbreviated to “bN” in bug lists). For
example, “_beta1” or “b1”.
■ A release candidate is “..._rcN”
(abbreviated to “rcN” in bug lists).
For example, “_rc2” or “rc2”.
Table 9.4 v2.x Fabric OS History with Feature and Enhancement Additions
Fabric OS
Version
New Feature
v2.2.0
■
■
■
■
Fabric Watch
Extended Fabrics
FA Management
Information Base (MIB)
FC-GS3 Management
Server
Enhancements
■
■
■
■
Fabric Shortest Path First
(FSPF) routing failover
enhancements
WEB TOOLS enhancements
— Switch Status
— Dramatically better Fabric
View
Switch Beaconing
Serial ID Gigabit Interface
Converters (GBICs)
Continued
363
140_SANs_09
364
8/14/01
3:40 PM
Page 364
Chapter 9 • SAN Implementation, Maintenance, and Management
Table 9.4 Continued
Fabric OS
Version
New Feature
v2.2.1
Enhancements
E_Port Enable / Disable
Simple Network Management
Protocol (SNMP) Access Control
Lists (ACLs)
■ Extended Fabrics
— Configurable on any switch
— License on portal switches
■
■
v2.2.2
■
■
■
■
v2.3.0
■
SilkWorm 6400 Support
—Group Definition Phase I
—ISL Topology Check /
Monitor
—SNMP Group Support
—New GETS for groups
Fabric Watch trap
enhancements
—Traps now include
thresholds
SCSI Enclosure Services
(SES) enhancements
—Config File Upload/
Download via SES
—Fabric OS Image Upload/
Download via SES
—SupportShow via SES
Fabric Access API v1.0
Switch Side
QuickLoop Fabric Assist
Mode
—Reengineering of
QuickLoop from Hub
Emulation to Virtual
Loops
—Industry-leading LIP
isolation
—Loop Hosts talk to
Fabric Targets
■
■
■
■
■
Fabric Watch
—Alarm Enable/Disable
—Threshold Reset to Defaults
WEB TOOLS
—Faster to load and run
—Support for QLFA zoning
—Support for ED5000 IOP
Mode
—Many enhancements
Management Server
—FC-GS-3 Platform Support
FA MIB v2.2
FC-GS3 Name Server
—More GET calls
—More Register calls
Continued
140_SANs_09
8/14/01
3:40 PM
Page 365
SAN Implementation, Maintenance, and Management • Chapter 9
Table 9.4 Continued
Fabric OS
Version
New Feature
■
v2.4.0
■
McData ED5000 switch
interoperability with these
constraints
—Limited to 31 switches
in fabric
—WWN-based zoning, no
hardware-enforced
zoning
—SilkWorm 1000 not
supported in ED5000
mode
—Zoning management
limitations
—All switches in fabric
must run 2.3+ED5000
mode
—No QuickLoop Fabric
Assist Mode in ED5000
mode gets
—No Alias Server or
Management Server
New features for
SilkWorm 6400
—Fabric Manager 1.0
—Group Definition
Phase 2
—Based on Management
Server technology
—Permits group management operations to be
done on one switch
rather than all
—Group SupportShow
Enhancements
365
140_SANs_09
366
8/14/01
3:40 PM
Page 366
Chapter 9 • SAN Implementation, Maintenance, and Management
Licenses
All switches must have fabric capability if you want to interconnect these
switches. Brocade WEB TOOLS, Fabric Watch, and Zoning licenses are also
desirable, but not required to build a fabric.You will need a QuickLoop license if
you intend to integrate private hosts into your fabric.The SilkWorm 20x0 and
the 22x0 switches offer varied fabric capabilities:
■
2010/2210 = No fabric license; loop switches
■
2040/2240 = Entry-fabric license; minimal switch-to-switch
connectivity (single ISL support)
■
2050/2250 = Full-fabric license; unlimited switch-to-switch
connectivity (multiple ISL support)
You should check each switch to verify that you have the licenses necessary to
build your SAN solution.The command licenseShow is used to determine what
licenses are installed on your switch, as shown in Figure 9.8. Note that a single key
can enable multiple features. If this is the case, you will not have a one-to-one
mapping of features and a license key. If you do not have the appropriate licenses,
you will need to contact your switch supplier to acquire the necessary licenses.
When acquiring a new license, it is necessary to supply the switch World-Wide
Name (WWN), which is available from the output of the switchShow command, and the switch serial number, which is available from the switch chassis.
Figure 9.8 Use licenseShow to Determine What Licenses Are Installed on
Your Switch
core1:admin> licenseShow
SRzy9Sz9zeTS0zAG:
Web license
bbSz9eQb9zccT0AQ:
Zoning license
RdzdSRcSyzSe0eTn:
QuickLoop license
cSczRScd9RdTd0SY:
Fabric license
140_SANs_09
8/14/01
3:40 PM
Page 367
SAN Implementation, Maintenance, and Management • Chapter 9
Automating Switch
Administration Activities
If you have to perform SAN administration activity more than once, consider
writing a script.You can use the Tcl/Tk-based Expect scripting language to
interface with the switch. In the future, you will also have the option to use the
Fabric OS Application Programming Interfaces (APIs) for automating switch
management functions. At the writing of this book, the Fabric OS APIs are available only to Brocade partners, with plans to make these APIs available to all
switch users in the future.We discuss the following topics in this section:
■
Fabric OS APIs
■
Expect scripting
Using Expect to interface with the switch is not as powerful or effective a solution as using the APIs. However, if you need to implement a solution now,
Expect is a good choice. Because of the power that the APIs deliver, further discussion is warranted to assist you with your planning. As we discuss the subjects
in this chapter, we provide examples of how to automate related functions by
using Expect.You can freely download and use the scripts mentioned in this
book by accessing the book’s Web site (www.syngress.com/solutions). Several
examples of the types of switch management functions you might want to automate follow later in this chapter:
■
Download new firmware to all of the switches in your fabric.
■
Reboot all of your switches at once or in a sequence.
■
Automate zone changes.
■
Facilitate troubleshooting.
Fabric OS APIs
The Fabric OS API is a programming interface to allow applications to access
fabric information and to perform control operations. Access to the switch functions is based on IP access either through out-of-band Ethernet or via IPFC from
a suitable HBA. Host-resident libraries and header files are required. Support for
Solaris,Windows 2000, and Hewlett-Packard HP-UX currently exists.
Application programs are compiled and linked to the library interfaces.The
library uses Remote Procedure Call (RPC) over a TCP/IP connection to a
367
140_SANs_09
368
8/14/01
3:40 PM
Page 368
Chapter 9 • SAN Implementation, Maintenance, and Management
switch to get information and perform control operations. A Perl interface is
planned. One of the benefits of using the APIs over scripts is that it simplifies
complex tasks into simple commands; commands that would require many lines
of scripting can potentially be a single command via the API.The API will be
rolled out for end-user support; however, the initial support is to third-party
management applications.
The application provider typically distributes host libraries and headers.Target
users are SAN management application providers with availability to all switch
users planned.The following companies provide SAN management applications:
■
VERITAS (SANPoint Control)
■
Computer Associates (Unicenter TNG)
■
BMC Software (PATROL)
■
Sun Microsystems (HighGround)
■
Hewlett-Packard (OpenView Storage Area Manager)
■
SANavigator (SANavigator)
■
Prisa (Visual SAN)
■
Micromuse (Netcool)
■
IBM Tivoli (TSNM)
The Fabric OS API is intended to provide the following operations:
■
Discovery applications can quickly discover the fabric topology
(switches, ports, and routes) and devices within the SAN.
■
Zoning provides full access to Brocade Zoning management facilities. A
transaction model with rollback manages multi-application access to
safeguard against concurrent access.
■
Switch and port management provides application control of individual switches and ports. Applications have access to Switch, Ports, Port
Statistics (PortStats), and Port Errors (PortErrors) objects for in-depth
information of critical SAN information. Obtain firmware versions from
all switches in your company.
■
Device management provides access to node and device objects that
provide information about the end points within the SAN.
■
Route management provides access to route control information to
assist users in discovering and managing routes within the fabric.
140_SANs_09
8/14/01
3:40 PM
Page 369
SAN Implementation, Maintenance, and Management • Chapter 9
Expect Scripting
Expect is a powerful tool for managing the switch in an automated fashion using
telnet commands; it not only automates applications such as telnet, ftp, passwd,
fsck, rlogin, and tip, but it is also used for testing them.The Expect home page
(http://expect.nist.gov) is an excellent source of information on Expect, the
foundation software required by Expect (Tcl,Tk) and Expect applications.
Another Web resource for Expect is the Tcl Developer Xchange, found at
www.scriptics.com. Expect is available for a variety of UNIX, Microsoft, and
Macintosh environments.
A Switch Management Wrapper Using Expect
As mentioned earlier, the scripts discussed in this book are available on the book’s
Web site (www.syngress.com/solutions). Although these scripts are not coding
works of art, they are a great foundation to build utilities for your switch
management.
A wrapper that allows you to execute a single command on a switch is provided as an example (Figure 9.9).The name of the script is run_sw_cmd. The
script takes two arguments: the command you wish to execute and the name of
the switch you want to execute the command on.This wrapper enables you to
run a switch command in an automated fashion.The hard part of the script and
the majority of lines for this program are focused on establishing a “connection”
(lines 1 through 62). Once the connection is made, it is very easy to just issue a
command to the switch, and it takes only two lines (lines 63 through 64) to do
this.The script is somewhat primitive since you need to set the user and password
information in the script. However, there is nothing to prevent modification of
the script to enable user and password arguments to be passed into the script.
Because only one telnet session with the switch is permitted at a time, you
cannot run an Expect script on a switch with an active telnet session. If you do,
the Expect script will not be able to gain a connection.
Figure 9.9 An Expect Script Wrapper for the SilkWorm Switch
Usage:
1
run_sw_cmd <command> <switch ip name>
#!/usr/local/bin/expect
2
3
# Author:
Chris Beauchamp, Brocade Communications
4
# Date:
06/01/01
Continued
369
140_SANs_09
370
8/14/01
3:40 PM
Page 370
Chapter 9 • SAN Implementation, Maintenance, and Management
Figure 9.9 Continued
5
6
proc telnetLogin {user passwd prompt} {
7
expect {
8
timeout
9
{puts "FAIL\nTelnet attempt for $user timed out\n"
10
return 1
11
}
12
eof
13
{puts "FAIL\nTelnet login prompt for $user never
happened\n"
14
return 1
15
}
16
# this is the case where we connect with the switch
17
"login:"
18
}
19
send "$user\r"
20
expect "Password:"
21
send "$passwd\r"
22
expect $prompt
23
return "0"
24
}
25
26
#
27
# main
28
#
29
30
# bail out if not enough args supplied
31
if {$argc != 2} {
32
puts "\nincorrect number of arguments supplied"
33
puts "
34
puts "\nexiting ..."
35
exit
36
}
\nusage: $argv0 <command> <switch>"
Continued
140_SANs_09
8/14/01
3:40 PM
Page 371
SAN Implementation, Maintenance, and Management • Chapter 9
Figure 9.9 Continued
37
38
set cmd [lindex $argv 0]
39
set switch [lindex $argv 1]
40
41
# change these values if you have different password or user
# requirements
42
set spasswd password
43
set suser admin
44
set sprompt admin>
45
46
set timeout 60
47
48
puts "telneting to switch"
49
spawn telnet $switch
50
set sw_spid $spawn_id
51
52
# exit since it was not possible to connect to the switch
53
catch {telnetLogin $suser $spasswd $sprompt} code
54
if {$code != 0} {
55
puts "unable to access switch"
56
exit
57
}
58
59
puts "switching context to switch telnet"
60
set spawn_id $sw_spid
61
62
# send the command
63
send "$cmd\r"
64
expect $sprompt
65
puts "\n"
66
return 0
371
140_SANs_09
372
8/14/01
3:40 PM
Page 372
Chapter 9 • SAN Implementation, Maintenance, and Management
You can also modify this script to read from a file of switches so that
you can execute the script on multiple switches. Alternatively, as shown in Figure
9.10, you can call the script from a UNIX shell script or Perl to obtain the
Fabric OS version from all switches in the fabric.
Figure 9.10 Integrating run_sw_cmd Expect Script with UNIX Shell Scripts
sun1# foreach switch ( core1 core2 edge1 edge2 edge3 )
? echo $switch
? run_sw_cmd version $switch | grep "Fabric OS:"
? end
core1
Fabric OS:
a2.4.1c
core2
Fabric OS:
a2.4.1a
edge1
Fabric OS:
a2.4.1a
edge2
Fabric OS:
a2.4.1a
edge3
Fabric OS:
a2.4.1a
Brocade Zoning Considerations
If you use switch-based zoning, you need to determine if you want to use hard
or soft Brocade Zoning, and how to manage your zones. A related zoning topic
that you also need to explore is where to zone.This section addresses these particular issues.
Brocade Zoning, which is an optionally licensed product, enables you to logically group devices into virtual SANs. Zoning is used to set up barriers between
different operating environments, to deploy logical fabric subsets by creating
defined user groups, or to create test and/or maintain areas that are separate
within the fabric. Zoning is an all-or-nothing operation: once a zone is enabled,
all devices must be defined in a zone, or each device will exist in a zone consisting of just that device, and that device will be inaccessible to other devices in
the fabric. In effect, this sets up an access by inclusion policy such that, by rule, a
host or storage device is not permitted to participate in the fabric until it is
positively included in at least one zone.With Brocade Zoning, you can define
multiple zoning configurations. However, only one zoning configuration is active
140_SANs_09
8/14/01
3:40 PM
Page 373
SAN Implementation, Maintenance, and Management • Chapter 9
at one time. It is possible to rapidly change zone configurations by just issuing
the cfgEnable <zone configuration> command.
One use of this capability is to facilitate policy-based management.This capability can be used in many ways. For example, a policy can be defined to provide
access to the tape library to Windows NT hosts during the day for continuous
backup, but migrate access to the UNIX hosts at the end of the day. Alternatively,
you might want to zone systems based on organizational structure.
Where to Zone?
It is possible to zone at various points in the SAN, such as the HBA or at the
host level, and you might even decide not to use switch-based zoning at all.You
might also want to use switch zoning in combination with other zoning
methods, such as using the HBA or storage controller to accomplish zoning, as
each might have a different level of granularity. As zoning is a component of
security, a combination of zoning at different locations in the SAN can be viewed
as an additional level of security. Many customers feel that you can never have
enough security.To provide context, see Figure 9.11 for the various zoning
methods and where these methods can be employed.
Much discussion surrounds the subject of where to zone. Major characteristics of zoning solutions include the need or lack of need for host resident software, zone configuration control, the ability of zoning to ease SAN management,
the ability to zone at a Logical Unit Number (LUN) level, and security. If you
use HBA zoning or a host resident zoning package, you need to install and maintain this software on all hosts that are part of the fabric. If one host is not running
the host resident software, your fabric is subject to illicit access or data corruption, as the fabric is unprotected without the resident software installed. Host resident or HBA zoning software is also subject to configuration changes at multiple
points, making management a challenge. Storage-based zoning, host resident
zoning, and HBA zoning normally are capable of LUN-level zoning, which is a
lower level of granularity than SAN switches can currently achieve.
Due to the inherent risk mentioned in zoning at this upper application layer,
it is advisable to supplement this solution with switch-based zoning as well to
prevent a newly attached device from accessing storage until it is properly configured. In this mode, the administrator configures zoning at both the host and the
switch. Doing so prevents any potential inappropriate data access if the host is not
configured properly.
As mentioned earlier, the SilkWorm zoning today cannot zone to the LUN
level.To do LUN-level zoning, you will need to choose an additional zoning
method. If you have multiple storage and HBA providers, it might be necessary
373
140_SANs_09
374
8/14/01
3:40 PM
Page 374
Chapter 9 • SAN Implementation, Maintenance, and Management
Figure 9.11 Where Zoning Can Happen in the SAN
Host Zoning
HBA Zoning
Hosts
SAN A
SAN B
Switch
Zoning
Storage Zoning
Storage
to learn, manage, and implement multiple zoning applications.The remainder of
this section focuses on switch-based zoning. At a minimum, you will likely want
to use switch zoning for the following reasons:
■
SilkWorm switches offer hard zoning, which is the most secure zoning
available in your SAN.
■
Switch zoning provides a single point of control—you need to manage
only one zoning interface as opposed to multiple HBA, storage, and host
zoning interfaces.
■
Switch zoning minimizes the impact devices have on each other by
limiting fabric activity such as Registered State Change Notification
(RSCN) to only those zone members affected by the RSCN or
limiting broadcast frames.
■
Some SAN devices can support only a limited number of device connections.With zoning, you can enforce the number of devices that exist
in a zone to align with the edge device connection limits.
140_SANs_09
8/14/01
3:40 PM
Page 375
SAN Implementation, Maintenance, and Management • Chapter 9
Hard Zoning or Soft Zoning?
Current Brocade SilkWorm switches support both hardware- and software-based
zoning. As there is not a setting to turn on one or the other, it is often a point of
confusion for administrators in terms of which one is being used.The type of
zoning you use depends on how the zones are defined. If you use a device
WWN or an Arbitrated Loop Physical Address (AL_PA) to define a zone object,
you are using soft zoning. If you use a device physical port number, in the form
(domain, port), you are using hard zoning (Figure 9.12).
Figure 9.12 Hard and Soft Zone Examples
core1:admin> cfgshow
Defined configuration:
cfg:
hard
green; yellow
cfg:
soft
red; blue
zone:
blue
jbod1; jbod2; softhost2
zone:
green
hardhost1; hardarray1
zone:
red
softjbod1; softjbod2; softhost1
zone:
yellow
hardhost2; hardarray2
alias: hardarray1
2,0
alias: hardarray2
3,9
Soft Zone
alias: hardhost1
0,8
alias: hardhost2
1,1
alias: softhost1
10:00:00:20:42:d9:78:31
alias: softhost2
20:00:00:50:37:d2:75:50
alias: softjbod1
21:00:00:20:37:d9:77:46
alias: softjbod2
21:00:00:20:37:d9:77:47
Effective configuration:
cfg:
hard
zone:
green
0,8
zone:
yellow
1,1
2,0
3,9
Hard Zone
375
140_SANs_09
376
8/14/01
3:40 PM
Page 376
Chapter 9 • SAN Implementation, Maintenance, and Management
The difference between hard and soft zoning is that hard zoning is enforced
at the Name Server and the Application-Specific Integrated Circuit (ASIC). Soft
zoning is enforced only at the Name Server.With hard zoning, each ASIC maintains a list of source port IDs that have permission to access any of the ports on
that ASIC, and the ASIC hardware itself will actually block inappropriate frames
from passing through it, dropping them if they attempt to talk outside their
zones.Your choice of zoning also influences how you maintain and operate your
SAN.
When a device requests a list of nodes from the Name Server, this is analogous
to calling a telephone directory service.When the Name Server responds to a
request, it returns nodes that the requesting device is allowed to access based on
zoning definitions.When you contact a telephone directory service, unlisted telephone numbers are not returned. However, if you know the unlisted party’s telephone number or randomly guess an unlisted telephone number, there is nothing
to prevent you from calling the unlisted party’s telephone number.With hard
zoning, even if the device is aware of and attempts to use an “unlisted” port ID,
the hardware will prevent communications from happening. Some edge devices
either cache port IDs or bypass the Name Server under certain circumstances and
will attempt to communicate with another device even though that device is not
in the Name Server. Normally, this type of behavior is the nature of the device.
For example, an initiator might not respond to an RSCN by design. An RSCN is
normally sent to tell the initiator that the zones have changed and some devices
that were previously being accessed are no longer available. If a device does not
respond to this RSCN, it will continue to access even addresses that have been
removed from its view of the Name Server. A malicious initiator might start scanning addresses to discover “live” ports. Hard zoning prevents initiators from
accessing devices under such circumstances. If you are using soft zoning, these
types of accesses are not prevented. Figure 9.13 shows the difference in security
between hard zoning and soft zoning. Note that with hard zoning, you have
protections at the Name Server and at the port, as depicted by the padlock icons.
140_SANs_09
8/14/01
3:40 PM
Page 377
SAN Implementation, Maintenance, and Management • Chapter 9
Figure 9.13 Hard Zoning Is More Secure than Soft Zoning
With Hard Zoning you have zoning enforcement at the hardware level and with the Name Server
Simple
Name
Server
With Soft Zoning you have zoning enforcement at the Name Server only
Simple
Name
Server
377
140_SANs_09
378
8/14/01
3:40 PM
Page 378
Chapter 9 • SAN Implementation, Maintenance, and Management
Hard Zoning and Soft Zoning Differences
When you zone by WWN (soft zoning), you have the flexibility of physically
moving that device anywhere within the fabric without redefining your zones.
This is because the device WWN has no dependencies on physical connection.
Currently with hard zoning, the zone definition is based on the physical location
of the edge device. If you move that edge device, you need to modify your zone
definitions, since the zone definition is no longer valid.When you replace a failed
device with a new device, you will need to modify your zone data with the new
device WWN if you are using soft zoning. It is not necessary to modify a zone
definition when replacing a hard-zoned device, since the device’s physical location is not changing.
Hard zones are easier to implement since you just need to know the switch
domain and port number of the device you want to zone.When you use soft
zoning, however, you need to obtain the device WWN and it is harder to visualize the relationship between zone definition and a physical device.
When you use hard zones, it is easier to replicate the zoning environment,
since the domain and port identifiers do not need to be changed.You might want
to replicate a zone environment when you implement the second fabric of your
dual-fabric solution. Replicating SAN environments using soft zones is not as
easy since re-entry of the unique WWNs associated with each SAN node is
required. Because domain IDs are subject to change, hard zoning definitions
might need to be redefined when a domain ID changes.
Zone Management
Zoning is a fabric-wide resource administered from any switch in the fabric,
which automatically distributes itself to every switch in the fabric. Zoning
administration can be managed via telnet commands,WEB TOOLS, or the
Fabric OS API to any switch in the fabric.You can use each of these zone
management interfaces standalone or in combination with each other.
The fabric provides maximum redundancy and reliability, since each switch
stores the zoning information locally and can distribute it to any switch added to
the fabric. For large zoning configurations or frequent zone changes, it might be
desirable to automate these operations. Downloading the zoning configuration
into a text file for manipulation and maintenance might also be desired.While
the zoning information is redundantly distributed throughout the fabric, you are
encouraged to make at least one backup copy of your zoning configuration by
using the command configUpload.The configUpload command saves not
140_SANs_09
8/14/01
3:40 PM
Page 379
SAN Implementation, Maintenance, and Management • Chapter 9
only the switch configuration, but also the zoning configuration information to a
file located on a specified host. Note that to enable a zone configuration with
configDownload, you need to first disable the switch (use the command
switchDisable).
Scripting Zoning Operations
You have the option to use scripting to automate certain zoning operations. For
example, you can create a script to automatically change a zoning configuration by
enabling a predefined zone configuration.You might want to do frequent zone
changes to virtually move a tape drive to different zones in a fabric as you perform
your backup. Scripting is also effective for changing zoning configurations based on
policy. For example, in a disaster recovery scenario, your policy might dictate to disable noncritical access to the SAN so that production systems can take over the
resources used by noncritical systems. By automating the zone change process, you
speed up the zone changes and minimize the potential for human error.
With multiple zoning configurations defined in your fabric, it is quite easy to
switch between configurations by issuing the cfgEnable <configuration>
command. If you need to change configurations frequently or based on policy, you
might consider writing a script to cfgEnable the appropriate configuration.The
script would be very similar to the script shown in Figure 9.9 (run_sw_cmd).
Another option to leverage scripting for your zoning operations is to automate your zone creation with a script. Such a script would also serve as a backup
to the zone configuration running in your SAN.You can modify this script to
add or delete zone objects.When you need to restore a zone or implement zone
changes, just execute the script.The script flow is as follows:
1. Log in to the switch.
2. Clear the existing zone objects.
3. Create the zone objects.
4. Enable the desired configuration.
Figure 9.14 is a code fragment of an Expect script that can be used to create
or modify a zone configuration.This script is called make_zone.You need to
modify the zone entries within the script.The script is based on the script
run_sw_cmd (Figure 9.9) and is also available on the book’s Web site
(www.syngress.com/solutions).The syntax might appear a bit awkward or confusing, since you need to “escape” the double quotes (“) with a backslash (\) so
that the double quotes are passed to the switch and not interpreted by the Expect
script.
379
140_SANs_09
380
8/14/01
3:40 PM
Page 380
Chapter 9 • SAN Implementation, Maintenance, and Management
Figure 9.14 A Zone Creation Expect Script
Usage: make_zone <switch>
Switch login code
.
.
.
# clear out the existing configuration
send "cfgclear\r"
expect $sprompt
# create your zoning objects
send "alicreate \"jbod1\",\"21:00:00:20:37:d9:77:46\"\r"
expect $sprompt
send "alicreate \"jbod2\",\"21:00:00:20:37:d9:77:47\"\r"
expect $sprompt
send "zonecreate \"red\",\"jbod1;jbod2;0,0;0,1\"\r"
expect $sprompt
send "zonecreate \"blue\",\"jbod1;jbod2;1,0;1,1\"\r"
expect $sprompt
send "zonecreate \"green\",\"2,0;2,1;3,0;3,1\"\r"
expect $sprompt
send "cfgcreate \"colors1\",\"red;green\"\r"
expect $sprompt
send "cfgcreate \"colors2\",\"blue;green\"\r"
expect $sprompt
# enable the desired configurations
send "cfgenable \"colors1\"\r"
expect $sprompt
puts "\n"
return 0
140_SANs_09
8/14/01
3:40 PM
Page 381
SAN Implementation, Maintenance, and Management • Chapter 9
Zoning Tips
The Brocade Zoning manual extensively documents the use of zoning.The following list of tips will guide you through your zoning implementation as a
supplement to the Brocade manuals:
■
Minimal unwanted interactions To minimize unwanted interactions
between devices and to facilitate fault isolation, limit the number of
HBAs/initiators in a zone to one.The exception is clustering applications where HBAs need to communicate with each other.
■
Heterogeneous environments To reduce challenges related to
operating system interoperability, zones can be created that fence off different operating systems. If a shared target such as a tape drive is needed,
an overlapping zone can be used while still protecting the different operating systems from each other.
■
Aliases Use aliases to define your zone members. If a zone member
changes, you need only to update the alias versus potentially changing
multiple zone definitions. Aliases also give meaningful names to a device,
much in the same way an IP name gives a meaningful name to an IP
address. Aliases can be used for either single devices or for a group of
multiple devices. From a service perspective it is the best method for
getting a textual list of what is attached to what port.
■
Addition of a new switch To avoid zone conflicts and fabric segmentation when a new switch joins a fabric, clear and save the zone on the
new switch prior to that switch joining the fabric. Do this with the
commands cfgClear, cfgDisable, and cfgSave. A new switch added to
the fabric automatically inherits the active zoning configuration information in the fabric and immediately begins enforcement.
■
Node and Port WWN When a zone member is specified by Node
Name, then all ports on that device are in the zone.When a zone
member is specified by Port Name, only that single device port is in the
zone. A device has one Node WWN and one or more Port WWN(s).
For flexibility, consider using the Node WWN for your zoning entries if
you must use soft zoning.
■
Zone changes When issuing the configDownload command to
enable a given zoning configuration, you should insert the keyword
381
140_SANs_09
382
8/14/01
3:40 PM
Page 382
Chapter 9 • SAN Implementation, Maintenance, and Management
“clear:” into the file immediately before the zoning lines.This will
ensure that the new zones take effect and that there is no segmentation.
■
Backup When you have finished your zoning implementation, make a
backup of your zoning data by using the command configUpload.
Validating Your Fabric
Prior to transitioning your fabric to production, it is important to validate that the
SAN you have implemented is ready.The time to identify and correct any problems is during the validation of your fabric and prior to transitioning to production. Doing so involves establishing your SAN profile, which we discuss in
Chapter 8,“SAN Troubleshooting.” After baselining your SAN profile, you need to
inject faults into the fabric to verify that the fabric and the edge devices are
capable of recovering.The next step involves generating an I/O load in the SAN
that approximates various application I/O profiles. Finally, you will want to run an
I/O load on your SAN while also doing fault injection to approximate a worstcase scenario—a failure in your SAN while your SAN is in production. After
completing the validation phase, you can then hand off the SAN to production.
In the next section, we cover baselining your SAN profile, fault injection,
running in I/O load, and using I/O generators.
Baseline Your SAN Profile
We discuss the SAN profile process in Chapter 8.You need a baseline of your
SAN so that you can quickly determine if the testing you execute results in any
discrepancies.To baseline a SAN you need to bring up the SAN and all edge
devices and verify that the fabric is stable and all devices are present and
accounted for. Using the commands nsShow, nsAllShow, topologyShow, and
switchShow, verify that the number of switches and devices you expect to be
present are indeed visible to the fabric. If there is a discrepancy, refer to Chapter 8
for troubleshooting guidance. Once you verify that the correct number of
switches and devices are present in the fabric, update your SAN profile form. In
addition to the SAN profile information identified for collection in Chapter 8,
you also need to identify the ISL ports in your fabric.You can do this by
reviewing the output of switchShow on each switch in the fabric. Figure 9.15
provides a graphical depiction of a fabric with a SAN profile example.
140_SANs_09
8/14/01
3:40 PM
Page 383
SAN Implementation, Maintenance, and Management • Chapter 9
Figure 9.15 SAN Profile Example (Profile)
Total Name
Server Entries
-------------9
Number
Switches
-------5
core1
name server entries:
isl ports:
number of ISL ports:
2
0: 1: 2: 3: 4: 5:
6
core2
name server entries:
isl ports:
number of ISL ports:
0
0: 1: 2: 3: 4: 5:
6
edge1
name server entries:
isl ports:
number of ISL ports:
7
0: 1: 2: 3:
4
edge2
name server entries:
isl ports:
number of ISL ports:
0
0: 1: 2: 3:
4
edge3
name server entries:
isl ports:
number of ISL ports:
0
0: 1: 2: 3:
4
core1
edge1
core2
edge2
edge3
383
140_SANs_09
384
8/14/01
3:40 PM
Page 384
Chapter 9 • SAN Implementation, Maintenance, and Management
The data extracted from the SAN shown in Figure 9.15 was extracted via a
shell script.This shell script in turn used the Expect script run_sw_cmd. For
large fabrics where you need to repeatedly capture a SAN profile, a script like
get_san_profile is a real time saver.This shell script is available on the book’s
Web site (www.syngress.com/solutions).
Fault Injection
Fault injection is the process of creating scenarios in the SAN that mimic potential faults. It is effective for uncovering marginal connections and malfunctioning
devices. Fabric and edge devices should gracefully recover after a fault is injected.
The process of fault injection and SAN verification is straightforward:
1. Capture a SAN profile baseline.
2. Inject a fault.
3. Compare the SAN profile baseline to a current SAN profile.
4. Check edge devices to verify that no devices have dropped off (are no
longer visible to the hosts or switches).
5. If there are any unexpected discrepancies, go to Chapter 8 for
troubleshooting guidance.
Fault injection should involve both the fabric and the edge devices. Power
cycling and resetting are typical fault injection activities for edge devices.You can
simulate an edge device going offline and online by doing a
portDisable/portEnable for a particular edge port. For the fabric, you have
several fault injection activities from which to choose:
■
Reboot a switch or power cycle a switch.
■
Disable and enable a switch (switchDisable/switchEnable).
■
Disable and enable ISL ports (portDisable/portEnable).
An exhaustive testing of the fabric and all edge devices is not usually warranted. However, spot-checking is useful prior to transitioning to production. For
edge devices, select one or two representative hosts and storage devices for the
edge device fault injection. Power cycle and/or reset these devices. After the
device recovers from being power cycled or reset, check the other edge devices
to verify that no devices, except the device being reset, are dropped. A dropped
device is a device previously visible to the edge device that is no longer accessible: for example, a disk device that was visible via the UNIX format command
140_SANs_09
8/14/01
3:40 PM
Page 385
SAN Implementation, Maintenance, and Management • Chapter 9
but is no longer visible via the format command after a fault injection is
considered dropped. A dropped device is considered an error that requires further
troubleshooting.
For the fabric, select two or three switches for reboot, power cycle, and disable/enable fault injection. If you are using a core/edge architecture, one of the
switches used for fault injection should be a core switch. For the ISL testing,
choose three to five ISLs spread across multiple switches to disable and then
enable. After each fault injection, capture a SAN profile. Compare this SAN profile to the baseline. Also check the edge devices to see that no devices drop out. If
there are no discrepancies, the SAN passes the test.
Running an I/O Load
It is very important to establish a stable SAN prior to moving to testing a SAN
with an I/O load. If the SAN is not stable prior to load testing, it becomes
difficult to establish a root cause if problems arise, since these problems can be
stability-related and/or load-related. Some problems only arise under load, such as
marginal links.When you do load testing in your SAN, you should run a variety
of load types, focusing on a load that is most similar to the type of I/O you
expect in your SAN. Once you are able to test the SAN with a variety of loads,
try doing so with fault injection.The level of fault injection during load testing
should be less intensive than the fault injection phase of testing. A suggested level
of fault injection testing to perform while the SAN is under load is as follows:
■
Reboot a switch.
■
Reboot/reset one storage device and one host (you can simulate this
situation by using the portDisable/portEnable command).
■
Disable and enable two or three ISLs that are each located on a
different switch.
This is the final test. If you can run I/O in your SAN while doing fault
injection and you are able to recover after the fault, it is time to move to production. Some faults might cause some I/O not to recover.This can happen because
the host driver is unable to recover I/O under certain circumstances (for
example, tape I/O), or because timeout values on the edge devices or in the
SAN require tuning. Adjusting timeout settings in the SAN is a complex process
that involves edge device and switch settings.Timeout settings are normally
configured in the HBA or storage device. Refer to the HBA or storage device
385
140_SANs_09
386
8/14/01
3:40 PM
Page 386
Chapter 9 • SAN Implementation, Maintenance, and Management
configuration documentation for the specifics of how to make these changes and
what the impact is of doing so.
Types of Load
I/O can be classified in three ways: either a read or a write, random or sequential,
and I/O size. A second-order description of I/O is whether the I/O is bandwidth-intensive. If the I/O is bandwidth-intensive, it is more meaningful to measure this I/O by throughput (in MB/sec). If the I/O is not bandwidth-intensive,
you normally measure this I/O in terms of I/O Per Second (IOPS).
Usually, I/O is a mix of reads and writes. However, some applications are very
biased. For example, a video server I/O activity will normally be almost 100
percent reads.
I/O can further be classified as random or sequential. Some examples of
random I/O are an e-mail server or an Online Transaction Processing (OLTP)
server. Sequential I/O is characteristic of decision support (such as data warehousing) or scientific modeling applications.
The third characteristic of I/O is the size of the I/O. I/O sizes typically range
from 2 KB to over 1 MB.
Table 9.5 lists the application I/O profiles that establish the typical magnitude
of application bandwidth consumption. For SAN design performance purposes,
I/O is classified by bandwidth utilization: light, medium, and heavy. It is important
to ultimately support test assumptions by gathering actual data when possible.You
can gauge the type of I/O activity in your existing environment by using I/O
measurement tools such as iostat (UNIX) or diskperf (Microsoft).
Table 9.5 Application I/O Profiles
Application
Bandwidth
Utilization
Read /
Write Mix
Typical
Access
Typical
I/O Size
OLTP, e-commerce,
e-mail, UNIX File
System (UFS),
Common Internet
File Services (CIFS)
OLTP (raw)
Light
80% read /
20% write
Random
8 KB
Light
Random
2 KB–4 KB
Customer Response
Management (CRM)
Light
80%
20%
85%
15%
Random
2 KB–4 KB
read /
write
read /
write
Continued
140_SANs_09
8/14/01
3:40 PM
Page 387
SAN Implementation, Maintenance, and Management • Chapter 9
Table 9.5 Continued
Application
Decision support,
high-performance
computing, seismic,
imaging, technical
computing
Video server
SAN applications:
serverless backup,
snapshots, thirdparty copy
Bandwidth
Utilization
Read /
Write Mix
Typical
Access
Medium to
Heavy
90% read /
10% write
Sequential 16 KB–
128 KB
Heavy
98% read /
2% write
Variable
Sequential > 64 KB
Heavy
Typical
I/O Size
Sequential > 64 KB
I/O Generators
You need an I/O generator to place a load in the SAN. Use the application I/O
profiles outlined in Table 9.5 as a guide for providing input to your I/O generator.You should run a light-bandwidth and a heavy-bandwidth I/O load in your
SAN for testing, tweaking one of these profiles to match your anticipated load
profile. An even better approach is to use the target applications for load testing.
However, doing so is often difficult or not possible.When deciding on which
tool to use for your testing, focus on the tool’s ability to do the following:
■
Generate variable I/O sizes
■
Generate sequential and random I/O
■
Generate a mix of reads/writes
■
Generate one or more process(es)/thread(s) per disk or LUN
For Microsoft environments, Iometer is a popular and robust tool that meets
these requirements. Iometer is available from Intel (http://developer.intel.com/
design/servers/devtools/iometer) and the tool is free. For UNIX environments,
finding a tool for testing I/O is more challenging.There are also public domain
tools available, such as IOzone (www.iozone.org). More flexible and powerful
tools are not in the public domain and you need to obtain these tools directly
from storage and host suppliers. vxbench is a very powerful and flexible tool
available from VERITAS that can generate the loads outlined in Table 9.5.You
will need to contact a VERITAS representative to obtain a copy of vxbench.
387
140_SANs_09
388
8/14/01
3:40 PM
Page 388
Chapter 9 • SAN Implementation, Maintenance, and Management
Tips for Generating an I/O Load
■
Many I/O utilities do both reads and writes. Writes can
be destructive and can cause loss of data. Make sure that
the storage you are writing to does not contain data that
you need.
■
For UNIX operating systems, use the raw devices to achieve
maximum bandwidth. If you use a “cooked” device (a device
with a file system), you will incur CPU overhead related to
the file system and anomalous results due to buffering.
■
To achieve maximum bandwidth, use sequential I/O with one
or two threads/processes per device if you are using a JBOD.
When using a RAID device, you need to use multiple threads,
starting with two and doubling your thread/process count
until you observe a reduction in bandwidth.
■
To achieve maximum IOPS, start with two threads/processes
per device and double your thread/process count until you
see a reduction in IOPS. The switch does not measure IOPS,
so you will need to use an external tool such as iostat (UNIX)
or diskperf (Microsoft) to establish your IOPS.
■
To observe the performance for a particular switch, issue the
command portPerfShow from a telnet session on that switch.
Many storage and host suppliers have internally developed I/O testing tools
for UNIX and Microsoft environments that are available if you ask for them. A
reliable standby in the UNIX environment is the tool dd, which is available from
many UNIX operating systems. With dd, it is possible to perform variable size
I/O and to generate one or more processes. However, generating random I/O or
a mix of reads and writes is difficult. An example of running a heavy-bandwidth
load using the UNIX utility dd and the Microsoft environment tool Iometer follows (Figures 9.16 and 9.17). For dd, it is necessary to only use one process per
disk to achieve maximal bandwidth.
140_SANs_09
8/14/01
3:40 PM
Page 389
SAN Implementation, Maintenance, and Management • Chapter 9
Figure 9.16 Generating a Heavy-Bandwidth Load Using a Shell Script and
the UNIX Utility dd
#! /bin/csh -f
# 100% reads to 3 disks
dd if=/dev/rdsk/c1t0d0s2 of=/dev/null bs=64k count=2000 &
dd if=/dev/rdsk/c1t1d0s2 of=/dev/null bs=64k count=2000 &
dd if=/dev/rdsk/c1t2d0s2 of=/dev/null bs=64k count=2000 &
# 100% writes to 4 disks
dd if=/dev/zero of=/dev/rdsk/c1t3d0s0 bs=64k count=2000 &
dd if=/dev/zero of=/dev/rdsk/c1t4d0s0 bs=64k count=2000 &
dd if=/dev/zero of=/dev/rdsk/c1t5d0s0 bs=64k count=2000 &
dd if=/dev/zero of=/dev/rdsk/c1t6d0s0 bs=64k count=2000 &
Figure 9.17 Generating a Heavy-Bandwidth Load Using the Tool Iometer—All
Seven Disks Are Selected
389
140_SANs_09
390
8/14/01
3:40 PM
Page 390
Chapter 9 • SAN Implementation, Maintenance, and Management
To generate maximum Fibre Channel bandwidth of 100 MB/sec you will
need multiple disks, if using a JBOD, and possibly multiple storage arrays, if the
array is not capable of sustaining 100 MB/sec (Figures 9.18 and 9.19). For
example, if you run a single disk in a JBOD, you are never going to hit 100
MB/sec—you will need to run a number of drives to saturate a link.
Figure 9.18 The Access Pattern Is 100 Percent Sequential Read with an I/O
Size of 64 KB
Figure 9.19 The Bandwidth Is 95+ MB/sec—Approaching Fibre Channel
Maximum
140_SANs_09
8/14/01
3:40 PM
Page 391
SAN Implementation, Maintenance, and Management • Chapter 9
SAN Maintenance
There are multiple maintenance functions that you will need to perform
throughout the life of your SAN. Some of these maintenance activities will be
planned, and others will happen unexpectedly. If your SAN is designed to be
resilient or redundant, unexpected maintenance should have minimal or no
impact on your SAN operations. Failed or malfunctioning devices are normally
the cause of unexpected maintenance.We provide a suggested process in this
section for each maintenance activity discussed, including verification procedures.
Other detailed SAN maintenance procedures are available from Brocade reference manuals as well as whitepapers.The focus of this section is to present an
overview of the tasks and actions necessary to do SAN maintenance that is
integrated with the methodologies and process defined in this book:
■
Maintaining a configuration log
■
Backing up and restoring a switch configuration
■
Bringing up a fabric
■
Expanding a fabric: merging fabrics, adding a switch, or replacing
a switch
■
Upgrading your fabric
■
Replacing or adding an edge device in the fabric
The Configuration Log: Key Information to
Gather and Maintain about Your SAN
A configuration log is an up-to-date compilation of information and configuration details about your SAN. Enough information about your SAN should exist
in your configuration log so you can recreate your SAN based on it.Whenever
you make a change to your SAN, you should also update your configuration log.
The configuration log can exist in hardcopy, softcopy, or both.There are some
aspects of your configuration log that are not printable, such as firmware. If you
maintain your configuration log in softcopy and this softcopy is stored in your
SAN, you should also maintain a hardcopy or disaster backup, in case a disaster
makes your SAN inaccessible. Having a softcopy of your configuration log
enables rapid searches and easy updates of your SAN configuration data.You will
need to access your log for a variety of reasons:
391
140_SANs_09
392
8/14/01
3:40 PM
Page 392
Chapter 9 • SAN Implementation, Maintenance, and Management
■
Disaster recovery
■
Troubleshooting
■
Recreating a switch whose configuration is destroyed
■
Planning SAN additions (for example, replacing your core switches with
large core switches)
■
Modifying or expanding a SAN design
■
Recovering accidentally deleted licenses
■
Recovering or reconfiguring a zoning configuration
The key to a successful configuration log is diligent updates.Without the
updates, your configuration log is not very useful. A suggested structure for your
configuration log resembles the following (Figure 9.20 shows a Microsoft
Windows Explorer view of an online configuration log):
1. Detailed diagrams of your SAN:
A. Switch topology
B. Host and storage connections
2. Firmware log of all SAN devices:
A. A table for all devices, listing the device and related firmware
B. A directory structure containing an entry for each device’s firmware
3. A log book where any additions, deletions, or modifications to your
SAN are logged
4. A directory structure for the switches:
A. A copy of each switch’s configuration is maintained. Use the command configUpload to save a switch’s configuration to a host.
B. supportShow information for each switch, captured after the SAN is
tested and verified.
5. Your SAN profile
6. A script directory for any scripts you create
7. A zoning directory for zoning configurations
140_SANs_09
8/14/01
3:40 PM
Page 393
SAN Implementation, Maintenance, and Management • Chapter 9
Figure 9.20 Explorer Screenshot of a Configuration Log
Backing Up and Restoring a Switch
Configuration
When you implement a new SAN, change your switch configuration, add a new
switch to your SAN, or replace a switch in your SAN, you should create a
backup of each switch’s configuration on a host.You do so with the command
configUpload, which generates an editable text file. You can then restore a
switch’s configuration with the command configDownload. The direction of
upload and download is relative to the switch. Sometimes is it confusing whether
you are backing up a configuration or restoring a configuration.To back up a
configuration, you upload to a host.To restore a configuration, you download
from a host.
You can also create a standard configuration profile, suitable for configuring all
switches in your SAN, by stripping out the switch-specific data from the switch
configuration file. A switch configuration profile enables you to perform rapid
configuration of your switch or switches for initial implementation, additions, or
replacements.The alternative is a manual and time-consuming configuration of
fabric parameters, SNMP information, and Fabric Watch information.The switch
configuration file can also be used as a backup for your zoning configuration or
as a zoning reconfiguration tool. If you ever lose a switch’s license information,
you can recover this information from the configuration backup data.When
replacing a switch, you can reference the switch configuration backup for IP
address information.
393
140_SANs_09
394
8/14/01
3:40 PM
Page 394
Chapter 9 • SAN Implementation, Maintenance, and Management
The configuration file is written as three sections.The first section contains
the switch boot parameters (otherwise known as the switch’s identity) and is preceded by the heading [Boot Parameters]. It has variables such as the switch’s
name and IP address.This section corresponds to the first few lines of output of
the configShow command.The second section contains general switch configuration variables, such as diagnostic settings, fabric configuration settings, Fabric
Watch setting, license key information, and SNMP settings.This section corresponds to the output of the configShow command (after the first few lines),
although there are more lines uploaded than shown by the command.The second
section is preceded by the heading [Configuration].The third section contains
the zoning configuration. It corresponds to the output of the cfgShow command and is preceded by the heading [Zoning].
To create a standard switch configuration profile, strip out the [Boot
Parameters] and [Zoning] headings and section data, leaving the
[Configuration] heading and data.You might also want to strip out QuickLoop
data from the configuration section if you are not running QuickLoop on every
switch.To restore or load a configuration, it is necessary to disable the switch
(switchDisable) before downloading the configuration information.To load a
configuration, use the command configDownload, specifying the standard profile or backup configuration file as the configuration file.
It is also possible to use the configuration file as a zoning backup and as a
zoning reconfiguration tool. By stripping out the boot parameters and configuration information from the configuration file and then downloading the resulting
zoning information, you can restore or change a zone configuration. If you are
using the configuration file for zoning configuration changes, you need to insert
the keyword clear: in the configuration file to clear out the existing SAN zone
configuration and prevent zone conflicts.
Bringing Up a Fabric
There are several instances, such as power failure, initial bring up, or fabric-wide
firmware upgrade, when you will need to bring up an entire fabric.The ideal
order of bring up is as follows:
1. Bring up the fabric.
2. Bring up the storage devices.
3. Bring up the hosts.
140_SANs_09
8/14/01
3:40 PM
Page 395
SAN Implementation, Maintenance, and Management • Chapter 9
This order stems from the fact that the host must have visibility to the
storage, especially during boot when devices are configured: for the host to have
visibility to the storage, the storage and SAN must be online.You can bring up
the storage first and then bring up the SAN. However, it is recommended that
you power up the SAN first. Unfortunately, this order is difficult to implement.
Powering off or disconnecting edge devices is frequently very time consuming or
very difficult to schedule. A more likely scenario involves bringing the SAN
down via a power cycle or a reboot of all switches in the fabric, and then
bringing the fabric back up while edge devices are powered on and connected.
An example of unplanned bring up is a power outage.When a power outage
occurs, the order in which hosts, storage, and switches come online is variable.To
bring up a fabric, use the following steps as a guideline:
1. Bring up the switches. Either power on the switches or issue the
command reboot to all of the switches in the fabric.
2. Verify the fabric. Once the fabric is up, you need to verify that all
edge devices and switches are present. Use the SAN profile to compare
the previous baseline of your fabric switch count and device count to a
current profile. If you see any discrepancies, follow the troubleshooting
procedures detailed in Chapter 8. Use topologyShow to verify that all
switches are online and use nsAllShow to verify that the correct
number of devices are present in the fabric. Even if you are able to execute the ideal order of bring up (fabric, storage, host), it is still necessary
to compare the baseline SAN profile to the current SAN profile, since it
is possible that all edge devices did not come back online. Note that this
is becoming less of an issue, especially with newer edge devices.
Expanding a Fabric: Merging Fabrics, Adding
a Switch, or Replacing a Switch
Merging two fabrics, replacing a switch, and adding a switch to a fabric are similar processes. It is important that the zoning configurations and fabric configuration parameters are consistent between the new switch or fabric and the existing
fabric. Execute the following steps when adding a switch or switches to
the fabric:
1. If necessary, update your SAN profile with the current state of the SAN.
2. Resolve any zone conflicts.
395
140_SANs_09
396
8/14/01
3:40 PM
Page 396
Chapter 9 • SAN Implementation, Maintenance, and Management
3. Resolve any switch configuration parameter conflicts and make any necessary switch-specific configuration changes such as port configuration
changes, enabling QuickLoop, SNMP, Fabric Watch settings, and other
configuration changes. If you have a standard switch configuration, you
can download this configuration with the command configDownload.
4. Resolve any domain ID conflicts or connect a disabled/powered-down
switch.
5. Verify that the new switch or switches are licensed consistently with the
existing fabric licensing scheme.
6. Check the new switch’s or switches’ Fabric OS version and, if possible,
make the Fabric OS version consistent for the whole SAN.
7. Verify that your SAN devices are minimally impacted by an RSCN. If
your SAN devices have difficulty handling RSCNs or your applications
are adversely impacted, consider stopping I/O on those devices.
8. Connect the new switch or switches to the existing SAN.
9. Enable or power up the new switches.
10. Connect your edge devices.
11. Capture a new SAN profile to verify that the correct number of edge
devices and switches are present in the fabric. If there are any disparities,
reference Chapter 8 for guidance on troubleshooting. Once the correct
number of switches and edge devices are accounted for, create a baseline
SAN profile for future reference.
12. Back up the configuration for the added switch or switches with the
command configUpload.
To avoid zone conflicts, it is simplest to clear out the zone configurations for
the new switch or switches by executing the commands cfgClear, cfgDisable,
and cfgSave on the switch(es) being added. If you are merging multiple fabrics,
select one of the fabrics as the active fabric; add the zone entries from the nonactive fabrics to the active fabric zoning configuration; and then clear out the nonactive fabric switches’ zone information. Once you add the “blank” switches into
the fabric, these blank switches will absorb the zoning configuration of the
active fabric.
Certain configuration parameters in the fabric must be the same.To review
your switch configuration parameters, issue the command configShow. You
140_SANs_09
8/14/01
3:40 PM
Page 397
SAN Implementation, Maintenance, and Management • Chapter 9
must resolve any conflicts in fabric configuration parameters before adding a new
switch to the fabric. For example, if there is a difference with the variable Error
Detect Timeout Value (E_D_TOV), it is necessary to either change this setting on
the new switches or the existing switches so that the value is consistent on all
switches that are going to be part of the same fabric.
You can compare fabric configurations from your new switch and the
existing fabric by examining the output from the command configShow. You
can upload your standard switch settings from a configuration file to ensure consistency of your switch configuration throughout the fabric.You can create and
restore switch configurations by using the commands configUpload and
configDownload. If a backup of your switch configuration is not current, execute a configUpload to capture a current backup of your switch configuration
and license information.When merging fabrics or adding a new switch to a
fabric, you need to check for domain ID conflicts and resolve these conflicts by
changing one of the conflicting domain IDs. If you bring a disabled switch or
powered-down switch into a fabric, you do not need to resolve domain ID conflicts, since the new switch will negotiate an acceptable domain ID.
When a disabled or powered-down switch joins the fabric, if there is a
domain ID conflict, the added switch will negotiate a new domain ID. If domain
IDs change, verify your zone definitions to identify and correct any hard zones
affected by the domain ID change. Recall that a domain and a port number
define a hard zone. Also, some edge devices might have dependencies on a device
port ID, which is a function of the domain ID. If the domain ID changes, it
might be necessary to reboot your host or have your host rescan for devices.
NOTE
What is a domain ID? The Fibre Channel specification Fabric Generic
Requirements (FC-FG), available from the Technical Committee T11 of the
National Committee for Information Technology Standards (NCITS) at
www.t11.org, defines the concept of a domain as “the highest or most
significant hierarchical level in the three-level addressing hierarchy.” A
SilkWorm switch is considered a domain. The domain number uniquely
identifies the switch in a fabric. Within a fabric, a domain is identified by
an address ranging from 1 to 239 (domain ID). The range of allowed
values varies depending on the switch model and other system settings.
SilkWorm switches automatically assign domain IDs as part of the switch
initialization process.
397
140_SANs_09
398
8/14/01
3:40 PM
Page 398
Chapter 9 • SAN Implementation, Maintenance, and Management
To maintain a full feature set across all switches in a single fabric, you should
run the same version of Fabric OS on all switches in that fabric. Before adding a
new switch, check the license information with the command licenseShow to
verify that a consistent license set exists with the new switch and the existing
fabric.When you add a new switch to a fabric, you should try to do so when
I/O is quiescent.You can verify if any I/O is occurring in your fabric by issuing
the command portPerfShow on each switch in your fabric. When you add a
new switch or switches, there will be a pause in any active I/O as the fabric
reconfigures and edge devices respond to RSCNs. If you successfully tested your
fabric with fault injections while generating an I/O load, you will have an accurate idea about how your fabric and edge devices will respond to the new
switch addition(s).
Upgrading Your Fabric
We describe both processes in the following two sections. A hot upgrade (also
called a rolling upgrade) requires that your edge devices be configured with redundant paths and software capable of managing path failover.With a hot upgrade,
you reboot one switch at a time for the new firmware to take effect.With a cold
upgrade, you reboot all of your switches at the same time.You would perform a
hot upgrade if you were unable to take down an entire fabric. A cold upgrade
should take your fabric down only for a few minutes as you reboot for the new
firmware to take effect.
Issues Applicable to Both
Hot and Cold Upgrades
The actual process of downloading firmware (firmwareDownload) does not
require you to take the switch down. For the new firmware to take effect, you do
need to reboot your switch.Wait until all switches are running new firmware
before configuring any new software features or zoning parameters. It is recommended that all switches be upgraded to the same firmware level, to support
all features in the current fabric; however, rolling upgrades are possible and
supported.When performing a rolling upgrade, note that the new functionality
might not be available on the switches until all switches are running the new
version of Fabric OS.
140_SANs_09
8/14/01
3:40 PM
Page 399
SAN Implementation, Maintenance, and Management • Chapter 9
NOTE
To minimize the reboot process, use the fastboot command after the
firmware download. This skips the Power-On Self-Test (POST) and goes
right to loading code and bringing up the switch. The fastboot time is
approximately 30 seconds compared to approximately two minutes if
POST is run during a normal reboot.
Performing a Hot Fabric Upgrade
1. If necessary, update your SAN profile with the current state of the SAN.
2. Make sure the switch has redundant paths for devices attached to it. If
possible, force the I/O path on the devices to fail over to a neighboring
switch using software provided on those devices.
3. Verify that there is no traffic on the switch, using the perfShow telnet
command if manual failover was possible in step 2.
4. Download the new firmware (firmwareDownload) onto the switch.
5. Reboot the switch for the firmware to take effect.When you take a
switch down, it will cause the fabric to reconfigure and you will see a
pause in any outstanding I/O as the fabric reconfigures.When the
switch re-enters the fabric, you will also see a pause in I/O as the fabric
reconfigures.
6. Capture a new SAN profile to verify that the correct number of edge
devices and switches are present in the fabric. If there are any disparities,
reference Chapter 8 for guidance on troubleshooting.
7. Re-enable the redundant paths for the attached devices.
8. Repeat steps 2 through 7 until all switches in the fabric have
been upgraded.
9. Configure any new software features or zoning parameters.
10. Capture a new SAN profile to verify that the correct number of edge
devices and switches are present in the fabric. If there are any disparities,
reference Chapter 8 for guidance on troubleshooting and resolving
the discrepancies.
11. Create a baseline SAN profile for future reference.
399
140_SANs_09
400
8/14/01
3:40 PM
Page 400
Chapter 9 • SAN Implementation, Maintenance, and Management
Performing a Cold Fabric Upgrade
1. If necessary, update your SAN profile with the current state of the SAN.
2.
Verify that there is no traffic on the switch, using the perfShow
telnet command.
3. Download the new firmware (firmwareDownload) onto all switches
in the fabric.
4. Reboot all switches in the fabric.
5. Capture a new SAN profile to verify that the correct number of edge
devices and switches are present in the fabric. If there are any disparities,
reference Chapter 8 for guidance on troubleshooting and resolving
these discrepancies.
6. Configure any new software features or zoning parameters.
7. Capture a new SAN profile to verify that the correct number of edge
devices and switches are present in the fabric. If there are any disparities,
reference Chapter 8 for guidance on troubleshooting.
8. Create a baseline SAN profile for future reference.
How to Automate firmwareDownload
Using the run_sw_cmd script as a base, you can automate the download of
firmware to a switch.You can then call this script from a “for” loop to download
to multiple switches.The following excerpt in Figure 9.21 is from a script called
fw_download, which is available on the book’s Web site (www.syngress.com/
solutions).
Figure 9.21 Excerpt from a Firmware Download Automation Expect Script
# using ftp version of fw download -
note requires cleartext password
set cmd "firmwareDownload
\"192.168.162.102\",\"root\",\"/book/v2.4.1c\",\"fooba
r\""
# send the command
send "$cmd\r"
expect $sprompt
140_SANs_09
8/14/01
3:40 PM
Page 401
SAN Implementation, Maintenance, and Management • Chapter 9
Replacing or Adding an Edge
Device in the Fabric
When you add a new device to the fabric, you need to do some work ahead of
time. Ideally, you will connect the new device to a switch with the highest concentration of devices that the new device is expected to access. Carefully consider
placing new devices on the core of a core/edge fabric, since taking up core
switch ports with edge devices limits your expansion capabilities. See Chapter 7
for more detail on device placement in the SAN. If you are using zoning, you
need to update your zoning configuration with information from the new
device. If you are doing AL_PA zoning or hard zoning and you replace a device
on the same port, you do not need to make any zoning changes. Remember that
an AL_PA is an arbitrated loop physical address. An AL_PA is an 8-bit address and
is used to identify a private loop device (for example, ef ). An AL_PA might also
be part of a public loop 24-bit address (for example, 0102ef ). AL_PAs are “soft,”
meaning the address is dynamically assigned, or AL_PAs are “hard,” meaning the
AL_PA is manually set.There might be some port configuration requirements
such as QuickLoop, or a port might require “locking” to a G_Port (F_Port or
E_Port) or L_Port. By default, Brocade SilkWorm ports are auto-configuring, the
switch ports will match the topology of the edge port, and “locking” a port
should not be necessary. Configuring a switch port for G_Port locks the switch
port topology to a point-to-point connection, and configuring for an L_Port
locks the switch port for a loop connection. If you are adding a new device,
establish whether any port configuration dependencies exist and make any necessary changes. Adding a new device should have minimal impact on any active
I/O. Devices within the new device’s zone might experience a pause in I/O as
they respond to the RSCN notifying the SAN that there has been a change
when the device is added.The following steps detail how to add or replace a
device in the fabric:
1. Choose a location for the device in the SAN when making an addition.
2. If necessary, update your SAN profile with the current state of the SAN.
3. Update your zoning configuration to reflect the new edge devices prior
to connection.This is extremely important as most devices perform
discovery during login, so the zones need to be in place.
401
140_SANs_09
402
8/14/01
3:40 PM
Page 402
Chapter 9 • SAN Implementation, Maintenance, and Management
4. If necessary, make any port changes such as locking a port to specific
topology or enabling QuickLoop.
5. Connect your new device or replace the existing device. If replacing a
server, be sure to remove it from the zone, and if using software-based
zoning, either reboot it or disconnect it from the SAN, as it might
potentially still access mounted and known storage subsystems.
6. Capture a new SAN profile to verify that the correct number of edge
devices and switches are present in the fabric. If there are any disparities,
reference Chapter 8 for guidance on troubleshooting. For example, if
there are 42 devices in the SAN before you connect your F_Port device,
expect 43 devices to exist after you connect the device.
7. Create a baseline SAN profile for future reference.
140_SANs_09
8/14/01
3:40 PM
Page 403
SAN Implementation, Maintenance, and Management • Chapter 9
Summary
Several decisions and considerations regarding your SAN solution are necessary
prior to installation. Installation time is when to plan your cabling and to implement a cable layout scheme that is manageable, flexible, and maintainable. An
effective cable management scheme not only enables ease of maintenance, but it
is also aesthetically pleasing. Key areas to pay attention to when cabling are ISL
cabling and taking up slack.When racking your switches, make sure to avoid
single points of failure.This means separating redundant fabrics into different
racks and powering resilient fabrics in such a way that a power failure does not
cause the fabric to fail. For some situations, it is not possible or practical to dedicate an Ethernet connection for each switch. It is possible to manage Brocade
switches via direct Ethernet connections or via IPFC. Using IPFC currently does
have some disadvantages, such as a single point of management failure; however,
this is no different from management via the Ethernet ports. In that case, you
would have the management station and all the switches connected to an
Ethernet switch or hub. If the Ethernet cable goes bad, you cannot manage any
switch. Other areas that require thought prior to installation are a switch-naming
convention, the version of Fabric OS to install, and ensuring that the correct
licenses have been purchased.You should have a well thought-out switch-naming
convention to enable easy identification of a physical switch in case a problem
arises. Choose your Fabric OS version before you implement your SAN.The
Fabric OS selection process might involve several of your SAN device providers,
since Fabric OS support levels vary between SAN device vendors.
If you have to do a SAN administration activity more than once, consider
writing a script.You can use the Tcl/Tk-based Expect scripting language to
interface with the switch. In the future, you will also have the option to use the
Fabric OS APIs for automating switch management functions.With scripting you
can automate many activities, such as downloading firmware, rebooting switches,
automating zone changes, and facilitating troubleshooting. A wrapper script that
enables the automation of SilkWorm switch commands is provided as an example
and as a tool to use in your SAN.You can use this script as is, or modify it to suit
your needs.
If you use switch-based zoning, you need to determine if you want to use
hard or soft Brocade Zoning and how to manage your zones. A related zoning
topic that you also need to explore is where to zone. Zoning enables you to logically group devices into virtual SANs. Zoning is used to set up barriers between
different operating environments, to deploy logical fabric subsets by creating
403
140_SANs_09
404
8/14/01
3:40 PM
Page 404
Chapter 9 • SAN Implementation, Maintenance, and Management
defined user groups, or to create test areas or maintenance areas that are separate
within the fabric. Zoning is an all-or-nothing operation: once a zone is defined,
all devices must be defined in a zone, or each device will exist in a zone consisting of just that device, and that device will be inaccessible to other devices in
the fabric. It is possible to zone at various points in the SAN, such as the HBA or
at the host level, and you might even decide not to use switch-based zoning at
all.You might also want to use switch zoning in combination with other zoning
methods such as using the HBA or storage controller to accomplish zoning.
Zoning offers many advantages, such as a single point of management, hard
zoning, and the ability to create virtual SANs. Hard zoning is the most secure
zoning available because it is enforced at the Name Server and at the ASIC.There
are differences between hard zoning and soft zoning, such as flexibility in moving
zoned devices in the fabric and ease of implementation. A domain and a port
number define a hard zone, while a device WWN defines soft zoning.You can
script repetitive zoning operations. Scripting large zoning configurations might
also be easier than using other zoning management interfaces, such as
WEB TOOLS.
Prior to transitioning your fabric to production, it is important to validate
that the SAN you have implemented is ready.The time to identify and correct
any problems is during the validation of your fabric and prior to transitioning to
production. Doing so involves establishing your SAN profile and injecting faults
into the fabric to verify that the fabric and the edge devices are capable of recovering. Generating an I/O load in the SAN that approximates various application
I/O profiles is also an important part of the SAN validation process.To approximate a worst-case scenario—a failure in your SAN while your SAN is in production—you will want to run an I/O load on your SAN while also doing fault
injection. Suggested fault injections involve rebooting or power cycling your
switches and edge devices. I/O can be classified in three ways: a read or a write,
random or sequential, and I/O size. A second-order description of I/O is
whether the I/O is bandwidth-intensive.When you do load testing in your SAN,
you should run a variety of load types, focusing on a load that is most similar to
the type of I/O you expect in your SAN. Once you are able to test the SAN
with a variety of loads, try doing so with fault injection.The level of fault injection during load testing should be less intensive than the fault injection phase of
testing.There are several publicly available load generators, such as Iometer. You
might also want to ask your storage or host provider for tools they use in their
load testing.
140_SANs_09
8/14/01
3:40 PM
Page 405
SAN Implementation, Maintenance, and Management • Chapter 9
There are multiple maintenance functions that you will need to perform
throughout the life of your SAN. A suggested process is provided for several
maintenance activities: creating a configuration log, backing up and restoring a
switch configuration, bringing up a fabric, expanding a fabric (merging fabrics,
adding a switch, or replacing a switch), upgrading your fabric, and replacing or
adding an edge device in the fabric. Other detailed SAN maintenance procedures
are available from Brocade reference manuals as well as whitepapers. A diligently
maintained configuration log can help you with disaster recovery, troubleshooting, recreating a switch whose configuration is destroyed, SAN design
modifications or expansion, recovering accidentally deleted licenses, and recovering a zoning configuration.When you implement a new SAN, change your
switch configuration, add a new switch to your SAN, or replace a switch in your
SAN, you should create a backup of each switch’s configuration on a host using
the command configUpload.
Solutions Fast Track
Installation Considerations
Ensure that ISLs run in front of only the switches to which they are
connected.This will allow the switches to be removed without downtime for the fabric.
When racking your switches, be sure to avoid single points of failure.
This means separating redundant fabrics into different racks and powering resilient fabrics in such a way that a power failure does not cause
the fabric to fail.
Carefully consider solely using in-band management of your switch.
Consider using both in-band and out-of-band management.
Have a well thought-out switch-naming convention to enable easy iden-
tification of a physical switch in case a problem arises.
If you intend to implement a large fabric, work closely with your switch
supplier to identify a version of Fabric OS that supports the size of SAN
you intend to build.
405
140_SANs_09
406
8/14/01
3:40 PM
Page 406
Chapter 9 • SAN Implementation, Maintenance, and Management
Automating Switch Administration Activities
If you have to do a SAN administration activity more than once,
consider automating the activity with a script.
Use Expect for automating your SAN administration activities today and
consider using Fabric OS APIs when they become available.
Take the Expect script example (run_sw_cmd) and modify it for your
SAN administration activities.
If you use Expect scripting, you need the supporting software. See the
following URL for Expect installation guidance: http://expect.nist.gov.
Brocade Zoning Considerations
Determine whether you want to use hard or soft zoning prior to
implementing your zone scheme.
Hard zoning is more secure than soft zoning, as hard zoning is enforced
at the Name Server and at the hardware level and will actually block
inappropriate access.
There are differences between hard zoning and soft zoning from a
maintenance perspective. For example, you need to update your zone
information if you replace a device that is part of a soft zone.
Consider using a script to build and maintain large zoning configura-
tions. Scripts can also be helpful to implement disaster recovery policies
that are implemented in zoning.
Validating Your Fabric
Baseline your fabric first so that you can quickly identify failures when
you validate your SAN. Use the SAN profile as your baseline.
You can automate many validation activities, such as taking your SAN
profile and fault injection.
The key to fault injection is to establish how your entire system behaves
when a fault is encountered.
140_SANs_09
8/14/01
3:40 PM
Page 407
SAN Implementation, Maintenance, and Management • Chapter 9
Key fabric fault injections include: switch power cycle, ISL
disable/enable, and switchDisable/switchEnable. Key edge device
fault injections include: reset or power cycle the edge device.You can
simulate this event by doing a portDisable/portEnable.
Running a load in your SAN can shake out issues like marginal links.
The final test is to run a load on your SAN while you do fault
injections. If your SAN is able to handle this test, you are ready
for production.
Pick a minimum of two load types for your SAN I/O testing: one that
approximates your SAN application load, and a load that is bandwidthintensive.
Ask your host supplier, HBA supplier, or storage supplier for the tools
they use for I/O testing in a UNIX environment. If you want to use a
Microsoft/Intel tool, Iometer is a good choice.
SAN Maintenance
If possible, use the “cold” upgrade process for fabric upgrades. It takes
only a few minutes of downtime.
When adding a switch to the fabric, clear out the zone information.
It is simplest to power down or disable your switch prior to connecting
that switch to an existing SAN. Doing so will avoid domain ID conflicts.
A diligently maintained SAN configuration log can help you with dis-
aster recovery, troubleshooting, recreating a switch whose configuration
is destroyed, SAN design modification or expansion, recovering accidentally deleted licenses, and recovering a zoning configuration.
Back up your switch configuration with the command configUpload
whenever you add or replace a switch.
Maintaining a baseline SAN profile is essential to many SAN mainte-
nance activities. Make sure you know how to create and maintain a
SAN maintenance profile.
The act of loading firmware does not impact SAN operations.The
process of activating does impact SAN operations because a reboot
is required.
407
140_SANs_09
408
8/14/01
3:40 PM
Page 408
Chapter 9 • SAN Implementation, Maintenance, and Management
Frequently Asked Questions
The following Frequently Asked Questions, answered by the authors of this book,
are designed to both measure your understanding of the concepts presented in
this chapter and to assist you with real-life implementation of these concepts. To
have your questions about this chapter answered by the author, browse to
www.syngress.com/solutions and click on the “Ask the Author” form.
Q: Do I have to take my SAN down to perform a fabric upgrade?
A: No. If you have your SAN configured such that all edge devices have dual
paths, it is possible to perform a “hot” upgrade.
Q: When I merged my fabrics, several disks were no longer accessible from the
hosts.What happened?
A: If you were using hard zoning, it is possible that the domain IDs of one of the
switches you merged changed.This change in domain ID might have invalidated some of your zones.
Q: Are rogue hosts a real threat to soft zoning?
A: Hosts that intentionally bypass the Name Server are as likely a threat to
security as a device that caches a Name Server entry or does not respond
to RSCNs.
Q: Why do I have to use a complex scripting language to manage my SAN?
A: You do not have to write any scripts to manage your SAN.WEB TOOLS
and other commercially available SAN management software is available
to perform a variety of SAN management tasks. Scripting is for users who
prefer the flexibility and power that scripting enbles for their SAN management tasks.
Q: When will the Brocade APIs be available for end-user use?
A: Check with your switch supplier.The current target date is planned for the
end of 2001.
140_SANs_AppFT
8/14/01
3:42 PM
Page 409
Appendix
Building SANs with
Brocade Fabric
Switches
Fast Track
This Appendix will provide you with a quick,
yet comprehensive, review of the most
important concepts covered in this book.
409
140_SANs_AppFT
410
8/14/01
3:42 PM
Page 410
Appendix • Building SANs with Brocade Fabric Switches Fast Track
❖ Chapter 1: Introduction to SANs
Overview of SANs
SAN technology evolved from direct-attach interconnects like Small
Computer Systems Interface (SCSI).
Fibre Channel supports SCSI, Internet Protocol (IP), and the Fibre Channel
Virtual Interface (FC-VI) Protocol.
The distance between Fibre Channel nodes can be as much as 10 km.
Fibre Channel supports copper, multimode optical, and single-mode
optical media.
SAN technology has moved from Fibre Channel Arbitrated Loop to full
Fibre Channel switch fabric.
Taming the Storage Monster
Data storage needs are increasing rapidly.
Requirements due to databases, e-mail, multimedia, and the Internet have
dramatically increased the required amount of storage for data.
Disk farms, storage arrays, and storage consolidation are the keys to solving
the storage problem.
Benefits of Building a SAN
Fibre Channel is ideal for supporting high-availability configurations and
business-critical back-end operations, due to the ability to set up redundant
networks and clusters.
SAN technology allows for storage consolidation and data pooling for more
efficient use of storage resources.
Backup windows are shrinking, and backup traffic on the LAN can be easily
reduced by using a SAN to reduce network congestion due to backup.
Block-level, high-speed access through SCSI-Fibre Channel Protocol (FCP)
can accelerate data access between storage and hosts, and can free up host
resources that would be occupied serving files and data through IP.
140_SANs_AppFT
8/14/01
3:42 PM
Page 411
Building SANs with Brocade Fabric Switches Fast Track • Appendix
Chapter 1 Continued
Cluster protocol access through FC-VI frees up CPU cycles in hosts and
enables clustered database operations.
One of the major advantages of SAN technology is its long-distance
capability for disaster tolerance.
When to Deploy a SAN
The most important part of determining whether to deploy a SAN is to
focus on the actual business application that will be served with the
SAN deployment.
Speed and bandwidth requirements determine if the technology is right for
the application. Compared with other technologies, such as IP-based file
sharing and Network Attached Storage (NAS), the Fibre Channel protocol
provides for more usable bandwidth and faster data transfer.
A SAN is ideal for block-level access to shared storage.
Fibre Channel works well for centralized access to storage arrays, redundant
connections, clustered configurations, and disaster tolerance.
Steps to a Successful SAN Deployment
Data collection Evaluate the goals of the deployment to determine
options in achieving high availability, redundancy, fault tolerance, data consolidation, cost reduction, and so forth.
Data analysis Investigate the hardware and software options that support
those goals.
Architecture development Design and install a SAN testbed to set up
configuration and components. Select the software and hardware carefully to
avoid any interoperability problems.
Testing the prototype Test the configuration for interoperability, functionality, error handling, and fault tolerance.
Transition existing hardware in a controlled release to production
Stage the deployment by rolling out the setup gradually, making changes on a
limited basis to minimize risk.
411
140_SANs_AppFT
412
8/14/01
3:42 PM
Page 412
Appendix • Building SANs with Brocade Fabric Switches Fast Track
❖ Chapter 2: Fibre Channel Basics
The Architecture of SANs
A Fibre Channel SAN provides the advantages of increased speed, reliability,
and scalability.
Fibre Channel presently transmits at 1062.5 Gbit/sec over single- and multimode optical and copper cabling.
A SAN implemented using the Fibre Channel protocol incorporates the
benefits of a channeled connection and a network.
A SAN is constructed from three primary types of elements: initiating
devices, switches, and target devices.
A target device is a storage device on a SAN. Device enclosures like tapes,
JBODs, or RAIDs are the most common type of target device.
An initiating device is a device that actively seeks out and interacts with
target devices on the SAN.
Switches create the foundation of the Fibre Channel SAN. A group of interconnected switches is called a fabric.
Fibre Channel Protocol Basics
Fibre Channel is primarily used to transport the SCSI and IP protocols.
Devices are identified by an 8-bit Arbitrated Loop Physical Address
(AL_PA) in an arbitrated loop topology, and a 24-bit address for switched
fabric connections.
Frames start with a primitive Start Of Frame (SOF) and end with an End
Of Frame (EOF) primitive.
There are five Fibre Channel layers, designated FC-0 through FC-4.
The FC-0 layer is the physical media layer and includes the media selection
and connectors.
The FC-1 layer is the signal encoding and decoding layer.The FC-1 layer
uses 8b/10b encoding.
The FC-2 layer is the Fibre Channel protocol layer.
140_SANs_AppFT
8/14/01
3:42 PM
Page 413
Building SANs with Brocade Fabric Switches Fast Track • Appendix
Chapter 2 Continued
The FC-3 layer is the Fibre Channel common services layer.The services
are servers in a Fibre Channel fabric that manage connections between
devices connected remotely through the switched fabric.
The FC-4 layer is the Fibre Channel ULP mappings layer.
Classes of Service
Classes of service specify what mechanisms are required for transmission of
different types of data.
Class 1—Acknowledged connection-oriented service.
Class 2—Acknowledged connectionless service.
Class 3—Unacknowledged connectionless service.
Class 4—Fractional bandwidth connection-oriented service.
Class F—Used for inter-switch communication.
Storage Network Topologies
There are three primary types of topologies in Fibre Channel: point-topoint, arbitrated loop, and switched fabric (also called point-to-point).
The primary use of the point-to-point topology is to connect devices
directly to switches or other bridge devices.
The arbitrated loop topology allows up to 127 devices in a ring formation
to share the bandwidth of a single line without a switch.
Fabrics allow thousands of devices to be interconnected.
Switches have three types of ports. FL_Ports are fabric loop ports that attach
arbitrated loops to the fabric. F_Ports are fabric ports that connect single
devices to the fabric in a point-to-point topology. E_Ports connect a switch
to another switch.
Fabric-attached devices have a three-part address.The first segment indicates
the physical switch, the second part indicates the physical port, and the last
part is the arbitrated loop address in a loop device or 0x00 for a fabric device.
413
140_SANs_AppFT
414
8/14/01
3:42 PM
Page 414
Appendix • Building SANs with Brocade Fabric Switches Fast Track
Chapter 2 Continued
Fabric Services
Switches exchange information in their servers so that all individual switch
servers contain the same information.This creates distributed servers.
The fabric port is used to log a device into the fabric.The response frame
from login assigns the device its 24-bit address.
The Name Server is used as a database to register and store information
about all devices on the fabric.
The Fabric Controller at well-known address 0xFFFFFD provides state
change notification service to registered nodes. State change notification is a
service that notifies devices when a change in fabric topology occurs.
The Management Server provides information about the fabric without
stipulation as to zone.
❖ Chapter 3: SAN Components and Equipment
Overview of Fibre Channel Equipment
Understanding the features of your Fibre Channel equipment is key when
building a robust infrastructure.
A Fibre Channel network is comprised of cabling, GBICs, hubs, switches,
HBAs, and routers.
Fibre Channel shares much of the same terminology as Ethernet
networking, but the functionality of similarly named equipment is not
necessarily identical.
Cabling and GBICs
Copper cabling is almost always terminated with either an HSSDC or DB-9
male connector.
Multimode optical fiber is terminated using a variety of optical connectors,
including SC, LC, and MT-RJ.
Single-mode fiber is the most expensive media type, but preferable for
long distances.
140_SANs_AppFT
8/14/01
3:42 PM
Page 415
Building SANs with Brocade Fabric Switches Fast Track • Appendix
Chapter 3 Continued
Single-mode fiber, because of its small diameter (9 µm), has the highest
transmission speed potential.
Copper cabling is available in two types: active and passive. Active copper
lines provide twice the distance of passive copper lines.
The HSSDC connector was specifically designed as a Gigabit copper connector, improving density and performance over the DB-9 style connector.
GBICs are removable transceivers used in all types of Fibre Channel devices,
including switches, hubs, and HBAs.
GBICs offer the option of interfacing with almost all types of connectors.
A Media Interface Adapter (MIA) converts DB-9 copper connectors to
optical SC connectors.
Using Hubs
Hubs serve as a very basic level for connecting different ports in a
network together.
Hubs can connect up to 127 devices together in an FC-AL loop.
Simple hubs contain no intelligence, just electrical connections.
Managed hubs provide a level of error tolerance and management features.
Managed hubs provide LIP isolation, automatic port bypass, signal retiming,
and management interfaces.
Fibre Channel LIPs can be a major source of problems in arbitrated loop
configurations.
To avoid an earlier generation of problems due to loop architecture, most
people are moving to switched fabric devices instead.
Using Switches and Fibre Channel Fabrics
Switches are classified into three categories: entry-level, scalable fabric, and
core fabric switches.
Entry-level switches are focused on small workgroups of 8 to 16 ports, usually are geared toward low cost, and deliver limited scalability and management. Fabric switches provide the capability to cascade switches together to
create larger fabrics.
415
140_SANs_AppFT
416
8/14/01
3:42 PM
Page 416
Appendix • Building SANs with Brocade Fabric Switches Fast Track
Chapter 3 Continued
A core fabric switch is designed for interconnecting multiple edge switches
to form multihundred-port SANs.
HBAs are used to connect servers to the network.They map SCSI commands
in the operating system to Fibre Channel frames on the network. HBAs range
from low-end, loop-only devices to high-end, fabric multipathing adapters.
Major protocols supported by HBAs are SCSI-FCP for storage, IPFC for
networking, and VIFC for clustering.
HBAs either support 1 Gbit/sec or 2 Gbit/sec speeds, with current generation cards supporting 1 Gbit/sec, and emerging cards supporting both.
HBAs can be found in single one-port configurations or dual-port adapters
for higher density.
LUN masking enables control of access to devices in the network from
the HBA.
Persistent binding is the mapping of a Fibre Channel device into an operating system at a specific device location.
Dynamic discovery is the capability to dynamically add and remove drives
from your system without reboot.
HBA API support is an important feature that allows management of your
HBA by SAN management software.
Remote booting is the use of an HBA to boot an operating system image
across the SAN and is used to dynamically change hosts and enable ease of
disaster recovery.
Connecting Legacy Devices into Your SAN
Α Fibre Channel router, which is also known as a bridge, allows legacy parallel SCSI devices to attach to your Fibre Channel SAN.
A Fibre Channel router plugs into Fibre Channel on one side and a SCSI
bus on the other.
Frames are translated from SCSI-FCP to parallel SCSI bus signals
through routers.
140_SANs_AppFT
8/14/01
3:42 PM
Page 417
Building SANs with Brocade Fabric Switches Fast Track • Appendix
Chapter 3 Continued
Routers provide many different features, including different numbers
of SCSI buses and different support for parallel SCSI protocols
and termination.
Advanced features include selective LUN presentation, extended copy support, and various management interfaces.
Selective LUN presentation is the capability of a router to mask the presence of devices to different hosts in the network and allow for better security and control over resources.
Extended copy support (third-party copy) allows software to directly back
up data on the SAN, saving CPU and network traffic.
Available management interfaces include telnet, SNMP, Ethernet, and
serial ports.
Bridging and Routing to IP Networks and Beyond
Fibre Channel-to-DWDM technology multiplexes Fibre Channel signals
onto higher bandwidth fiber for transmission over MAN distances (up to
100 km).
Use of DWDM is transparent to Fibre Channel switches, except for
buffer settings.
It is necessary to increase buffer credit settings to handle the long distances/
delays involved in MANs.
Fibre Channel can also be transported across IP networks like ATM and
Gigabit Ethernet.
FC_IP (not to be confused with IPFC) encapsulates Fibre Channel frames
in the IP protocol and can be used for remote backup and extending
SAN distances.
Fibre Channel Storage
Fibre Channel storage is important as the core of the data storage on
your network.
Fibre Channel storage ranges from simple JBOD devices to multiterabyte
storage arrays.
417
140_SANs_AppFT
418
8/14/01
3:42 PM
Page 418
Appendix • Building SANs with Brocade Fabric Switches Fast Track
Chapter 3 Continued
A JBOD is a cabinet of independent disks, all connected into the Fibre
Channel network in a loop.
Hosts individually address disks in a JBOD.
RAID arrays provide additional protection and performance to your storage.
Different RAID levels are appropriate for different applications.
High-end storage arrays add support for multiple terabytes of data. Other
types of connections include parallel SCSI, ESCON, and FICON.
High-end arrays also generally include a large amount of cache, which is
used to speed up data access.
Selective LUN presentation is the ability of high-end storage to control
access by hosts to data and to ensure data integrity.
LUN export across multiple ports is used for redundancy and high availability, but requires dynamic multipathing software or drivers to work.
Snapshot backup volumes are used to enable backup on live databases and
data images.
❖ Chapter 4: Overview of Brocade
SilkWorm Switches and Features
Selecting the Right Switch
Identify your requirements for availability, port density, functionality,
and cost.
Decide whether you need an arbitrated loop or full-fabric environment.
Learn which switch functions best satisfy your requirements.
Consider what strategic direction you want to take, and whether your
current switches will scale easily to meet your needs.
Understanding the Brocade Fabric OS
Fabric OS is the operating system for all Brocade SilkWorm switches.
Key functions include auto-discovery, in-order frame delivery, zoning,
and others.
140_SANs_AppFT
8/14/01
3:42 PM
Page 419
Building SANs with Brocade Fabric Switches Fast Track • Appendix
Chapter 4 Continued
Fabric OS provides the capability to work with other storage
management applications.
Using Optional Brocade Features
You can use Brocade Zoning to isolate devices into separate, virtual SANs.
Zoning is ideal for multiple customer environments where data security
is critical.
Extended Fabrics enables the benefits of Fibre Channel technology at distances up to 100 km.
Fabric Watch tracks switch and fabric events to help you optimize fabricwide performance and proactively identify problems before they happen.
QuickLoop integrates private loop-based devices into switched fabric
environments.
QuickLoop helps support legacy devices to protect existing investments
while also providing performance and reliability advantages.
WEB TOOLS is an advanced monitoring tool that sends alerts about fabric
events to help prevent downtime.
You can use a Web browser interface and Java plug-in to monitor switchedfabric SANs from any workstation in your enterprise.
Future Capabilities in the Brocade
Intelligent Fabric Services Architecture
The Brocade Intelligent Fabric Services Architecture includes the SilkWorm
family of fabric switches, advanced fabric services, open fabric management
tools, and enterprise-class security products.
ISL Trunking is an optional software product ideal for optimizing performance of Brocade 2 Gbit/sec Fibre Channel fabric switches.
Frame Filtering enables a variety of new capabilities for monitoring and
managing SAN fabrics and enhancing both security and reliability.
Secure Fabric OS is the most comprehensive SAN security architecture
available, addressing vulnerabilities in the SAN fabric and supporting
multiple authentication methods.
419
140_SANs_AppFT
420
8/14/01
3:42 PM
Page 420
Appendix • Building SANs with Brocade Fabric Switches Fast Track
❖ Chapter 5: The SAN Design Process
Looking at the Overall Lifecycle of a SAN
The SAN design process is a cycle.
This process consists of seven phases:
1. Data Collection
2. Data Analysis
3. Architecture Development
4. Prototype and Test
5. Transition
6. Release to Production
7. Maintenance
Whenever there is a fundamental change to the SAN, the cycle
should repeat.
Conducting Data Collection
Data collection is the foundation on which a SAN is built.
You should interview everybody who has an interest in the project.
During the interview process, create a technical requirements document.
Analyzing the Collected Data
There are several things that you need to get out of data analysis:
—The number of different fabrics that will make up the SAN solution
—The port count and performance characteristics of each fabric
—An estimate of the hardware required to meet these requirements
You might be able to localize traffic for better performance if you can create
well-defined groups.
Prepare an ROI proposition to justify your SAN project.
140_SANs_AppFT
8/14/01
3:42 PM
Page 421
Building SANs with Brocade Fabric Switches Fast Track • Appendix
❖ Chapter 6: SAN Applications
and Configurations
Configuring a High-Availability Cluster
HA clusters are used for redundant, fail-safe installations of mission-critical
business applications.
Clustering provides availability, manageability, and scalability.
Availability is the capability of a cluster to tolerate hardware, network, or
software errors.
The most common use of clustering is two servers configured to share
storage through Fibre Channel.
Redundant HBAs and switches should be used to provide fault tolerance.
The use of dynamic multipathing software, drivers, or HBAs can provide
higher levels of availability to your cluster.
Using a SAN for Storage Consolidation
Storage consolidation enables administrators to centralize storage resources.
Consolidation provides more efficient use of storage, enhances manageability, and improves accessibility.
Almost any layout of a storage network can be used for storage consolidation.
Consolidation requires attention paid to how operating systems treat
shared volumes.
In order to properly partition data in a consolidation environment, you need
to use fabric zoning, LUN masking on storage or the host, or software to
control permissions.
It is generally best to use fabric zoning even when also using another access
control product to achieve a more effective security model, and to provide
a “broadcast container,” which can increase the scalability and reliability of
a SAN.
An example of a typical storage consolidation setup is a shared SAN used to
provide data storage for a Web farm, where many servers read the same disks
to present data.
421
140_SANs_AppFT
422
8/14/01
3:42 PM
Page 422
Appendix • Building SANs with Brocade Fabric Switches Fast Track
Chapter 6 Continued
Storage LUN masking is used to ensure that only specific hosts are allowed
access to specific logical units of a storage array.The advantage of storage
LUN masking is that the storage guarantees which host is allowed access to
any volume.
HBA LUN masking is also used to limit what storage a host can see,
and requires that every host in the network participate in the same
masking scheme.
Software partitioning provides another type of control over LUN presentation, but it generally requires upper-level software and demands that every
host in the network be loaded with that software.
Switch zoning, available in Brocade switches, provides a convenient way to
allocate storage to hosts, and to consolidate different departments into a
single company network.
Switch zoning does not currently support control at the LUN level, only at
the port and WWN levels. Upcoming products will add this capability. For
now, other access control techniques might need to be used in addition to
switch zoning to provide access control at the LUN level.
Storage LUN masking provides another way to control access to volumes in
a shared SAN.
High-end storage arrays provide the capability to specify the port or node
WWN of a host HBA, and specify which volumes in the array will respond
to requests.
By using storage LUN masking, you can ensure that only hosts with permission can read or write from a specified volume.
Storage LUN masking requires the participation of the storage only to
enforce permissions.
HBAs provide access control to volumes through LUN masking.
LUN masking controls which volumes an operating system can see through
a particular HBA.
HBA LUN masking requires the participation of all of the hosts in the
network to avoid contention for storage resources.
140_SANs_AppFT
8/14/01
3:42 PM
Page 423
Building SANs with Brocade Fabric Switches Fast Track • Appendix
Chapter 6 Continued
LAN-Free Backup Configuration
Traditional backup systems used SCSI direct-attached tape storage.The
LAN-based client-server backup model, although an improvement, cannot
account for ever-increasing amounts of data through the LAN connection.
LAN-free backups using storage networks solve LAN-based problems by
offloading traffic from the LAN and increasing bandwidth.
SAN Server-Free Backup
Server-free backup is the use of a SAN to remove backup traffic from
a LAN.
Backup is done directly on the SAN for each device, rather than each host
being involved in data transfer.
Third-party copy provides an even more efficient way to transfer data to
tape, freeing a backup server from needing to directly access disks and copy
data to tape.
Making Your Enterprise Disaster Tolerant
Fibre Channel SANs are ideal for mirroring and accessing data across
large distances.
It is now possible to separate critical systems many miles apart.
Brocade switches provide extended credits on ISLs to enable high performance and reliable long-distance operation.
❖ Chapter 7: Developing a SAN Architecture
Identifying Fabric Topologies and SAN Architectures
A fabric consists of one or more interconnected Fibre Channel switches.
A SAN includes one or more related fabrics and everything attached
to them.
In a resilient core/edge fabric topology, two or more switches act as a core
to interconnect multiple edge switches.This is the best “general-use”
topology available, especially when combined with the dual-fabric approach
to SAN architecture.
423
140_SANs_AppFT
424
8/14/01
3:42 PM
Page 424
Appendix • Building SANs with Brocade Fabric Switches Fast Track
Chapter 7 Continued
In order to select the right topology, you must first decide the requirements
for your SAN architecture.This includes redundancy and scalability in addition to port count.
In general, the cascade, ring, full mesh, and partial mesh are best used in
architectures where the individual fabrics that comprise the SAN will not
change much.This could be true in a static, low-growth environment, or in
a “SAN islands” design.
The resilient core/edge topology is the best choice for general use and for
situations where SAN requirements are either unknown or might change
frequently.
The resilient core/edge topology can be combined with dual fabrics to
achieve maximum performance, reliability, and scalability.
Working with the Core/Edge Topology
The core/edge topology offers a number of key advantages over other
topologies. Core/edge fabrics are:
—Easy to scale without downtime.
—Capable of scaling to a large number of ports.
—Flexible in terms of their cost-to-performance ratios. (You can add
switches to the core with a clear knowledge of how doing so will affect
both cost and performance.)
—Easy to understand, manage, and performance-tune.
—Well-tested and reliable.
Several core/edge fabrics can be used as “cookie-cutter fabrics” when design
information is incomplete or might change frequently.
Determining Levels of Availability
There are four levels of availability that a SAN architecture might employ.
The dual-fabric, resilient approach is the most reliable and the most
frequently recommended.
140_SANs_AppFT
8/14/01
3:42 PM
Page 425
Building SANs with Brocade Fabric Switches Fast Track • Appendix
Chapter 7 Continued
In most cases, this approach is not more expensive to implement than the
other three approaches, and it might be less expensive in some cases.
This approach allows for the failure of anything up to and including an
entire fabric without application downtime.
Configuring Traffic Patterns
Tiered fabrics allow simplified management and storage resource planning,
but are the worst-case scenario from the standpoint of locality.
Locality is the most effective approach to performance tuning in a SAN, but
it is frequently unattainable.
You should view locality as a “moving target,” since network complexity
increases over time. However, it is worth getting as much locality as is practical into a SAN, since all SANs benefit in several ways from this technique.
Evaluating Performance Considerations
Over-subscription is never a bad thing in and of itself. It is only when oversubscription becomes congestion that problems might arise.
Latency is almost never a driving consideration in real-world SAN performance, since fabric latency is at least one order of magnitude lower than
typical disk subsystem latency. Exceptions to this rule include clustering software and some highly performance-sensitive applications.
In almost all cases, considerations outside the fabric will limit performance,
such as CPU speed of hosts or the I/O profile of an application.
❖ Chapter 8: SAN Troubleshooting
The Troubleshooting Approach:
The SAN Is a Virtual Cable
Use the SAN’s visibility to both storage and hosts to start your troubleshooting process.
425
140_SANs_AppFT
426
8/14/01
3:42 PM
Page 426
Appendix • Building SANs with Brocade Fabric Switches Fast Track
Chapter 8 Continued
The switchShow, nsShow, nsAllShow, errShow, and topologyShow
commands are extremely informational and invaluable to the troubleshooting process.
The UNIX format command or HBA vendor-supplied utilities are also
helpful in troubleshooting.
When you start the troubleshooting process, determine if the issue is fabric
related or device related. A fabric-related issue impacts many devices, and a
device issue impacts only a few devices.
Troubleshooting the Fabric
A fabric issue impacts many devices. A logical switch outage, such as segmentation or physical switch outage, can cause many devices to drop out of
the fabric. Problems with ISL initialization are also considered fabric issues.
The quickest way to narrow your search of a fabric problem is to compare
your baseline SAN profile to your current SAN profile and investigate
discrepancies.
A SAN profile includes the number of devices per switch (nsShow),
number of devices in the fabric (nsAllShow), and number of switches in
the fabric (topologyShow).The errShow and switchShow commands are
also helpful in tracking down fabric issues.
Some fabric issues are caused by a mismatch in fabric service timeout variables and the edge device timeout settings. Careful analysis of both the
fabric and the edge devices is necessary to resolve this complex issue.
Troubleshooting Devices that Cannot Be Seen
The first thing to check is that the missing device is logically connected to
the SAN as indicated by switchShow output.
Next, check to see that the device is present in the Name Server, using the
command nsShow. If the device is not in the Name Server, it is invisible to
the other devices in the fabric.
Other common causes of missing devices are zone conflicts or
marginal links.
140_SANs_AppFT
8/14/01
3:42 PM
Page 427
Building SANs with Brocade Fabric Switches Fast Track • Appendix
Chapter 8 Continued
Troubleshooting Marginal Links
Use portErrShow to establish if there are a relatively high number of
errors, such as CRC errors. Look for a steadily increasing number of errors
to confirm a marginal link.
A marginal link can impact multiple devices. For example, a shared storage
device with a marginal link can cause communication problems with all
devices that access that shared storage.
A marginal link can be caused by any of the components that make up the
link: switch port, switch GBIC, cable, edge device GBIC, and the edge device.
Troubleshooting I/O Pauses
I/O pauses happen, and both the SAN and edge device can and should
tolerate such events.
An I/O pause can be as harsh as the powering down of a host or storage
device while I/O is in transit, which will cause I/O to cease. Alternatively, it
might be as lightweight as a port-level RSCN, which might be a problem
for only the most latency-sensitive applications. Relative to the SAN, fabric
events can also cause a pause in I/O.
Several applications, such as video-on-demand and applications that are
evolving into the SAN model, such as tape backup, are very sensitive to
latency and/or RSCNs. High latencies and large numbers of RSCNs can
adversely affect these applications.
Storage vendors, switch vendors, application vendors, and HBA vendors are
working with the standards bodies (T11) as well as modifying their product
implementations to handle these types of exceptions.
❖ Chapter 9: SAN Implementation,
Maintenance, and Management
Installation Considerations
Ensure that ISLs run in front of only the switches to which they are connected.This will allow the switches to be removed without downtime for
the fabric.
427
140_SANs_AppFT
428
8/14/01
3:42 PM
Page 428
Appendix • Building SANs with Brocade Fabric Switches Fast Track
Chapter 9 Continued
When racking your switches, be sure to avoid single points of failure.This
means separating redundant fabrics into different racks and powering
resilient fabrics in such a way that a power failure does not cause the fabric
to fail.
Carefully consider solely using in-band management of your switch.
Consider using both in-band and out-of-band management.
Have a well thought-out switch-naming convention to enable easy identification of a physical switch in case a problem arises.
If you intend to implement a large fabric, work closely with your switch
supplier to identify a version of Fabric OS that supports the size of SAN
you intend to build.
Automating Switch Administration Activities
If you have to do a SAN administration activity more than once, consider
automating the activity with a script.
Use Expect for automating your SAN administration activities today and
consider using Fabric OS APIs when they become available.
Take the Expect script example (run_sw_cmd) and modify it for your
SAN administration activities.
If you use Expect scripting, you need the supporting software. See the
following URL for Expect installation guidance: http://expect.nist.gov.
Brocade Zoning Considerations
Determine whether you want to use hard or soft zoning prior to implementing your zone scheme.
Hard zoning is more secure than soft zoning, as hard zoning is enforced
at the Name Server and at the hardware level and will actually block
inappropriate access.
There are differences between hard zoning and soft zoning from a maintenance perspective. For example, you need to update your zone information
if you replace a device that is part of a soft zone.
140_SANs_AppFT
8/14/01
3:42 PM
Page 429
Building SANs with Brocade Fabric Switches Fast Track • Appendix
Chapter 9 Continued
Consider using a script to build and maintain large zoning configurations.
Scripts can also be helpful to implement disaster recovery policies that are
implemented in zoning.
Validating Your Fabric
Baseline your fabric first so that you can quickly identify failures when you
validate your SAN. Use the SAN profile as your baseline.
You can automate many validation activities, such as taking your SAN profile and fault injection.
The key to fault injection is to establish how your entire system behaves
when a fault is encountered.
Key fabric fault injections include: switch power cycle, ISL disable/enable,
and switchDisable/switchEnable. Key edge device fault injections
include: reset or power cycle the edge device.You can simulate this event by
doing a portDisable/portEnable.
Running a load in your SAN can shake out issues like marginal links.The
final test is to run a load on your SAN while you do fault injections. If your
SAN is able to handle this test, you are ready for production.
Pick a minimum of two load types for your SAN I/O testing: one that
approximates your SAN application load, and a load that is bandwidthintensive.
Ask your host supplier, HBA supplier, or storage supplier for the tools they
use for I/O testing in a UNIX environment. If you want to use a
Microsoft/Intel tool, Iometer is a good choice.
SAN Maintenance
If possible, use the “cold” upgrade process for fabric upgrades. It takes only a
few minutes of downtime.
When adding a switch to the fabric, clear out the zone information.
It is simplest to power down or disable your switch prior to connecting that
switch to an existing SAN. Doing so will avoid domain ID conflicts.
429
140_SANs_AppFT
430
8/14/01
3:42 PM
Page 430
Appendix • Building SANs with Brocade Fabric Switches Fast Track
Chapter 9 Continued
A diligently maintained SAN configuration log can help you with disaster
recovery, troubleshooting, recreating a switch whose configuration is
destroyed, SAN design modification or expansion, recovering accidentally
deleted licenses, and recovering a zoning configuration.
Back up your switch configuration with the command configUpload
whenever you add or replace a switch.
Maintaining a baseline SAN profile is essential to many SAN maintenance
activities. Make sure you know how to create and maintain a SAN maintenance profile.
The act of loading firmware does not impact SAN operations.The process
of activating does impact SAN operations because a reboot is required.
140_SANs_index
8/14/01
3:43 PM
Page 431
Index
A
B
ACK frame, 43
active/active storage controllers, 198, 200
active/passive storage controllers, 198, 200
Adapters, Media Interface (MIAs). See
Media Interface Adapters (MIAs)
addressing, switched fabric, 48–49, 358, 361
administrative activities, automating,
367–372
Expect scripting and, 369–372
Fabric OS API and, 367–368
Alias Server, 50, 52, 134
analyzer, Fibre Channel, 287, 314–316
any-to-any connectivity, 268–269
APIs
Fabric OS API, 367–368, 378
HBA API, 101–103
application service providers (ASPs), data
sharing and, 12
Application-Specific Integrated Circuit
(ASIC), 80
applications
assessing need for SAN in interviews,
16–17
selecting during design phase, 165
switch management via, 94
arbitrated loop topology, 4, 5, 33, 39, 47–48
architecture, SAN. See SAN architecture
asynchronous transfer mode (ATM), 31
automatic device discovery, 133
Fabric OS and, 133
Host Bus Adapters (HBAs) and, 101
automatic path failover, 134
automatic port bypass, managed hubs and,
78
availability levels, SAN architecture and,
256–260
availability, high-availability (HA) cluster,
197
backups, configuration
switch configuration file, 393–394
zoning configuration, 382
backups, network
accelerating cycling of, 14
collecting information on in interviews,
167
LAN-free, 212–213
reducing network congestion from, 13
remote, 218–219
SAN-based server-free, 213–216
bad_eof error statistic, 297
bandwidth
assessing need for SAN, 17
Fibre Channel hub, 34
baseline SAN profiles, 308–311, 317–318,
382–384
block-level protocols, data access speed and,
14
bonded SC connectors, 72
bridges, Fibre Channel, 35, 60, 64–65,
106–109
extended copy support by, 108–109
management interfaces for, 109
number of SCSI buses on, 107
SCSI termination type and, 108
selective LUN presentation by, 108
types of SCSI ports available on, 108
broadcasting, IP over Fibre Channel (IPFC),
88, 96–97
Brocade Fabric Access Layer API, 137
Brocade Fabric Assist, 79, 138–139, 147, 337
Brocade Fabric Manager, 131
Brocade Fabric OS, 132–135, 228
adding switches with, 248
automatic device discovery, 133
command-line interface, 135
continuous port monitoring, 133
dynamic routing services, 134
431
140_SANs_index
432
8/14/01
3:43 PM
Page 432
Index
Fabric Access Layer API, 137
Fabric OS API and, 367–368, 378
Fibre Channel services provided by,
133–134
history of features and enhancements,
363–365
in-band interface, 135
Management Information Bases (MIB)
provided by, 135
switch beaconing and, 135
syslog daemon interface, 135
universal port support by, 133
version, selecting which to use, 361–365
Brocade Fabric Watch, 75, 138, 147, 339,
366
Brocade Intelligent Fabric Services
Architecture, 140–143
frame filtering and, 142
hardware-enforced zoning, 142
ISL Trunking, 140–142
performance analysis and, 143
Secure Fabric OS and, 143
Brocade QuickLoop, 79, 80, 138–139
isolating marginal port faults, 339
private loop devices and, 165–166, 337
Brocade Remote Switch, 218
Brocade Secure Fabric OS, 143
Brocade SilkWorm switches, 124–132, 146,
147
entry-level series, 126–128
licensing, 136, 322
Metropolitan Area Networking (MAN)
and, 219
port error statistics, 339, 341–342
scalable, 128–131
selecting most appropriate, 124–126
SilkWorm 1000, 300, 301
SilkWorm 12000 Core Fabric Switch,
126, 131–132
SilkWorm 2000, 126, 300, 301
SilkWorm 2010, 126, 127
SilkWorm 2040, 126, 127
SilkWorm 2050, 126, 128
SilkWorm 2200, 126
SilkWorm 2210, 127
SilkWorm 2240, 127
SilkWorm 2250, 128
SilkWorm 2400, 126, 128, 129
SilkWorm 2800, 126, 128, 129–130
SilkWorm 6400 Integrated Fabric, 126,
130–131, 256
SwitchType values, 302
zoning of, 373–375
See also switches, Fibre Channel
Brocade SOLUTIONware guides, 158, 196
Brocade Web site, 158
Brocade WEB TOOLS, 139–140, 147, 366,
378, 408
Brocade Zoning, 136, 146–147, 372–373
licensing for, 366, 372
private loop devices and, 165–166
buffer credits, switch port, 86–87
buffer-to-buffer flow control, 43
business goals, identifying, 153, 158
business requirements, identifying, 158–159
C
cable testers, 315
cabling, 61, 65–68, 164
copper, 61, 65–66
layout of, 351–354
multimode optical, 36, 61, 66–67
single-mode optical, 36, 61, 68–69
camTest command, 289
cascade topology, 236–237
compared with other topologies, 247
resiliency of, 257, 258
central memory diagnostics, 289
centralMemoryTest command, 289
cfgClear command, 324
cfgDisable command, 324, 343
cfgEnable command, 343, 347
cfgShow command, 286, 324, 333
140_SANs_index
8/14/01
3:43 PM
Page 433
Index
channel protocols, 31
channels, 31, 38
Chaparral products
Fibre Channel/SCSI bridges, 215
Network Storage, 14
character encoding, Fibre Channel, 36, 41
Class F service, 45, 50
classes of service, Fibre Channel, 37, 39,
43–45
Class 1, 43–44
Class 2, 44, 84
Class 3, 44, 84
Class 4, 44
Class F, 44, 50, 84
Cluster Server, Microsoft’s, 11, 200–202
clustering techniques, FC-VI standard and,
14–15
clusters, high-availability. See HighAvailability (HA) clusters
cmemRetentionTest command, 290
CMI bus connection diagnostics, 289
cmiTest command, 289
cold fabric upgrades, 398, 400
combination adapters, 98
command-line interface, Fabric OS, 135
complex core resilient core/edge topology,
244–245, 247
components, 61–65
attention to those in production during
SAN planning, 166
evaluating pre-existing, 165–166
redundant HA cluster, 199
validation of, 166
See also specific components
composite resilient core/edge topology,
245–246
Computer Associates Unicenter TNG, 350,
368
configDownload command, 393, 394
configShow command, 286, 394
configUpload command, 324, 382, 393
configuration logs, 391–393
433
configuration management software, HBA,
101
congestion, network, 13, 233, 270
connectors, Fibre Channel, 61–62, 69–73
D-B9 connectors, 69–70
high-density optical connectors, 72–73
HSSDC connectors, 70–71
SC connectors, 71–72
copper cabling, 61, 65–66
copper connectors, 62, 69–71
core/edge fabric, 228, 229–230, 242–246
adding edge switches to, 248–250
compared to other topologies, 247
complex core/edge, 244–245
resiliency of, 256, 257, 258
scaling without downtime, 248
simple core/edge, 244
target designs for, 253–256
upgrading core switches, 250–253
core switches, 81–82, 242–243, 250–253
adding new to core/edge fabric, 251
configuring new core/edge fabric,
251–252
core team, SAN
identifying people to include, 156–157,
193–194
interview process for, 157–176
costs
Brocade SilkWorm switches, 125, 126
cabling media, 65, 67, 68
cascade topology, 236, 237
complex core resilient core/edge topology, 245
full-mesh topology, 238
partial-mesh topology, 242
ring topology, 238
CPU speed, pre-SAN performance data
and, 170
CRC errors, 295
crc_err error statistic, 296–297
crossPortTest command, 290
Crossroads Systems, 14, 215
140_SANs_index
434
8/14/01
3:43 PM
Page 434
Index
D
D characters, 41
data access, increasing speed of, 14
data analysis, SAN design process and, 153,
194
port requirements, establishing, 182–187
Return On Investment (ROI)
proposition, 153, 159, 187–189
SAN grouping process, 178–182
data characters, 41
data collection, SAN design process and,
153, 156–177, 194
backup information, 167
business problem identification, 158
business requirement identification,
158–159
component testing needs, 166–167
components, identifying those in place,
165–166
components, identifying those in production, 166
current performance data, 168–172
design interview form, 175–176
host information, 160–162
initiator-to-target communications matrix,
167–168
maintenance downtime, 174–175
node information, 160–161
performance, determining future needs,
172–174
physical assessment of hardware, 176–177
processing collected data, 177–182
SAN-enabled applications needed, 165
SAN implementation downtime, 174
selecting people to interview, 156–157
storage device information, 162–163
storage facility information, 164
technical requirement identification,
159–160
timeline creation from, 175–176
data movers, 14, 214
data replication techniques, 218
data sharing, 12, 18–19, 203–212
file-level sharing, 19
LUN masking and, 210–211
resource sharing, 19
software management of, 211–212
switch zoning and, 208–210
volume-level sharing, 19
with Web farms, 206–207
See also switch zoning
data storage
consolidating with SANs, 9–10, 11–13,
203–212
increased need for, 8–9
sharing among multiple hosts, 12,
203–212
database servers, HA cluster configuration,
198–200, 226
DataCare SANsymphony, 12
DB-9 connectors, 62, 69–70
DB-9 serial cabling, 69, 70
dd tool, UNIX, 388–389
debuggers
Fibre Channel analyzer, 314–315
portLog, 314
protocol analyzers, 315–316
dedicated connection types, 43
Dense Wave Division Multiplexing
(DWDM), 109–110, 217, 219
design interview form, 176
Destination ID (D_ID), switch, 302
devices
adding to fabrics, 401–402
automatic discovery of, 133
collecting information on in interviews,
160–164
determining if existing require additional
hardware, 165–166
timeout at bring up, 321–322
troubleshooting missing, 279–283,
327–335
DiagErr# message, 293
diagClearError command, 290
140_SANs_index
8/14/01
3:43 PM
Page 435
Index
diagDisablePost command, 290
diagEnablePost command, 290
diagHelp command, 289
diagnostic switch commands, 289–308
errShow, 278, 292–295, 318
help, 291–292
nsAllShow, 286, 320, 329, 382, 395
nsShow command. See nsShow command
portDisable, 249, 302, 318, 330, 334–335
portEnable, 302, 318, 330, 334–335
portErrShow, 286, 295–297
portLoopbackTest, 289, 290
show vs. dump, 291
supportShow. See supportShow command
switchEnable, 251, 318
switchShow. See switchShow command
topologyShow, 307–308, 309, 320, 329
diagnostics, storage array, 287
diagShow switch command, 286, 290
disabled switches, 300
disaster tolerance, SANs and, 15–16, 31,
216–221
data replication and remote backup,
218–219
Metropolitan Area Networks (MANs)
and, 219–221
Disk Administrator,Windows, 312, 317
disk farms, 9
disk I/O performance, 169–172, 385–390
disk monitoring tools, 169–170
disk seek time, 171
diskmon feature,Windows NT, 169
diskperf -yd command, 172
diskperf -yv command, 172
diskperf utility, 172, 386
disks
storage on, 32, 33, 64, 111
troubleshooting missing, 279–283
distance requirements, SANs and, 17–18
distributed fabric services, 41
domain IDs, switch, 48, 300–301, 319
conflicting, 326–327
435
merging fabrics and, 397
setting, 358
downstream information, 302, 303, 304
downtime
determining acceptable for maintenance
and changes, 174–175
determining acceptable for SAN implementation, 174
scaling core/edge networks without,
248–253
drivers, determining if full fabric, 165–166
dual-fabric SANs, 248
HA clusters and, 199, 202
nonresilient, 257
resilient, 257
dual-port adapters, 98
DWDM. See Dense Wave Division
Multiplexing (DWDM)
dynamic discovery
Fabric OS and, 133
Host Bus Adapters (HBAs) and, 101
dynamic routing services, Fabric OS, 134
E
e-mail, data storage and, 8
E_Ports (ISLs), 48–49, 87
cable layout and, 351–354
incomplete initialization of, 318–319
ISL over-subscription ratio, 231–232
load sharing through, 134
port configuration conflicts and, 322–323
trunking, 140–142
edge devices. See nodes
edge ports, 243
edge switches, 243, 248–250
18-switch core/edge SAN design, 255
Electrically Erasable Programmable ReadOnly Memory (EEPROM), 74–75
Emulex HBA configuration utility, 101, 102
enc_in error statistic, 296
enc_out error statistic, 297
End Of Frame (EOF) primitive, 40
140_SANs_index
436
8/14/01
3:43 PM
Page 436
Index
end-to-end flow control, 43
er_disc_c3 error statistic, 297
er_enc_out statistic, 338
errDump command, 286
error logs, 292–295
error messages, 284, 285, 293, 324, 338
error statistics, 296–297, 341–342
errors
displaying logged, 292–295
MQ errors, 318, 327
errShow command, 278, 292–295, 318
Ethernet, 31
in-band/out-of-band switch management,
356
router management by, 109
Exchange Mail Server, 226
Expect scripting
firmwareDownload automation by, 400
switch management by, 369–372
zoning management by, 379–380
Extended Copy command, 108–109, 214,
215
Extended Fabrics, 136–137
Extreme SCSI, 169, 170
EZ Fibre, JNI’s, 312
F
F_Ports, 48–49, 87
Fabric Access Layer API, 137
Fabric Assist, 79, 138–139, 147, 337
Fabric Configuration Server, 51
fabric licenses, 322, 366
Fabric Login (FLOGI) frame, 50
Fabric Loop Attachment (FLA), 337
Fabric Manager, 131
Fabric OS API, 367–368, 378
fabric port count, 230
fabric segmentation, 318, 323–327, 348
fabric services, 36, 38, 41, 49–52, 85–86
Alias Server, 50, 52, 134
Fabric/Switch Controller, 51
Login Server, 38, 50
Management Server, 51, 85, 86, 133
Name Server. See Name Server
Registered State Change Notification
(RSCN), 51, 85–86, 284, 343–344,
347
switches and, 85–86
Time Server, 52, 86
Fabric Shortest Path First (FSPF), 90–91,
134, 228, 233, 256
Fabric/Switch Controller, 38, 50
fabric topologies, 229, 235
cascade topology, 236–237
comparison of properties of, 247
complex, 246
congestion and, 270
core/edge fabric, 242–246, 248–256
distance and, 234
factors affecting performance, 270–271
full-mesh topology, 239–240
over-subscription and, 270
partial-mesh topology, 240–242
resiliency of, 256
ring topology, 237–238
SAN architecture availability and, 257–260
scalability of, 236
vs. SANs, 275
fabric troubleshooting, 316–327
comparing SAN profiles to identify problem, 317–318
domain ID conflicts, 326–327
fabric information available from hosts,
317
fabric licenses and, 322
I/O pauses and, 343–344
incompatible fabric parameters, 325–326
MQ errors, 327
Name Server discrepancies, 320–321
port configuration conflicts, 322–323, 347
switch LEDs and, 288
symptoms indicative of fabric problem,
316
140_SANs_index
8/14/01
3:43 PM
Page 437
Index
timeout of devices at bring up, 321–322
topologyShow command and, 309, 320,
321, 329, 382, 395
zoning conflicts, 323–324
See also SAN troubleshooting
Fabric Watch, 75, 138, 147, 339, 366
Fabric Zone Server, 51
fabrics, 5, 34, 36, 49–52
adding edge devices to, 401–402
adding switches to, 395–398
bringing up, 321–322, 394–395
defined, 229
licenses for, 322, 366
merging, 395, 408
segmented, 318, 323–327, 348
timeout of devices at bring up, 321–322
troubleshooting. See fabric troubleshooting
upgrading, 398–400, 408
verification of, 382–383, 395
zone management and, 378–379
fabricShow command, 250, 251, 286
fan-in, 234
fan-out, 234
faShow command, 286
fastboot command, 399
fault injection techniques, 384–385
fault tolerance
of High-Availability (HA) clusters, 199, 200
of SANs, 10–11
faultShow command, 286
FC-0 Fibre Channel layer, 36, 40
FC-1 Fibre Channel layer, 41
FC-2 Fibre Channel layer, 36, 41
FC-3 Fibre Channel layer, 36, 41
FC-4 Fibre Channel layer, 36, 41–42
FC-AL. See Fibre Channel Arbitrated Loop
(FC-AL)
FC-GS-3 standard, 41, 50
FC-VI standard, 15
FC_IP (Fibre Channel across IP), 110
437
FCIA. See Fibre Channel Industry
Association (FCIA)
FCP/SCSI protocol, 96
Fiber Distributed Data Interface (FDDI),
31, 43
fiber-optic cables, 36, 61, 65
fiber-optic connectors, 62, 71–73
high-density optical connectors, 72–73
SC optical connectors, 71–72
Fibre Channel, 30–58
broadcasting via, 88
character encoding, 36, 41
classes of service, 37, 39, 43–45
cost of, 28
CPU requirements, 170
disk seek time requirements, 171
distances supported, 2, 39
HBA speed requirements, 171
history of, 2
hot-plug systems and, 104–105
interoperability of, 28, 58
layers, 36–37
PCI bus speed requirements, 170–171
protocols supported, 2, 31, 39, 42
RAID speed requirements, 171
RAM requirements, 171
resources on, 8
routing IP over to Gigabit Ethernet, 65,
110–111
routing over IP networks, 110
SCSI performance vs., 57
speed of, 14, 31, 38
standards, 35
switched fabric installation, 5, 6
topologies, 37, 39
transfer rates, 35
Fibre Channel analyzers, 287, 314–316
Fibre Channel Arbitrated Loop (FC-AL), 4,
5, 33, 39, 47–48, 60
Fibre Channel common services layer. See
FC-3 Fibre Channel layer
Fibre Channel disks, 33
140_SANs_index
438
8/14/01
3:43 PM
Page 438
Index
Fibre Channel Industry Association (FCIA),
8, 22, 35, 58
Fibre Channel layers, 36–37, 40–42
FC-0, 36, 40
FC-1, 36, 41
FC-2, 36, 41
FC-3, 36, 41
FC-4, 36, 41–42
Fibre Channel Management MIB, 93
Fibre Channel Protocol (FCP), 96
Fibre Channel protocol layer. See FC-2
Fibre Channel layer
Fibre Channel standards projects, 35
Fibre Channel Storage Area Network
(SAN). See SANs (Storage Area
Networks)
Fibre Channel-to-Dense Wavelength
Division Multiplexing (DWDM), 16,
109–110
Fibre Channel-to-Gigabit Ethernet bridges,
65, 110–111
FibreAlliance Management Information
Base (MIB), 92–93
FibreAlliance MIB, 109
fields, frame, 39–40
file-level sharing, 19
File Transfer Protocol (FTP), router management by, 109
firmware upgrades, 89, 398–400
cold fabric upgrade, 400
hot fabric upgrade, 399
scripting of, 400
FL_Ports, 48–49, 87
flow control, 43
format UNIX command, 312, 317
14-switch core/edge SAN design, 255–256
frame filtering, 142
frames, 37, 39–40, 41
classes of service and, 43–45
headers, 39–40
free ports, 243
full-mesh topology, 239–240, 247
fw_downloand script, 400
G
GBICs. See Gigabit Interface Connectors
(GBICs)
Get All Next (GA_NXT) request, 51
get_san_profile script, 384
Get_Time frame, 52
Get_Time_Response frame, 52
Gigabit Ethernet, 14, 30, 57
cost of, 28
routing Fibre Channel to, 110–111
Gigabit Interface Connectors (GBICs), 60,
61–62, 73–75, 121
Brocade SilkWorm switches and, 126
disadvantages of, 74, 75
GBIC ports on equipment, 74
serialized, 74–75
H
HA clusters. See High-Availability (HA)
clusters
hard zoning, 83–84, 303, 374, 375–378
hardware forwarding, 38
hardware, SANs, 21, 61–65
identifying pieces in place during design
phase, 165–166
initiating devices, 33
interconnecting devices, 33–34
physical assessment of in design process,
176–177
selecting, 21–22
target devices, 32–33
See also specific pieces
HBA. See Host Bus Adapters (HBAs)
HBA API, 101–103
headers, frame, 37, 39–40
heartbeat, network, 198, 202
help command, 291–292
Hewlett-Packard
LUN Manager product, 212
OpenView, 12, 350, 368
high-availability applications, 198–200
140_SANs_index
8/14/01
3:43 PM
Page 439
Index
high availability, ensuring with SANs,
10–11, 31
High-Availability (HA) clusters, 196–202,
257
active/active model, 198
active/passive model, 198
advantages of, 197
database servers and, 198–200
fault tolerance of, 199, 200
high-availability applications and, 198–200
Microsoft Cluster Server and, 200–202
redundancy and, 199
storage devices for, 200
zero-downtime failovers, 199
high-availability storage devices, 200
high-density optical connectors, 62, 72–73
high-end storage arrays, 113–114
LUN export across multiple ports by,
113–114
selective LUN presentation by, 113
snapshot backup volumes by, 114
High-Performance Parallel Interface
(HiPPI), 31, 38, 42–43
HighGround, Sun’s, 350
hop counts, 231, 276
Host Bus Adapters (HBAs), 33, 60, 64, 95
combination adapters, 98
configuration management software for, 101
connecting hosts to fabric with, 95
default LUN access permissions and, 100
drivers for, 172
dual in HA clusters, 199
dynamic discovery by, 101
Fabric Assist and, 139
fabric-capable, 98–99
HBA API and, 101–103
hot-plug systems and, 104–106
LUN mapping (persistent binding) and,
99–100
LUN masking and, 99
ports available on, 98
private (loop-based), 98
439
protocol access permissions, 100
protocols supported, 95, 96–97
QuickLoop and, 139
remote booting and, 103–104
speed requirements, 97, 171, 172
static discovery by, 101
storage partitioning by, 210–211
types of, 95–96
zoning, 373
host tier switches, 263
hosts, 33
checking for with switchShow command,
329
collecting information on in interviews,
160–162
troubleshooting information available
from, 287, 312–313, 317
hot fabric upgrades, 398, 399
hot-plug systems, 104–106, 126
hot-swappable components, 86, 125
HSSDC-2 connectors, 71
HSSDC connectors, 62, 70–71
hubs, Fibre Channel, 4, 34–35, 60, 63, 76–80
LIP process and, 77, 78–79
managed, 35, 60, 76–78
simple electrical, 34, 35, 60, 76
vs. switches, 57
I
I/O generators, 387–390
I/O load, 385–390
generation of, 387–390
types of, 386–687
I/O pauses, 342–343, 347
IBM DB2, 15, 226
in-band switch management, 356–358
incomplete ISL initialization, 318–319
InfiniBand technologies, 30, 57, 58, 71
initiating devices, 33
initiator-to-target communications,
mapping, 167–168
installation, SAN. See SAN installation
140_SANs_index
440
8/14/01
3:43 PM
Page 440
Index
Integrated Drive Electronics (IDE), 2
Intelligent Fabric Services Architecture,
140–143
frame filtering and, 142
hardware-enforced zoning, 142
ISL Trunking, 140–142
performance analysis and, 143
Secure Fabric OS and, 143
Intelligent Peripheral Interface (IPI), 42
Inter-Switch Links (ISL; E_ports), 48–49, 87
cable layout and, 351–354
incomplete initialization of, 318–319
ISL over-subscription ratio, 231–232
load sharing through, 134
port configuration conflicts and, 322–323
trunking, 140–142
interconnecting devices, 33–34
Internal Rate of Return (IRR) calculations,
189
Internet, data storage and, 9
Internet Protocol (IP), 2, 31, 39, 42, 96–97
Internet Service Providers (ISPs), 12–13
interoperability labs, 22–23
interviews, SAN design process and, 150,
153
backup information, identifying required,
167
business problem identification, 158
business requirements identification,
158–159
component testing needs, 166–167
current performance data, 168–172
design interview form, 175–176
host information, collecting, 160–162
identifying people to interview, 156–157,
193–194
implementation, determining acceptable
downtime for, 174
initiator-to-target communications matrix,
167–168
maintenance downtime, determining
acceptable, 174–175
performance, determining future needs,
172–174
SAN-enabled applications desired, 165
storage device information, 162–163
storage facility information, 164
technical requirement identification,
159–160
timeline creation from, 175–176
Iometer, Intel’s, 169, 170, 387, 389
iostat utility, Sun Solaris, 169, 386
IOzone, 387
IP addresses, setting switch, 358, 361
IP networks
routing Fibre Channel across, 110
routing over Fibre Channel to Gigabit
Ethernet, 110–111
IP protocol, 2, 31, 39, 42, 96–97
IP targets, 32, 33
IPFC protocol, 88, 96, 356–358
iSCSI, 57–58
ISLs (E_Ports). See Inter-Switch Links (ISLs)
J
Java-based Web pages, switch management
by, 93–94
JNI EZ Fibre, 312
Just a Bunch Of Disks (JBOD), 32, 33, 64,
111
K
K characters, 41
Key Distribution server, 50
L
LAN-based backup configurations,
212–213, 214
LAN-free backup configurations, 212–216
latency, 231
LC connectors, 72
LEDs, switch, 287–289, 318, 329
140_SANs_index
8/14/01
3:43 PM
Page 441
Index
legacy devices, connecting to SANs, 106
Legato NetWorker, 13
license, fabric, 322, 366
licenseShow command, 286, 322, 366
LIP (Loop Initialization Primitive), 77–78,
337–338, 339
LIP process, 77, 78–79
LIP storm, 79
Lip_in count, 339, 342
Lip_out count, 339, 342
Lip_rx count, 342
load testing, I/O, 385–390
locality, performance optimization through,
266–268
logical unit number. See LUN (Logical Unit
Number)
Login Server, 38, 50
logs, configuration, 391–393
logs, error, 292–295
loop environments, 4, 5, 33, 90
isolating marginal port faults, 339
LIP imbalances and, 339
LIP process and, 78–79
marginal GBICs and, 338
marginal loop connections, 337–338
marginal port behavior on disrupted, 338
migrating to switched fabrics, 79–80
Loop Initialization Primitive (LIP), 77,
337–338, 339
loop ports. See FL_Ports
loop zoning, 77
loopPortTest command, 290
LUN (Logical Unit Number)
access permissions, 100
high-availability (HA) clusters and, 200
LUN-level zoning, 373, 374
Microsoft Cluster Server configurations,
200–202
selective presentation of, 108
LUN Manager, Hewlett-Packard’s, 212
LUN mapping (persistent binding), 99–100
LUN masking, 122, 205, 226
441
HBA-based, 99
hot-plug systems and, 105
storage partitioning by, 210–211
M
make_zone script, 379–380
MAN technologies. See Metropolitan Area
Networking (MAN)
manageability, high-availability cluster, 197
managed Fibre Channel hub, 34–35, 60,
76–78
Management Information Base (MIB),
92–93, 135
Management Server, 38, 85, 86, 133
mapping, LUN, 99–100
marginal GBICs, 338
marginal loop connections, 337–338
marginal point-to-point/fabric device links,
335–336
marginal ports, 335
disrupted loops and, 338
fault isolation and, 339, 340
marginal GBICs, 338
marginal loop connections, 337–338
marginal point-to-point/fabric device
links, 335–336
portErrShow command, 295–297
masking, LUN, 122, 205, 226
HBA-based, 99
hot-plug systems and, 105
storage partitioning by, 210–211
media, cabling, 61, 65–68, 164
Media Interface Adapters (MIAs), 75
mesh topologies, 238
compared to other topologies, 247
full-mesh topology, 239–240
partial-mesh topology, 240–242
resiliency of, 256, 257, 258
metadata servers, 212
Metropolitan Area Networking (MAN),
15–16, 217, 219–221, 238
MIAs. See Media Interface Adapters (MIAs)
140_SANs_index
442
8/14/01
3:43 PM
Page 442
Index
Micromuse’s Netcool, 350, 368
Microsoft Cluster Server (MSCS), 11,
200–202
Microsoft Exchange databases, 11
Microsoft SQL Server, 226
Microsoft Windows Hardware Quality Lab
(WHQL), 202
migration process, 154
missing devices, troubleshooting, 279–283,
327–335
locating on Name Server with nsShow,
332–333
port configuration conflicts, 329–332
switchShow command and, 329–332
zoning mismatches and, 333–335
MQ errors, 318, 327
mqShow command, 286
MSCS. See Microsoft Cluster Server
(MSCS)
MT-RJ connectors, 72–73
multi-LUN devices, 208
multicast groups, 52
multimode optical cables, 36, 61, 66–67
multipathing software, 199
N
Name Server, 38, 41, 50–51, 85, 133–134
hard zoning and, 376
missing devices and, 280–283, 303–307,
332–335
nsAllShow command, 286, 320, 329, 382,
395
nsShow command, 281–282, 303–307,
320–321
names, switch, 360
Net Present Value (NPV) calculations, 189
NetBackup,VERITAS’, 13
Netcool, Micromuse’s, 350, 368
Network Attached Storage (NAS), 17, 18
network backups
accelerating cycling of, 14
collecting information on in interviews,
167
LAN-free configurations for, 212–213
reducing network congestion from, 13
remote, 218–219
SAN-based server-free, 213–216
Network Data Management Protocol
(NDMP), 214
network heartbeats, 198, 202
network protocols, 31
NetWorker, Legato’s, 13
node count, 230
node WWNs, 306–307
nodes, 230
adding to fabrics, 401–402
collecting information on in interviews,
160–164
missing from Name Server, 334–335
over-subscription of, 232
timeout of at bring up, 321–322
WWNs, 307
nonresilient dual-fabric SANs, 257
nsAllShow command, 286, 320, 329, 395
nsShow command, 286
fabric validation using, 382
troubleshooting fabrics with, 320–321
troubleshooting missing devices with,
281–282, 332–335
troubleshooting SANs with, 303–307
O
Open Fibre Control (OFC), 67
OpenView, Hewlett-Packard’s, 12, 350, 368
optical cabling, 36, 61, 65
optical connectors, 62, 71–73
Oracle Parallel Server, 15, 226
out-of-band switch management, 356
over-subscription, 231
140_SANs_index
8/14/01
3:43 PM
Page 443
Index
P
parallel bus SCSI, 2–4
parallel SCSI termination, 108
parityCheck command, 290
partial-mesh topology, 240–242, 247
PATROL, BMC’s, 350, 368
payback period, 189
payloads, 36, 39
PCI bus speed, pre-SAN, 170–171
PCI hot-plug systems, 104–106
perfmeter, Sun Solaris, 169
perfmon,Windows, 169
performance
Brocade Intelligent Fabric Services
Architecture and, 143
of cascade topology, 237
of complex core resilient core/edge topology, 245
determining future performance needs,
172–174
evaluating pre-SANs, 168–172
factors affecting, 270–271
of full-mesh topology, 240
of partial-mesh topology, 242
of ring topology, 237–238
Peripheral Component Interconnect (PCI)
hot-plug systems, 104–106
persistent binding (LUN mapping), 99–100
persistent logging, 293
phantom devices, 302, 303
PLOGI/FLOGI timeout failures, 321–322
point-to-point connections, 87
point-to-point topology, 37, 39, 45–47
port addressing, 307–308
port-based zoning, 83
port counts, 230
port indicators, LED, 287–289
port internal loopback diagnostic, 290
Port Login (PLOGI) frame, 50
port register diagnostics, 289
port WWNs, 306–307
443
portCfgEport command, 322–323, 330
portcfgFport command, 330
portCfgLport command, 286, 330
portDisable command, 249, 302, 318, 330,
334–335
portEnable command, 302, 318, 330,
334–335
portErrShow command, 286, 295–297
portFlagsShow command, 286
portLog debugging tool, 314
portLogDump command, 286, 314
portLogShow command, 314
portLoopbackTest command, 289, 290
portRegShow command, 286
portRegTest command, 289
portRouteShow command, 286
ports, 48–49, 50
automatic bypass of by managed hubs, 78
buffer credits available per switch port,
86–87
collecting information on in interviews,
160–161
configuration conflicts, 322–323, 348
determining number needed, 182–187
display of port state information, 297–303,
318
E_Ports. See E_Ports (ISLs)
edge, 243
F_Ports, 48–49, 87
FL_Ports, 48–49
free, 243
manual configuration of, 330
marginal, 335–342
monitoring of by Fabric OS, 133
number on Brocade SilkWorm switches,
125
number on HBAs, 98
number on routers, 107, 108
self-configuring, 87–88
types of SCSI on routers, 108
universal port support by Fabric OS, 133
portSemShow command, 286
140_SANs_index
444
8/14/01
3:43 PM
Page 444
Index
portShow command, 341, 342
portStatsShow command, 278, 341, 342
power supplies, redundancy and, 86, 125
primitives, 36, 39–41
principal switches, 300
Prisa’s Visual SAN, 350, 368
private devices, 302, 303
private Host Bus Adapters (HBAs), 98
Private Loop Direct Attached (PLDA), 337
private loop drivers, 165–166
Private Loop Fabric Attach (PLFA), 337
problem descriptions, troubleshooting and,
284–285
protocol analyzers, 315
prototypes, 21, 153–154
psShow switch command, 286
Q
qlDisable command, 330
qlPortDisable command, 330
qlShow command, 286, 330
QuickLoop, 79, 80, 138–139, 330, 337, 339,
366
QuickLoop CAM diagnostics, 289
R
R_RDY primitive, 43
racking, 354–355
RAID storage arrays, 32, 33, 64, 111–112
controller speed requirements, 171
switch zoning and, 208
RAM (Random Access Memory), 171
ramTest command, 289
redundancy, 86, 125, 126, 256–257
redundant fabric architecture, 175, 199,
256–257, 258–260
Registered State Change Notification
(RSCN), 51, 85–86
I/O pauses and, 343–344, 347
new device registration and, 284
reliability
cascade topology, 236, 237
complex core resilient core/edge
topology, 245
full-mesh topology, 240
partial-mesh topology, 241, 242
ring topology, 238
remote backups, 218–219
remote booting, 103–104
Remote Switch, 218
resiliency, SANs and, 126, 256
resilient core/edge fabric topology, 242–246
adding edge switches to, 248–250
defined, 229–230
upgrading core switches, 250–253
resilient dual-fabric SANs, 257
resource sharing, 19
Return Merchandise Authorization (RMA)
system, 173–174
Return On Investment (ROI) proposition,
153, 159, 187–189, 190
RFT_ID frame, 50–51
ring topology, 237–238
compared to other topologies, 247
resiliency of, 256, 257, 258
ROI proposition. See Return On
Investment (ROI) proposition
rolling upgrades, HA clusters and, 197,
198–199, 202
routers, Fibre Channel, 35, 60, 64–65,
106–109
extended copy support by, 108–109
management interfaces for, 109
number of SCSI buses on, 107
SCSI termination type and, 108
selective LUN presentation by, 108
types of SCSI ports available on, 108
run_sw_cmd script, 369, 372, 379, 384, 400
S
SAN architecture, 153, 230–231, 253–256
any-to-any connectivity and, 268–269
availability models, 256–260
core/edge fabric designs, 253–256
140_SANs_index
8/14/01
3:43 PM
Page 445
Index
development of, 153
Intelligent Fabric Services Architecture,
140–143
leveraging tiers, 261–265
localizing traffic, 266–268
SAN-based server-free backups, 213–216
SAN-based third-party copy data movers,
214, 215–216
SAN configurations, 196–221, 226
disaster tolerance solutions, 216–221
High-Availability (HA) clusters, 196–202
LAN-free backup configurations, 212–213
SAN-based server-free backups, 217
storage consolidation configurations,
203–212
SAN grouping, 178–182
SAN installation, 21, 350
creating baseline profile at, 382–383
ISL cable layout for, 351–354
racking considerations, 354–355
setting switch parameters, 358–359, 361
zoning considerations, 372–382
SAN lifecycle, 190
architecture development phase, 153
data analysis phase, 153, 177–189
data collection phase, 153, 156–177
maintenance, 155
migration process, 154
overview of, 151–155
prototypes, 153–154
release to production, 154–155
testing, 154
SAN maintenance, 155
adding edge devices to fabrics, 401–402
adding switches to fabrics, 395–398
bringing up fabrics, 321–322, 394–395
configuration log for, 391–393
downtime requirements for, 174–175
fabric upgrades, 398–400
merging fabrics, 395, 408
replacing switches on fabrics, 395–398
switch configuration backups, 393–394
445
SAN management, 350–351
automation of administrative activities,
367–372
in-band IPFC, 356–358
out-of-band, 356
SAN port count, 230
SAN profiles, 308–311, 317–318, 382–384
SAN troubleshooting, 278–316
creating problem descriptions, 284–285
creating SAN profiles, 308–311
Fibre Channel analyzers, 314–316
gathering supportShow information,
285–287
I/O pauses, 342–343
marginal links, 335–342
missing devices, 279–283, 327–335
portLog debugging tool, 314
SAN profiles, 308–311, 317–318, 382–384
scenario describing, 279–283
switch diagnostics and, 289–308
switch LEDs and, 287–289
using information from hosts for, 312–313
“virtual cable” approach to, 278–279
See also fabric troubleshooting
SAN verification
fault injection techniques, 384–385
I/O load and, 385–390
SAN profiles and, 382–383
SANavigator, 350, 368
SANergy,Tivoli’s, 12, 212
SANmark Conformance Documents
(SCDs), 35
SANmark suite, 8, 22, 35, 58
SANs (Storage Area Networks), 229
acceleration of backup cycles by, 14
applications best served by, 16–17
arbitrated loop topology, 4, 5, 33, 37, 39,
47–48
architecture overview, 30–36
availability levels, 257–260
cluster protocol access and, 14–15
configuration log for, 391–393
140_SANs_index
446
8/14/01
3:43 PM
Page 446
Index
configuration, typical, 7
creation of, 2–7
data access speeds and, 14, 31
deployment of, 20–24, 154–155, 174
design process, 153, 156–177, 187–189,
194
disaster tolerance with, 15–16, 31,
216–221
distances supported, 4, 17
fabrics vs., 275
high availability features of, 10–11, 31
High Availability (HA) clusters and,
196–202
initiating devices, 33
installing. See SAN installation
lifecycle of, 151–155
maintenance requirements, 155
managing data storage with, 9–10
Network Attached Storage (NAS) vs., 18
network congestion and, 13
point-to-point topology, 37, 39, 45–47
port requirements, 182–187
prototypes, 21, 153–154
release to production, 154–155
remote booting and, 103–104
resources on, 8
Return On Investment (ROI)
proposition, 153, 159, 187–189
scalability of, 31
speed of, 17, 31
storage consolidation and data pooling
through, 11–13, 203–212
storage servers vs., 31, 32
switched fabric topology, 37, 39, 48–49
target devices and, 32–33
testing, 21, 23, 154, 382–390
tiered, 261–265
when to deploy, 16–20
See also fabrics; Fibre Channel
SANsymphony, DataCare’s, 12
SC connectors, 62, 71–72
SCDs. See SANmark Conformance
Documents (SCDs)
scripting, automated
of firmwareDownload, 400
SAN profiles and, 384
switch operations, 369–372
zoning operations, 379–380
SCSI buses, 107
SCSI disks, 33
SCSI Enclosure Services (SES), 94
SCSI-FCP standard, 41
SCSI protocol, 2, 28, 31, 39, 41, 42, 57
SCSI-to-Fibre Channel routers, 65
Secure Fabric OS, 143
segmented fabrics, 323–327, 348
selective LUN presentation, 108, 113
serial ports
router/bridge management by, 109
switch management by, 91
Serial Storage Architecture (SSA), 3
serialized ID GBICs, 74–75
server cycles, increasing, 14–15
server-free backups, SAN-based, 213–216.
See also LAN-free backup configurations
services, fabric. See fabric services
SES. See SCSI Enclosure Services (SES)
setGbicMode command, 290
setSplbMode command, 290
shared file system technique, 206
signal retiming, managed hubs and, 78
signaling interface, Fibre Channel, 40
SilkWorm series switches. See Brocade
SilkWorm switches
simple electrical hubs, 34, 35, 60, 76
Simple Name Server, 133. See Name Server
simple resilient core/edge topology, 244,
247
single fabric, nonresilient SAN architecture,
257
single fabric, resilient SAN architecture, 257
single-mode fiber-optic cable, 36, 61, 68–69
140_SANs_index
8/14/01
3:43 PM
Page 447
Index
Single Point Of Failure (SPOF), 233–234
single-port adapters, 98
snapshot backup volumes, 114
SNIA. See Storage Networking Industry
Association (SNIA)
SNMP Management Information Base
(MIB), 92, 121, 135
SNMP polling, 295–296
SNMP (Simple Network Management
Protocol)
router/bridge management by, 109
switch management by, 91–93
soft zoning, 83, 375–378, 408
software
identifying existing in design phase,
165–166
selecting, 21–22, 165–166
storage partitioning, 205, 211–212, 226
Solaris, 169–170, 287, 312, 317
special character (K), 41
speed
assessing need for, 17, 122
auto-negotiation of by switches, 88
HBA capabilities, 97
spinFab command, 290
spinSilk command, 290
sramRetentionTest command, 290
ST connectors, 71
star topology network, 228, 242–244
Start Of Frame (SOF) primitive, 39
State Change Registration (SCR) frame, 51
static discovery, HBAs and, 101
Storage Area Networks (SANs), Fibre
Channel. See SANs (storage area networks)
storage arrays, 9, 11, 21, 287. See also RAID
storage arrays
storage consolidation, SANs and, 11–13,
203–212
partitioning software, 211–212
shared storage with Web farms, 206–207
switch zoning and, 208–210
447
storage devices, 64
collecting information on in interviews,
162–163
connecting legacy to SANs, 106
drivers for, 204
High-Availability (HA) clusters and, 200
high-end storage arrays, 113–114
individual disk drives, 33, 64, 111
Just a Bunch Of Disks (JBOD), 32, 33, 64,
111
RAID storage arrays, 64, 111–112, 208
storage consolidation SAN configurations,
203, 204
storage LUN masking, 105
storage network topologies, 37, 45–49
arbitrated loop topology, 37, 39, 47–48
point-to-point topology, 37, 39, 45–47
switched fabric topology, 37, 39, 48–49
Storage Networking Industry Association
(SNIA), 8, 103
Storage Networking World Conference, 8
storage partitioning
with LUN masking, 210–211
software for, 205, 211–212, 226
with switch zoning, 51
See also switch zoning
storage tier switches, 263
subordinate switches, 300
Sun Microsystems products
HighGround, 350, 368
Jira management standard, 109
Solaris, 169–170, 287, 312, 317
supportShow command, 285–286, 289, 290,
318
switch beaconing, Fabric OS and, 135
switch commands, 289–290
case sensitivity of, 291
diagnostic. See diagnostic switch commands
show vs. dump, 291
See also specific commands
switch configuration file, 393–394
140_SANs_index
448
8/14/01
3:43 PM
Page 448
Index
switch domain IDs, 300–301, 319, 326–327,
358, 361, 397
switch LEDs
device troubleshooting and, 329
fabric troubleshooting and, 318
SAN troubleshooting and, 287–289
switch ports, buffer credits per, 86–87
switch zoning, 83–84, 226, 372–382
adding new switches and, 381, 396–397
aliases and, 381
automatic scripting of activities, 379–380
backing up of configurations, 382
clearing zones, 381
conflicts with, 323–324
determining where to zone, 373–375
hard, 83–84, 374, 375–378
hardware enforced, 142
hot-plug systems and, 105–106
management of, 378–379
minimizing unwanted interactions, 381
mismatch of and missing devices, 333–334
Node and Port WWN and, 381
port-based, 83
soft, 83, 375–378, 408
storage partitioning and, 84, 205, 208–210
switchBeacon command, 302
switchDisable command, 251, 300, 343
switched fabric ports. See F_Ports
switched fabric topology, 5, 37, 39, 48–49
switchEnable command, 251, 318, 343
switches, Fibre Channel, 4, 33–34, 36, 60,
63, 80–83
adding new to fabrics, 395–398
adding new to zoned SAN, 381
auto-negotiation of speed by, 88
automated management of, 369–372
automatic configuration of by Fabric OS,
133–134
Brocade SilkWorm series of. See Brocade
SilkWorm switches
buffer credits available per switch port,
86–87
cable layout and, 351–354
classes of service and, 84
configuration backups for, 393–394
core. See core switches
Destination ID (D_ID), 302
disabled, 300
edge, 243, 248–250
equipment redundancy and, 86
exchange of information by, 49–50
fabric licenses and, 322, 366
fabric switches, 81
FSPF compliance, 90–91
host tier, 263
hubs vs., 57
in-band switch management, 356–358
IPFC broadcasting and, 88
loop operation capabilities, 90
management interfaces, 91–94
naming, 360
out-of-band switch management, 356
principal, 300
restoring configuration of, 393–394
selecting most appropriate, 124–126
self-configuring ports and, 87–88
setting parameters for, 358–359, 361
storage tier, 263
subordinate, 300
switched fabric topology and, 48–49
upgrading, 89
zoning and. See switch zoning
switchShow command, 278, 279, 286
fabric troubleshooting with, 307–308,
318–319
fabric validation using, 382
SAN troubleshooting with, 297–303,
307–308
troubleshooting missing devices with,
279–280, 329
SwitchType values, 302
syslog daemon interface, Fabric OS, 135
syslogIpaAdd command, 293
syslogIpRemove command, 293
140_SANs_index
8/14/01
3:43 PM
Page 449
Index
syslogIpSho command, 293
system DRAM diagnostics, 289
T
T11, 35
tape drives, 32, 33, 171
target devices, 32–33
Task Name, 294, 295
technical goals, identifying, 153
technical requirements document, 160, 190
technical requirements, identifying SAN,
159–160, 190
telnet, switch management by, 91, 92, 378
tempShow switch command, 285
10-switch core/edge SAN design, 253–254
termination, type of SCSI, 108
third-party copy backup systems, 214,
215–216
third-party copy devices, 14, 214, 215–216,
217
tiered SANs, 261–265
tiers, leveraging, 261–265
Time Server, 38, 41, 50, 52, 86
timeline, creation of development, 175–176,
190
timeout failures, edge device, 321–322
Tivoli products
SANergy, 12, 212
Storage Network Manager, 350, 368
Token Ring protocol, 31
too_long error statistic, 297
topologyShow command, 307–308, 309,
320, 321, 329, 382, 395
traceShow command, 286
traffic patterns
any-to-any connectivity and, 268–269
evaluating future traffic, 172–174
Fabric OS dynamic routing, 134
initiator-to-target mapping, 167–168
leveraging tiers, 261–265
localizing traffic, 266–268
pre-SANs performance data on, 169–172
449
SAN grouping process and, 178–182
transitioning, 154
transmission words, 36
Troika products
HBA driver, 172
SAN Command utility, 101, 102, 312
troubleshooting
fabrics, 316–327
missing devices, 279–283, 327–335
SANs (Storage Area Networks), 278–316
tools for, 287–316
20-switch core/edge SAN design, 254
U
ULP information, 36
ULP mapping, 42
unbonded SC connectors, 72
unconfirmed domains, 300–301, 319
Unicenter TNG, 350, 368
UNIX
dd tool for I/O generation, 388–389
format command, 312, 317
performance monitoring, 169–170
unmanaged Fibre Channel hub, 34, 35, 60
Unzoned Name Server, 51
upgrades
acceptable downtime for, 174–175
fabric, 398–400, 408
Fibre Channel switch, 89
firmware, 89, 398–400
rolling with HA clusters, 197, 198–199,
202
upstream devices, 302, 303, 304
uptime switch command, 285
V
/var/adm/messages file, Solaris’s, 287, 312,
317
VERITAS products
Cluster Server, 11
Dynamic Multipathing, 172, 199
140_SANs_index
450
8/14/01
3:43 PM
Page 450
Index
NetBackup, 13
SANavigator, 103
SANPoint control, 103, 350, 368
Volume Manager, 12, 212
vxbench, 387
version switch command, 285
Virtual Interface (VI) protocol, 2, 15, 31, 42,
97
Visual SAN, Prisa’s, 350, 368
volume-level sharing, 19
Volume Manager,VERITAS’, 12
Vxbench,VERITAS’, 387
W
Web-based router management, 109
Web-based switch management, 93–94
Web farms, storage sharing with, 206–207
Web sites
Brocade, 158
Expect, 369
Intel (Iometer), 387
IOzone, 387
router management through, 109
switch management through, 93–94
WEB TOOLS, 139–140, 147, 366, 378, 408
well-known addresses, 50, 51, 52
WHQL. See Microsoft Windows Hardware
Quality Lab (WHQL)
Windows 2000 Advanced Server, 200
Windows 2000 Data Center, 200, 202
Windows 2000 OS, 99
Windows NT, 99, 169, 200
wiring, 61, 65–68, 164
World-Wide Names (WWNs), 307, 366
World Wide Web, data storage needs and, 9
WWN ports, 306–307
WWN spoofing, 143
X
X3T11, 35
Z
zero-downtime failover, 199
zoning, switch. See switch zoning
140_SANs_BM
8/14/01
3:38 PM
Page 451
140_SANs_BM
8/14/01
3:38 PM
Page 452
Additional SAN Information
Resources from Brocade
To learn more about SANs and SAN technologies, you can visit the Brocade
Web site at www.brocade.com.This site contains a wide range of useful
resources designed for many types of audiences—from high-level managers
and other key decision makers to system administrators and network
architects. In particular, the Brocade SAN solution center and educational
Web pages offer a wealth of timely information.
Brocade SAN Solution Center
Accessible at www.brocade.com/san, the Brocade SAN solution center
includes a variety of information about Brocade products and SAN
technologies in the form of:
■
White papers
■
Product data sheets
■
Brocade SOLUTIONware configuration summaries
■
Standards updates
■
Industry news
■
Tools for understanding Return On Investments
Brocade Education Services
Accessible at www.brocade.com/education_services, Brocade education
provides information about the full range of Brocade educational
offerings, including:
■
Brocade SAN certification requirements
■
Course descriptions
■
Registration forms
■
Special promotions
140_SANs_BM
8/14/01
3:38 PM
Page 453
Brocade Networking
Storage Conference 2002
If you want to interact with the SAN experts, just plan to attend the second annual
Brocade Storage Networking Conference.Visit www.brocade.com/san to get the
latest information about registration, early-bird discounts, and previews of the agenda
and speakers.
140_SANs_BM
8/14/01
3:38 PM
Page 454
SYNGRESS SOLUTIONS…
AVAILABLE NOW
ORDER at
www.syngress.com
ASP Configuration Handbook: A Guide to ISPs
Internet Service Providers (ISPs) are confronted with shrinking profit margins,
fierce competition, and undifferentiated products. To successfully overcome
these new economic realities, ISPs have begun to expand beyond just offering
bandwidth and connectivity, by providing customers with “outsource services”
such as software applications and electronic commerce solutions This book is
the blueprint for a successful ISP-to-ASP transition.
ISBN: 1-928994-26-1
Price: $49.95 US, $77.95 CAN
AVAILABLE NOW
ORDER at
www.syngress.com
Administering Cisco QoS in IP Networks
Administering Cisco QoS for IP Networks discusses IP Quality of Service
(QoS) and how it applies to Enterprise and Internet Service Provider (ISP)
environments. It reviews routing protocols and quality of service mechanisms
available today on Cisco network devices and it will provide you with examples and exercises for a hands-on experience designed to give you the background to implement these capabilities in your network.
ISBN: 1-928994-21-0
Price: $59.95 US, $92.95 CAN
AVAILABLE DECEMBER 2001
ORDER at
www.syngress.com
Juniper Networks Internet Routing Guide
Upstart Juniper Networks has created a new generation of faster routers that
are extremely price competitive with similar Cisco models. Juniper Networks
Internet Routing Guide is a comprehensive, hands-on guide on how to install,
configure, and troubleshoot Junipers core router family. This book will immediately meet the demands of network engineers responsible for the installation
and configuration of Juniper Routers. It also covers the latest “M-Class” routers,
as well as configuration strategies, and the JUNOS software environment.
ISBN: 1-928994-76-8
Price: $69.95 US, $108.95 CAN
[email protected]