Latest trends and technologies in Storage Networking
Download
Report
Transcript Latest trends and technologies in Storage Networking
Latest trends and technologies
in Storage Networking
By: Gururaja Nittur
Advisor: Dr. Chung E Wang
Second Reader: Dr. Du Zhang
Scope of the Project
Study the new technologies in the storage
networking arena
–
–
–
–
Fibre channel protocol
NAS, SAN and iSCSI
Storage Virtualization
High Availability
Demonstrate high availability by writing a
DMP(Dynamic Multi Pathing) driver for Solaris
What is Storage Networking?
“The practice of creating, installing,
administering, or using networks whose
primary purpose is the transfer of data
between computer systems and storage
elements and among storage elements”
Why Storage Networks?
“The total amount of data being stored doubles every year. Also, more
than 90% of companies today would fail to survive a catastrophic
data loss. Businesses face a mission-critical need to protect, access,
and manage their ever-growing volume of storage assets”
Explosive growth of business data
Internet and Multimedia
High Availability
Management complexity
Why Storage Networks? (Contd..)
Fibre Channel (FC)
A serial, high-speed data transfer technology
Open standard, defined by ANSI and OSI
Data rate upto 100 MB/sec. (200 MB/sec. fullduplex)
Supports most important higher protocols like IP,
ATM, SCSI etc.
Does not have its own command set, but
facilitates data transfers between individual FC
devices.
Parallel Transmission
Set of data signals are sent simultaneously
through 8, 16 or even more wires.
Problems with parallel transmission
– data sent simultaneously over all the wires have to be
received simultaneously as well
– Total time = t+dt
• ‘t’ - time taken for the signals to reach receiver
• ‘dt’ – additional delay due to hardware inconsistencies
• ‘dt’ increases with cable length causing lesser frequency
Example - SCSI
• Bus length limitations
• Max bus speed is limited (~40 MB/sec in Ultra SCSI)
• Limited device count
Serial Transmission
Serial transmission uses single cable
– All signals are delayed the same and arrive at
the receiver in the same order in which they
were sent.
– Higher bus length
Examples
– SSA (Serial Storage Architecture by IBM)
– Fibre Channel
Current Technology
DAS using SCSI
Emerging Technologies
Network Attached Storage (NAS)
Storage Area Networks (SAN)
Storage over IP (iSCSI)
Network Attached Storage
Network Attached Storage
Storage device will have a built-in
network interface
NAS unit can be plugged directly into the
network to allow quick and easy access
Standard network protocols such as CIFS
and NFS can be used to share data
Network Attached Storage
NAS engine is usually SCSI for low-end
systems for cost reasons and Fibre
Channel for the enterprise systems
NAS is easy to install and relatively easy
to maintain
Network is used exclusively for data
transfer causing additional overhead
Backup using LAN is really a overhead
Storage Area Networks
Storage Area Networks
As much as 60% of the traffic on a std
corporate network is made up of
housekeeping actions like Backup
Storage Area Network has been fuelled
significantly by the desire to get this
housekeeping off the network
Primary interface for SAN infrastructure is
Fibre Channel
Storage Area Networks
SAN provides excellent performance and
easier management
SAN implementations are expensive due
to hardware costs
Better resource sharing could make up for
the initial investment
SAN is very flexible in that more storage
and servers can be added easily
iSCSI
Motivation
– GB Ethernet
iSCSI is a draft
standard protocol to
encapsulate SCSI
commands into
TCP/IP packets
Can be used to build
IP based SANs
Storage Virtualization
The research firm Gartner Group estimated that
80% of the storage costs is used up for managing
the installed storage
Switch and array management becomes very
difficult with increased storage hardware
Virtualization provides a logical view and eases
management.
Examples – Veritas Volume Manager, IBM
Tivoli etc.
Future of network storage
SAN islands connected by IP networks
Network Unified Storage (NUS)
– NAS + SAN on GB Ethernet networks
High Availability
Host
Single Point Failure
Disk
Host
Multi Pathing
Disk
Dynamic Multi Pathing
Increased disk availability
Load balancing
Identifies disks uniquely from different
hosts
Dynamic Multi Pathing
/dev/rdsk entries
c1t1d0
c2t1d0
….
cnt1d0
/dev/rdsk entries
c1t1d0
c2t1d0
….
cnt1d0
Host
Host
..
Disk
Implementation Details
Scan the disks listed in /dev/rdsk
If no UUID is present, generate a unique UUID
and stamp it in the disk’s private region
Add this device to a hash table hashed on UUID
Load this table to the kernel and write the ioctls
to update this info
Use an algorithm (Currently round robin) to
efficiently load balance the I/O requests.
If a path is bad for more than five I/O attempts,
mark it bad and do not use it for path selection.
Implementation Details
User code
– Read /dev/rdsk folder and generate a hashed list of
available disks
– Load this list to kernel. Also provide APIs to push
newly added/removed disks.
Kernel code
– Filter driver to choose the best path
– ioctls to do the following
•
•
•
•
LOAD_DISKS
NEW_DISK
MODIFY_DISK
GET_DISK_HANDLE …
Questions??