Transcript Title

Accelerating Time to Results
KC ZHANG
Panasas Technical and Business Development Manager
[email protected]
Leader in Parallel Storage Systems
Agenda
Panasas introduction
Customer successes
Panasas solutions
Panasas
Availability
Slide 2
Panasas, Inc.
Panasas
Founded by Garth Gibson in 1999. First Customer Ship in 2003
The fastest supercomputer in the world runs Panasas
Primary Investors:
HQ – Silicon Valley
Market Focus:
o
o
o
o
o
o
Energy
Academia
Government
Life Sciences
Manufacturing
Finance
Technologies: parallel file system and parallel storage appliance
World wide support with over 25 global resellers
Slide 3
Panasas, Inc.
Partnering to meet customer needs
Application ISVs
Resellers
Standards Development
Slide 4
Panasas, Inc.
Recognized Product Innovation
and Excellence
NAS Magic Quadrant
“Visionary”
Best HPC
Storage Product
Top 5 Vendors to
Watch in 2009
Top Collaboration
Between Government
and Industry Roadrunner,
Top Supercomputing
Top Supercomputing
Achievement Roadrunner,
Los Alamos National Laboratory
Achievement Roadrunner,
Los Alamos National Laboratory
8 Panasas Customers Win HPCWire Awards in 2008!
6 Panasas Customers Win HPCWire Awards in 2007!
10 Disruptive New
Storage Technologies
Promise Big Changes
Slide 5
Panasas, Inc.
Panasas Powers RoadRunner
Slide 6
Panasas, Inc.
RoadRunner at a Glance
Slide 7
Panasas, Inc.
Petascale Red Infrastructure Diagram
with Roadrunner Accelerated FY08
NFS
and
other
network
services,
WAN
Secure Core switches
Nx10GE
NxGE
Nx10GE
Archive
I
B
4
X Compute Unit
FTA’s
10GE
Roadruner
Phase 3
1.026 PF
F
a
t
T
r
e
e
IONODES
Site wide
Shared
Global
Parallel
File
System
(Panasas)
Compute Unit
4 GE
per 5-8
TB
10GE
IONODES
IO
Unit
M
y
r
i
n
e
t
CU
Roadrunner
Phase 1
70TF
CU
Lightning/Bolt
35 TF
Scalable to
600 GB/sec
before adding
Lanes
Slide 8
1GE
IONODES
IO
Unit
M
y
r
i
n
e
t
CU
Panasas, Inc.
Leaders in HPC choose Panasas
ENERGY
SWIFTCOMPANY
Slide 9
Panasas, Inc.
The Common Themes
A. Very complex problems and simulations
B. Very large number of files being used concurrently
C. Very large number of concurrent users/servers
D. Consolidating Users and Clusters on one storage system
E. Any or all of the above
Panasas solves the most difficult storage problems
while delivering very high reliability in an easy to use
appliance-like package.
Slide 10
Panasas, Inc.
Breaking Through the Bottleneck
Clusters = Parallel Compute
Parallel Compute needs Parallel IO
Linux
Compute
Cluster
Linux
Compute
Cluster
Issues
Complex Scaling
Limited BW & I/O
Islands of storage
Inflexible
Expensive
Single data
path to
storage
Monolithic
Storage
(NFS
servers)
Slide 11
Benefits
Linear Scaling
Extreme BW & I/O
Single storage pool
Ease of Mgmt
Lower Cost
Parallel
data
paths
Panasas
Parallel
Storage
Clusters
Panasas, Inc.
What is Parallel Storage?
The architecture for scale-out file storage
Clustered
NFS
NFS
File
Server
File
Server
File
Server
NAS:
Clustered Storage:
Network Attached
Storage
Multiple NAS file servers managed
as one. Good aggregate
performance.
Slide 12
Parallel
NFS
Parallel Clustered
Storage:
File server not in data path.
Performance bottleneck
eliminated.
Panasas, Inc.
Panasas Storage Cluster:
Built on Industry-Standard Components
Integrated 10GE Switch
Battery Module
(2 Power units)
Shelf Front
1 DB, 10 SB
Shelf Rear
StorageBlade
DirectorBlade
Midplane routes GE, power
Slide 13
Panasas, Inc.
Performance and Scaling
DirectFLOW client
o
Standard installable file system
o
Supports all common Linux flavors
o
Support up to 12K clients
Panasas DirectFLOW® data path
DirectorBlade cluster
o
Divides namespace into virtual
volumes
o
Allows metadata to scale (no
bottleneck)
Demonstrated scalable
performance
o
Slide 14
30+ GB/sec of sustained
throughput from a single
filesystem
Panasas, Inc.
Scalable NAS - NFS/CIFS
Scalable NFS/CIFS server
o
o
o
o
Load automatically distributed across
scalable DirectorBlade modules
Scale to satisfy growing number of clients
Any DirectorBlade module can access
any file
Slide in a new DB, instantly get more
NFS ops/sec into the same data
Access same data from any protocol
o
o
Slide 15
Integrates non-Linux devices into system
2+9 configuration typically best for NFS.
Balances CPU ops/sec with disk ops/sec
Panasas, Inc.
Total Time in Hours to complete the job
400
Data Set
350
• 23 Million Traces
Hours
300
• 139GB input
dataset
250
• 234GB output
depth migrated
image gathers
200
150
• 247MB per depth
slice, 970 depth
slices
100
50
0
Panasas
Other Vendor A
Other Vendor B
Throughput of Reads & Writes (MB/sec)
60
Data Set
50
MB / SEC
• 23 Million Traces
• 139GB input
dataset
40
• 234GB output
depth migrated
image gathers
30
20
• 247MB per depth
slice, 970 depth
slices
10
0
Panasas
Chart Legend
Other Vendor A
Read Rate
Write Rate
Other Vendor B
Aggregate Throughput for 24 Nodes
1400
Data Set
• 23 Million Traces
1200
• 139GB input
dataset
MB / SEC
1000
• 234GB output
depth migrated
image gathers
800
600
• 247MB per depth
slice, 970 depth
slices
400
200
Chart Legend
Aggregate Read Throughput
Aggregate Write Throughput
0
Panasas
Other Vendor A
Other Vendor B
Job Time Activity
Panasas
Other Vendor B
Other Vendor A
Data Set
• 23 Million Traces
Chart Legend
• 139GB input dataset
Processor Waiting on Data
• 234GB output depth migrated image gathers
Computation
• 247MB per depth slice, 970 depth slices
ActiveScale Operating System
DirectFLOW® Protocol
o
Provides parallel data paths for maximum
performance
PanFS™ Parallel File System
o
o
o
Distributed and parallel file system
Block management hidden behind object
storage interface
File management distributed across metadata
managers
Designed to be managed by
non-storage professionals
ActiveScan Predictive Media Management
o
o
Continuous sweeps of all data and disk media in the StorageBlade
If discrepancies are detected the system proactively corrects the media defects
Predictive Disk Management
o
Anticipates disk problems with automated, predictive failure analysis; data is moved
prior to failure, to avoid reconstruction
Real-time monitoring of client load generation
o
Slide 20
Identify performance bottlenecks among storage users
Panasas, Inc.
Horizontal Parity: Panasas ObjectRAID
Parity calculated and written to disk(s)
o
Any failed disk can be reconstructed from the remaining disks
Panasas ObjectRAID is faster
o
Uses multiple RAID controllers to run in parallel (“Parallel Reconstruction”)
Panasas ObjectRAID is more efficient
o
Reconstructs only user data versus every sector on disk
800GB Blade reconstructed in 31 minutes at Los Alamos National Laboratory!
Horizontal Parity
Slide 21
Panasas, Inc.
Unique: Vertical Parity
Solves media error problem
regardless of drive density
“RAID” within an individual drive
Improves on internal ECC
capabilities
Independent of horizontal arraybased parity schemes
Vertical
Parity
Seamless recovery from media
errors by applying RAID schemes
across disk sectors
Vertical
Parity
Horizontal Parity
Slide 22
Panasas, Inc.
Unique: Network Parity
Extends parity capability across the data path to the client or server node
Enables end-to-end data integrity validation
o
Protects from errors introduced by disks, firmware, server hardware, server software,
network components and transmission
o
Client either receives valid data or an error notification
Network
Parity
Vertical
Parity
Horizontal Parity
Slide 23
Panasas, Inc.
Manageability:
Single Global Namespace
Panasas removes artificial, physical and logical boundaries
o
Eliminates need to maintain mount scripts or move data
Cluster 1
Cluster 3
Cluster 1
Cluster 3
Cluster 2
Cluster 2
Single Global Namespace
Archived
Files
Cluster 1
Results
Cluster 2
Results
Cluster 3
Results
Traditional Storage Networks
Slide 24
Panasas Storage Cluster
Panasas, Inc.
Automatic provisioning for easy growth
Online Provisioning
o
o
Configure One DirectorBlade and all
others obtain their configuration via
DHCP on private port
New Storage is seamlessly integrated
into the system
DHCP on
Private Port
Reading Config
Setting IP Addrs
Matching Versions
Growth without limitations
o
Terabytes to Petabytes
o
Single seamless namespace
Single Seamless Namespace!
Slide 25
Panasas, Inc.
Manageability:
Automatic RAID configuration
Per File RAID
o
Small File
RAID Layout is an Attribute Stored within the Object
System assigns RAID level based on file size
o
< 64 KB RAID 1 for efficient space allocation
o
> 64 KB RAID 5 for optimum system performance
RAID 1 Mirroring
Large File
Automatic transition from RAID 1 to 5
o
No re-striping
RAID 5 Striping
Two level RAID MAP, Stripe width and depth
o
Automatically optimizes stripe size
Enables optimum system growth and reconstruction
Slide 26
Panasas, Inc.
Manageability: Dynamic Load Balancing
1
StorageBlade Capacity
2
StorageBlade Performance
3
DirectorBlade Performance
Biases new data objects to
new blades
Dynamically moves data
objects from filled blades as
needed
Data objects striped
broadly for performance
Dynamically moves
objects from “hot” blades
Slide 27
Cluster design assigns new
clients to least utilized
DirectorBlades
Panasas, Inc.
Proven Panasas Scalability
Storage Cluster Sizes Today (e.g.)
Slide 28
o
Boeing, 50 DirectorBlades, 500 StorageBlades in one system. (plus 25
DirectorBlades and 250 StorageBlades each in two other smaller
systems.)
o
LANL RoadRunner.100 DirectorBlades, 1000 StorageBlades in one
system today, planning to increase to 144 shelves next year.
o
Intel has 5,000 active DF clients against 10-shelf systems, with even more
clients mounting DirectorBlades via NFS. Release 3.2 will allow them to
deploy up to 12,000 clients against a single system.
o
BP uses 200 StorageBlade storage pools as their building block
o
Most customers run systems in the 100 to 200 blade size range
Panasas, Inc.
Fast Deployment
Panasas Appliance Model
o
Deploy solutions in hours and days vs. weeks
and months
o
Ireland's most powerful computer (#117 in the
world) was installed in three hours and
powered up in just one day, thanks to a rapidly
deployable computing platform from Silicon
Graphics and Panasas.
http://biz.yahoo.com/prnews/090205/sf67219.html?.v=1
Slide 29
Panasas, Inc.
ActiveScale 3.2 Released Sept 2008
Performance
10 GE switch => 50% improvement in shelf performance
Multi-core client performance tuning
Infiniband connectivity
RAID-10 volumes to optimize N-1 workloads
Reliability
Complete HA feature set with addition of NFS/CIFS Fail over
Industry leading data integrity with Vertical Parity and Network Parity
Manageability
Snapshots
NDMP support for easy backups
Slide 30
Panasas, Inc.
Summary
Parallel storage provides high performance for faster survey turnaround
and more complex algorithms
o
10s of GB/s in production seismic processing data centers
o
50% performance increase per shelf with 10Gb Ethernet
Scalability to support more complex data acquisition and larger clusters
o
Deployed on a single shelf on survey vessels
o
12,000 core clusters in production today
o
4PB+ systems in production today
Proven across the E&P industry
o
All major ISVs: Landmark, Paradigm, Schlumberger
o
Operating on 6 continents for Service Cos., NOCs, Majors and Independents
Panasas is proven to cost effectively increase
processing throughput!
Slide 31
Panasas, Inc.
For more information,
call Panasas at:
Thank You
1-888-PANASAS
(US & Canada)
00 (800) PANASAS2
(UK & France)
00 (800) 787-702
张克诚
(Italy)
+001 (510) 608-7790
(All Other Countries)
Slide 32
13701026265
Panasas, Inc.