Storage Decisions 2003

Download Report

Transcript Storage Decisions 2003

Hosted by
Case Study - Storage
Consolidation
Steve Curry
Yahoo Inc.
About Yahoo!
Quick Stats
 300+ million registered users




2 billion page request per day
25 countries, 14 languages
500TB data on disk
1PB data on tape
Hosted by
Yahoo! Storage Operations
Hosted by
Responsibilities
 All US storage administration






Data archiving / backups
US/Global storage architecture / standards
2nd tier support for global operations
Tool development
24/7 global issue/outage response
Reporting
Case Study #1 – Y! Photos/Briefcase
Case Study #1
• Online photo album
• Online file storage
Hosted by
Case Study #1 – Y! Photos/Briefcase
Legacy Architecture
 Cheap… *repeat* cheap JBOD’s





Single host support JBOD array
A/B mirror for redundancy
FreeBSD OS
150TB of content
Custom apps
Hosted by
Hosted by
Case Study #1 – Y! Photos/Briefcase
…Legacy Architecture
 Advantages
• Low cost hardware
• Extremely distributed
 Disadvantages
• Not very scalable
• Management headache
• No longer meets reliability requirements
Hosted by
Case Study #1 – Y! Photos/Briefcase
…Legacy Architecture
 Management Issues
• Management is per host (over 160 storage hosts)
• Synchronous mirror between A/B pair
• No “Hot-Swap” support
• Single spindle performance
Case Study #1 – Y! Photos/Briefcase
Hosted by
This… X 12!  Single tier, single spindle performance.
Hosted by
Case Study #1 – Y! Photos/Briefcase
Consolidation Plan
 NAS or SAN?
 Requirements
• Reliability
• Scalability
• Reduce management overhead
Considerations
• Current hardware investment
• Application support
Hosted by
Case Study #1 – Y! Photos/Briefcase
Network Attached Storage Solution
 Management
• Filers are heavily deployed
• Smart appliance
• Suite of tools already developed for filers
 Advantages
• RAID redundancy
• Multi-spindle performance
• Takes advantage of existing hardware
• Ease of application port
Hosted by
Case Study #1 – Y! Photos/Briefcase
…Network Attached Storage Solution
 Disadvantages
• Initial cost of deployment (cutover, SCSI –vs- IDE)
• Lot’s of JBOD’s to get rid of! ;-)
Hosted by
Case Study #1 – Y! Photos/Briefcase
New Architecture







NAS solution
FreeBSD app servers
Load balanced
10 storage hosts
Point in time snapshots
Dedicated SAN backup fabric
Distributed-farm model
Case Study #1 – Y! Photos/Briefcase
Hosted by
Simple 2 tier model. Scalable, redundant, multispindle RAID performance, hot-swap support.
Hosted by
Case Study #1 – Y! Photos/Briefcase
Consolidation Wins!






Cost considerations
Performance
Backups
Management
High availability
Hot swap
Case Study #2 - Data Mining
Case Study #2
 Global data mining
 Global log collection
Hosted by
Case Study #2 - Data Mining
Current Architecture




DAS attached arrays
Custom scripts
Stacker type tape libraries
Single-tier disk storage
Hosted by
Case Study #2 - Data Mining
Management Issues




Large storage host count
Many small tape libraries
No redundancy
Does scale for future requirements
Hosted by
Case Study #2 - Data Mining
Hosted by
Storage Requirements




High write performance
Data growth 2TB per day!!
Store data on disk for 30 days
Archive to tape
Consolidation Considerations




Reduce host management
Create a multi-tier storage architecture
Consolidate to one large tape library
Increase write performance
Case Study #2 - Data Mining
• Common Y! model
• Multi-tier storage
• Scalable
Hosted by