Storage Decisions 2003
Download
Report
Transcript Storage Decisions 2003
Hosted by
Case Study - Storage
Consolidation
Steve Curry
Yahoo Inc.
About Yahoo!
Quick Stats
300+ million registered users
2 billion page request per day
25 countries, 14 languages
500TB data on disk
1PB data on tape
Hosted by
Yahoo! Storage Operations
Hosted by
Responsibilities
All US storage administration
Data archiving / backups
US/Global storage architecture / standards
2nd tier support for global operations
Tool development
24/7 global issue/outage response
Reporting
Case Study #1 – Y! Photos/Briefcase
Case Study #1
• Online photo album
• Online file storage
Hosted by
Case Study #1 – Y! Photos/Briefcase
Legacy Architecture
Cheap… *repeat* cheap JBOD’s
Single host support JBOD array
A/B mirror for redundancy
FreeBSD OS
150TB of content
Custom apps
Hosted by
Hosted by
Case Study #1 – Y! Photos/Briefcase
…Legacy Architecture
Advantages
• Low cost hardware
• Extremely distributed
Disadvantages
• Not very scalable
• Management headache
• No longer meets reliability requirements
Hosted by
Case Study #1 – Y! Photos/Briefcase
…Legacy Architecture
Management Issues
• Management is per host (over 160 storage hosts)
• Synchronous mirror between A/B pair
• No “Hot-Swap” support
• Single spindle performance
Case Study #1 – Y! Photos/Briefcase
Hosted by
This… X 12! Single tier, single spindle performance.
Hosted by
Case Study #1 – Y! Photos/Briefcase
Consolidation Plan
NAS or SAN?
Requirements
• Reliability
• Scalability
• Reduce management overhead
Considerations
• Current hardware investment
• Application support
Hosted by
Case Study #1 – Y! Photos/Briefcase
Network Attached Storage Solution
Management
• Filers are heavily deployed
• Smart appliance
• Suite of tools already developed for filers
Advantages
• RAID redundancy
• Multi-spindle performance
• Takes advantage of existing hardware
• Ease of application port
Hosted by
Case Study #1 – Y! Photos/Briefcase
…Network Attached Storage Solution
Disadvantages
• Initial cost of deployment (cutover, SCSI –vs- IDE)
• Lot’s of JBOD’s to get rid of! ;-)
Hosted by
Case Study #1 – Y! Photos/Briefcase
New Architecture
NAS solution
FreeBSD app servers
Load balanced
10 storage hosts
Point in time snapshots
Dedicated SAN backup fabric
Distributed-farm model
Case Study #1 – Y! Photos/Briefcase
Hosted by
Simple 2 tier model. Scalable, redundant, multispindle RAID performance, hot-swap support.
Hosted by
Case Study #1 – Y! Photos/Briefcase
Consolidation Wins!
Cost considerations
Performance
Backups
Management
High availability
Hot swap
Case Study #2 - Data Mining
Case Study #2
Global data mining
Global log collection
Hosted by
Case Study #2 - Data Mining
Current Architecture
DAS attached arrays
Custom scripts
Stacker type tape libraries
Single-tier disk storage
Hosted by
Case Study #2 - Data Mining
Management Issues
Large storage host count
Many small tape libraries
No redundancy
Does scale for future requirements
Hosted by
Case Study #2 - Data Mining
Hosted by
Storage Requirements
High write performance
Data growth 2TB per day!!
Store data on disk for 30 days
Archive to tape
Consolidation Considerations
Reduce host management
Create a multi-tier storage architecture
Consolidate to one large tape library
Increase write performance
Case Study #2 - Data Mining
• Common Y! model
• Multi-tier storage
• Scalable
Hosted by