Choosing a Computing Architecture
Download
Report
Transcript Choosing a Computing Architecture
Choosing a Computing
Architecture
Chapter 8
Architectural
Requirements
Scalability
Manageability
Flexibility
User
Budget
Availability
Extensibility
Integration
Business
Technology
Strategy for Architecture
Definition
Obtain existing architecture plans
Obtain existing capacity plans
Document existing interfaces
Prepare capacity plan
Prepare technical architecture
Document operating system requirements
Develop recovery plans
Develop security and control plans
Create architecture
Create technical risk assessment
Hardware Architecture
Involve all experts
New technology
Old technology
Networking
Hardware Architectures
Robust
Available
Reliable
Extensible
Scalable
Supportable
Recoverable
Parallel
VLM
64-bit
Connective
Open
Hardware Architectures
SMP
Cluster
MPP
NUMA
Hybrids use SMP and MPP
Evaluation Criteria
Determine the platform for your needs
SMP
Clusters
NUMA
MPP
High
Scalability
Low
High
Maturity
Low
Parallel Processing
Parallel daily
operations
Shared resources
- Memory
- Disk
- Nothing
Loosely or tightly
coupled
Database
Application
Hardware
Operating system
Making the Right Choice
Requirements differ from operational
systems
Benchmark
- Available from vendors
- Develop your own
- Use realistic queries
Scalability important
SMP
Communication by shared memory
Disk controllers accessible to all CPUs
Proven technology
CPU
CPU
CPU
CPU
Common bus
Shared memory
Shared disks
SMP
Benefits:
- High concurrency
- Workload balancing
- Moderate scalability
- Easy administration
Limitations:
- Memory (cluster for improvements)
- Bandwidth
NUMA
Loosely coupled shared memory
CPU
CPU
CPU
CPU
CPU
CPU
Shared bus
Shared
memory
Disk
Nonuniform
memory access
Shared
memory
Disk
NUMA
Benefits:
- Fully scalable, incremental additions to
disk, CPU, and bandwidth
- Performs better than MPP
- Suited for Oracle server
Limitations:
- The technology is new and less proven
- You need new tools for easy system
management
- NUMA is more expensive than SMP
Clusters
Node 1
CPU CPU CPU
Node 2
CPU CPU CPU
Node 3
CPU CPU CPU
Shared
memory
Shared
memory
Shared
memory
Common high-speed bus
Common high-speed bus
Clusters
Shared disk, loosely coupled
Dedicated memory
High-speed bus
Shared resources
SMP node
Benefits:
- High availability
- Single database concept, incremental
growth
Limitations:
- Scalability, internode synchronization needed
- Operating system overhead
MPP
CPU
CPU
CPU
CPU
Memory
Memory
Memory
Memory
Disk
Disk
Disk
Disk
MPP
A shared nothing architecture
Many nodes
Fast access
Exclusive memory on a node
Low cost per node
Scalable
nCUBE configuration
MPP Benefits
Unlimited incremental growth
Very scalable
Fast access
Low cost per node
Good for DSS
MPP Limitations
Rigid partitioning
Cache consistency
Restricted disk access
High memory cost per nodes
High management burden
Careful data placement
Windows NT
Architecture based on the client-server model
Benefits:
- Include built-in Web services
- Scalability
- Ease of management and control
Limitations:
- Not as secure
- Cannot execute programs remotely
- Lack linear scalability beyond four processors
- Addressing space for applications is limited to
two gigabytes
Architectural Tiers
Tiered structures:
- Modular
- Logical separation
Distributed structures:
- Two-tier
- Three-tier
- Four-tier (and more)
Middleware
Technologies for integration
Gateway
Database Server
Requirements
Robust
Available
Reliable
Extensible
Scalable
Supportable
Recoverable
Parallel
Parallelism
Database
Query
Load
Index
Sort
Backup
Recovery
Further Considerations
Optimization strategy
Partitioning strategy
Summarization strategy
Indexing techniques
Hardware and software scalability
Availability
Administration
Server Environments
Operational
servers
•Open DBMS
•Network, relational,
hierarchical
•Mainframe
proprietary DBMS
•Oracle, IMS, DB2,
VSAM, Rdb, Non
Stop SQL, RMS
Warehouse
servers
•Open DBMS
•Relational
•General purpose and
warehouse-specific
DBMS
•Oracle, Informix,
Sybase, IBM DB2,
NCR/AT&T Teradata
Red Brick
Data mart
servers
•Open DBMS
•Relational and
multidimensional
•General purpose
and warehouse
specific DBMS
•Oracle, Oracle
Express, Arbor
Essbase, MS SQL
Server, NT
Parallel Processing
A large task broken into smaller tasks:
Concurrent execution
One or more processors
Elapsed time
Processor 1
Parallel
Processor 1
Processor 2
Processor 3
Processor 4
Not parallel
Parallel Database
Increased speed
Improved scalability
Performance gains
- Availability
- Flexibility
- More users
Processor 1
Processor 2
Processor 3
Processor 4
Parallel
Parallel Query
SQL code split among server processes.
SubQuery
Query
SubQuery
SubQuery
Parallel Load
Bypass SQL processing to speed throughput.
Parallel Processing
Reduces the time to create
Allocates memory in cache efficiently
Runs simultaneously from any node
- Offline
- Online
Recovery Runs simultaneously from redo logs
Summaries Uses the CREATE TABLES AS
SELECT statement
Index
Sort
Backup
Summary
This lesson discussed the following topics:
Outlining the basic architecture
requirements for a warehouse
Highlighting the benefits and limitations of
all the different hardware architectures