Data warehouse

Download Report

Transcript Data warehouse

Unit 5
Organizing Data and Information
Learning Outcomes
 Understand records in a database
 Understand components of a DBMS
 Identify database models and types
 Define
 Database concepts
 Structures & record keeping
 Data languages
 Development tools
 Security and admin
Principles of Information Systems, Eighth Edition
2
Data Management
 Without data and the ability to process it, an
organization could not successfully complete most
business activities
 Data consists of raw facts
 To transform data into useful information, it must first
be organized in a meaningful way
Principles of Information Systems, Eighth Edition
3
The Hierarchy of Data
 Character: a group of 8bits forming a byte
 Can be an uppercase letter, lowercase letter, numeric
digit, or special symbol
 Field: group of characters, typically a name, number,
or combination of characters that describes an aspect
of a business object or activity
 Record: collection of related data fields
 File: collection of related records
 Database: collection of integrated and related files
Principles of Information Systems, Eighth Edition
4
The Hierarchy of Data (continued)
Figure 5.1: The Hierarchy of Data
5
Definitions
 Database management system (DBMS): group of
tools that support database management and it’d
performance in an organisation
 Data independence: data is independent from the
program that creates it. This means programs and data
developed in for 1 app are compatible with programs
and data from another app.
 Data redundancy: duplication of data in separate
files – can lead to inconsistency.
Principles of Information Systems, Eighth Edition
6
Database Approach
Principles of Information Systems, Eighth Edition
7
Keys*
 Key: field or set of fields in a record that is used to
identify the record
 Primary key: field or set of fields that uniquely
identifies the record
 Foreign Key: A non-primary key attribute in the table
that is a primary key in the other table.
Principles of Information Systems, Eighth Edition
8
Data Views
 Physical View: Where is data stored physically?
 Drive, disk, surface, track, sector, Record
 Tape, block, record number (key)
 Used by programmers
 Logical View: What data is needed? How do users need
the data to be arranged?
 Organize factors into tables for application use
 Provide a way for tables to cross-reference
*
Principles of Information Systems, Eighth Edition
9
Accessing File Structures
 Sequential file structure e.g. payroll
 Data is stored and accessed in a sequence according to a key
field
 Good for periodic processing of a large volume of data, but
updating with new transactions can be troublesome
 Direct file structure e.g. reservations
 Data stored randomly and accessed directly using an index of
keys and related storage addresses.
 Indexed Sequential file structure
 Records stored sequentially on direct access medium (e.g.
hdd) with address for each record.
 Index keeps track of addresses for speed. Sequential access
for large amounts of data
O’BRIEN
10
Record Processing
 Online vrs Batch
Principles of Information Systems, Eighth Edition
11
DBMS Components
1.
2.
3.
4.
5.
6.
Data dictionary: Stores definitions of data elements, and data
characteristics e.g. data type, amount of storage: txt 30
Data Languages:

Data definition language: Specifies content and structure of
database and defines each data element e.g keys, relationship

Data manipulation language: allows users to access and
change data and produce reports. E.g. SQL, QBE
Application Development Tools: Programs to develop apps to
use the database, e.g. CASE tools
Software Security: protect database from hacking &
unauthorised access.
Archiving, backup & recovery systems: in case database is
lost/crashes, it can be recovered from backups or archives.
Report Writers: tools to specify a report format for retrieving data
Principles of Information Systems, Eighth Edition
12
Sample data dictionary report
Principles of Information Systems, Eighth Edition
13
Structured Query Language (SQL)
Examples of SQL Commands
Principles of Information Systems, Eighth Edition
14
Database Models
1. Hierarchical
2. Network
3. Relational
4. Multidimensional
5. Object-oriented
6. Object-relational
Principles of Information Systems, Eighth Edition
15
Hierarchical Database Model
 May have more than one offspring, but only one parent
 Treelike
 One-to-many relationship
 Used for structured, routine types of transaction processing
Laudon
16
Network database model
 More complex
 Many-to-many relationship
 More flexible but doesn’t support ad hoc requests well
Principles of Information Systems, Eighth Edition
17
Relational Database Model
 Data elements stored in simple tables
 Can link data elements from various tables
 Very supportive of ad hoc requests but slower at
processing large amounts of data than hierarchical or
network models
Principles of Information Systems, Eighth Edition
18
The Relational Database Model
Principles of Information Systems, Eighth Edition
19
The Relational Database Model
(continued)
Principles of Information Systems, Eighth Edition
20
The Relational Database Model
(continued)
Principles of Information Systems, Eighth Edition
21
Multidimensional DB model
 A variation of the
relational model
 Cubes of data and
cubes within cubes
 Popular for online
analytical
processing (OLAP)
applications
Principles of Information Systems, Eighth Edition
22
Object-oriented DB Model
 Data descriptions &





procedures that act on data
are grouped together.
Key technology of
multimedia web-based
applications
Good for complex, highvolume applications
Can store more data types
Objects can be reused
Faster data accessed
Principles of Information Systems, Eighth Edition
23
Object-Relational DB Model
 Combines features of object-oriented and relational
databases
 Standard relational database with facilities for new
data types (e.g. video) and operations to be added.
Principles of Information Systems, Eighth Edition
24
Types of Databases
1. Operational DB – to support business
operations
2. Analytical BD – Summarized data 4 analysis
3. Data Warehouse*
4. Distributed - data may be spread across several
smaller databases in different locations .
5. End-user –developed by managers and other
users on their own PCs
6. External/on-line – pay for access to external
statistical online databases
Principles of Information Systems, Eighth Edition
25
Data Warehouses, Data Marts, and
Data Mining
 Data warehouse: Logical collection of data gathered
from many operational DBs to support business and
decision making.
 Data mart: subset of a data warehouse
 Data mining: automated discovery of patterns, trends
and rules from data in data warehouse to predict
business strategy.
Principles of Information Systems, Eighth Edition
26
Data Warehouses, Data Marts, and
Data Mining (continued)
Figure 5.17: Elements of a Data Warehouse
Principles of Information Systems, Eighth Edition
27
Advantages of DBMSs
 Data sharing
 Reduced data redundancy (no duplicates)
 Improved data consistency (more integrity)
 Data independence (data independent of the
application using it.)
 Improved data admin & control – better overall
protection
 More emphasis on data as a resource
Principles of Information Systems, Eighth Edition
28
Problems with DBMSs
 Concurrency problem – data used at same time - lock
 Resource problems – costly initially
 Security problems - cos data all in one place
Principles of Information Systems, Eighth Edition
29