Relational Databases

Download Report

Transcript Relational Databases

IS605/606: Information Systems
Instructor: Dr. Boris Jukic
Relational Database Systems in
Enterprise Information Systems
File systems


File systems rely on coded file management programs to access,
insert and modify their content
As such, file systems are plagued by two main problems
 Structural dependence refers to the fact that if a file structure is
changed (such as deletion or addition of a field), the related file
management programs have to be modified accordingly
 Data dependence refers to the fact that the changes in data
characteristics, such as changing a field from integer to decimal
(or even just changing the length of the field), will cause the
related file management programs to be changed
 Finally, there is a problem of data redundancy
Data Redundancy

When the same data is stored in more than
one location (in multiple files or multiple fields
within one file) It may lead to:

Data integrity (inconsistency) problems


may be caused by either data entry errors or failure to
update all multiple copies of the same data
Data anomalies: modification, insertion and
deletion
Data Redundancy in File Systems and
Resulting Anomalies
Product ID
001
001
001
002
002
003
003


Product Name Product Description
Product Price Customer ID Customer Name Customer Location Quantity Sold
PlainSounder AM/FM Radio, 2 speakers
$51
C1
Chrysler Corp
Germany
15000
PlainSounder AM/FM Radio, 2 speakers
$51
C2
Circuit City
California
18400
PlainSounder AM/FM Radio, 2 speakers
$51
H1
Hyundai
Korea
26500
BetterBox
Radio, CD, 6 Speakers
$123
G1
GM
Michigan
23650
BetterBox
Radio, CD, 6 Speakers
$123
F1
Ford Motors
Michigan
14000
UltraX
CD, MP3, 8 Speakers
$270
H2
Honda
Japan
7500
UltraX
CD, MP3, 8 Speakers
$270
B1
BMW
Germany
2560
Modification anomaly: if PlainSounder model description changes
Insertion Anomaly: if a new customer (Toyota for example) is
added to the list of those who buy BetterBox product
Database Systems

Database Systems achieve data independence and
structural independence



If data type of as filed is changed or a field is eliminated or
a new one added, the existing management programs
(queries) do NOT have to be modified
If properly designed, databases have a low level of
redundancy, eliminating most of the insertion,
deletion and modification anomalies
Logically related data instead of physically
separated and unrelated files
Four (Logical) Data Models

Hierarchical Model (Legacy)


Standard tree-like structure
Network Model (Legacy)

More than one parent allowed

Relational Model

First truly data and structurally independent model
 No predetermined navigational maps as in two older
models
 The Database technology of choice
Object Model


Tables become objects
RELATIONAL DATABASE
ADVANTAGES

Database advantages from a business
perspective include





Increased flexibility
Increased scalability and performance
Reduced information redundancy
Increased information integrity (quality)
Increased information security
Relational Database Management System


In RMDBS, all data appears to be stored in a
collection of tables (or relations), which are
independent of one another, but can be
linked through common entries in one of the
tables' columns or fields (controlled
redundancy)
Relational Schema: The graph depicting
relationship types between tables
Relational Schema
Reduced Data Redundancy
Products
Customers
Product Id
Product Name
Product Description
Product Price
001
PlainSounder
AM/FM Radio, 2 speakers
002
BetterBox
Radio, CD, 6 Speakers
$123.00
003
UltraX
CD, MP3, 8 Speakers
$270.00
Customer
ID
Customer
Name
Customer
Location
Product
ID
C1
Chrysler
Germany
001
C2
Circuit City
California
001
H1
Hyundai
Korea
001
G1
GM
Michigan
002
F1
Ford Motors
Michigan
002
H2
Honda
Japan
003
B1
BMW
Germany
003
$51.00
Tables in RDBMS





Tables: Logical constructs containing individual entity sets.
Tables are always two-dimensional: rows and columns
 each row represents a single entity (or entity instance) from
the entity set
 each (uniquely named) column represents one attribute
 each row-column intersection results in a single data value
Each table must have a primary key : An attribute uniquely
identifying each row (entity), satisfying the entity integrity
conditions. Null value (no entry) is not permitted for a
primary key.
The order of rows and columns within the table is irrelevant
Foreign Key is an attribute in one table whose values must
either match the value of a primary key in another table or
be set to null (no value). These conditions are known as
referential integrity constraint.
Relationships within the relational
database:
•
•
One-to-one relationships:
One-to many relationships:
•
Examples:
•
•
•
professor - class
department - employee
Many to-many relationships
•
•
Examples: parts – product, student – class, …
it is recommended to break it into a set
(usually two) of one-to-may relationships
through a so called composite (bridge) entity
Enterprise data planning
•
A large component of the business
informational needs can be captured by
the mapping of all entities the
organizations need to keep track of and
the relationships among them
•
E-R (Entity-Relationship) modeling is a
standard technique that provides a simplified
picture of the relationship among entities.
Keys and Relationships: transaction
processing system example
RELATIONAL DATABASE
ADVANTAGES

Database advantages from a business
perspective include





Increased flexibility
Increased scalability and performance
Reduced information redundancy
Increased information integrity (quality)
Increased information security
Increased Flexibility

A well-designed database should:



Handle changes quickly and easily
Provide users with different views
A database has only one physical view


Physical view – deals with the physical storage of
information on a storage device such as a hard disk
A database can have multiple logical views

Logical view – focuses on how users logically access
information to meet particular business needs
Increased Scalability and Performance

A database must scale to meet increased
demand, while maintaining acceptable
performance levels


Scalability – refers to how well a system can
adapt to increased demands
Performance – measures how quickly a system
performs a certain process or transaction
Reduced Information Redundancy

Databases reduce information redundancy


Redundancy – the duplication of information or
storing the same information in multiple places
Inconsistency is one of the primary problems
with redundant information
Increase Information Integrity
(Quality)

Information integrity – a measure of the quality of information

Integrity constraints – rules that help ensure the quality of
information
 Operational integrity constraints – rules that enforce
basic and fundamental information-based constraints
 Business-critical integrity constraints – rules that
enforce business rules vital to an organization’s success
and often require more insight and knowledge than
operational integrity constraints
Increased Information Security

Information is an organizational asset and
must be protected

Databases offer several security features
including:



Passwords – provide authentication of the user
Access levels – determine who has access to the
different types of information
Access controls – Determine types of user
access, such as read-only access
DATABASE MANAGEMENT
SYSTEMS

Database management systems (DBMS) –
software through which users and application
programs interact with a database
INTEGRATING INFORMATION
AMONG MULTIPLE DATABASES

Organizations typically maintain multiple
systems, each with its own database

Integration – allows separate systems to
communicate directly with each other
INTEGRATING INFORMATION
AMONG MULTIPLE DATABASES

Forward integration – takes information entered into a given system
and sends it automatically to all downstream systems and processes
INTEGRATING INFORMATION
AMONG MULTIPLE DATABASES

Backward integration – takes information entered into a given system
and sends it automatically to all upstream systems and processes
INTEGRATING INFORMATION
AMONG MULTIPLE DATABASES

Building a
central
repository
specifically
for
integrated
information