Database Concepts - classes.ischool.syr.edu

Download Report

Transcript Database Concepts - classes.ischool.syr.edu

THE RELATIONAL DATABASE
MODEL & THE DATABASE
DEVELOPMENT PROCESS
Fact of the Week:
According to a Gartner study in ‘06, Microsoft
SQL server had the highest annual growth of
any proprietary DBMS at 28% increasing
market share from 15.6% to 17.4%
Learning Objectives





Describe the Relational Model
Define relational terms and understand the
terminology in practice.
Understand these relational terms through practice
Explain the System Development Life Cycle (SDLC)
Explain the Database Life Cycle (DBLC)
Fundamental Axioms of DBMSs





Users communicate with computer applications
(websites, etc)
Computer applications communicate with DBMSs.
Users do not communicate with DBMSs directly.
As a result DBMSs although they can be used
interactively, are not used interactively.
A DBMS is not a replacement for sound database
design principles.
First of many examples for the semester

https://labjobs.ischool.syr.edu
Website
You
Web Server (IIS)
Web Browser
Application
Components
User Interface
and Business
Logic
(HTML/C#)
Db using
Relational
Model
SQL Server
Data access
logic and
Data (SQL)
The Relational Database Model



Many DBMS applications implement the Relational
Model, but none of them enforce it. This permits rookie
database designers to shoot themselves in the foot. And
many have (including yours truly).
The Relational Model has: (Codd’s 3 rules)
1)Data Independence


2) Data Consistency


Clear separation between data and metadata
Minimal redundancy; the data adopts the “DRY” principle
3) Easy to use

You don’t have to understand the implementation to use it.
You can build a poorly-designed DB in a DBMS
Do you see problems with this database design?
Think Codd: Independence? Consistency? Ease-of-Use?
Asset#
1
2
3
4
5
6
7
8
Resource Sign-out Sheet
Item
Category Employee Phone Date Out
The ABC's of Excel
Book
Dave Smith
3321
9-Jun
InFocus Projector
Hardware Sally Jones
4539
2-Jun
Prog. for Dummies
Book
Art Wilson
9091
20-Jun
Learning Perl
Book
Keith Way
3382
Dell Laptop #1
Hardware Sally Jones
4539
20-Jun
Windows 2000 Server Solftware
Office 2000 Premium Software Dave Smith
3321
12-Jun
Dell Laptop #2
Hardware
Change Sally Jones’s phone #? How many ‘Software’ in the database?
Add New Employee Bob Smith? Delete Dave Smith (no longer works here)
Activity: Relational Terminology
Identify Each of
These :
 Table
 Relation
 Row
 Column
 Tuple
 Attribute
 Physical Domain
 Logical Domain
Asset
Item
Category Asset#
The ABC's of Excel Book
1
InFocus Projector
Hardware
2
Prog. for Dummies
Book
3
Learning Perl
Book
4
Dell Laptop #1
Hardware
5
Windows 2000 Server Software
6
Office 2000 Premium Software
7
Dell Laptop #2
Hardware
8
Tables are “Buckets for yer data”
“Products
Appear on
Orders”
“Customers
Place
Orders”
Orders
Customers
Tables:
• Specialized
• Order doesn’t matter
• Contain real items
Products
DBMS : Physical Domain
Different “flavors” of DBMSs use different data types.
DBMS: Logical Domain / Constraints


Default Value – a value entered into an attribute for
a row when one isn’t specified.
Check Constraint – an expression which must be
evaluated prior to the insertion of a row. Eg.
Employee_hourly_wage >= 0

Unique Constraint – ensures duplicate values are not
inserted into a column. (Secondary Keys should have unique
constraints)

Lookup table – a separate table containing all of the
acceptable values for a given column, typically
varchar (Think drop down list) The column you’re trying to
constrain is a FK to the lookup table.
Activity: The Relational Table
Relation Name?
Attributes?
Physical Domain of Columns?
Logical Domain of Columns?
Candidate Keys?
The Primary Key Constraint




Special selected constraint (you choose it)
Enforces entity integrity on the table.
Must be data unique for each row
Should be a candidate key
Good PK / Bad PK
Good candidate key choices?
 Customer Name?
 Email Address?
 Name and DOB?
 SSN?
 Customer Selected value?
 Random Unique #?
 Sequential Unique #?
 Last two are examples of
surrogate keys

The best PK’s
 Don’t
change …
ever!
 Have no external
meaning ...
 Do not compromise
security…
 Do not hinder
performance …
Activity: Which of these is a good PK?
Candidate keys? Best primary key? Why? Should a Surrogate key be used?
The Foreign Key constraint


The foreign key is a column added to one relation so
that it can be associated with another relation.
Foreign keys must have referential integrity – their
values must come from the corresponding PK column
in the relation.
Example of FK: The Lookup Table
Asset
Item
Category Asset#
The ABC's of Excel
Book
1
InFocus Projector
Hardware
2
Prog. for Dummies
Book
3
Learning Perl
Book
4
Dell Laptop #1
Hardware
5
Windows 2000 Server Software
6
Office 2000 Premium Software
7
Dell Laptop #2
Hardware
8
CategoryID
Book
Hardware
Foreign Key
Software
Activity: Find the keys

Candidate? Primary? Foreign? Secondary? Surrogate?
One more time.
Tables
Rule for joining tables
Columns: Physical Domain? Logical Domain? Alllow Null?
Keys: Candidate? Primary? Foreign, Surrogate, Secondary?
The Natural Join of PK-FK at Work
Natural
Join
FK Example: Implementation of a 1-M Relationship
FK Example: Implementation of a M-N Relationship
This M-M Relationship
has been resolved into
two 1-M relationships
Example:Null and Flags
Nulls typically cause
problems in Varchar and
Numeric, and bit fields
Employee
EID
101
102
103
104
105
Ename
ETermDate EBenefitPlan
Willie Survive
BluePoint
Mike Rophone
01/01/04
Curt Tens
02/13/04
Sara Doctorintahaus
OrangePoint
Dustin Dawind
103 and 105 are
null for different
reasons!
Null makes sense
for this column
Employee
EID
101
102
103
104
105
Ename
ETermDate EBenefitPlan
Willie Survive
BluePoint
Mike Rophone
01/01/04 Term
Curt Tens
02/13/04 Term
Sara Doctorintahaus
OrangePoint
Dustin Dawind
Op-Out
Flags used to
represent
different status
Data Models: Abstraction Levels
Conceptual
Logical
Internal
Highly Abstract
Hardware and Software
Independent
Somewhat Abstract
Hardware Independent
Software Dependent
External
Physical
Not Abstract (Concrete)
Hardware and Software Dependent
Systems Development Lifecycle
SDLC / DBLC
II
Analysis
III
Design
IV
Implementation
Resources
I
Planning
Logical
Model
Conceptual
Model
Physical
Model
Time
Internal / External
Model
V
Maintenance &
Support
THE RELATIONAL DATABASE
MODEL & THE DATABASE
DEVELOPMENT PROCESS
Questions?