MANAGING DATA RESOURCES
Download
Report
Transcript MANAGING DATA RESOURCES
FILE ORGANIZATION
• BIT: Binary Digit (0,1;Y,N;On, Off)
• BYTE: Combination of BITS which
represent a CHARACTER
• FIELD: Collection of BYTES which
represent a Fact
• RECORD: Collection of FIELDS
which reflect a TRANSACTION
• FILE: A Collection of Similar
RECORDS
Managing Digital Data
The layout of a personnel file in traditional file organization.
FILE ORGANIZATION
• DATABASE: An Organization’s Electronic
Library of FILES
• Primary FIELD or Key: Field in Each
Record Uniquely Identifies
THIS Record For
RETRIEVAL
UPDATING
SORTING
TRADITIONAL FILE ENVIRONMENT
(FLAT FILE)
•
•
•
•
•
DATA REDUNDANCY
PROGRAM / DATA DEPENDENCY
LACK OF FLEXIBILITY
POOR SECURITY
LACK OF DATA SHARING & AVAILABILITY
Different information making up a student record retained in three different sites.
DATABASE MANAGEMENT SYSTEM
(DBMS)
SOFTWARE TO CREATE & MAINTAIN DATA
*** ENABLES BUSINESS APPLICATIONS TO
EXTRACT DATA
*** INDEPENDENT OF SPECIFIC COMPUTER
PROGRAMS
DBMS
Managing Digital Data
• Traditional Files vs. Databases: Pros and
Cons
– Traditional File Advantages
• Simplicity
• Efficiency
• Customization
– Database Advantages
•
•
•
•
Reduced data redundancy
Application/data independence
Better control
Flexibility
COMPONENTS OF DBMS:
• THE SCHEMA: describe the structure of the
Database.
DATA DEFINITION LANGUAGE:
• Defines Data Elements in Database
• and Used to construct the schema
DATA MANIPULATION LANGUAGE:
• Manipulates Data for Applications
• DATA DICTIONARY:
– Maintains all information supplied by the
developer when constructing the schema
*
DBMS
• Data Definition Language (DDL)
Data definition language to create a schema in NOMAD
Components of Database
Management Systems
• Data Manipulation Language (DML)
– Used to query the database
Figure 8.14 A Paradox query by example
DATA DICTIONARY
A typical data dictionary for a staff file
ADVANTAGES OF DBMS:
• REDUCES COMPLEXITY
• REDUCES DATA REDUNDANCY
• CENTRAL CONTROL OF DATA DEFINITIONS
• REDUCES PROGRAM / DATA DEPENDENCE
• REDUCES DEVELOPMENT / MAINTENANCE COSTS
• ENHANCES SYSTEM FLEXIBILITY
• INCREASES ACCESS / AVAILABILITY OF
INFORMATION
*
DBMS
Database Models
HIERARCHICAL DATA MODEL
– Records are related hierarchically –
each category is a subcategory of the next level up
– Disadvantages of hierarchical databases
• To retrieve a record, a user must start at the root and
navigate the hierarchy.
• If a link is broken, the entire branch is lost.
• Requires considerable data redundancy
ROOT
FIRST
CHILD
2nd
CHILD
Employer
Compensation
Ratings
Salary
Job
Assignments
Pension
Benefits
Insurance
Health
NETWORK DATA MODEL
• VARIATION OF HIERARCHICAL MODEL
• Allows a record to be linked to more than one
parent
• USEFUL FOR MANY-TO-MANY
RELATIONSHIPS
NETWORK
1
NETWORK
A
NETWORK
2
NETWORK
B
NETWORK
C
RELATIONAL DATA MODEL
• Consists of tables; links among entities are
maintained with foreign keys
• DATA IN TABLE FORMAT
– RELATION: TABLE
– TUPLE: ROW (RECORD) IN TABLE
– FIELD: COLUMN (ATTRIBUTE) IN TABLE
HOURS * RATE
ABLE
$ 40.50 $ 10.35
BAXTER $ 38.00 $ 8.75
CHEN
$ 42.70 $ 9.25
DENVER $ 35.90 $ 9.50
TOTAL
$ 419.18
$ 332.50
$ 394.98
$ 341.05
TYPES OR RELATIONS
ONE-TO-ONE:
STUDENT
CLASS
ONE-TO-MANY:
STUDENT
A
MANY-TO-MANY:
ID
STUDENT
B
CLASS
1
STUDENT
C
CLASS
2
STUDENT STUDENT
A
B
STUDENT
C
Relational Operations
• Data Manipulation
– Select : is the selection of records that meets certain condition.
– Project: is the selection of certain columns from a table.
– Join
: is the joining of data from multiple tables.
• Structured Query Language (SQL)
– International standard DDL and DML for
relational DBMS.
– Advantages of using SQL
Users do not need to learn different DDLs and DMLs.
SQL can be embedded in widely used 3rd generation
languages, increasing efficiency and effectiveness.
Programmer not forced to rewrite statements since SQL
statements are portable.
Database Models
Advantages and disadvantages of database models
Database Architecture
• Shared Resource and Client/Server
Systems
– Four basic client/server models
• Applications run at a server
• Applications run on local PCs
• Applications run on both the local PCs and the
server
• Applications and key elements of the database
are split between the PCs and the server
Database Architecture
• Distributed Databases
– Replication :Full copy of the entire database is
stored at all sites
A replicated database: each computer holds a copy of the entire database
Database Architecture
– Fragmentation: Parts of database are stored where they
are most often accessed
A fragmented database: each computer holds only the part of the database that is most
frequently accessed by the local users
Web Databases
• Databases on the Web
–
–
–
–
Catalogs
Libraries
Directories
Client lists and profiles
• When linking a database to the Internet,
consider
• Which application to use
• How to ensure Web surfers do not interfere with database
updates
• How to maintain security
Data Warehousing
• Data warehouse
– Collection of data that supports management
decision making
– Phases in Building a Data Warehouse
• Extraction Phase
• Cleansing Phase
• Loading Phase
• Data Mining
– Selecting, exploring, and modeling data to discover
unknown relationships
OPERATIONAL,
HISTORICAL DATA
Data
Warehousing
INTERNAL
DATA
SOURCES
DATA WAREHOUSE
DATA
ACCESS &
ANALYSIS
EXTRACT,
TRANSFORM
INFORMATION
DIRECTORY
EXTERNAL
DATA
SOURCES
QUERIES &
REPORTS
OLAP
DATA MINING
Data-Mining
Potential applications of data-mining
DATABASE ADMINISTRATION
• DEFINES & ORGANIZES DATABASE
STRUCTURE AND CONTENT
• DEVELOPS SECURITY PROCEDURES
• DEVELOPS DATABASE DOCUMENTATION
• MAINTAINS DBMS
*
Ethical and Societal Issues
A Too-Risky Info Highway
• Out of Hand -- Out of Control
– DBMSs allow organizations to collect, maintain,
and sell vast amounts of private personal data
easily.
• Where is the Information Going?
– Many consumers provide information daily without
being aware of where it is actually going.
– http://www.publicrecordfinder.com/
• The Upside
– Database technology enables better and faster
services.