Transcript Lec. notes

MIS 3150 Data and Info
Management
Lecture 1 - Database fundamentals
Arijit Sengupta
Structure of this quarter
ISOM
MIS3150
1. Design
0. Intro
Database
Fundamentals
Conceptual
Modeling
Relational
Model
2. Querying
Query
Languages
Advanced
SQL
3. Applications
4. Advanced
Topics
Java DB
Applications –
JDBC
Transaction
Management
Data
Mining
Normalization
Newbie
Users
Designers
Developers
Professionals
Today’s buzzwords
ISOM
•
•
•
•
•
•
•
Organizational data cycle
Database
Database Management Systems
Data Independence
Centralized Database Systems
Client-Server Database Systems
Distributed Database Systems
Getting used to Pilot
ISOM
•
•
•
•
•
•
Content – lecture notes, etc.
Discussion boards
Dropboxes – for assignments
Groups – form the project groups
Quizzes and exams
Grades
• Finally… Elluminate!
Before we begin – Elluminate!
ISOM
• In the Elluminate tab, find the
Elluminate link for today’s lecture
• When you click the link, a small jws
application will download and start
(might take a few minutes)
• Remember – do not click the
microphone button if you are
using the lab computers.
Elluminate practices
ISOM
•
•
•
•
•
Asking e-questions
Providing e-feedback
Using chat
Using polls and poll results
Using the whiteboard
Poll Question 1
ISOM
How would you evaluate your current
knowledge of databases?
A. Very little or no knowledge
B. Some knowledge (e.g., Access)
C. Good knowledge (used an
enterprise DBMS before)
D. Expert (work/worked in database
industry)
Poll Question 2
ISOM
How would you evaluate your knowledge of
SQL?
A.Little of no knowledge
B.Have written some very basic SQL with
only Select-from-where
C.Have written mid-level SQL with joins
and aggregate functions
D.Expert – have used nested subqueries
and other advanced features
Using the whiteboard
ISOM
Write one thing that you hope you will learn in this course.
Objectives of today’s lecture
ISOM
• Know common database terminology
• Understand requirements and uses of
data
• Know the differences between databases
and flat files
• Realize the importance and need for
databases in problem-solving
• Understand the different types of
databases and their differences
Data: A Resource
ISOM
• The Success of an organization depends
on efficient use of its resources:
 Buildings, factories, equipment
 Technical know-how
 Human resources
 Data
• Data: An important organizational
resource
Electronic Data
ISOM
• Why?
Large volume in a small space
Ease of sharing
Ease of use
Data analysis
• How?
File-based system versus databases
Legacy (File-based) Systems
ISOM
•
•
•
•
•
•
•
•
Uncontrolled data redundancy,
Data inconsistency
Poor data sharing
Difficult to keep up with changes
Record format Vs. user requirements
Programs Vs. record format
Low productivity
High maintenance cost
So what is a database?
ISOM
• According to Oxford English
Dictionary:
“A structured collection of data held in
computer storage; esp. one that
incorporates software to make it
accessible in a variety of ways”
• So does it make pretty much every
collection of data a “database”?
The Database
ISOM
• The data itself PLUS The data
definitions (metadata), applications,
queries and visualizations
Database Approach
ISOM
• Non-redundant collection of logically related
facts
 representing some aspect of the real world
 the data itself plus the data definitions
•
•
•
•
•
Permits sharing
Consistent representation for each piece of data
Avoids (minimizes) redundancy
Allows different user views
Users are isolated from most changes
Why Databases?
ISOM
• Independence from representation
formats
• Control redundancy and consistency
• Ensure integrity/security
• Better scalability
• Allow ad hoc access
• Better maintenance
• Better concurrency
Data Independence
ISOM
• Does data have to be part of programs?
• Do we need to change one if the other
changes?
• Three-tier architecture of databases
View
What the users see
Conceptual
How we model data
Physical
How data is stored
Access Flexibility
ISOM
•
•
•
•
•
Easy to ask ad-hoc questions
No need for separate codes
User-friendly interface
Command-based (e.g., SQL)
Graphical (e.g., QBE)
Data Integrity
ISOM
• Ensures that the stored data are
consistent and correct
• Easy to define global rules
customer_age > 21 years
number_of_credits < 18
• Can allow multiple users to access
data without compromising on data
integrity
Data Security
ISOM
• Access definition
Global
Local
• Uniform access authorization
Data Redundancy
ISOM
•
•
•
•
•
Data need not be replicated
Less wastage of storage space
Less data anomaly
Reduced and controlled redundancy
Tighter control of replicated data
Standardization
ISOM
• Everybody talks the same talk.
Less chance of misunderstanding
Easier to interpret other’s data
• Easier to merge
Useful when several organizations
combine to form one.
Productivity and
Maintenance
ISOM
• Increase in productivity
 User-friendly interface
 Independence from specific data structure
• Easier maintenance
 Less code to maintain
• The DBMS is the bulk of the code.
• Ad-hoc queries make it possible to make do with
much less code.
 The vendor makes revisions of the DBMS.
• Economy of scale
Productivity vs. Maintenance
ISOM
100
90
80
70
60
50
40
30
20
10
0
Prod-DB
Prod-FL
Mnt-DB
Mnt-FL
Start
1st week 2nd week 3rd week 4th week
Disadvantages of Databases
ISOM
•
•
•
•
•
Software complexity
Processing inefficiency
Need for co-ordination
Organizational impact
Risk
DBMS
ISOM
• A Specialized piece of software that
sits between the data and its users.
Database
Management
System
Intension + Extension
Data
DBMS Functions and Users
ISOM
• Four major uses of a DBMS package
 Database Development, Interrogation, Maintenance,
and
 Application Development
• Automated tools for design, query, and
application development
• Database users
 Database administrators ( DBAs )
 Database designer
 End Users
Summary
ISOM
• Data is essential for an organization
• A Database is usually the most
effective way of storing and
organizing data
• File-based Vs. database systems
• Database system properties
• Types of database systems