2.MakingDatabaseSyst..

Download Report

Transcript 2.MakingDatabaseSyst..

Making Database Systems Usable
University of Michigan
H.V. Jagadish, Adriane Chapman, Aaron Elkiss,
Magesh Jayapandian, Yunyao Li, Arnab Nandi, Cong Yu
Mar 27, 2008
Christiano Santiago
1
Agenda
 Users Expectations
 The Persistence of Pain
 The Painless Future
 Conclusion
 Q&A
Mar 27, 2008
Christiano Santiago
2
Users Expectations
Database interactions
 Ability to query in a more sophisticated way,
using complex query semantics
 More precise and complete answers
 Expectation of structure in the result set
 Create and update databases
Mar 27, 2008
Christiano Santiago
3
The Persistence Of Pain
Mar 27, 2008
Christiano Santiago
4
Higher Level Presentation Data Model
Painful Relations
Mar 27, 2008
Christiano Santiago
5
Higher Level Presentation Data Model
Painful Relations
Simple concept versus Implementation
X
Flight info
Mar 27, 2008
Christiano Santiago
6
Higher Level Presentation Data Model
Painful Relations
What are the fligth departure times from Detroit to Beijing?
SELECT s.departure_time
FROM schedule AS s,
flight_info AS f, airports AS d, airports AS a
WHERE s.id = f.schedule_id
AND f.fid = d.id
AND d.city_name = “Beijing”
AND f.tid = a.id
AND a.city_name = “Detroit”
Mar 27, 2008
Christiano Santiago
7
Higher Level Presentation Data Model
Painful Relations
Problems
 Large number of table
 Poorly named entities
 Poorly named fields
 Joins are not intuitive
 Recursive self-joins
 SQL is too onerous
 Dependency on DBA (please don’t leave!)
Mar 27, 2008
Christiano Santiago
8
Multiple Ways to Access Data
Painful Options
Mar 27, 2008
Christiano Santiago
9
Multiple Ways to Access Data
Painful Options
Too much functionality and too many options
X
Mar 27, 2008
Christiano Santiago
10
Multiple Ways to Access Data
Painful Options
Barry Schwartz, The tyranny of choice.
Scientific American, April 2004, pp. 71-75
Mar 27, 2008
Christiano Santiago
11
Multiple Ways to Access Data
Painful Options
Perspectives
 Design systems for customized value
 Care about how well uses can get their job
done
 Forms-based interfaces have been the
primary means to interact with databases
 Provide simple querying tools for novice
users and powerful tools for experts
(customization)
Mar 27, 2008
Christiano Santiago
12
Precision and Recall
Unexpected Pain
Mar 27, 2008
Christiano Santiago
13
Precision and Recall
Unexpected Pain
X
Mar 27, 2008
Christiano Santiago
14
Precision and Recall
Unexpected Pain
Search engines error acceptance
 Expectations are set correctly
 Usually top few results do contain something
relevant
 If no relevant results were found, search it
again
 The web is so huge that no one knows
exactly what is out there
Mar 27, 2008
Christiano Santiago
15
Mar 27, 2008
Christiano Santiago
16
Precision and Recall
Unexpected Pain
Unable to query
 Systems may impede users from querying the way they want
 Scenario: world traveler with infinite flexibility and many
destinations to visit
X
Mar 27, 2008
Christiano Santiago
17
Precision and Recall
Unexpected Pain
Unable to query
 User’s versus form designer’s mental model of the data.
 Users should be worried about how. They just need the
what.
 Challenge #1: it’s not easy to provide a straightforward
and comprehensive way of specifying what a user wants
when a query requires aggregation or multiple complex
joins.
 Challenge #2: performance, security access level, data or
program errors may prevent users from running a query.
Mar 27, 2008
Christiano Santiago
18
Precision and Recall
Unexpected Pain
Unexpected results
 (Unexpected results) + (no explanation) = frustration
Example 1
• The client record shows he has 2 cars
but
• There is only 1 car related to this client in the
database.
# of cars was a field
in the Client table,
with no referential
Integrity with
CarOwner table
Mar 27, 2008
Christiano Santiago
19
Precision and Recall
Unexpected Pain
Unexpected results
 (Unexpected results) + (no explanation) = frustration
Example 2
User is looking for cheap flights provided by her
favorite airline:
• Los Angeles for $75
• Boston for $100
$400 was less
than half the
• San Francisco for $400 (?)
average price
for a ticket to
San Francisco
Mar 27, 2008
Christiano Santiago
20
Precision and Recall
Unexpected Pain
Unexpected results
 Empty & non-empty results
 Users may need to know where a result
comes from
 Users may need to know why a result is
returned
 System’s usability can be significantly
affected when no explanation can be given
Mar 27, 2008
Christiano Santiago
21
Errors In Query Construction
Unseen Pain
Mar 27, 2008
Christiano Santiago
22
Errors In Query Construction
Unseen Pain
Database Users versus Computer Scientists
X
Mar 27, 2008
Christiano Santiago
23
Errors In Query Construction
Unseen Pain
Mar 27, 2008
Christiano Santiago
24
Errors In Query Construction
Unseen Pain
Perspectives
 Query purpose: provide information that the
user doesn’t have
 How do I provide a clear and precise “from”
and “where” clause, if I don’t know the
possibilities?
 WYSIWYG
Mar 27, 2008
Christiano Santiago
25
Errors In Query Construction
Unseen Pain
WYISWYG
Assisted Querying using Instant-Response Interfaces
Arnab Nandi & H. V. Jagadish - University of Michigan
Mar 27, 2008
Christiano Santiago
26
Errors In Query Construction
Unseen Pain
WYISWYG
 Google Earth approach
 Can you image finding a location in a map
using coordinates instead of zoom and
rotation?
Mar 27, 2008
Christiano Santiago
27
Database Creation
Birthing pain
Mar 27, 2008
Christiano Santiago
28
Database Creation
Birthing pain
Corporate Market versus Average User
X
Mar 27, 2008
Christiano Santiago
29
Database Creation
Birthing pain
Perspectives
 Users want to store their data into databases
but this is not an easy task





Database schema
Table structure
Normalization
Relationships
Constraints
Mar 27, 2008
Christiano Santiago
30
Database Creation
Birthing pain
Perspectives
 An everyday database



No clear view of required database structure
No comprehensive design at starting point
Structure grows as time passes
 Structure grows incrementally
 Need to support both structured and
unstructured information
Mar 27, 2008
Christiano Santiago
31
The Painless Future
Mar 27, 2008
Christiano Santiago
32
The Painless Future
Geographic Model
 Joins between data sets using geographic or
spatial location as the basis
Mar 27, 2008
Christiano Santiago
33
The Painless Future
Network Model
 Graph or network representation of data.
Mar 27, 2008
Christiano Santiago
34
The Painless Future
Multidimensional Model
 Data as points in multi-dimensional space
Mar 27, 2008
Christiano Santiago
35
The Painless Future
Tabular Model
 People is used to seeing data represented in
simple two-dimensional tables
Mar 27, 2008
Christiano Santiago
36
The Painless Future
Presentation Data Models
Perspectives
 Data manipulation through presentation data model versus view
updates
 Consistency among view options: geographic and textual views
in synchrony
 Data provenance: “why” and “where” related to result sets
 Direct data manipulation


Point-and-click, drag-and-drop, etc.
New few number of algebra operators to support it
 Schema-later (unstructured) and heterogeneous database
design
Mar 27, 2008
Christiano Santiago
37
Conclusion
 Databases are extremely difficult for most
people to interact with.
 We must rethink the architecture of the
database systems as a whole.
 Presentation model as a distinct layer above
usual logical data model.
Mar 27, 2008
Christiano Santiago
38
Questions & Answers
Mar 27, 2008
Christiano Santiago
39