Transcript Databases

Course information
CS 848–Winter 2003
Topics in Databases: Information Integration
Instructor: Grant Weddell
Office: DC3109; Phone: ext 4463;
Email: [email protected]
CS848: Topics in Databases: Information Integration
Course information (cont’d)
Organization: The first part of the course will consist of lectures that
will be given by the instructor. The second part of the course will
consist of seminars to be delivered by students. The lectures and
seminars will take place in DC3313 on Tuesdays from 4 to 6pm. Our
first class will take place on January 7th, 2003.
Credit: Students will be graded on their class participation, on their
seminar and on a project to be negotiated with the instructor near the
start of the course. Note that class participation will entail reading
assignments.
Course web site: db.uwaterloo.ca/~gweddell/cs848
CS848: Topics in Databases: Information Integration
Information integration
SQL Server
Global Schema
Subsystem1
Open, Scan, etc.
Subsystem2
SQL
XQuery
SQL
Disk Files
XML Server
Subsystem3
Another SQL Server
CS848: Topics in Databases: Information Integration
Course focus
Query optimization in information integration:
Translation of queries on a global schema to query plans
that are able to efficiently marshal the needed data by
communicating in appropriate (sub)languages with other
servers. [Ullman, 1999], [Levy et at., 1995]
CS848: Topics in Databases: Information Integration
General observations
 Need a sufficiently powerful notion of a database.
 Need a sufficiently powerful way of capturing
database schema.
 Can SQL and the relational model suffice?
 Structured databases, semi-structured databases,
low level APIs, …
CS848: Topics in Databases: Information Integration
Observations (cont’d)
 GAV/LAV/GLAV [Lenzerini, 2002]
 Doall vs intelligent locals.
 Steps in optimization.
 Cost based optimization; extra logical aspects.
 Grow/shrink.
CS848: Topics in Databases: Information Integration