Validation - Berkeley Database Research
Download
Report
Transcript Validation - Berkeley Database Research
XML, Java, and the future of the
Web
Jon Bosak, Sun Microsystems
Last revised 1997.03.10
Presented and Annotated by
Byung-Hoon Kang
([email protected])
RoadMap :
Part1: Why XML ?
Part2: Example applications
HTML and SGML
SGML and XML
SGML vs. XML vs. HTML
Example applications envisioned in 1997
Database interchange problem
Why solution is not feasible in HTML
XML solution envisioned in 1997
Part3: Why CSS/XSL ?
What are style sheet and why?
Style sheet languages
Why two style sheets language for XML ?
Conclusion: XML,DTD,CSS and XSL
HTML and SGML
SGML allows documents to describe their own grammar
HTML hardwires a small set of tags: a single SGML
specification
HTML
Extensibility: new tag ?
application change?
Yes
Needed
SGML
Yes
Independent
Structure: deep nested ?
No
Yes
Validation: structural validity ?
No
Yes
SGML and XML
SGML :
Contains many optional features not needed for Web applications
Cost/benefit ratio unattractive to current vendors of Web
browsers
DTD is required all the time
XML :
A simplified subset of SGML for Web applications
Retain SGML’s extensibility, structure, and validation
DTD is optionally needed for validation
SGML vs. XML vs. HTML
HTML
Yes/No
SGML
XML
Yes
Yes
Extensibility: new tag ?
Structure: deep nested ?
No
Yes
Yes
Validation: structural validity ?
No
Yes
Yes
DTD is required for parsing ?
No
Yes
Opt.
Browser dependency ?
High
Low
Low
Cost/Benefit ?
Good
Poor
Med
When were you born ?
‘92
‘86
‘98
RoadMap :
Part2: Example applications
Example applications envisioned in 1997
Database interchange problem
Why solution is not feasible in HTML
XML solution envisioned in 1997
Part3: Why CSS/XSL ?
What are style sheet and why?
Style sheet languages
Why two style sheets language for XML ?
Conclusion: XML,DTD,CSS and XSL
XML applications : envisioned in 1997
to mediate between two or more heterogeneous databases.
to distribute a significant proportion of the processing load
from the Web server to the Web client.
to present different views of the same data to different
users.
to tailor information discovery to the needs of individual
users.
Database interchange: the universal hub
The information tracking system for a home health care
agency
Need to key-in all of patient medical histories and billing
data from a variety of doctors, hospitals, pharmacies,
and insurance companies.
1. Log into the hospital's Web site.
2. Become an authorized user.
3. Access the patient's medical records using a Web
browser.
4. Print out the records from the browser.
5. Manually key in the data from the printouts.
Database interchange: the universal hub
A real solution would look more like this:
1. Log into the hospital's Web site.
2. Become an authorized user.
3. Access the patient's medical records in a Web-based
interface that represents the records for that patient with
a folder icon.
4. Drag the folder from the Web application over to the
internal database application.
5. Drop it into the database.
Why solution is not feasible in HTML ?
HTML tag set is too limited
to represent or differentiate between the multitude of database
fields
hard to automate the process
HTML is incapable of representing the variety of structures
in those documents.
HTML lacks mechanism in checking the data for structural
validity
XML solution envisioned in 1997
1. Form a standards consortium for Health Care Markup
Language and its HCML.DTD
2. Adopt a single industry-wide interchange format that
serves as the single output format.
3. Use any standard application to validate and process
HCML marked-up data with HCML.DTD
Similar to the steps in traditional SGML usage
Still have the same problem of standardization.
Is XML advantageous than SGML in doing steps 1, 2 or 3
?
RoadMap :
Part3: Why CSS/XSL ?
What are style sheet and why?
Style sheet languages
Why two style sheets language for XML ?
Conclusion: XML,DTD,CSS and XSL
What are style sheets and Why?
Style sheets
Describe how documents are presented on screens, in
print or in audio.
Detach the formatting information from data source in
document
Why: want to influence the presentation of documents
without sacrificing device-independence or
adding new HTML tags.
Stylesheets
DSSSL (Document Style Semantics and Specification
Language) for SGML
DSSSL-online : the subset of DSSSL for SGML online.
CSS (cascading style sheets) :
provides a style mechanism well suited to the relatively low-level
demands of HTML
currently used a little bit
XSL (extensible style language):
needed to present XML document on the browser.
transformational functionality
Why two Style Sheet languages?
CSS XSL
Can be used with HTML? yes
no
Can be used with XML ? yes
yes
Transformation language? no
yes
Syntax ?
XML
CSS need to be used to style HTML documents
XSL, on the other hand, is able to transform documents
CSS
For example, XSL can be used to transform XML data
into HTML/CSS documents
Both languages can be used to style XML documents
Conclusion: XML, DTD, CSS and
XSL
To validate XML structural input : DTD
To exchange data using XML: None or DTD for structural
validation
To standardize the data exchange model: Still need to form
a consortium
To render XML in XML-support-browser : CSS or XSL
To render XML in HTML-support-browser : Use XSL to
transform XML into HTML/CSS