The Making of the Queen's Manuscript

Download Report

Transcript The Making of the Queen's Manuscript

Digital Philology
from XML & XSLT to
XPath 2.0 & XQuery
in Medieval Transcriptions
The Arts & Humanities Research Council
The Roberts Fund
CNRS
Agence Nationale de la Recherche
Atelier Villejuif, LAMOP, Paris
mai 2008
Un laboratoire pour la
philologie numérique
– une bourse gagné des
Fonds Roberts par Mansfield
en 2007 afin de préparer les
outils XML pour les
chercheurs ingénieurs en
Moyen Français
www.python.org
xmlfr.org
UTF-8 is an
8-bit Unicode
Transformation
Format
It forgives
non-accented
characters but
You must
manage things
when your
web page
has
accented
characters
eg Georgian Commas
• ii •
While an ampersand or une esperluette
we show as an html entity
&
November 2007 TEI P5 Announced
Full elements need an opening and a closing tag
while an ‘empty’ element is complete in one tag.
A full element is opened <name> and closed </name>
While
An empty element is complete within itself:
<abbr expan="n"/>t
November 2007 TEI P5 Announced
Our empty elements, most of which Python converts from
simple rules:
tie<abbr expan="n"/>t
<cb n="b"/> here marking the start of column b
<lb n="CDAM.369c:01"/>
<layout ruledLines="38"/>
<pb n="241" side="r"/> page break, here marking the start of fol. 241r
The only one you will need to type in will be:
soubz<space/>rire
will show as
'soubz rire' in the diplomatic transcription and as
'soubzrire' in the modern edition
XPath 2.0
in EditiX
EditiX and Oxygen have built-in transform
Processors so that you can apply
XSLT scripts to your XML transcription
at any time during your tagging work
You cannot change the HTML webpage. It is frozen. However, it does
highlight for you any potential problems
XML + XSL
= HTML Web Page
Our Portfolio of Essential Files
transcription.xml the text you are transcribing.
4431.dtd is our Document Type Description which lists in alphabetic
order the elementsand their attributes which we have chosen from
the TEI P5 guidelines
mat/table.xsl is our main transform program for the whole manuscript
and for individual books.
It differs from our specialised XSLT programs
(which include the XSLTransform: glossary.xsl for rendering a table of glosses
from all those we have typed-in inline).
Please be careful when downloading XSLTransform
files because some systems (especially on Windows)
add a further file extension of .xml after the .xsl.
classes.css is our full list of class definitions for our cascading style-sheet
which renders font size and colour on the final web version.
javascript.js is CM's JavaScript or ECMAScript code for the interactive
glosses and name pop-up windows.
fleur.png is the triangular red notes pointer image for the web versions
of the online Harley transcription:
get.html is the pilot version of our Glossary File and system which
passes a link to the DMF2 at our partner laboratory ATILF.
But for the ‘grand public” we need a way out of the building-site
A Native XML Database
Native, so we do not have to do any
additional coding beyond our TEI P5 XML
http://sligachan.lib.ed.ac.uk:8080/exist/admin/admin.xql
arnoul.vjf.cnrs.fr/actes
Future Funding
HERA Joint Research Programmes
http://www.heranet.info/Default.aspx?ID=274
National Contacts:
France - CNRS Bruno Laurioux
[email protected]
United Kingdom - AHRC
Christelle Pellecuer [email protected]
charlie.mansfield à vodafone.net
FiN