Consonant Characters and Inherent Vowels

Download Report

Transcript Consonant Characters and Inherent Vowels

New i18n Developments
at the W3C
An overview of the W3C
Internationalization Activity
Richard Ishida
W3C
Plan

W3C Overview

Recent I18n Activity organizational changes

ITS Working Group

GEO Working Group

I18n Core Working Group
W3C Overview
W3C Overview
 What?
Mission:
Lead the
technical evolution of
the Web
and ensure its
interoperability
Created and led by:
Sir Tim BernersLee, inventor of
the WWW.
Keywords:
consensus and
vendor neutrality
W3C Overview
 Who?
ERCIM
MIT
3 Hosts
Keio
W3C Overview
 Who?
ERCIM
MIT
3 Hosts
14 Offices
60 Staff (20
locations)
Keio
W3C Overview
 Who?
3 Hosts
14 Offices
60 Staff (20
locations)
360 Members
40 Liaisons
• HP, Microsoft, Sun, WebMethods, Sony, Fujitsu,
Software AG, IBM, Apple, Elisa, …
• HP, Microsoft, Sun, WebMethods, Sony, Fujitsu,
Software AG,
• IBM, Apple, Elisa, …
• Nokia, Siemens, Vodaphone, DoCoMo, T-Online, …
• Academia Sinica, FhG, MIT, CSIRO, EUnet, ETRI,
ERCIM, …
• Boeing, ChevronTexaco, Agfa, DaimlerChrysler,
Elsevier, …
• and many more…
W3C Overview
 How?
Architecture
4 Domains
Interaction
Technology & Society
Web Accessibility
W3C Overview
 How?
4 Domains
32 Working
Groups
13 Interest
Groups
7
Coordination
Groups
Architecture
Interaction
XML Core
XML Query
XML Schema
XSL
Internationalization
WS Addressing
WS Choreog.
WS Descn.
XML Protocol
Compound docs
XForms
Device Independence
SVG
HTML
Multimodal Interaction
CSS
Symm
Timed Text
Voice Browser
Technology & Society
Web Accessibility
Patent Policy
Policy & Outreach
P3P Spec.
RDF Data Access
Semantic Web BPD
Key Mng.
Education & Outreach
Authoring Tools
Evaluation & Repair
Protocols & Formats
User Agent Accessibility
Web Content Acc.
Recent I18n Activity
Organizational Changes
I18n Activity Changes
 New I18N Activity structure
Chairs
I18N Core WG
I18N WG
Reviews
CharMod
Locale info
Addison Phillips,
Quest software
GEO WG
Articles
Guidelines
Tests
Tutorials
Core TF
WS TF
GEO TF
Richard Ishida,
W3C
ITS WG
NEW
I18N IG
member only
XML tag set
Guidelines
Yves Savourel,
Enlaso
I18N IG
public
Martin Dürst,
Aoyama Gakuin Uni.
I18n Activity Changes
 New I18N Activity Team changes
Martin Dürst
Felix Sasaki
Internationalization Tag Set
(ITS) Working Group
http://www.w3.org/International/its/
ITS Working Group
 Provide a set of bidi tags.
✓
The title says "W3C ,‫ "פעילות הבינאום‬in Hebrew.
✗
Using the bidi algorithm only
The title says "‫פעילות הבינאום‬, W3C" in Hebrew.
Markup (characters ordered as in memory!)
"<span dir="rtl">‫ ם ו א נ י ב ה ת ו ל י ע פ‬, W 3 C</span>"
ITS Working Group
 Allow for identification of non-translatable text.
At the VISTA console, submit a job to print. (Refer to “Submitting a Job” in
Chapter 5.)
At the operator control panel, make sure the
printing system is in Make-Ready mode. The
MAKE-READY/RUN indicator should not be lit.
Press the START button to sound the horn. The
MAKE READY / RUN indicator flashes.
At the third beep, press the START button again.
The START indicator remains lit and paper
<para> movement begins.
Press the START button to sound the horn. The
MAKE-READY/ RUN indicator flashes.
When the web reaches minimum print speed, the test pattern prints.
</para> Press the MAKE-READY/RUN button to place the printing system in Run
mode and start printing the live test pages. The MAKE-READY/ RUN indicator
should be lit.
ITS Working Group
 Allow for identification of non-translatable text.
Von der VISTA-Konsole aus einem Druckauftrag übermitteln. (Siehe hierzu
“Auftrag übergeben” in Kapitel 5.)
Am Steuerpult prüfen, ob der Make-ReadyModus aktiv ist. (Die Anzeige MAKEREADY/RUN darf nicht leuchten).
START drücken, so dass die Hupe ertönt und die
Anzeige MAKE READY / RUN blinkt.
Beim dritten Ton erneut START drücken. Die
Anzeige START leuchtet konstant, und der
<para> Papiertransport läuft an.
Press the
<span translate="no">START</span>
button to sound the horn. The
<span translate="no">MAKE-READY/ RUN</span>
indicator flashes.
</para>
ITS Working Group
 Guidelines for format developers
Volcanic eruptions have
literally devastated large
inhabited areas. During the
1914 eruption of
Sakurajima in Kyushu, 687
houses in Kurokami were
buried in hot ash. What
remained of this shrine
gate, previously five meters
tall, was left as a reminder.
Kurokami maibutsu gate
(腹五社神社黒神埋没鳥居),
Sakurajima Island.
<image src="kk-torii.jpg" height="180" width="240"
caption="Kurokami maibutsu gate (腹五社神社黒神埋
没鳥居), Sakurajima Island."/>
ITS Working Group
 Guidelines for format developers
Volcanic eruptions have
literally devastated large
inhabited areas. During the
1914 eruption of
Sakurajima in Kyushu, 687
houses in Kurokami were
buried in hot ash. What
remained of this shrine
gate, previously five meters
tall, was left as a reminder.
Kurokami maibutsu gate
(腹五社神社黒神埋没鳥居),
Sakurajima Island.
<image src="kk-torii.jpg" height="180" width="240">
<caption>Kurokami maibutsu gate
(<span xml:lang="ja">腹五社神社黒神埋没鳥居</span>),
Sakurajima Island.</caption>
</image>
Other things: bidi markup, abbr, styling, etc.
Guidelines, Education &
Outreach (GEO)
Working Group
http://www.w3.org/International/geo/
GEO Working Group
 Available articles
General

International & multilingual Web sites

Monolingual vs. multilingual Web sites

Serving XHTML 1.0
Characters & encoding

Character encodings

Document character set

Who uses Unicode?

An Introduction to Multilingual Web Addresses

Language tags in HTML and XML

The HTTP Charset parameter

Checking HTTP headers

Checking the character encoding using the
validator

CSS character encoding declarations

Multilingual form encoding

HTML, XHTML, XML and control codes

Unexpected characters or blank lines

Setting encoding in web authoring applications

Setting 'charset' information in .htaccess

Missing characters and glyphs

Non-English tags
Language

Why use the language attribute?

Using HTTP and meta for language
information

Language tags in HTML and XML

2-letter or 3-letter language codes

Styling using the lang attribute

Setting language preferences in a browser



When to use language negotiation
Apache MultiViews language negotiation set
up
Accept-Language used for locale setting
Bidirectional text

What you need to know about the bidi
algorithm and inline markup

Bidi space loss

XHMTL/HTML & bidi formatting codes vs.
markup

CSS vs. markup for bidi support
Resource identifiers

An Introduction to Multilingual Web Addresses
Local formats

Displaying formatted dates

Select box sorting

Dates and time
Miscellaneous

Display capabilities

Script direction & languages

Introduction to Indic Scripts

Ruby
On the way

Global gateway pull-downs

XSLT for multilingual output

Upgrading to Unicode

Using the link element
GEO Working Group
 Tutorials








Now available
Using language information in XHTML, HTML
and CSS
Character sets & encodings in XHTML, HTML
and CSS
In preparation
Declaring Language in HTML and HTML
Creating Bidirectional XHTML/HTML Pages
Multilingual Web Addresses
An Introduction to Writing Systems
GEO Working Group
 Site improvements

Site search

'Site map'

Topic index

Techniques index

News filters & RSS feeds

Translation process

Redesign, liaising with WAI (in
preparation)

'Getting started' text

We need your feedback
GEO Working Group
 Distinguish between text-processing language and
primary language metadata

Text-processing language



the language of a specific range of text
used for processing such as text-tospeech, styling, etc.
can indicate only ONE language at a
time
The French
word for cat
is chat.
This is French text.
GEO Working Group
 Distinguish between text-processing language and
primary language metadata

Text-processing language




the language of a specific range of text
used for processing such as text-tospeech, styling, etc.
can indicate only ONE language at a
time
The French
word for cat
is chat.
This is French text.
Primary language metadata



describes the language(s) of the
document as a whole
not a list of all languages used in the
document
The French
word for cat
is chat.
could be more than one language
This is an English document.
GEO Working Group
 Primary language is associated with the intended
audience of the document
Japanese phrase book
hello
今日は
thank you
ありがとう
goodbye
さようなら
English
Greetings
Bienvenue
This Web page
contains an
enormous
amount of
useful stuff.
Cette page
contient une
quantité
énorme de
choses
utiles.
We hope you
will enjoy
Nous
espérons
que ceci
vous plaira.
MyOrganization
Go here
Go there
Go
elsewhere
English & French
Go back
French
Bonjour cher
lecteur.
Voici des choses
que vous pouvez
lire. Nous les
avons traduits pour
que vous les
compreniez;
malgré le fait que
la navigation reste
en anglais.
GEO Working Group
 Ways of indicating language information
HTTP Content-Language header
Language attribute on html tag
Content-Language meta tag
Language attribute on embedded
element
HTTP/1.1 200 OK
Date: Wed, 05 Nov 2003 10:46:04 GMT
Server: Apache/1.3.28 (Unix) PHP/4.2.3
…
Content-Type: text/html; charset=utf-8
Content-Language: en
<html lang="en">
<head>
…
<meta http-equiv="Content-Language" content="en" />
…
</head>
<body>
<p>The French word for <em>cat</em> is
<em lang="fr">chat</em>.</p>
…
</body>
</html>
I18n Core Working Group
http://www.w3.org/International/core/
I18n Core Working Group
 Publications
Character Model for the World Wide Web
Fundamentals
REC!
Resource Identifiers
CR
Normalization
WD
International Resource Identifiers (IRIs)
Proposed Standard
I18n Core Working Group
 Reviews
•
UTR#23
•
SSML
•
XML 1.0 3e
•
XML ID
•
XSLT 2.0,
•
SVG 1.2
•
XQuery 1.0 Serialization
•
XHTML 2.0
•
TAG Web Arch document
•
VoiceXML 2.1
•
CSS3
•
Media Types in Binary Data in XML
•
CCXML
•
SOAP 1.2
•
CSS 2.1
•
WSDL 2.0
Check out the Radar chart
I18n Core Working Group
 XHTML2 review insights
• Use of quotes
• Elimination of title
• Abbreviations
• Entities
• etc.
Thankyou
http://www.w3.org/International/