OCLC and FRBR

Download Report

Transcript OCLC and FRBR

Lisbon
9 October
2008
OCLC and FRBR
Janifer Gatenby
Research Integration and Standards
OCLC, Leiden
Agenda
WorldCat coverage
Implementations of FRBR
• Manifestations and Works
• Authorities
FRBR web and data services
Research projects and the future
WorldCat: Multi-lingual resource
Multilingual WorldCat
1998:
37.5 records
23.9 m
2007:
86.0 records
45.3 m
French
German
2.3 m
2.2 m
5.3 m
9.8 m
Spanish
Japanese
1.6 m
.8 m
3.3 m
1.8 m
Russian
Chinese
.8 m
.7 m
1.5 m
2.0 m
Italian
Latin
.7 m
.3 m
1.4 m
1.0 m
Portuguese
Dutch
.3 m
.2 m
.8 m
2.5 m
Hebrew
.2 m
.5 m
Total Records
English
1998
36%*
Total Records: 110,861,114
2007
47%*
English and Non-English: 2008 April
*Percentage of Non-English Records
Holdings: 1,300 million
Libraries: 69,000
Articles: 60 million & growing
50%
50%
English
Non-English
FY09 Data Loading
Library or Group
Records
National Library
of Israel
1.2m
UNB Canada
GBV
Comments
Hebrew, Arabic and Cyrillic
records in vernacular. 788k
new records added.
Coming:
National Library
of China
1m
19.2m
33.7 million holdings
11.2 million new records
added
Bibliothèque
Nationale de
France
U. Québec
Target load for 2008 / 2009 300-500 million
Unity UK
Implementations of FRBR
WorldCat
FictionFinder
WorldCat Identities
WorldCat is FRBRised
http://www.worldcat.org
WorldCat and FRBR
WorldCat is “FRBRised”
• 110 million records, 85 million works (1.3 manifestations per
work)
How achieved
• Algorithm
• Table of uniform titles to ignore
• Xref tables (large)
• Data mining WorldCat
• VIAF matching techniques
FRBR Web and data services
xISBN
xOCLC
xISSN
WorldCat API
Work Cluster Service
WorldCat xISBN & xISSN
• Web services to find all related editions of a book or serial
• Easily incorporated into library catalogs, Web sites, and other library
applications
• See http://worldcat.org/devnet for more info
100+ ISBNs for Sorcerers Stone
32 English (US and UK)
9 Spanish
3 Russian, German, Finnish , Latin
2 Chinese, Czech, French, Korean, Norwegian, Persian, Polish,
Portuguese, Romanian, Turkish, Welsh,
1 Afrikaans, Albanian, Armenian, Basque, Bengali, Georgian, Galician,
Gaelic, Ancient Greek, Greek, Gujarati, Hindi, Hungarian, Icelandic,
Italian, Japanese, Latvian, Lithuanian, Malayalam, Sherpa,
Slovenian, Swedish, Thai, Ukranian, Urdu
16 Audio
59 Book
ISSN History Tool
http://worldcat.org/xissn/titlehistory?issn=0888-5885
xOCLC
http://xisbn.worldcat.org/xisbnadmin/xoclcnum/index.htm
• Submit OCLC number
• Receive related identifiers sorted by holdings occurrence
• total WorldCat records: 110,861,884
• total records with ISBNs: 22,739,250 (20.51%)
• total records with LCCNs: 12,279,387 (11.07%
http://xisbn.worldcat.org/xisbnadmin/xoclcnum/stat.htm
WorldCat API
Integrate WorldCat data
FRBR work clusters
Accepts: SRU & OpenSearch
Outputs: MARCXML, Dublin Core, RSS, Atom
WorldCat Grid Developers’ Network:
http://www.worldcat.org/devnet/blog/
http://worldcat.org/devnet/wiki/SearchAPIDemos
http://www.worldcat.org/blogs/
Work Cluster Service
Data export service
• provision of work level identifiers associated with a member’s subset of
WorldCat
SRU record update – real time exposure
PiCarta
NCC
PUSH
L
O
G
SRU UPDATE M21
WorldCat
WorldCat identifiers,
FRBR work id
synchronisation
GGC
Research Projects and the Future
VIAF
DAI
Identities Hub
GLIMIR
VIAF
Invitation to extend: 2008
DAI – Digital Author Identifier
SURF requires all researchers to have
Identification which is registered in
The Dutch National Union Catalogue
Authority file
Investigating loading Dutch
Authority file to VIAF
Identities Hub (OR / RLG partners project)
End user editing Phases
• Merging identities
• Move selected works from one
identity to another
Additional authority information
under consideration
• Institution affiliations
• Subject areas
• Birth and death dates
• Create new identity
• Email addresses
• Edit an identity
• Co-authors
Coming 2009: WorldCat Identities API – LCCN & name search
GLIMIR
Global Library Manifestation
Identifier
• No one single manifestation
identifier
• ISBN, ISSN, ISMN (music), ISRC
(sound recordings), V-ISAN
(audio-visual)
Only 30% of WorldCat
resources have an
international
identifier
Identifiers are the key to navigation
among web sites for resources &
information about resources; & thus
the key to mash-ups
Outside use of OCLC identifiers….
A subsidiary of the US ISBN Agency
Linking inwards
Permalinks
• Simple URLs permitting direct access into
WorldCat
WorldCat API
• Identifiers give direct access to metadata
and thence enriched content & full text
www.worldcat.org/oclc/225507364
View point of libraries
All resources identified in a global scheme
• Consolidate hits for better ranking
Possible to do cross database links more reliably; mash-ups
Improved quality of WorldCat
• Better exposure
• Facilitates cataloguing
Family of Identifiers
ISCI
WCat Identities, VIAF
FRBR clusters
ISNI, ISIL
ISTC, ISWC, ISAN
Work
Expression
Manifestation
Manifestation
Authors
Expression
Manifestation
Manifestation
Subjects,
Dewey +
GLIMIR
ISBN, ISSN, ISMN, VISAN
Linked content at the right level
Biographies, affiliations
Reviews, evaluation,
lists, prizes
Work
Expression
Manifestation
Manifestation
Authors
Expression
Manifestation
Manifestation
Subjects,
Dewey +
Full text links, usage
statistics, cover art,
holdings
Obrigado