Archive - Chandra X
Download
Report
Transcript Archive - Chandra X
The Chandra Bibliography Database
Arnold Rots, Sherry Winkelman,
Sarah Blecksmith, John Bright
Chandra Data Archive Operations Group, CXC/SAO
Stéphane Paltani
Observatoire de Marseille
CXC
Summary
Existing capability
Identifiers and automatic linking
Extension of the database
Attributes
Database design
Database maintenance
Services
Conclusion and coming attractions
This presentation is adapted from a paper given at ADASS XIII
Pages 9-13 are less relevant and provided FYI only
The interface on pages 15, 16 is an old version, provided for illustrative
purposes only; please try the prerelease, using the URL on page 18
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
2
Existing Capability
On the part of the archive:
– Links from datasets (observations)
to articles in the ADS
– Scattered links to some specific
articles
On the part of the ADS:
– Links from articles (bibcodes) to
datasets in data center archives
– General project tags
This is very valuable, but also very
labor-intensive
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
3
Existing Capability
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
4
Identifiers and Automatic Linking
The ADS, the data centers, and US journal editors have reached
an agreement that will enable authors to insert these links
directly in a manuscript
Central to such linking are IVOA-compliant dataset identifiers
–
–
–
–
Namespace: ivo:
Authority Id: ADS
Data collection
Dataset
ivo://ADS/Sa.CXO#214
ivo://ADS/Sa.CXO#M31mosaic
We will provide services that will enable users to insert these IDs
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
5
Extension of the Database
Originally, the database contained only journal articles and
conference proceeding papers that can unambiguously be
connected with specific observations, plus an amorphous
collection of papers that are “Chandra-related”
Extension of subject categorization:
–
–
–
–
–
Referring to specific observations
Referring to published results
Predicting Chandra results
Referring to instrumentation, software, or operations
Other
Inclusion of all other types of publications (except preprints!)
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
6
Attributes
Subject
– Observations, instruments, software, operations
Kind of publication
– Book, journal, proceedings, thesis, circular, review, newsletter, internal
Type of publication
– Article, abstract, memo, data, erratum, article (abstract only available),
title only, electronic
Number of citations
Keywords (standard ApJ as well as custom)
A variety of other items
– Date of publication, refereed or not, etc.
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
7
Database Design
ObsId
1
0+
1+
1+
BibTable
1+
1+
1
1
Subjects
1
1
Observation
Catalog
0+
0+
Datasets
1
1
Keywords
0,1
1
URLs
Proposals
1+
DatasetObsIds
1
Std Keywords
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
8
Database Maintenance
Management of new entries
through a dedicated database:
– Automatic filling of BibWork
– Attributes filled in through GUI
– Migrate entries to BibTable upon
completion
– The Datasets and DatasetObsIds
table are common with the main
database
BibWork
ObsId
Datasets
DatasetObsIds
Automatic updating of number of
citations
Automatic check on validity of
bibcodes
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
9
Database Maintenance Interface
Filling the database
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
10
Database Maintenance Interface
Checking the paper
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
11
Database Maintenance Interface
Set attributes
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
12
Database Maintenance Interface
Establish proposal links
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
13
Services
Exchange of information with ADS: harvesting of Bibcode –
Dataset Identifier pairs in both directions
Provide access to datasets through either a Dataset Identifier or
a Bibcode
Provide information to ADS on Bibcodes that are not related to
specific observations
Provide access to publications through queries from our archive;
see next page and:
http://cxc.harvard.edu/cgi-gen/cda/bibliography.cgi
Derive metrics through queries (standardized as well as custom;
see Paul Green’s presentation)
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
14
Services
Literature search from the archive (shown here is an old version)
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
15
Services
(Previous page)
A simple query example: find all
publications related to Chandra
Crab observations
This renders 4 articles – be aware
that there may be more (e.g.,
meeting abstracts!) that could not
be traced to specific observations
The bibcodes link to the abstracts
in the ADS
(This page)
The link to the ADS provides more
details on all four papers
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
16
Database Metrics (Oct 2003)
All Articles
Category
1999
2000
2001
2002
Referee d only
2003
Total
No. Cit.
Total
No. Cit.
Observations
53
284
485
485
352
1659
5639
712
5597
Refer to obs.
9
94
333
499
322
1257
5300
897
5231
Instr., etc.
34
141
124
69
18
386
1362
354
1355
Predict result
11
67
21
14
21
135
306
22
296
Unclassified
15
90
70
29
40
244
663
118
650
122
676
1033
1097
753
3681
13270
2103
13129
1011
2507
2735
2758
1859
10870
Total
Reviewed
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
17
Conclusion
We have developed a comprehensive database that is capable
of tracking all mission-related publications and preserving all
relevant information
Added to this are a database and GUI that make maintenance
(i.e., data entry) as painless as possible
Services include cross-linking with the ADS, a powerful literature
search from the Chandra archive, and metrics
The entire package is reasonably mission independent and we
are happy to provide it to other data centers
Try the new interface at:
http://cxc.harvard.edu/cgi-gen/cda/bibliography.cgi
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
18
Coming Attractions
FITS keyword database:
an interactive web-based tool that allows users to look up the meaning
and use of all keywords used in CXC FITS files, and to construct
compliant headers
Special Requests:
a web-based tool that allows users to make special data requests,
backed up by a database that tracks the status of these requests
– Request for previous data versions
– Request for special processing
– Request for data on physical medium
– Request for custom database query
– Anything else (reasonable)
CXC
2004-01-12
Chandra Users Committee: Chandra Data Archive
19