Science Overview Architectural Status (http://www.astrogrid.ac.uk

Download Report

Transcript Science Overview Architectural Status (http://www.astrogrid.ac.uk

Science Overview
Architectural Status
(http://www.astrogrid.ac.u
k)
Multi-wavelength showing the jet in M87: from top to
bottom – Chandra X-ray, HST optical, Gemini mid-IR, VLA
radio
Nicholas Walton
IoA, Cambridge
credits: “NASA / Chandra X-ray Observatory / Herman Marshall (MIT)”,
“NASA/HST/Eric Perlman (UMBC), “Gemini Observatory/OSCIR”, “VLA/NSF/Eric
Perlman (UMBC)/Fang Zhou, Biretta (STScI)/F Owen (NRA)”
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p1
Printed: 12/12/01
Drivers: The Sociology of
Astronomy

Continuing Collectivization
 Facility
class (common user) instruments
 Central
development of supporting s/w (e.g. Iraf)
 Calibrated
archives and access tools (e.g. IPAC)
 Information
services (e.g. ADS, NED, astro-ph)
 Consortium
projects (e.g. MACHO, SLOAN,
VISTA)

Evolving Developments
 Inter-operable
 Data
archives, joint queries (e.g. MAST)
mining (exploration and analysis tools)
 Information
discovery tools
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p2
Printed: 12/12/01
Drivers: Technological Change I

The Growth of Data
 Significant
increase in number and size of
telescopes
In the optical:
ESO's 4x8-m VLT, Gemini's 2x8-m
 In the x-ray: XMM-Newton, Chandra
 In the mm: ALMA

 Significant
increase in size and multiplex
capabilities of associated instrumentation and
detectors, e.g.:
In the optical: VISTA will have a Gpixel array
 In the radio: e-Merlin with data rates of 320 Gbps will
generate

N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p3
Printed: 12/12/01
Drivers: Technological Change II

The Growth of Data Archives
 Many
observatories hold multi-TB archives
 New
initiatives set-up to support new observing
capabilities (e.g. TeraPix)
 In


the optical, need to ingress all-sky survey's
Whole sky at 0.1 arcsec/pix is 100TB
Increasing Importance of Archival Data
 Time
on expensive facilities (e.g. HST) only
awarded if the archive has been searched before
hand
 This
trend is continuing, driving observatory and
user
demand
for Meeting,
more
archival
data
N A Walton: Science
Overview:
AstroGrid Collaboration
Dec accessible
13-14, 2001
p4
Printed:
12/12/01
Empowering Science Driven
Observational Astronomy

Break down the 'wavelength' barriers
 e.g.
Greater focus on science driven proposals
encouraging the use of data sets from across a
wide range of wavelengths

Increase Coordination across countries
 e.g.
The UK joins ESO: improved knowledge
transfer

Increase Access
 e.g.
East European countries may not be able to
fund a major new telescope but can contribute
to, and access 'Virtual Observatories' p5 Printed: 12/12/01
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
Astronomical Drivers: Enabling New
Science

Linking the near and distant Universe
Comparison of rest frame samples to study
evolution of galaxies
 Combination of UV, optical and IR datasets


Creating the 'Digital Sky'


Temporal data measures motions in the Galactic
centre, probes the creation of our Galaxy
The search for extra-solar planets

The planet-transit technique using federated survey
data for millions of stars
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p6
Printed: 12/12/01
Astronomical Drivers: New Era of Surveys

SuperCOSMOS (UK: now till 2002)
 Based

on Schmidt plates - Science database ~2TB
Sloan Digital Sky Survey (US: now till 2005)
 Dedicated
CCD survey telescope: Science database
~10TB

UKIDSS (UK: from 2003)
 IR

VISTA (UK: from 2005)
 IR

camera on 4-m UKIRT: Science database ~30TB
camera on 4-m VISTA: Science database ~300TB
LSST (US: from 2007-8?)
 Dedicated
~8-m telescope: All sky/few
nights:~5000TB/yr
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p7
Printed: 12/12/01
Astronomical Drivers: Pre-Discovery
Mining

Investigating the progenitors of sources that
show variability
 Dark
matter revealed by microlensing
events
 Planets
revealed by stellar variability
 Formation
of neutron stars revealed by
GRB's
 Death
of massive stars revealed by Type II
SN
The progenitor of SN1999gi
is <9 M: found from
mining pre-discovery HST
images.
(Smartt et al, 2001, ApJ, 556, L29)
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p8
Printed: 12/12/01
Astronomical Drivers: Rare Objects

Huge data sets open up the possibility to
find rare objects
 Those

that are hard to find:
Brown dwarfs, have unique red colours
 Those
that are intrinsically rare
High-z quasars, stand out due to
suppression of their blue colour by the Ly-
forest
Hi-z QSO's found from SDSS multi-colour data:

the shaded area is domain of QSO's, solid line is
track for increasing z (Fan et al, 2001, ApJ, 121, 31)
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p9
Printed: 12/12/01
Astronomical Drivers: New Objects

Large datasets open the possibility to
discover new objects
 Those
that have been missed before
because they are extremely rare or
short-lived
 Those
that have previously been
misclassified, revealed as outliers in new
parameter space correlations
DPOSS group, during searches for
high-z quasars, have discovered
peculiar objects, this one perhaps
a BAL QSO (Djorgovski et al, 2001,
PASP, 225, 52)
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p10
Printed: 12/12/01
Astronomical Drivers: Cosmology thru SN
Ia
AstroGrid will enable rapid typing
of search fields via galaxy photo-z's
in support of UK/EU groups, e.g.
EC RTN SN network.
Issue: large scale image deconvolution
matching candidates to z
Example SCP data
from Perlmutter, 2001
snap.lbl.gov/pubdocs/
SAUL.pdf
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p11
Printed: 12/12/01
Astronomical Drivers: The X-ray Universe
AstroGrid will enhance the ability to
classify new X-ray objects and speed
object sample creation
Issue: federating large scale multi-λ data
The ELIAS team is using Chandra,
XMM, ISO, SCUBA, optical imaging
and VLA data to study a number of
fields: see
www.roe.ac.uk/~omar/survey.html
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p12
Printed: 12/12/01
Astronomical Drivers: WD's and Dark
Matter
AstroGrid products will enable
the federation of multi-epoch
data to increase the number of
White Dwarf's known. With IR
data cool objects will be found
impacting on the Dark Matter
problem.
Issue: TB's of multi-epoch, multicolour data, finding moving objects
WD0346+246, moving at 1.3''/year,
was discovered by Hambly et al.,
(2000, Nature, 403, 57). It is cool,
but has blue colours in the IR due to
collision absorption by H2
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p13
Printed: 12/12/01
Astronomical Drivers: Solar Complexity
Solar-B will be launched
in 2005, aimed at studying
the interaction between
the Sun's magnetic field
and its corona
Solar-B will be supported
by complementary space/
ground data.
Science often targets
events with many multi-λ
observations required
Image from Marshall Space
Flight Centre, NASA
science.msfc.nasa.gov/ssl/pad/solar/Beyond_Solar-B.htm
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p14
Printed: 12/12/01
AstroGrid: framework for
applications
Enable future
development and
deployment of AstroGrid
and External (e.g NVO)
tools:
Automated detection of outliers
in SDSS two colour data
(Connolly et al, 2001, AJ in press)
A NVO prototype of an automated
discovery tool for arcs developed
by A Szalay
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p15
Printed: 12/12/01
AstroGrid: a Typical Challenge
• Problems in matching multi-λ survey data:
Differences in angular resolution, s/n ratios, backgrounds, etc
(Djorgovski et al, 2001, astro-ph/0108346)
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p16
Printed: 12/12/01
Astronomical e-Science Initiatives
AstroGrid
www.astrogrid.ac.uk
European Grid of Solar Observations
www.mssl.ucl.ac.uk/gri/egso
Astrophysical Virtual Observatory
AstroVirtel
www.eso.org/avo
ecf.hq.eso.org/astrovirtel
National Virtual Observatory
www.us-vo.org
The Digital Sky
www.cacr.caltech.edu/SDA/digital_sky.html
The Virtual Sky Project
virtualsky.org
Sloan Digital Sky Survey: SkyServer
skyserver.fnal.gov/en
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
Skyview
p17
Printed: 12/12/01
skyview.gsfc.nasa.gov
Towards Phase-B: The Science
Case

Rapid development of science requirements
 Emphasis

Development of 'use cases'

Example: Formation of large scale structure




on user input via consultation
Construct unbiased cluster of galaxies sample over z to test various
cosmological models of galaxy formation
Need to operate on large data sets, constructing simulated surveys to
test for selection effects
Generate predictions of observed sample properties and compare with
observed samples
Development of 'White Paper' developing
AstroGrid's role for Phase-B and in the
context of larger VO picture
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p18
Printed: 12/12/01
Developing the Science
Requirements


Recognising the specific need to support UK science
and associated key datasets was the backdrop to the
genesis of the AstroGrid program
Focus on community involvement
 Development
of a few scenarios for AstroGrid,
highlighting there capabilities and application to science
areas
 Invite

comment from community
Interaction channels will be via
 The
PS visiting astronomy groups
 Presentations,
 Development

both face to face and on-line
of a suitable questionnaire
Organise 2nd AstroGrid Workshop: Mar/Apr 2002.
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p19
Printed: 12/12/01
Use Case Development: practical
req's

In addition to development of the science
requirements, generic use cases are being
developed, e.g.:
 Log
on remotely, operate remotely
 Donate
 Fix
a small catalogue to the grid
the WCS in a FITS image
 Find
observations via instrument footprint
 Synthesis
 Matching

colour data for a set of objects
objects by position with errors
As the science requirements are formalised, use
cases will be derived from them
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p20
Printed: 12/12/01
A Baseline Architecture for
AstroGrid
Architecture must address Top Level Design
Issues:
Ease of use, emphasising a uniform
appearance
 Functionality
 Speed of Response
 Relevance
 Cost
 Reliability
 Scalabilty
 Outreach
 Level of data-provider buy in

N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p21
Printed: 12/12/01
Architecture: Quality Assurance

AstroGrid federation capabilities will
characterise quality parameters, e.g.:
 Positional
accuracy →reference frames
 Photometric
accuracy →std colour systems
 Completeness
 Temporal
→weight maps
resolution
 Wavelength
resolution and coverage
 Photometric
system
 Image
distrotions
 Artifacts
 Catalogues
→criteria used to ID objects, etc.
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p22
Printed: 12/12/01
Architecture: Data Inclusion Model
AstroGrid will potentially enable any data sets to
be accessed through AstroGrid services.
However the data must be adequately described.
Data
any
restricted
AstroGrid
any
(described)
How
any
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
restricted
restricted
p23
Printed: 12/12/01
Unified
Modelling
Language
(UML)
representatio
n of the
query
architecture
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p24
Printed: 12/12/01
Architecture Overview
Major elements in the query form a circular flow:
> a command flow involving user initiation
> an inner 'grid' loop with bulk data flows
- Client s/w
WP-A3
<=======<
- Query Engine
WP-A2
|
|
- Resource Management WP-A2
<====<====< |
- Resource Mapping/index WP-A2
|
| |
- Data Harvesting
WP-A2
|
| |
- Data Services
WP-A4
|
| |
- Results Management
WP-A4
|
| |
- Data Mining
WP-A4
>====>====> |
- Visualisation
WP-A3
|=======^
Noted above are WP areas leading expansion of the architecture in
that area
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p25
Printed: 12/12/01
Architecture Overview: UML view
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p26
Printed: 12/12/01
Detailing the Architecture





The Data flow diagrams (DFD's) show a functional
decomposition
They do not say how functions are distributed
across hardware or software
Grid 'layer' diagrams are developed later when the
design has been matched to the analysis
The DFD's emphasise the flow of data (s/w
distribution), the transformations of data (s/w reuse) and the nature of data between
transformations (interoperability)
Flows described in a Data Dictionary
Transforms described in Class-ResponsibilityN A Walton:
Science Overview: AstroGrid Collaboration
Dec 13-14, 2001
p27
Printed: 12/12/01
Collaboration
(CRC)Meeting,
notes

UML representation
of
the 'query' process
Diagram to be read
with:
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
 CRC
Notes
 Data
Dictionary
p28
Printed: 12/12/01
UML representation of
the 'Harvest' process
Diagram to be read
with:
 CRC
Notes
 Data
Dictionary
p29
Printed: 12/12/01
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
Extract from the Data Dictionary

Query
 A list

of constraints on data to be extracted from the Grid
Authorisation
 Something
that indicates that a user is allowed to use a
resource up to some certain level

Table meta-data
 Describe
a table of data sufficiently that the table can be
understood by Grid s/w and transformed as necessary

Formal catalogue-extract
 Tabular
data in AstroGrid favoured form. The catalogue
extracts need to be merged and transformed. A rich
format such as XML is required.
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p30
Printed: 12/12/01
Extract from the 'query' CRC

Query-form display
 Displays
forms to capture queries, passes contents of
form to Grid, passes user's credentials to the Grid.

Query scheduler
 Determines
whether a query can be run immediately;
whether is must be run in batch; whether it needs prior
approval for use in resources.

Resource authoriser
 Determines
whether a given user is allowed to use a
given resource at a particular level. Probably needs to be
built from scratch
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p31
Printed: 12/12/01
The NVO Model: A Comparison
Usual 'Grid' fabric, resource
& collective layers added to
with astro-specfic user &
connectivity layers
NVO focussed on (mainly)
optical and near-IR large
scale survey data sets.
NVO layer model, figure taken from
nvo-proj.pdf at www.us-vo.org
N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
p32
Printed: 12/12/01
Concluding
Remarks



N A Walton: Science Overview: AstroGrid Collaboration Meeting, Dec 13-14, 2001
AstroGrid is a
major new UK
funded e-science
initiative
In partnership with
EU centres it will
play a lead role in
the realisation of a
European Virtual
Observatory
AstroGrid is poised
to significantly
enhance the
opportunities of the
UK astronomical
researchp33 Printed: 12/12/01