No Slide Title

Download Report

Transcript No Slide Title

The University of Washington
eScience Institute
This afternoon:
 Phyllis Wise, Provost
 Ed Lazowska, Computer Science &
Engineering
 Dan Fay, Microsoft Research
 Martin Savage, Physics
 David Baker, Biochemistry
 Andy Connolly, Astronomy
eScience: Computational Science
for the 21st Century
Ed Lazowska
Bill & Melinda Gates Chair in
Computer Science & Engineering
Interim Director, eScience Institute
November 2008
http://eScience.washington.edu/
Theory
Experiment
Observation
Theory
Experiment
Observation
Theory
Experiment
Observation
Theory
Experiment
Observation
Computational
Science
Protein interactions
in striated muscles
Tom Daniel lab
QCD to study
interactions of
nuclei
David Kaplan lab
Gas
Stars
Study of dark matter
Dark Matter
Tom Quinn lab
Theory
Experiment
Observation
Computational
Science
eScience
eScience is driven by data
 Massive volumes of data from sensors and networks
of sensors
Apache Point telescope,
SDSS
15TB of data
(15,000,000,000,000 bytes)
Large Synoptic Survey
Telescope (LSST)
30TB/day,
60PB in its 10-year
lifetime
Large Hadron Collider
700MB of data
per second,
60TB/day, 20PB/year
Illumina Genome
Analyzer
~1TB/day
Regional Scale Nodes of the
NSF Ocean Observatories
Initiative
2000 km of fiber optic cable
on the seafloor, connecting
thousands of chemical,
physical, and biological
sensors
The Web
20+ billion web pages
x 20KB = 400+TB
One computer can
read 30-35 MB/sec
from disk => 4 months
just to read the web
Point-of-sale terminals
eScience is about the analysis of data
 The automated or semi-automated extraction of
knowledge from massive volumes of data
There’s simply too much of it to look at
The technologies of eScience
 Sensors and sensor networks
 Databases
 Data mining
 Machine learning
 Data visualization
 Cluster computing
at enormous scale
eScience will be pervasive
 Computational science has been transformational, but
to some extent it has been a niche
As an institution (e.g., a university), you didn’t need to
employ it broadly in order to be competitive
 eScience capabilities must be broadly available and
broadly practiced
If not, the institution will simply cease to be competitive
The University of Washington
eScience Institute
 Mission
Help position the University of Washington at the forefront
of research both in modern eScience techniques and
technologies, and in the fields that depend upon these
techniques and technologies
 Strategy
Increase the sharing of expertise and facilities
Bootstrap a cadre of Research Scientists
Add faculty in key fields
Make the entire University more effective
 Launched July 1 with $1 million in permanent funding
from the Washington State Legislature
Sought, and need, $2 million
Steering Committee
 Appointed by Provost
Phyllis Wise
Ed Lazowska, CSE and eScience
Institute Interim Director
Mary Lidstrom (chair), Vice
Tom Ackerman,
Provost for Research
Atmospheric Sciences
Matt O’Donnell, Engineering
Ginger Armbrust,
Tom Quinn, Astronomy
Oceanography
Chance Reschke, eScience
Tom Daniel, Biology
Institute Technical Coordinator
David Goodlett, Medicinal
Mani Soma, EE and Office of
Chemistry
the VP for Research
Terry Gray, UW Technology
Werner Stuetzle, Arts &
Ron Johnson, CTO
Sciences
David Kaplan, Physics
Peter Tarczy-Hornoch,
Richard Karpen, Arts &
Biomedical & Health
Sciences
Informatics
Activities
 Direction-setting interviews with UW research
leaders regarding technology needs
 124 interviews thus far
 Top researchers of all ages in all fields
 Technology needs, in priority order
1. Data management facilities
•
Storage, backup, security
2. Shared expertise
•
3.
4.
5.
6.
Data management specifically, technology in general
Computing power and high-bandwidth network access
Data collection and analysis
Communication and collaboration technologies
Shared laboratories and pricing
 Initial staffing
Research Scientist recruited for cluster computing
Chance Reschke
Research Scientist being recruited for data management
Consulting model developed
Jeff Gardner as “TeraGrid Champion”
Data management consultancy under development
Overall coordination coming on-board
Erik Lundberg
First faculty search underway
Werner Stuetzle chairing search committee
 Laying the groundwork for broadly shared facilities
Data center space coordination and planning
UW Tower scheduled to come online in late 2009
~600KW for research computing
EPIC
Intelligent use of the research allocation in UW Tower
Coordinated, cost-effective compute and storage solutions for
the UW eScience community
 Active exploration of alternative approaches to
facilities
Amazon Web Services
Google/IBM cloud
Microsoft Dryad and Azure
 Participation in proposal preparation
Moore Foundation Sequencing Center
NSF Data Net - The GRADD Collaboration
NSF Track 2d (with PNNL, PSC, CMU)
 Community building
Web site for general information
http://eScience.washington.edu/
SIG for eScience technical staff
http://staff.washington.edu/reschke/escience-sig/SIG.pdf
Monthly technical “brown bag lunch”
Regular discussions with research groups across campus
regarding their eScience needs
We can help you (some currently,
better shortly) with …
 Facilities
 Proposals
 Data management issues
See posters
Email [email protected]