Transcript xldb2 xldb1

Welcome!
http://www-conf.slac.stanford.edu/xldb07
One Day, Three Goals
1. Identify trends and major roadblocks related to
building extremely large databases
2. Bridge the gap between users trying to build
extremely large databases and database
vendors
3. Understand if and how open source projects
like the LSST Database can contribute to the
previous two goals in the next few years
Things We Talked About
Valuable data discarded due to
scalability limits and cost
Substantial commonalities between
science & industry (pattern
discovery, multi-d aggregation,
unpredictable query load,
procedural language needs, …)
Industry leading scale, science leading
complexity of analytics
Parallel, shared-nothing architectures on
commodity clusters are becoming very
popular
Roadblocks:
funding problems,
vendor-users disconnect,
science-academia disconnect
Rebuilding, not reusing software
Gap between needs and what vendors
offer is widening
Structured and unstructured data
coming together
MapReduce popular, but lacks efficient
joins
Things We Decided

 Conduct another workshop in
~1 year, 2-3 days, @SLAC
– Don’t expand size much
– By-invitation only
– Focus on experience sharing,
commonalities that can be developed
into community-wide requirements

 Try to setup smaller workshop
and/or working group(s)
– In particular science – db academics
http://xldb.slac.stanford.edu/display/XLDB/SciDB

 Set up shared infrastructure
– Initially wiki, possibly test-bed
environments
 Try to define a standard benchmark
focused on data-intensive queries
http://www-conf.slac.stanford.edu/xldb08
Two Days, Three Goals
1. Continue to understand major roadblocks
related to extremely large databases with
an emphasis on complex analytics
2. Continue bridging the gaps within the XLDB
community including science, industry,
database researchers and vendors
3. Build the open source SciDB community
It Is All About Ad-hoc Discussions
* You are expected to speak up too
– But no sale speeches, please
* Discussions are not electronically recorded
* Detailed report will be released
– Once OK’ed by workshop participants
Attendance – Rough Breakdown
xldb1 xldb2
23
25
Data-intensive scientific users
11
12
Data-intensive industrial users
16
12
Vendors, incl. startups
3
13
Academia, db research & programmers
53
62
Attendance – Rough Breakdown
If this group
won’t make
a difference,
who will?
1. Big science
2. Big industries
3. All major DBMS vendors
4. Very promising startups
5. World-class DB researchers
6. Superstar DB programmers
Dinner
* Location
– Sheraton Palo Alto
– Driving directions available
* Reception
– 7:00 pm – 7:30 pm
* Dinner
– 7:30 pm – 10:00 pm
– Buffet
* Cost
– Free
– Maybe except the valet parking
Make sure you
wear your
XLDB2 badge
BIG Thanks to Our Sponsors
Agenda
http://www-conf.slac.stanford.edu/xldb08/agenda.htm