metadata_stonjek_short-3D - Indico

Download Report

Transcript metadata_stonjek_short-3D - Indico

Distributed Databases
for ATLAS
Stefan Stonjek
04-Jul-2006
Stefan Stonjek: LCG 3D
1
Outline

LCG 3D Project


COOL


Distributed Deployment of Databases for LCG
Conditions Database Project
Other database applications
04-Jul-2006
Stefan Stonjek: LCG 3D
2
Client access patterns


Main applications: Reconstruction, Simulation
Access Patterns








Read from file: experiment data, physics model
Read from database: file catalogue, geometry, conditions
Write to file: reconstructed data, simulated data
Write to database: file catalogue
Some minor applications (calibration) write to conditions database
Geometry, conditions: high volume; file catalogue: low volume
No need for instantaneous replication
Model for conditions database: write only at Tier-0 and replicate to Tier-1s and from
there to Tier-2s
•
•
Geometry change less, can be deployed by files
File catalogue is a more localized issue, not covered in this talk
•
04-Jul-2006
file catalogue is local for two but distributed for two other experiments
Stefan Stonjek: LCG 3D
3
Tiers, Resources and Level of Service

Different requirements and service capabilities for different tiers

Tier1 Database Backbone





Tier2




Medium volume, often only sliced extraction of data
Asymmetric, possibly only uni-directional replication
Part time administration (shared with fabric administration)
Tier3/4 (eg Laptop extraction)



High volume, often complete replication of RDBMS data
Can expect good network connection to other T1 sites
Asynchronous, possibly multi-master replication
Large scale central database service, local dba team
Support fully disconnected operation
Low volume, sliced extraction from T1/T2
Need to deploy several replication/distribution technologies


Each addressing specific parts of the distribution problem
But all together forming a consistent distribution model
04-Jul-2006
Stefan Stonjek: LCG 3D
4
Possible Service Architecture
O
T0
M
M
- autonomous
reliable service
T1- db back bone
- all data replicated
- reliable service
O
T2 - local db cache
T3/4
-subset data
-only local service
M
O
M
Oracle Streams
Cross vendor extract
MySQL Files
Proxy Cache
04-Jul-2006
Stefan Stonjek: LCG 3D
5
Possible distribution technologies

Vendor native distribution,Oracle replication and related technologies






Table-to-Table replication via asynchronous update streams
Transportable tablespaces
Little (but non-zero) impact on application design
Potentially extensible to other back-end database through API
Evaluations done at FNAL and CERN
Combination of http based database access with web proxy caches close to
the client

Performance gains



reduced real database access for largely read-only data
reduced transfer overhead compared to low level SOAP RPC based approaches
Deployment gains



04-Jul-2006
Web caches (e.g. squid) are much simpler to deploy than databases and could remove
the need for a local database deployment on some tiers
No vendor specific database libraries on the client side
“Firewall friendly” tunneling of requests through a single port
Stefan Stonjek: LCG 3D
6
Multi Tier Computing for LHC
T2
T2
T2s and T1s are inter-connected
by the general purpose research
networks
T2
T2
T2
T2
GridKa
IN2P3
T2
T2
Any Tier-2 may
access data at
any Tier-1
Dedicated
10 Gbit links
Brookhaven
TRIUMF
ASCC
T2
Nordic
Fermilab
T2
RAL
CNAF
T2
T2
T2
04-Jul-2006
PIC
SARA
T2
T2
Stefan Stonjek: LCG 3D
7
COOL

COOL: conditions database toolkit
Allow easy handling of condition data on a relational
database
 Is not relational itself


Data devided into
Folders (read: subdetector)
 Channels
 IOVs (interval of validity)


Still some performance issues
04-Jul-2006
Stefan Stonjek: LCG 3D
8
COOL performance



COOL performance testing done in a way which is
optimal for COOL but not close to a real world
scenario
ATLAS should provide number about the planed
COOL / conditions database usage
Because of internal COOL structure


Do not use many folders
Loop over Folder, Channel, IOV

04-Jul-2006
Sounds strange
Stefan Stonjek: LCG 3D
9
ATLAS


ATLAS is using the COOL system
Sometimes ATLAS has to keep using old
version of COOL
COOL had no backward compatibility
 New versions are sometimes unstable


ATLAS has to learn how to use COOL in an
optimal way
04-Jul-2006
Stefan Stonjek: LCG 3D
10
Conclusion


3D, COOL and ATLAS are on a good way
One has to ensure that they work well together
Need COOL tests which reflect ATLAS usage
pattern
 Nee ATLAS information how ATLAS wants to use
COOL (with numbers)

04-Jul-2006
Stefan Stonjek: LCG 3D
11