Steps towards an SDMX implementation

Download Report

Transcript Steps towards an SDMX implementation

The SDMX framework: key issues and the business case
Gabriele Becker
Bank for International Settlements
SDMX Conference: Towards Implementation of SDMX
Washington, 11 January 2007
Views expressed are those of the presenter and not necessarily those of the BIS.
1
Overview
 SDMX vision
 The statistical production chain
 Components of the SDMX framework
 Analyse the relationships
 Conclusions
2
The SDMX vision
 Facilitating data and metadata exchange
 Efficient use of technologies and standards
 Reduce reporting burden
 Enhance availability of statistical data and metadata
for the users
 Data reporting = data dissemination = data sharing
3
Statistical processing chain in an NSO or an NCB
 Requirements specification
 Data collection design
 Data collection from reporting agents
 Data quality control
 Data compilation
 Data dissemination (internal, respondents, public, int.
organisations)
 Data analysis (data discovery, navigation, search)
4
SDMX components
 Information model for data and metadata
 (Draft) content oriented Guidelines
 Meta Data Common Vocabulary
 Cross Domain Concepts
 Two syntaxes to exchange data and metadata (EDI, XML)
 SDMX Registry standards and registry interfaces
 SDMX Tools
 How do these relate to the statistical processing chain?
5
EDI and XML
syntax expressions
and tools
SDMX registry
and registry
interfaces
Content oriented
Guidelines
Information
Model
1
Requirements specification


2
Data collection design




3
Data collection from reporting agents




4
Data quality control



5
Data compilation


6
Data dissemination (to respondents, public, int.
organisations)
Data analysis (data discovery, navigation, search within the
organisation)








The statistical processing chain and the
SDMX components
7
6
Some initial conclusions
 Information model “is everywhere” in the statistical
processing chain
 Content Guidelines ”are everywhere” as well
 Technology (syntaxes, tools and registries) are enablers
when it comes to exchanging or sharing data
 How can SDMX impact the individual steps in the
statistical processing chain?
7
Requirements specification & data collection design
 Information model
 Forces us to consider metadata from the start
 Helps to organise the data when specifying new data
collections
 Examples for efficient structuring of data later today
 Applying the (same) information model from the start leads to
for economies of scale
 Content Guidelines
 Potential re-use of existing code lists
 Use of registries to share code-lists and structures
 Learning effect across the statistical organisation(s)
 First data structure definition is the most difficult ….
 SDMX Tools to help defining data structures
8
Data collection from reporting agents
 Standard formats (SDMX-EDI or SDMX-ML)
 Reflect the Information Model and Content Guidelines
 Support collection of data and metadata
 Data structures and code lists exchanged in computer readable
formats
 Support for automation
 Stepwise implementation using SDMX tools …
9
Stepwise implementation
 Currently “push” models prevail
 Reporting agents send data (files) to the collecting agency
 Can start with simple tools (eg EXCEL SDMX creator)
 Automation possibilities for reporting agents and for
collection agency already for “push models”
 Generalisation of SDMX file creation
 Creation of SDMX-EDI or SDMX-ML out of the database
 Conversion between SDMX-EDI and SDMX-ML via tools
 Move to “pull model” and use of registries possible any
time
10
Data quality control and compilation
 Standard reporting formats allow for automated checking
 Check formulas may be derived from the code lists
 eg in the case of hierarchical code lists
 SDMX supports move towards metadata driven
processing
 Generic processing systems based on information (data) model
 Decreasing marginal cost for new data collections
11
Data dissemination
 Commonalities with data collection
 Changed roles: collector (NSO / NCB) becomes provider
to international organisations or the public
 Same SDMX formats as used for data collection
 Disseminate
 Data and metadata
 Data (and meta data) structure definitions
 Push and pull models
 Use of SDMX registries
 Automation
12
Data analysis (discovery, navigation, search)
 Any navigation system is metadata driven
 SDMX information model forces us to be clear about
the metadata and the data structure from the start
 Generalised navigation system based on the model
can easily incorporate new data collections (= new
data structures based on the information model)
 Registries help to find data from remote sources
 Generic tools (stylesheets) for easy data rendering
 Economies of scale and better service for users
13
Conclusions: Information model
 SDMX is not only for IT experts
 SDMX can only be applied if statistical experts apply
the information model in their work
 Define data and metadata structures
 SDMX information model and content guidelines can …
 Affect the complete statistical processing chain
 Influence the design of our processing systems
 Influence the design of our navigation systems
14
Conclusions: technical standards and tools
 Technical standards allow efficient and generalised
application of SDMX information model
 Technology (eg registry technology) is an enabler to
make SDMX work better
 A stepwise introduction of SDMX is possible using the
SDMX tools
 SDMX is a framework
15
SDMX: Benefits
 Speed up movement of data through the processing
chain
 Reduce “time to users” for data and metadata
 Increase level of automation
 Reduce risk of errors
 Enhance facilities for metadata exchange
 Easier to ship metadata with the data
 Better understanding of data
 Reporting = dissemination = data sharing
 Reduction of reporting burden
16
Practical steps
 Review SDMX information model
 Sample data structures used by others
 Relate to your own data model as implemented in your
organisation and systems
 There will be commonalities …
 Experiment with the SDMX tools to understand and build data
structures for your own data
 Review the technical standards and formats
 Understand relationship between the information model and its
practical implementation as SDMX-EDI or SDMX-ML
 Assess impact on your own organisation
17
Towards SDMX Conformance
 Apply information model for data structure definitions
 Use cross domain concepts as much as possible when
defining data structures
 Re-use existing (internationally shared) code lists
 Use SDMX-EDI or SDMX-ML for data and meta data
exchange (push model)
 Publish data structure definitions
 Offer data in SDMX-ML on (public) website
 Offer data via SDMX registry (pull model)
18
Thank you!
Gabriele Becker
Head, Statistical Information Systems
Monetary and Economic Department
Bank for International Settlements
Basle, Switzerland
[email protected]
19