Data in a nutshell.

Download Report

Transcript Data in a nutshell.

1
Data in a nutshell
Open data workshop, Lebanon 23/5/2014
Michalis Vafopoulos
NTUA www.publicspending.net & www.vafopoulos.org
Welcome to the data era
5
6
The era of Open
budgets, spending, registries, contracting…
7
The Transparency program in
Greece (2010-2014)
o A revolution in open government
o ex-ante reporting of every state
decision
o paradigm shift for 40K public servants
8
The Transparency program in
Greece
o manifests the value of
procrastination principle (again)
o strong rival to the Clientelistic state
o The new version under beta testing
(delivery: in 10 days!)
9
publicspending.net
2011: I believed that the Transparency
program is the open data “gold” (&
persuaded 7 more people)
10
publicspending.net
2012: …with some dust and rocks in a
deep goldmine
11
2013: time to chisel some jewelry
2014: open data everywhere
12
Indexing, searching, global
comparisons
Indexing, searching, global
comparisons
Indexing, searching, global
comparisons
Indexing, searching, global
comparisons
Indexing, searching, global
comparisons
Where the money goes in Greece
Data: Open, big, linked
Open: access
…everyone to use and republish as she wishes
Big: scale
high volume, velocity and variety
Linked: use
Publish once, use as many times
Is it working?
• Current Employee Names, Salaries, and
Position Titles
• The Open Database Of The Corporate
World
• Crime map
• NHS efficiency savings: the role of
prescribing analytics
• where public money goes worldwide
Examples
Can you find the famous persons born in
Beirut before 1900?
In Paris, Athens, … ?
Examples
#anoixtigenia, @vafopoulos
Examples
#anoixtigenia, @vafopoulos
Why Open Data
o more & better information
o objective and processable information
for economic/political “dialogue”
• to promote competition
• to decrease cost
• to judge the efficiency of policy
mixtures
• to enable participation
24
(initial) scope OGD
to provide an objective &
intermediate layer of information
that will enable citizens, journalists,
business people and politicians to rediscover their own “stories” from
data.
25
LOD in Greece: why it is important
• quality of information during
economic crisis
• transparency & efficiency in funding
development
26
Issues
o how can we initiate the virtuous
cycle of creation?
demonstrate LOD’s added value
o how to get the most out of data?
local & global interconnections
27
In few words,
Apps, Apps, Apps…..
28
Public Spending in Greece &
worldwide publicspending.net
• the first LOD App in Greece
• daily updates
• open spending linked data, endpoint &
visualizations
29
Insights in Global Public Spending
Open but Effective?
o Who really gets the public money?
o For what? From whom?
o Can we compare them?
o Is public spending effective?
o <your question goes here>
31
Useful economic open data
1. The full cycle of public money
2. Uniform Company names
3. Compatible Payment categories
32
1. The full cycle of public money
Prices
33
Follow Public Money all the Way
Vocabulary (fpm)
oA compact and minimal way to
model the flows of public money
oFrom budget to spending
including business information
and prices
34
Useful economic open data
1. The full cycle of public money
2. Uniform Company names
3. Compatible Payment categories
35
2. Not uniform Company names
The problem:
different names
for the same
company
“Oracle” in the Australian
public spending
36
Reconciling Company names: the
CORFU technique
Rodríguez, Jose María Álvarez, Ordoñez de Pablos, Patricia, Vafopoulos, Michalis N. and Labra, José Emilio
3. Compatible Payment categories
The problem:
Spending decisions are using
different (or not any!)
classification schemes (e.g. CPV,
UNSSC, NAICS)
38
Compatible Payment categories
Transforming classification
schemes or literal descriptions to
CPV, expanding:
The MOLDEAS project
Methods On Linked Data for E-procurement Applying Semantics
39
40
Reconciling Company names:
the Forbes Global 2000 companies
Compatible Payment categories
Going global: AUSTRALIA
43
the 5 stars of open linked data
★make your stuff available on the Web (whatever
format)
★★make it available as structured data
(e.g. excel
instead of image scan of a table)
★★★non-proprietary format (e.g. csv instead of
excel)
★★★★use URLs to identify things, so that
people can point at your stuff
★★★★★link your data to other people’s data
to provide context
http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/
Linked data =
internet + http +
RDF
Linked Data Principles
1. Use URIs as names for things
2. Use URIs so that people can look up
(dereference) those names.
3. When someone looks up a URI,
provide useful information.
4. Include links to other URIs so that
they can discover more things.
Web as a database
• Linked Data makes the web
exploitable as ONE GIANT HUGE
GLOBAL DATABASE!
• Is there any query language like sql?
SPARQL…
What are we planning?
• LOD for the main economic
activities (insurance, banking)
•
•
Law for open data by default in
ALL public organisations
Open data education (open
generation)
References
o
o
o
Vafopoulos, Michalis N., Rodríguez, Jose María Álvarez, Meimaris, Marios, Xidias,
Ioannis, Klonaras, Michailis and Vafeiadis, Giorgos, Insights in Global Public
Spending (May 12, 2013). Available at SSRN: http://ssrn.com/abstract=2264958 or
http://dx.doi.org/10.2139/ssrn.2264958
Vafopoulos, Michalis N., The Web Economy: Goods, Users, Models, and Policies (July
26, 2012). Michalis Vafopoulos (2012) "The Web Economy: Goods, Users, Models,
and Policies", Foundations and Trends® in Web Science: Vol. 3: No 1-2, pp 1-136.
http://dx.doi.org/10.1561/1800000015. Available at SSRN:
http://ssrn.com/abstract=2117855
ALVAREZ, J. and LABRA, J. 2012. Towards a pan-european e-procurement platform
to aggregate, publish and search public procurement notices powered by Linked
Open Data: the MOLDEAS approach. International Journal of Software Engineering
and Knowledge Engineering. 22, 3 (2012), 365–383.
49
More info
•
•
•
•
•
Twitter: @vafopoulos
[email protected]
www.Vafopoulos.org
www.publicspending.net
www.Youtube.com/websciencegr