Skip to main content
Top

2016 | OriginalPaper | Chapter

StatSpace: A Unified Platform for Statistical Data Exploration

Authors : Ba-Lam Do, Peter Wetz, Elmar Kiesling, Peb Ruswono Aryan, Tuan-Dat Trinh, A Min Tjoa

Published in: On the Move to Meaningful Internet Systems: OTM 2016 Conferences

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In recent years, the amount of statistical data available on the web has been growing fast. Numerous organizations and governments publish data sets in a multitude of formats and encodings, using different scales, and providing access through a wide range of mechanisms. Due to such inconsistent publishing practices, integrated analysis of statistical data is challenging. StatSpace tackles this problem through semantic integration and provides uniform access to disparate statistical data. At present, it incorporates more than 1,800 data sets published by a variety of data providers including the World Bank, the European Union, and the European Environment Agency. StatSpace transparently lifts data from raw sources, maps geographical and temporal dimensions, aligns value ranges, and allows users to explore and integrate the previously isolated data sets. This paper introduces the constituent elements of the StatSpace architecture – i.e., a metadata repository, URI design patterns, and supporting services – and demonstrates the usefulness of the resulting Linked Data infrastructure by means of use case examples.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
e.g., http://​data.​worldbank.​org/​indicator, accessed August 30, 2016.
 
2
e.g., https://​www.​ons.​gov.​uk, accessed August 30, 2016.
 
5
 
7
https://​sdmx.​org/​, accessed August 30, 2016.
 
9
All prefixes used in this paper can be looked up at http://​prefix.​cc.
 
14
The expenditure dimension consists of four code lists, i.e., classification of individual consumption by purpose (COICOP), classification of functions of government (COFOG), classification of purposes of non-profit institutions serving households (COPNI), and classification of outlays of producers by purpose (COPP).
 
15
For instance, the top concept in the age code list is total (i.e., http://​statspace.​linkedwidgets.​org/​codelist/​cl_​age/​Total) which is split into various age groups such as 0–4, 5–9, ... , 105–109 (type: age-group), and special values, e.g., 70+, 75+, 80+ (type: age-plus). Each age group is split into individual ages (type: age-individual), and each special value is split into age groups.
 
17
https://​www.​w3.​org/​TR/​r2rml, accessed August 30, 2016.
 
20
http://​c3js.​org/​, accessed August 30, 2016.
 
Literature
1.
go back to reference Becker, K., Tan, X., Jahangiri, S., Knoblock, C.A.: Finding, assessing, and integrating statistical sources for data mining. In: Proceedings of Know@LOD 2015. CEUR (2015) Becker, K., Tan, X., Jahangiri, S., Knoblock, C.A.: Finding, assessing, and integrating statistical sources for data mining. In: Proceedings of Know@LOD 2015. CEUR (2015)
2.
go back to reference Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semant. Web Inf. Syst. (IJSWIS) 5(3), 1–22 (2009)CrossRef Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semant. Web Inf. Syst. (IJSWIS) 5(3), 1–22 (2009)CrossRef
3.
go back to reference Capadisli, S., Auer, S., Ngonga Ngomo, A.C.: Linked SDMX data: path to high fidelity statistical linked data. Semantic Web 6(2), 105–112 (2015) Capadisli, S., Auer, S., Ngonga Ngomo, A.C.: Linked SDMX data: path to high fidelity statistical linked data. Semantic Web 6(2), 105–112 (2015)
5.
go back to reference Dimou, A., Vander Sande, M., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: a generic language for integrated rdf mappings of heterogeneous data. In: Proceedings of Workshop on Linked Data on the Web (LDOW) (2014) Dimou, A., Vander Sande, M., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: a generic language for integrated rdf mappings of heterogeneous data. In: Proceedings of Workshop on Linked Data on the Web (LDOW) (2014)
7.
go back to reference Do, B.L., Aryan, P.R., Trinh, T.D., Wetz, P., Kiesling, E., Tjoa, A.M.: Toward a framework for statistical data integration. In: Proceedings of Workshop on Semantic Statistics (SemStats). CEUR (2015) Do, B.L., Aryan, P.R., Trinh, T.D., Wetz, P., Kiesling, E., Tjoa, A.M.: Toward a framework for statistical data integration. In: Proceedings of Workshop on Semantic Statistics (SemStats). CEUR (2015)
8.
go back to reference Do, B.L., Trinh, T.D., Aryan, P.R., Wetz, P., Kiesling, E., Tjoa, A.M.: Toward a statistical data integration environment: the role of semantic metadata. In: Proceedings of SEMANTICS Conference, pp. 25–32. ACM (2015) Do, B.L., Trinh, T.D., Aryan, P.R., Wetz, P., Kiesling, E., Tjoa, A.M.: Toward a statistical data integration environment: the role of semantic metadata. In: Proceedings of SEMANTICS Conference, pp. 25–32. ACM (2015)
9.
go back to reference Do, B.L., Trinh, T.D., Wetz, P., Anjomshoaa, A., Kiesling, E., Tjoa, A.M.: Widget-based exploration of linked statistical data spaces. In: Proceedings of Conference on Data Management Technologies and Applications (DATA). SciTePress (2014) Do, B.L., Trinh, T.D., Wetz, P., Anjomshoaa, A., Kiesling, E., Tjoa, A.M.: Widget-based exploration of linked statistical data spaces. In: Proceedings of Conference on Data Management Technologies and Applications (DATA). SciTePress (2014)
10.
go back to reference Kalampokis, E., Karamanou, A., Nikolov, A., Haase, P., Cyganiak, R., Roberts, B., Hermans, P., Tambouris, E., Tarabanis, K.: Creating and utilizing linked open statistical data for the development of advanced analytics services. In: Proceedings of Workshop on Semantic Statistics (SemStats). CEUR (2014) Kalampokis, E., Karamanou, A., Nikolov, A., Haase, P., Cyganiak, R., Roberts, B., Hermans, P., Tambouris, E., Tarabanis, K.: Creating and utilizing linked open statistical data for the development of advanced analytics services. In: Proceedings of Workshop on Semantic Statistics (SemStats). CEUR (2014)
11.
go back to reference Kalampokis, E., Roberts, B., Karamanou, A., Tambouris, E., Tarabanis, K.: Challenges on developing tools for exploiting linked open data cubes. In: Proceedings of Workshop on Semantic Statistics (SemStats). CEUR (2015) Kalampokis, E., Roberts, B., Karamanou, A., Tambouris, E., Tarabanis, K.: Challenges on developing tools for exploiting linked open data cubes. In: Proceedings of Workshop on Semantic Statistics (SemStats). CEUR (2015)
12.
go back to reference Kämpgen, B., Stadtmüller, S., Harth, A.: Querying the Global Cube: integration of multidimensional datasets from the web. In: Janowicz, K., Schlobach, S., Lambrix, P., Hyvönen, E. (eds.) EKAW 2014. LNCS, vol. 8876, pp. 250–265. Springer, Heidelberg (2014) Kämpgen, B., Stadtmüller, S., Harth, A.: Querying the Global Cube: integration of multidimensional datasets from the web. In: Janowicz, K., Schlobach, S., Lambrix, P., Hyvönen, E. (eds.) EKAW 2014. LNCS, vol. 8876, pp. 250–265. Springer, Heidelberg (2014)
13.
go back to reference Kelly, D., Gyllstrom, K., Bailey, E.W.: A comparison of query and term suggestion features for interactive searching. In: Proceedings of ACM SIGIR Conference on Research and development in information retrieval, pp. 371–378. ACM (2009) Kelly, D., Gyllstrom, K., Bailey, E.W.: A comparison of query and term suggestion features for interactive searching. In: Proceedings of ACM SIGIR Conference on Research and development in information retrieval, pp. 371–378. ACM (2009)
14.
go back to reference Meroño-Peñuela, A.: LSD Dimensions: use and reuse of linked statistical data. In: Lambrix, P., Hyvönen, E., Blomqvist, E., Presutti, V., Qi, G., Sattler, U., Ding, Y., Ghidini, C. (eds.) EKWA 2014 Satellite Events. LNCS, vol. 8982, pp. 159–163. Springer, Heidelberg (2015) Meroño-Peñuela, A.: LSD Dimensions: use and reuse of linked statistical data. In: Lambrix, P., Hyvönen, E., Blomqvist, E., Presutti, V., Qi, G., Sattler, U., Ding, Y., Ghidini, C. (eds.) EKWA 2014 Satellite Events. LNCS, vol. 8982, pp. 159–163. Springer, Heidelberg (2015)
15.
go back to reference Mutlu, B., Hoefler, P., Tschinkel, G., Veas, E., Sabol, V., Stegmaier, F., Granitzer, M.: Suggesting visualisations for published data. In: Proceedings of Conference on Information Visualization Theory and Applications (IVAPP), pp. 267–275. IEEE (2014) Mutlu, B., Hoefler, P., Tschinkel, G., Veas, E., Sabol, V., Stegmaier, F., Granitzer, M.: Suggesting visualisations for published data. In: Proceedings of Conference on Information Visualization Theory and Applications (IVAPP), pp. 267–275. IEEE (2014)
16.
go back to reference Paulheim, H.: Generating possible interpretations for statistics from linked open data. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 560–574. Springer, Heidelberg (2012)CrossRef Paulheim, H.: Generating possible interpretations for statistics from linked open data. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 560–574. Springer, Heidelberg (2012)CrossRef
17.
go back to reference Phillips, A.W.: The relation between unemployment and the rate of change of money wage rates in the United Kingdom, 1861–19571. Economica 25(100), 283–299 (1958) Phillips, A.W.: The relation between unemployment and the rate of change of money wage rates in the United Kingdom, 1861–19571. Economica 25(100), 283–299 (1958)
18.
go back to reference Ruback, L., Manso, S., Salas, P.E.R., Pesce, M., Ortiga, S., Casanova, M.A.: A mediator for statistical linked data. In: Proceedings of Annual ACM Symposium on Applied Computing, pp. 339–341. ACM (2013) Ruback, L., Manso, S., Salas, P.E.R., Pesce, M., Ortiga, S., Casanova, M.A.: A mediator for statistical linked data. In: Proceedings of Annual ACM Symposium on Applied Computing, pp. 339–341. ACM (2013)
19.
go back to reference Sabou, M., Arsal, I., Braşoveanu, A.M.: TourMISLOD: a tourism linked data set. Semant. Web 4(3), 271–276 (2013) Sabou, M., Arsal, I., Braşoveanu, A.M.: TourMISLOD: a tourism linked data set. Semant. Web 4(3), 271–276 (2013)
20.
go back to reference Salas, P.E.R., Martin, M., Da Mota, F.M., Auer, S., Breitman, K., Casanova, M.A.: Publishing statistical data on the web. In: Proceedings of International Conference on Semantic Computing (ICSC), pp. 285–292. IEEE (2012) Salas, P.E.R., Martin, M., Da Mota, F.M., Auer, S., Breitman, K., Casanova, M.A.: Publishing statistical data on the web. In: Proceedings of International Conference on Semantic Computing (ICSC), pp. 285–292. IEEE (2012)
21.
go back to reference Schlegel, K., Stegmaier, F., Bayerl, S., Granitzer, M., Kosch, H.: Balloon fusion: SPARQL rewriting based on unified co-reference information. In: Proceedings of International Workshop on Data Engineering Meets the Semantic Web, pp. 254–259. IEEE (2014) Schlegel, K., Stegmaier, F., Bayerl, S., Granitzer, M., Kosch, H.: Balloon fusion: SPARQL rewriting based on unified co-reference information. In: Proceedings of International Workshop on Data Engineering Meets the Semantic Web, pp. 254–259. IEEE (2014)
22.
go back to reference Schmachtenberg, M., Bizer, C., Paulheim, H.: Adoption of the linked data best practices in different topical domains. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 245–260. Springer, Heidelberg (2014) Schmachtenberg, M., Bizer, C., Paulheim, H.: Adoption of the linked data best practices in different topical domains. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 245–260. Springer, Heidelberg (2014)
23.
go back to reference Trinh, T.D., Wetz, P., Do, B.L., Anjomshoaa, A., Kiesling, E., Tjoa, A.M.: Open linked widgets mashup platform. In: Proceedings of the AI Mashup Challenge 2014. CEUR (2014) Trinh, T.D., Wetz, P., Do, B.L., Anjomshoaa, A., Kiesling, E., Tjoa, A.M.: Open linked widgets mashup platform. In: Proceedings of the AI Mashup Challenge 2014. CEUR (2014)
Metadata
Title
StatSpace: A Unified Platform for Statistical Data Exploration
Authors
Ba-Lam Do
Peter Wetz
Elmar Kiesling
Peb Ruswono Aryan
Tuan-Dat Trinh
A Min Tjoa
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-48472-3_50

Premium Partner