Skip to main content

2016 | OriginalPaper | Buchkapitel

Are Names Meaningful? Quantifying Social Meaning on the Semantic Web

verfasst von : Steven de Rooij, Wouter Beek, Peter Bloem, Frank van Harmelen, Stefan Schlobach

Erschienen in: The Semantic Web – ISWC 2016

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

According to its model-theoretic semantics, Semantic Web IRIs are individual constants or predicate letters whose names are chosen arbitrarily and carry no formal meaning. At the same time it is a well-known aspect of Semantic Web pragmatics that IRIs are often constructed mnemonically, in order to be meaningful to a human interpreter. The latter has traditionally been termed ‘social meaning’, a concept that has been discussed but not yet quantitatively studied by the Semantic Web community. In this paper we use measures of mutual information content and methods from statistical model learning to quantify the meaning that is (at least) encoded in Semantic Web names. We implement the approach and evaluate it over hundreds of thousands of datasets in order to illustrate its efficacy. Our experiments confirm that many Semantic Web names are indeed meaningful and, more interestingly, we provide a quantitative lower bound on how much meaning is encoded in names on a per-dataset basis. To our knowledge, this is the first paper about the interaction between social and formal meaning, as well as the first paper that uses statistical model learning as a method to quantify meaning in the Semantic Web context. These insights are useful for the design of a new generation of Semantic Web tools that take such social meaning into account.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
Notice that the official semantics of RDF [13] is defined in terms of a Herbrand Universe, i.e., the IRI dbr:London does not refer to the city of London but to the syntactic term dbr:London. Under the official semantics graphs G and H are therefore not isomorphic and they do not denote the same models. The authors believe that RDF names refer to objects and concepts in the real world and not (solely) to syntactic constructs in a Herbrand Universe.
 
3
A detailed proof for this, and for (1) is shared as an external resource at http://​wouterbeek.​github.​io/​iswc2016_​appendix.​pdf.
 
4
Or, equivalently, we must design a code which exploits the information that IRIs carry about their meaning to store the dataset efficiently.
 
5
The Pitman-Yor process itself does not specify which new meaning we should choose, only that a new meaning should be chosen. This distribution on meanings in \(\mathcal Y\) is inspired by the Dirichlet-Multinomial model.
 
7
Datasets with fewer than 1, 000 statements are not included in order to get a clear picture of what happens in case we have sufficient data to refute the null, as indicated by our observations from Fig. 1. A zoomed out version of Fig. 2, scaling to \(\log (p)\) values of \(-300,000\) is available at https://​goo.​gl/​r3uxpA, but is not included in this paper because its scale is no longer suitable for print.
 
Literatur
1.
Zurück zum Zitat Beek, W., Rietveld, L., Bazoobandi, H.R., Wielemaker, J., Schlobach, S.: LOD laundromat: a uniform way of publishing other people’s dirty data. In: Mika, P., et al. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 213–228. Springer, Heidelberg (2014) Beek, W., Rietveld, L., Bazoobandi, H.R., Wielemaker, J., Schlobach, S.: LOD laundromat: a uniform way of publishing other people’s dirty data. In: Mika, P., et al. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 213–228. Springer, Heidelberg (2014)
4.
Zurück zum Zitat Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley-Interscience, Hoboken (2006)MATH Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley-Interscience, Hoboken (2006)MATH
5.
Zurück zum Zitat Cyganiak, R., Wood, D., Lanthaler, M.: RDF 1.1 concepts and abstract syntax (2014) Cyganiak, R., Wood, D., Lanthaler, M.: RDF 1.1 concepts and abstract syntax (2014)
6.
Zurück zum Zitat Ding, L., Finin, T.W.: Characterizing the semantic web on the web. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 242–257. Springer, Heidelberg (2006)CrossRef Ding, L., Finin, T.W.: Characterizing the semantic web on the web. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 242–257. Springer, Heidelberg (2006)CrossRef
7.
Zurück zum Zitat Dragisic, Z., Eckert, K., Euzenat, J., Faria, D., Ferrara, A., Granada, R., Ivanova, V., Jiménez-Ruiz, E., Kempf, A.O., Lambrix, P., et al.: Results of the ontology alignment evaluation initiative 2014. In: Proceedings of the 9th International Conference on Ontology Matching, vol. 1317, pp. 61–104 (2014) Dragisic, Z., Eckert, K., Euzenat, J., Faria, D., Ferrara, A., Granada, R., Ivanova, V., Jiménez-Ruiz, E., Kempf, A.O., Lambrix, P., et al.: Results of the ontology alignment evaluation initiative 2014. In: Proceedings of the 9th International Conference on Ontology Matching, vol. 1317, pp. 61–104 (2014)
9.
Zurück zum Zitat Farrugia, J.: Model-theoretic semantics for the web. In: Proceedings of the 12th Internaional Conference on WWW, pp. 29–38. ACM (2003) Farrugia, J.: Model-theoretic semantics for the web. In: Proceedings of the 12th Internaional Conference on WWW, pp. 29–38. ACM (2003)
10.
Zurück zum Zitat Gottron, T., Knauf, M., Scheglmann, S., Scherp, A.: A systematic investigation of explicit and implicit schema information on the linked open data cloud. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 228–242. Springer, Heidelberg (2013)CrossRef Gottron, T., Knauf, M., Scheglmann, S., Scherp, A.: A systematic investigation of explicit and implicit schema information on the linked open data cloud. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 228–242. Springer, Heidelberg (2013)CrossRef
11.
Zurück zum Zitat Halpin, H.: Social Semantics: The Search for Meaning on the Web. Semantic Web and Beyond, 13th edn. Springer, Heidelberg (2013)CrossRef Halpin, H.: Social Semantics: The Search for Meaning on the Web. Semantic Web and Beyond, 13th edn. Springer, Heidelberg (2013)CrossRef
12.
Zurück zum Zitat Halpin, H., Thompson, H.: Social meaning on the web: from Wittgenstein to search engines. IEEE Intell. Syst. 24(6), 27–31 (2009)CrossRef Halpin, H., Thompson, H.: Social meaning on the web: from Wittgenstein to search engines. IEEE Intell. Syst. 24(6), 27–31 (2009)CrossRef
13.
Zurück zum Zitat Hayes, P.J., Patel-Schneider, P.F.: RDF 1.1 semantics (2014) Hayes, P.J., Patel-Schneider, P.F.: RDF 1.1 semantics (2014)
14.
Zurück zum Zitat Huang, Z., van Harmelen, F.: Using semantic distances for reasoning with inconsistent ontologies. In: Sheth, A.P., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 178–194. Springer, Heidelberg (2008)CrossRef Huang, Z., van Harmelen, F.: Using semantic distances for reasoning with inconsistent ontologies. In: Sheth, A.P., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 178–194. Springer, Heidelberg (2008)CrossRef
15.
Zurück zum Zitat Oren, E., Delbru, R., Catasta, M., Cyganiak, R., Stenzhorn, H., Tummarello, G.: Sindice.com: a document-oriented lookup index for Open Linked Data. Int. J. Metadata Semant. Ontol. 3(1), 37–52 (2008)CrossRef Oren, E., Delbru, R., Catasta, M., Cyganiak, R., Stenzhorn, H., Tummarello, G.: Sindice.com: a document-oriented lookup index for Open Linked Data. Int. J. Metadata Semant. Ontol. 3(1), 37–52 (2008)CrossRef
16.
Zurück zum Zitat Pitman, J., Yor, M.: The two-parameter Poisson-Dirichlet distribution derived from a stable subordinator. Ann. Probab. 25(2), 855–900 (1997)MathSciNetCrossRefMATH Pitman, J., Yor, M.: The two-parameter Poisson-Dirichlet distribution derived from a stable subordinator. Ann. Probab. 25(2), 855–900 (1997)MathSciNetCrossRefMATH
17.
Zurück zum Zitat Sauermann, L., Cyganiak, R.: Cool URIs for the semantic web (2006) Sauermann, L., Cyganiak, R.: Cool URIs for the semantic web (2006)
18.
Zurück zum Zitat Stoilos, G., Stamou, G., Kollias, S.D.: A string metric for ontology alignment. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 624–637. Springer, Heidelberg (2005)CrossRef Stoilos, G., Stamou, G., Kollias, S.D.: A string metric for ontology alignment. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 624–637. Springer, Heidelberg (2005)CrossRef
19.
Zurück zum Zitat Theoharis, Y., Tzitzikas, Y., Kotzinos, D., Christophides, V.: On graph features of semantic web schemas. IEEE Trans. Knowl. Data Eng. 20(5), 692–702 (2008)CrossRef Theoharis, Y., Tzitzikas, Y., Kotzinos, D., Christophides, V.: On graph features of semantic web schemas. IEEE Trans. Knowl. Data Eng. 20(5), 692–702 (2008)CrossRef
Metadaten
Titel
Are Names Meaningful? Quantifying Social Meaning on the Semantic Web
verfasst von
Steven de Rooij
Wouter Beek
Peter Bloem
Frank van Harmelen
Stefan Schlobach
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46523-4_12