Skip to main content
Top
Published in: Earth Science Informatics 1/2015

01-03-2015 | Research Article

SEM+: tool for discovering concept mapping in Earth science related domain

Authors: Jin Guang Zheng, Linyun Fu, Xiaogang Ma, Peter Fox

Published in: Earth Science Informatics | Issue 1/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The amount of Earth Science related domain concepts and vocabularies encoded in popular Semantic Web languages such as OWL and SKOS grows rapidly as more and more domain scientists realize the power of Semantic Web Technologies. The interlinking between these concepts will enable the possibility of performing data integration and identity recognition, which is crucial in developing applications that use data from multiple sources. In this paper, we discuss a new tool for performing concept mapping called SEM+. In SEM+, we designed the Information Entropy based Weighted Similarity Model to compute semantic similarity between entity data and suggest possible linking. We also adopted a blocking approach to group possible matching entities into one block and therefore reduce the computation space. We performed evaluations on SEM+ using the Integrated Ocean Observatory System ontology and the Marine Metadata Interoperability ontology and discussed the results and new findings.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. ACM Press, Addison-Wesley Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. ACM Press, Addison-Wesley
go back to reference Bechhofer S (2009) OWL: web ontology language. Encyclopedia of Database Sytems. Springer US. 2008–2009. Miles, Alistair, and José R. Pérez-Agüera. Bechhofer S (2009) OWL: web ontology language. Encyclopedia of Database Sytems. Springer US. 2008–2009. Miles, Alistair, and José R. Pérez-Agüera.
go back to reference Benjelloun O, Garcia-Mollina H, Menestrina D, Su Q, Whang S, Widom J (2009) Swoosh: a generic approach to entity resolution. VLDB J 18(1):255–276CrossRef Benjelloun O, Garcia-Mollina H, Menestrina D, Su Q, Whang S, Widom J (2009) Swoosh: a generic approach to entity resolution. VLDB J 18(1):255–276CrossRef
go back to reference Berners-Lee T, Hendler J, Lassila O (2001) The semantic web. Sci Am 284(5):28–37CrossRef Berners-Lee T, Hendler J, Lassila O (2001) The semantic web. Sci Am 284(5):28–37CrossRef
go back to reference Chaudhuri S, Ganti V, Motwani R (2005) Robust identification of fuzzy duplicates. In Prof. of ICDE, pp. 865–876 Chaudhuri S, Ganti V, Motwani R (2005) Robust identification of fuzzy duplicates. In Prof. of ICDE, pp. 865–876
go back to reference Cruz IF, Antonelli FP, Stroe C (2009) Agreementmaker: efficient matching for large real-world schemas and ontologies. PVLDB 2(2):1586–1589 Cruz IF, Antonelli FP, Stroe C (2009) Agreementmaker: efficient matching for large real-world schemas and ontologies. PVLDB 2(2):1586–1589
go back to reference Dong X, Halevy Y, Madhavan J (2005) Reference reconciliation in complex information spaces. In Proc. of SIGMOD, pp. 865–876 Dong X, Halevy Y, Madhavan J (2005) Reference reconciliation in complex information spaces. In Proc. of SIGMOD, pp. 865–876
go back to reference Duan S, Fokoue A, Srinivas K, Byrne B (2012) A clustering-based approach to ontology alignment, In Proc. of ISWC Duan S, Fokoue A, Srinivas K, Byrne B (2012) A clustering-based approach to ontology alignment, In Proc. of ISWC
go back to reference Elfeky M, Elmagarmid A, Verykios V (2005) Tailor: a record linkage tool box. In Proc. of SIGMOD, pp. 85–96 Elfeky M, Elmagarmid A, Verykios V (2005) Tailor: a record linkage tool box. In Proc. of SIGMOD, pp. 85–96
go back to reference Euzenat J (1994) Brief overview of T-Tree: the TROPES taxonomy building tool, in: 4th ASIS SIG/CR Workshop on Classification Research, Columbus (OH, US), pp. 69–87 Euzenat J (1994) Brief overview of T-Tree: the TROPES taxonomy building tool, in: 4th ASIS SIG/CR Workshop on Classification Research, Columbus (OH, US), pp. 69–87
go back to reference Haq BU (ed) (2007) The geological time table, 6th edn. Elsevier, Amsterdam Haq BU (ed) (2007) The geological time table, 6th edn. Elsevier, Amsterdam
go back to reference Jean-Mary Y, Shironoshita E, Kabuka M (2009) Ontology matching with semantic verification. In Proc. of Web Semantics: Science, Services and Agents on the World Wide Web Jean-Mary Y, Shironoshita E, Kabuka M (2009) Ontology matching with semantic verification. In Proc. of Web Semantics: Science, Services and Agents on the World Wide Web
go back to reference Jimenez-Ruiz E, Grau B (2012) LogMap: logic-based and scalable ontology matching, In Proc. of ISWC Jimenez-Ruiz E, Grau B (2012) LogMap: logic-based and scalable ontology matching, In Proc. of ISWC
go back to reference Klyne G, Carroll JJ (2006) Resource description framework (RDF): concepts and abstract syntax Klyne G, Carroll JJ (2006) Resource description framework (RDF): concepts and abstract syntax
go back to reference Lin D (1998) An information-theoretic definition of similarity. In Proc. of 15th International Conference of machine Learning (ICML) pp. 296–304 Lin D (1998) An information-theoretic definition of similarity. In Proc. of 15th International Conference of machine Learning (ICML) pp. 296–304
go back to reference Newcombe H, Kenedy J (1962) Record linkage: making maximum use of the discriminating power of identifying information. Commun ACM 5(11):563–566 Newcombe H, Kenedy J (1962) Record linkage: making maximum use of the discriminating power of identifying information. Commun ACM 5(11):563–566
go back to reference Nguyen K, Ichise R, Le B (2012) SLINT: a schema-independent linked data interlinking system. In Ontology Matching (OM 2012) Nguyen K, Ichise R, Le B (2012) SLINT: a schema-independent linked data interlinking system. In Ontology Matching (OM 2012)
go back to reference Raskin RG, Pan MJ (2005) Knowledge representation in the semantic web for earth and environmental terminology (SWEET). Comput Geosci 31(9):1119–1125CrossRef Raskin RG, Pan MJ (2005) Knowledge representation in the semantic web for earth and environmental terminology (SWEET). Comput Geosci 31(9):1119–1125CrossRef
go back to reference Rong S, Niu X, Xiang E, Wang H, Yang Q, Yu Y (2012) A machine learning approach for instance matching based on similarity metrics. In Proc. Of ISWC Rong S, Niu X, Xiang E, Wang H, Yang Q, Yu Y (2012) A machine learning approach for instance matching based on similarity metrics. In Proc. Of ISWC
go back to reference Sarawagi S, Bhamidipaty A (2002) Interactive deduplication using active learning. In Proc. of KDD, pp. 269–278 Sarawagi S, Bhamidipaty A (2002) Interactive deduplication using active learning. In Proc. of KDD, pp. 269–278
go back to reference Shannon C (1948) A mathematical theory of communication. Bell Sys Techn J 27(3):379–423CrossRef Shannon C (1948) A mathematical theory of communication. Bell Sys Techn J 27(3):379–423CrossRef
go back to reference Shvaiko P, Euzenant J (2013) Ontology matching: state of the art and future challenges. IEEE Trans Knowl Data Eng Shvaiko P, Euzenant J (2013) Ontology matching: state of the art and future challenges. IEEE Trans Knowl Data Eng
go back to reference SKOS (2007) simple knowledge organisation for the web. Cat Classif Q 43.3–4: 69–83. SKOS (2007) simple knowledge organisation for the web. Cat Classif Q 43.3–4: 69–83.
go back to reference Stumme G, Madche A (2011) FCA-Merge: bottom-up merging of ontologies, In the 7th International conference on artificial Intellligence (IJCAI), pp. 225–230 Stumme G, Madche A (2011) FCA-Merge: bottom-up merging of ontologies, In the 7th International conference on artificial Intellligence (IJCAI), pp. 225–230
go back to reference Tang J, Li J, Liang B, Huang X, Li Y, Wang K (2006) Using Bayesian decision for ontology mapping. J Web Semantics Sci, Serv Agents World Wide Web, pp. 243–262 Tang J, Li J, Liang B, Huang X, Li Y, Wang K (2006) Using Bayesian decision for ontology mapping. J Web Semantics Sci, Serv Agents World Wide Web, pp. 243–262
go back to reference Thor A, Rahm E (2007) Moma – a mapping-based object matching system. In Proc. of CIDR, pp. 247–258 Thor A, Rahm E (2007) Moma – a mapping-based object matching system. In Proc. of CIDR, pp. 247–258
go back to reference Volz J, Bizer C, Gaedke M, Kobilarov G (2009) Discovering and maintaining links on the web of data. In Proc. of ISWC, pp. 650–665 Volz J, Bizer C, Gaedke M, Kobilarov G (2009) Discovering and maintaining links on the web of data. In Proc. of ISWC, pp. 650–665
go back to reference Walker JD, Geissman JW (2009) 2009 GSA geologic time scale. GSA Today 19(4–5):60–61CrossRef Walker JD, Geissman JW (2009) 2009 GSA geologic time scale. GSA Today 19(4–5):60–61CrossRef
go back to reference Yancey W (2002) Bigmatch: a program for extracting probable matches from a large file for record linkage. Statistical research report series rrc2002/01, U.S. Bureau of Census Yancey W (2002) Bigmatch: a program for extracting probable matches from a large file for record linkage. Statistical research report series rrc2002/01, U.S. Bureau of Census
Metadata
Title
SEM+: tool for discovering concept mapping in Earth science related domain
Authors
Jin Guang Zheng
Linyun Fu
Xiaogang Ma
Peter Fox
Publication date
01-03-2015
Publisher
Springer Berlin Heidelberg
Published in
Earth Science Informatics / Issue 1/2015
Print ISSN: 1865-0473
Electronic ISSN: 1865-0481
DOI
https://doi.org/10.1007/s12145-014-0203-1

Other articles of this Issue 1/2015

Earth Science Informatics 1/2015 Go to the issue

Premium Partner