Skip to main content
Top
Published in: Journal of Intelligent Information Systems 1/2012

01-02-2012

Learning non-taxonomical semantic relations from domain texts

Authors: Janardhana Punuru, Jianhua Chen

Published in: Journal of Intelligent Information Systems | Issue 1/2012

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Ontology of a domain mainly consists of concepts, taxonomical (hierarchical) relations and non-taxonomical relations. Automatic ontology construction requires methods for extracting both taxonomical and non-taxonomical relations. Compared to extensive works on concept extraction and taxonomical relation learning, little attention has been given on identification and labeling of non-taxonomical relations in text mining. In this paper, we propose an unsupervised technique for extracting non-taxonomical relations from domain texts. We propose the VF*ICF metric for measuring the importance of a verb as a representative relation label, in much the same spirit as the TF*IDF measure in information retrieval. Domain-relevant concepts (nouns) are extracted using techniques developed earlier. Candidate non-taxonomical relations are generated as (SVO) triples of the form (subject, verb, object) from domain texts. A statistical method with log-likelihood ratios is used to estimate the significance of relationships between concepts and to select suitable relation labels. Texts from two domains, the Electronic Voting (EV) domain texts and the Tenders and Mergers (TNM) domain texts are used to compare our method with one of the existing approaches. Experiments showed that our method achieved better performance in both domains.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Berland, M., & Charniak, E. (1996). Finding parts in very large corpora. In Proc. 37th annual meeting of association for computational linguistics (pp. 57–64). Berland, M., & Charniak, E. (1996). Finding parts in very large corpora. In Proc. 37th annual meeting of association for computational linguistics (pp. 57–64).
go back to reference Berners-Lee, T., Hendler, J., & Lassila, O. (2001). The semantic web (pp. 30–37). Scientific American. Berners-Lee, T., Hendler, J., & Lassila, O. (2001). The semantic web (pp. 30–37). Scientific American.
go back to reference Brill, E. (1992). A simple rule-based part-of-speech tagger. In Third conference on applied natural language processing. Brill, E. (1992). A simple rule-based part-of-speech tagger. In Third conference on applied natural language processing.
go back to reference Caraballo, S. A. (1999). Automatic construction of a hypernym-labeled noun hierarchy from text. In Proc. 37th annual meeting of association for computational linguistics. Caraballo, S. A. (1999). Automatic construction of a hypernym-labeled noun hierarchy from text. In Proc. 37th annual meeting of association for computational linguistics.
go back to reference Cederberg, S., & Widdows, D. (2003). Using LSA and noun coordination information to improve the precision and recall of automatic hyponymy extraction. In Proc. of conference on natural language learning. Edmonton, Canada. Cederberg, S., & Widdows, D. (2003). Using LSA and noun coordination information to improve the precision and recall of automatic hyponymy extraction. In Proc. of conference on natural language learning. Edmonton, Canada.
go back to reference Ciaramita, M., Gangemi, A., Ratsch, E., Jasmin, S., & Isabel, R. (2005). Unsupervised learning of semantic relations between concepts of molecular biology ontology. In Proc. of 19th international joint conference on artificial intelligence. Ciaramita, M., Gangemi, A., Ratsch, E., Jasmin, S., & Isabel, R. (2005). Unsupervised learning of semantic relations between concepts of molecular biology ontology. In Proc. of 19th international joint conference on artificial intelligence.
go back to reference Dunning, T. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19, 61–74. Dunning, T. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19, 61–74.
go back to reference Faure, D., & Nedellec, C. (1998). A corpus-based conceptual clustering method for verb frames and ontology acquisition. In Proceedings of LREC workshop on adapting lexical and corpus resources to sublangauges and applications (pp. 5–12). Faure, D., & Nedellec, C. (1998). A corpus-based conceptual clustering method for verb frames and ontology acquisition. In Proceedings of LREC workshop on adapting lexical and corpus resources to sublangauges and applications (pp. 5–12).
go back to reference Fu, J., Fan, X., Mao, J., & Liu, X. (2009). Two stage semantic relation extraction. In Proceedings of international conference on hybrid intelligent systems. Shen Yang, China. Fu, J., Fan, X., Mao, J., & Liu, X. (2009). Two stage semantic relation extraction. In Proceedings of international conference on hybrid intelligent systems. Shen Yang, China.
go back to reference Girju, R., Badulescu, A., & Moldovan, D. (2003). Learning semantic constraints for the automatic discovery of part-whole relations. In Proc. HLT/NAACL-03 (pp. 80–87). Edmonton, Canada. Girju, R., Badulescu, A., & Moldovan, D. (2003). Learning semantic constraints for the automatic discovery of part-whole relations. In Proc. HLT/NAACL-03 (pp. 80–87). Edmonton, Canada.
go back to reference Girju, R., & Moldovan, D. (2002). Text mining for causal relations. In Proc. of FLAIRS conference (pp. 360–364). Girju, R., & Moldovan, D. (2002). Text mining for causal relations. In Proc. of FLAIRS conference (pp. 360–364).
go back to reference Hasegawa, T., Sekine, S., & Grishman, R. (2004). Discovering relations among named entities from large corpora. In Proc. of association of computational linguistics. Hasegawa, T., Sekine, S., & Grishman, R. (2004). Discovering relations among named entities from large corpora. In Proc. of association of computational linguistics.
go back to reference Hearst, M. A. (1992). Automatic acquisition of hyponyms from large text corpora. In Proc. 14th international conference computational linguistics. Nantes, France. Hearst, M. A. (1992). Automatic acquisition of hyponyms from large text corpora. In Proc. 14th international conference computational linguistics. Nantes, France.
go back to reference Jacquemin, C. (1995). A symbolic and surgical acquisition of terms through variation. In Symbolic approaches to learning for natural language processing (pp. 425–438). Jacquemin, C. (1995). A symbolic and surgical acquisition of terms through variation. In Symbolic approaches to learning for natural language processing (pp. 425–438).
go back to reference Kavalec, M., Maedche, A., & Svatek, V. (2004). Discovery of lexical entries for non-taxonomic relations in ontology learning. In SOFSEM—theory and practice of computer science. LNCS (Vol. 2932). Springer Kavalec, M., Maedche, A., & Svatek, V. (2004). Discovery of lexical entries for non-taxonomic relations in ontology learning. In SOFSEM—theory and practice of computer science. LNCS (Vol. 2932). Springer
go back to reference Lin, D. (1999). MINIPAR: A minimalist parser. In Maryland linguistics colloquium. University of Maryland, College Park. Lin, D. (1999). MINIPAR: A minimalist parser. In Maryland linguistics colloquium. University of Maryland, College Park.
go back to reference Maedche, A., & Volz, R. (2001). The text-to-ontology extraction and maintenance system. In ICDM-workshop on integrating data mining and knowledge management. San Jose, California. Maedche, A., & Volz, R. (2001). The text-to-ontology extraction and maintenance system. In ICDM-workshop on integrating data mining and knowledge management. San Jose, California.
go back to reference Manning, C. D., & Schütze, H. (1999). Foundations of statistical natural language processing. MIT Press. Manning, C. D., & Schütze, H. (1999). Foundations of statistical natural language processing. MIT Press.
go back to reference Miller, G., Beckwith, R., Fellbaum, C., Gross, D., & Miller, K. (1990). An introduction to WordNet: An on-line lexical database. International Journal of Lexicography, 3(4), 235–244.CrossRef Miller, G., Beckwith, R., Fellbaum, C., Gross, D., & Miller, K. (1990). An introduction to WordNet: An on-line lexical database. International Journal of Lexicography, 3(4), 235–244.CrossRef
go back to reference Pantel, P., & Lin, D. (2001). A statistical corpus based term extractor. In E. Stroulia, & S. Martwin, (Eds.), AI lecture notes in artificial intelligence (pp. 35–46). Pantel, P., & Lin, D. (2001). A statistical corpus based term extractor. In E. Stroulia, & S. Martwin, (Eds.), AI lecture notes in artificial intelligence (pp. 35–46).
go back to reference Punuru, J., & Chen, J. (2006). Automatic acquisition of concepts from domain texts. In Proc. of IEEE granular computing Atlanta. Punuru, J., & Chen, J. (2006). Automatic acquisition of concepts from domain texts. In Proc. of IEEE granular computing Atlanta.
go back to reference Punuru, J., & Chen, J. (2007). Extraction of non-hierarchical relations from domain texts. In Proc. of IEEE symposium on computational intelligence and data mining. Honolulu. Punuru, J., & Chen, J. (2007). Extraction of non-hierarchical relations from domain texts. In Proc. of IEEE symposium on computational intelligence and data mining. Honolulu.
go back to reference Qian, L., Zhou, G., Kong, F., & Zhu, Q. (2009). Semi-supervised learning for semantic relation classification using stratified sampling strategy. In Proceedings of the 2009 conference on empirical methods in natural language processing (pp. 1437–1445). Singapore. Qian, L., Zhou, G., Kong, F., & Zhu, Q. (2009). Semi-supervised learning for semantic relation classification using stratified sampling strategy. In Proceedings of the 2009 conference on empirical methods in natural language processing (pp. 1437–1445). Singapore.
go back to reference Ramshaw, L., & Marcus, M. (1995). Text chunking using transformation-based learning. In Third association for computational linguistics workshop on very large corpora. Ramshaw, L., & Marcus, M. (1995). Text chunking using transformation-based learning. In Third association for computational linguistics workshop on very large corpora.
go back to reference Riloff, E. (1996). Automatically generating extraction patterns from untagged text. In Proc. of 13th national conference on artificial intelligence (pp. 1044–1049). Riloff, E. (1996). Automatically generating extraction patterns from untagged text. In Proc. of 13th national conference on artificial intelligence (pp. 1044–1049).
go back to reference Schutz, A., & Buitelaar, P. (2005). RelExt: A tool for relation extraction from text in ontology extension. In Proc. of 4h international semantic web conference (ISWC-2005). Galway, Ireland. Schutz, A., & Buitelaar, P. (2005). RelExt: A tool for relation extraction from text in ontology extension. In Proc. of 4h international semantic web conference (ISWC-2005). Galway, Ireland.
go back to reference Stevenson, M. (2004). An unsupervised wordnet-based algorithm for relation extraction. In Fourth international conference on language resources(LREC-04). Lisbon, Portugal. Stevenson, M. (2004). An unsupervised wordnet-based algorithm for relation extraction. In Fourth international conference on language resources(LREC-04). Lisbon, Portugal.
go back to reference Tomokiyo, T., & Hurst, M. (2003). A language model approach for keyphrase extraction. In Proc. of ACL 2003 workshop on multiword expressions: Analysis, acquisition, and treatment (pp. 33–40). Tomokiyo, T., & Hurst, M. (2003). A language model approach for keyphrase extraction. In Proc. of ACL 2003 workshop on multiword expressions: Analysis, acquisition, and treatment (pp. 33–40).
go back to reference Yangarber, R., Grishman, R., Tapanainen P., & Huttunen, S. (2000). Unsupervised discovery of scenario-level patterns for information extraction. In Proc. of applied natural language processing conference. Seattle, WA. Yangarber, R., Grishman, R., Tapanainen P., & Huttunen, S. (2000). Unsupervised discovery of scenario-level patterns for information extraction. In Proc. of applied natural language processing conference. Seattle, WA.
go back to reference Zhou, G., Li, J., Qian, L., & Zhu, J. (2008). Semi-supervised learning for relation extraction. In Proceedings of international joint conference on natural language processing (IJCNLP08) (pp. 32–38). Hyderabad, India. Zhou, G., Li, J., Qian, L., & Zhu, J. (2008). Semi-supervised learning for relation extraction. In Proceedings of international joint conference on natural language processing (IJCNLP08) (pp. 32–38). Hyderabad, India.
Metadata
Title
Learning non-taxonomical semantic relations from domain texts
Authors
Janardhana Punuru
Jianhua Chen
Publication date
01-02-2012
Publisher
Springer US
Published in
Journal of Intelligent Information Systems / Issue 1/2012
Print ISSN: 0925-9902
Electronic ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-011-0149-4

Other articles of this Issue 1/2012

Journal of Intelligent Information Systems 1/2012 Go to the issue

Premium Partner