Skip to main content
Erschienen in: Neural Computing and Applications 3-4/2014

01.03.2014 | Original Article

Domain ontology graph model and its application in Chinese text classification

verfasst von: James N. K. Liu, Yu-lin He, Edward H. Y. Lim, Xi-zhao Wang

Erschienen in: Neural Computing and Applications | Ausgabe 3-4/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper proposes an ontology learning method which is used to generate a graphical ontology structure called ontology graph. The ontology graph defines the ontology and knowledge conceptualization model, and the ontology learning process defines the method of semiautomatic learning and generates ontology graphs from Chinese texts of different domains, the so-called domain ontology graph (DOG). Meanwhile, we also define two other ontological operations—document ontology graph generation and ontology graph-based text classification, which can be carried out with the generated DOG. This research focuses on Chinese text data, and furthermore, we conduct two experiments: the DOG generation and ontology graph-based text classification, with Chinese texts as the experimental data. The first experiment generates ten DOGs as the ontology graph instances to represent ten different domains of knowledge. The generated DOGs are then further used for the second experiment to provide performance evaluation. The ontology graph-based approach is able to achieve high text classification accuracy (with 92.3 % in f-measure) over other text classification approaches (such as 86.8 % in f-measure for tf–idf approach). The better performance in the comparative experiments reveals that the proposed ontology graph knowledge model, the ontology learning and generation process, and the ontological operations are feasible and effective.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alani H, Sanghee K, Millard DE, Weal MJ, Hall W, Lewis PH, Shadbolt NR (2003) Automatic ontology-based knowledge extraction from web documents. IEEE Intell Syst 18(1):14–21CrossRef Alani H, Sanghee K, Millard DE, Weal MJ, Hall W, Lewis PH, Shadbolt NR (2003) Automatic ontology-based knowledge extraction from web documents. IEEE Intell Syst 18(1):14–21CrossRef
2.
Zurück zum Zitat Besana P, Robertson D (2008) Probabilistic dialogue models for dynamic ontology mapping. Lect Notes Comput Sci 5327:41–51CrossRef Besana P, Robertson D (2008) Probabilistic dialogue models for dynamic ontology mapping. Lect Notes Comput Sci 5327:41–51CrossRef
3.
Zurück zum Zitat Buitelaar P, Ciomiano P (2008) Ontology learning and population: bridging the gap between text and knowledge. IOS Press, The Netherlands Buitelaar P, Ciomiano P (2008) Ontology learning and population: bridging the gap between text and knowledge. IOS Press, The Netherlands
4.
Zurück zum Zitat Busagala LSP, Ohyama W, Wakabayashi T, Kimura F (2008) Improving automatic text classification by integrated feature analysis. IEICE Trans Inf Syst E91(D4):1101–1109CrossRef Busagala LSP, Ohyama W, Wakabayashi T, Kimura F (2008) Improving automatic text classification by integrated feature analysis. IEICE Trans Inf Syst E91(D4):1101–1109CrossRef
5.
Zurück zum Zitat Chen WQ, Mizoguchi R (1999) Communication content ontology for learner model agent in multi-agent architecture. Adv Res Comput Commun Educ 95–102 Chen WQ, Mizoguchi R (1999) Communication content ontology for learner model agent in multi-agent architecture. Adv Res Comput Commun Educ 95–102
6.
Zurück zum Zitat Cimiano P, Hotho A, Staab S (2005) Learning concept hierarchies from text corpora using formal concept analysis. J Artif Intell Res 24(1):305–339MATH Cimiano P, Hotho A, Staab S (2005) Learning concept hierarchies from text corpora using formal concept analysis. J Artif Intell Res 24(1):305–339MATH
7.
Zurück zum Zitat Dahab MY, Hassan HA, Rafea A (2008) TextOntoEx: automatic ontology construction from natural English text. Expert Syst Appl 34(2):1474–1480CrossRef Dahab MY, Hassan HA, Rafea A (2008) TextOntoEx: automatic ontology construction from natural English text. Expert Syst Appl 34(2):1474–1480CrossRef
8.
Zurück zum Zitat Dicheva D, Dichev C (2007) Authors support in the TM4L environment. Int J Inf Technol Knowl 1(3):215–219 Dicheva D, Dichev C (2007) Authors support in the TM4L environment. Int J Inf Technol Knowl 1(3):215–219
9.
Zurück zum Zitat Dong ZD, Dong Q (2006) HowNet and the computation of meaning. World Scientific Publishing Company, SingaporeCrossRef Dong ZD, Dong Q (2006) HowNet and the computation of meaning. World Scientific Publishing Company, SingaporeCrossRef
10.
Zurück zum Zitat Etzioni O, Cafarella M, Downey D,Popescu AM, Shaked T, Soderland S, Weld DS, Yates A (2005) Unsupervised named-entity extraction from the web: an experimental study. Artif Intell 165(1):91–134CrossRef Etzioni O, Cafarella M, Downey D,Popescu AM, Shaked T, Soderland S, Weld DS, Yates A (2005) Unsupervised named-entity extraction from the web: an experimental study. Artif Intell 165(1):91–134CrossRef
11.
Zurück zum Zitat Fensel D, van Harmelen F, Horrocks I, McGuinness DL, Patel-Schneider PF (2001) OIL: an ontology infrastructure for the semantic web. IEEE Intell Syst 16(2):38–45CrossRef Fensel D, van Harmelen F, Horrocks I, McGuinness DL, Patel-Schneider PF (2001) OIL: an ontology infrastructure for the semantic web. IEEE Intell Syst 16(2):38–45CrossRef
12.
Zurück zum Zitat Forman G (2003) An extensive empirical study of feature selection metrics for text classification. Int J Mach Learn Res 3(7–8):1289–1305MATH Forman G (2003) An extensive empirical study of feature selection metrics for text classification. Int J Mach Learn Res 3(7–8):1289–1305MATH
13.
Zurück zum Zitat Gacitua R, Sawyer P, Rayson P (2008) A flexible framework to experiment with ontology learning techniques. Knowl Based Syst 21(3):192–199CrossRef Gacitua R, Sawyer P, Rayson P (2008) A flexible framework to experiment with ontology learning techniques. Knowl Based Syst 21(3):192–199CrossRef
14.
Zurück zum Zitat Gruber TR (2008) Ontology, encyclopedia of database systems. Springer, Berlin Gruber TR (2008) Ontology, encyclopedia of database systems. Springer, Berlin
15.
Zurück zum Zitat Haase P, Völker J (2008) Ontology learning and reasoning-dealing with uncertainty and inconsistency. Lect Notes Comput Sci 5327:366–384CrossRef Haase P, Völker J (2008) Ontology learning and reasoning-dealing with uncertainty and inconsistency. Lect Notes Comput Sci 5327:366–384CrossRef
16.
Zurück zum Zitat Hazman M, El-Beltagy SR, Rafea A (2009) Ontology learning from domain specific Web documents. Int J Metadata Semant Ontol 4(1/2):24–33CrossRef Hazman M, El-Beltagy SR, Rafea A (2009) Ontology learning from domain specific Web documents. Int J Metadata Semant Ontol 4(1/2):24–33CrossRef
17.
Zurück zum Zitat Koutero A, Fujita S, Sugawara K (2010) Design of an assisting agent using a dynamic ontology. Proc IEEE/ACIS Int Conf Comput Inf Sci 611–616 Koutero A, Fujita S, Sugawara K (2010) Design of an assisting agent using a dynamic ontology. Proc IEEE/ACIS Int Conf Comput Inf Sci 611–616
18.
Zurück zum Zitat Lan M, Tan CL, Su J, Lu Y (2009) Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans Pattern Anal Mach Intell 31(4):721–735CrossRef Lan M, Tan CL, Su J, Lu Y (2009) Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans Pattern Anal Mach Intell 31(4):721–735CrossRef
19.
Zurück zum Zitat Lim E, Liu J, Lee R (2009) Knowledge discovery from text learning for ontology modelling. Proc Int Conf Fuzz Syst Knowl Discov 7:227–231 Lim E, Liu J, Lee R (2009) Knowledge discovery from text learning for ontology modelling. Proc Int Conf Fuzz Syst Knowl Discov 7:227–231
20.
Zurück zum Zitat Lougheed P, Bogyo B, Brokenshire D, Kumar V (2005) Towards formalizing electronic portfolios. In: Proceedings of the international workshop applications of semantic Web techniques E-Learn. pp 9–18 Lougheed P, Bogyo B, Brokenshire D, Kumar V (2005) Towards formalizing electronic portfolios. In: Proceedings of the international workshop applications of semantic Web techniques E-Learn. pp 9–18
21.
Zurück zum Zitat Maedche A (2001) Ontology learning for the semantic web. IEEE Intell Syst 16(2):72–79CrossRef Maedche A (2001) Ontology learning for the semantic web. IEEE Intell Syst 16(2):72–79CrossRef
22.
Zurück zum Zitat Mahinovs A, Tiwari A (2007) Text classification method review. Decis Eng Rep Ser 1–13 Mahinovs A, Tiwari A (2007) Text classification method review. Decis Eng Rep Ser 1–13
23.
Zurück zum Zitat Missikoff M, Velardi P, Fabriani P (2003) Text mining techniques to automatically enrich a domain ontology. Appl Intell 18(3):323–340CrossRefMATH Missikoff M, Velardi P, Fabriani P (2003) Text mining techniques to automatically enrich a domain ontology. Appl Intell 18(3):323–340CrossRefMATH
24.
Zurück zum Zitat Mochol M, Jentzsch A, Euzenat J (2006) Applying an analytic method for matching approach selection. In: Proceedings of the international workshop Ontology Match. pp 37–48 Mochol M, Jentzsch A, Euzenat J (2006) Applying an analytic method for matching approach selection. In: Proceedings of the international workshop Ontology Match. pp 37–48
25.
Zurück zum Zitat Navigli R, Velardi P (2004) Learning domain ontologies from document warehouses and dedicated web sites. Comput Linguist 30(2):151–179CrossRefMATH Navigli R, Velardi P (2004) Learning domain ontologies from document warehouses and dedicated web sites. Comput Linguist 30(2):151–179CrossRefMATH
27.
Zurück zum Zitat Noy NF, Musen MA (2000) PROMPT: algorithm and tool for automated ontology merging and alignment. In: Proceedings of the international conference on articial intelligence and conference on Innovative appliance articial intelligence. pp 450–455 Noy NF, Musen MA (2000) PROMPT: algorithm and tool for automated ontology merging and alignment. In: Proceedings of the international conference on articial intelligence and conference on Innovative appliance articial intelligence. pp 450–455
28.
Zurück zum Zitat Oberle D, Eberhart A, Staab S, Volz R (2004) Developing and managing software components in an ontology-based application server. Lect Notes Comput Sci 3231:459–477CrossRef Oberle D, Eberhart A, Staab S, Volz R (2004) Developing and managing software components in an ontology-based application server. Lect Notes Comput Sci 3231:459–477CrossRef
29.
Zurück zum Zitat Oddy RN (1981) Information retrieval research. Butterworths, London Oddy RN (1981) Information retrieval research. Butterworths, London
30.
Zurück zum Zitat Ottens K, Aussenac-Gilles N, Gleizes MP, Camps V (2007) Dynamic ontology co-evolution from texts: principles and case study. In: Proceedings of the international workshop on the emerging semantics ontology evolving. pp 70–83 Ottens K, Aussenac-Gilles N, Gleizes MP, Camps V (2007) Dynamic ontology co-evolution from texts: principles and case study. In: Proceedings of the international workshop on the emerging semantics ontology evolving. pp 70–83
31.
Zurück zum Zitat Rosse C, Mejino JL Jr (2003) A reference ontology for biomedical informatics: the foundational model of anatomy. J Biomed Inform 36(6):478–500CrossRef Rosse C, Mejino JL Jr (2003) A reference ontology for biomedical informatics: the foundational model of anatomy. J Biomed Inform 36(6):478–500CrossRef
32.
Zurück zum Zitat Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1):1–47CrossRef Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1):1–47CrossRef
34.
Zurück zum Zitat Simperl E (2009) Reusing ontologies on the semantic web: a feasibility study. Data Knowl Eng 68(10):905−925CrossRef Simperl E (2009) Reusing ontologies on the semantic web: a feasibility study. Data Knowl Eng 68(10):905−925CrossRef
35.
Zurück zum Zitat Sun AX, Lim EP, Ng WK (2003) Performance measurement framework for hierarchical text classification. J Am Soc Inf Sci Tech 54(11):1014–1028CrossRef Sun AX, Lim EP, Ng WK (2003) Performance measurement framework for hierarchical text classification. J Am Soc Inf Sci Tech 54(11):1014–1028CrossRef
36.
Zurück zum Zitat Vacura M, Svátek V, Smrž P (2008) Pattern-based framework for representation of uncertainty in ontologies.In: Proceedings of the international conference on text speech and dialogue. pp 227–234 Vacura M, Svátek V, Smrž P (2008) Pattern-based framework for representation of uncertainty in ontologies.In: Proceedings of the international conference on text speech and dialogue. pp 227–234
37.
Zurück zum Zitat Vagin V, Fomina M (2011) Problem of knowledge discovery in noisy databases. Int J Mach Learn Cyber 2(3):135–145CrossRef Vagin V, Fomina M (2011) Problem of knowledge discovery in noisy databases. Int J Mach Learn Cyber 2(3):135–145CrossRef
38.
Zurück zum Zitat Wang XZ, Dong LC, Yan JH (2012) Maximum ambiguity based sample selection in fuzzy decision tree induction. IEEE Trans Knowl Data Eng 24(8):1491–1505CrossRef Wang XZ, Dong LC, Yan JH (2012) Maximum ambiguity based sample selection in fuzzy decision tree induction. IEEE Trans Knowl Data Eng 24(8):1491–1505CrossRef
39.
Zurück zum Zitat Wang XZ, He YL, Dong LC, Zhao HY (2011) Particle swarm optimization for determining fuzzy measures from data. Inf Sci 181(19):4230–4252CrossRefMATH Wang XZ, He YL, Dong LC, Zhao HY (2011) Particle swarm optimization for determining fuzzy measures from data. Inf Sci 181(19):4230–4252CrossRefMATH
40.
Zurück zum Zitat Warren P (2006) Knowledge management and the semantic web: from scenario to technology. IEEE Intell Syst 21(1):53−59CrossRef Warren P (2006) Knowledge management and the semantic web: from scenario to technology. IEEE Intell Syst 21(1):53−59CrossRef
41.
Zurück zum Zitat Yi WG, Lu MY, Liu Z (2011) Multi-valued attribute and multi-labeled data decision tree algorithm. Int J Mach Learn Cyber 2(2):67–74CrossRef Yi WG, Lu MY, Liu Z (2011) Multi-valued attribute and multi-labeled data decision tree algorithm. Int J Mach Learn Cyber 2(2):67–74CrossRef
42.
Zurück zum Zitat Zahiri SH (2012) Classification rule discovery using learning automata. Int J Mach Learn Cyber 3(3):205–213CrossRefMathSciNet Zahiri SH (2012) Classification rule discovery using learning automata. Int J Mach Learn Cyber 3(3):205–213CrossRefMathSciNet
43.
Zurück zum Zitat Zhang Y, Vasconcelos W, Sleeman D (2005) OntoSearch: an ontology search engine. Res Dev Intell Syst XXI 1a:58–69CrossRef Zhang Y, Vasconcelos W, Sleeman D (2005) OntoSearch: an ontology search engine. Res Dev Intell Syst XXI 1a:58–69CrossRef
44.
Zurück zum Zitat Zhang Q, Xing CX, Zhou LZ, Feng JH (2003) An ontology-based method for querying the web data. In: Proceedings of the international conference on advanced information network application 628-631 Zhang Q, Xing CX, Zhou LZ, Feng JH (2003) An ontology-based method for querying the web data. In: Proceedings of the international conference on advanced information network application 628-631
45.
Zurück zum Zitat Zhong ZM, Liu ZT, Li CH, Guan Y (2012) Event ontology reasoning based on event class influence factors. Int J Mach Learn Cyber 3(2):133–139CrossRef Zhong ZM, Liu ZT, Li CH, Guan Y (2012) Event ontology reasoning based on event class influence factors. Int J Mach Learn Cyber 3(2):133–139CrossRef
Metadaten
Titel
Domain ontology graph model and its application in Chinese text classification
verfasst von
James N. K. Liu
Yu-lin He
Edward H. Y. Lim
Xi-zhao Wang
Publikationsdatum
01.03.2014
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 3-4/2014
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-012-1272-z

Weitere Artikel der Ausgabe 3-4/2014

Neural Computing and Applications 3-4/2014 Zur Ausgabe