Skip to main content

2016 | OriginalPaper | Buchkapitel

ProMine: A Text Mining Solution for Concept Extraction and Filtering

verfasst von : Saira Gillani, Andrea Kő

Erschienen in: Corporate Knowledge Discovery and Organizational Learning

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Due to the on-going economic crisis, the management of organizational knowledge is becoming more and more important. This knowledge resides in organizational processes. The extraction of this hidden knowledge from the business processes and the usage of this knowledge for domain ontology development is a major challenge. This chapter presents ProMine, a text mining ontology extraction tool that extracts deep representations from the business processes. ProMine extracts new domain related concepts and proposes a new filtering mechanism based on a new hybrid similarity measure to filter most relevant concepts. The tool is evaluated through a case study of the insurance domain. The results showed that ProMine performance is good and it generates many new concepts against each business process.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
EUREKA_HU_12-1-2012-0039, supported by the Research and Technology Innovation Fund, New Széchenyi Plan, Hungary.
 
Literatur
Zurück zum Zitat Auer, S. (2005). Powl–a web based platform for collaborative semantic web development. Paper presented at the Proceedings of the Workshop Scripting for the Semantic Web. Auer, S. (2005). Powl–a web based platform for collaborative semantic web development. Paper presented at the Proceedings of the Workshop Scripting for the Semantic Web.
Zurück zum Zitat Barforush, A. A., & Rahnama, A. (2012). Ontology learning: Revisted. Journal of Web Engineering, 11(4), 269–289. Barforush, A. A., & Rahnama, A. (2012). Ontology learning: Revisted. Journal of Web Engineering, 11(4), 269–289.
Zurück zum Zitat Bekkerman, R., El-Yaniv, R., Tishby, N., & Winter, Y. (2001). On feature distributional clustering for text categorization. Paper presented at the Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Bekkerman, R., El-Yaniv, R., Tishby, N., & Winter, Y. (2001). On feature distributional clustering for text categorization. Paper presented at the Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.
Zurück zum Zitat Buitelaar, P., & Sacaleanu, B. (2001). Ranking and selecting synsets by domain relevance. Paper presented at the Proceedings of WordNet and Other Lexical Resources: Applications, Extensions and Customizations, NAACL 2001 Workshop. Buitelaar, P., & Sacaleanu, B. (2001). Ranking and selecting synsets by domain relevance. Paper presented at the Proceedings of WordNet and Other Lexical Resources: Applications, Extensions and Customizations, NAACL 2001 Workshop.
Zurück zum Zitat Cimiano, P., & Völker, J. (2005). Text2Onto. Natural language processing and information systems. Paper presented at the 10th International Conference on Applications of Natural Language to Information Systems, NLDB 2005, Alicante, Spain, June 15–17, 2005. Proceedings, of Lecture Notes in Computer Science (Edited by: Montoyo A, Muñoz R, Métais E). Cimiano, P., & Völker, J. (2005). Text2Onto. Natural language processing and information systems. Paper presented at the 10th International Conference on Applications of Natural Language to Information Systems, NLDB 2005, Alicante, Spain, June 15–17, 2005. Proceedings, of Lecture Notes in Computer Science (Edited by: Montoyo A, Muñoz R, Métais E).
Zurück zum Zitat Dagan, I., Pereira, F., & Lee, L. (1994). Similarity-based estimation of word cooccurrence probabilities. Paper presented at the Proceedings of the 32nd annual meeting on Association for Computational Linguistics. Dagan, I., Pereira, F., & Lee, L. (1994). Similarity-based estimation of word cooccurrence probabilities. Paper presented at the Proceedings of the 32nd annual meeting on Association for Computational Linguistics.
Zurück zum Zitat Euzenat, J., & Shvaiko, P. (2007). Ontology matching (Vol. 333). Berlin: Springer. Euzenat, J., & Shvaiko, P. (2007). Ontology matching (Vol. 333). Berlin: Springer.
Zurück zum Zitat Farquhar, A., Fikes, R., & Rice, J. (1997). The ontolingua server: A tool for collaborative ontology construction. International Journal of Human-Computer Studies, 46(6), 707–727.CrossRef Farquhar, A., Fikes, R., & Rice, J. (1997). The ontolingua server: A tool for collaborative ontology construction. International Journal of Human-Computer Studies, 46(6), 707–727.CrossRef
Zurück zum Zitat Formica, A. (2008). Concept similarity in formal concept analysis: An information content approach. Knowledge-Based Systems, 21(1), 80–87.CrossRef Formica, A. (2008). Concept similarity in formal concept analysis: An information content approach. Knowledge-Based Systems, 21(1), 80–87.CrossRef
Zurück zum Zitat Gacitua, R., Sawyer, P., & Rayson, P. (2008). A flexible framework to experiment with ontology learning techniques. Knowledge-Based Systems, 21(3), 192–199.CrossRef Gacitua, R., Sawyer, P., & Rayson, P. (2008). A flexible framework to experiment with ontology learning techniques. Knowledge-Based Systems, 21(3), 192–199.CrossRef
Zurück zum Zitat George, P., Vangelis, K., Anastasia, K., Georgios, P., & Constantine, S. D. (2009, June). Semi-automated ontology learning: The boemie approach. In Proceedings of the First ESWC Workshop on Inductive Reasoning and Machine Learning on the Semantic Web, Heraklion, Greece. George, P., Vangelis, K., Anastasia, K., Georgios, P., & Constantine, S. D. (2009, June). Semi-automated ontology learning: The boemie approach. In Proceedings of the First ESWC Workshop on Inductive Reasoning and Machine Learning on the Semantic Web, Heraklion, Greece.
Zurück zum Zitat Ghadfi, S., Béchet, N., & Berio, G. (2014). Building ontologies from textual resources: A pattern based improvement using deep linguistic information. Paper presented at the Proceedings of the 5th Workshop on Ontology and Semantic Web Patterns (WOP2014), Riva del Garda, Italy. Ghadfi, S., Béchet, N., & Berio, G. (2014). Building ontologies from textual resources: A pattern based improvement using deep linguistic information. Paper presented at the Proceedings of the 5th Workshop on Ontology and Semantic Web Patterns (WOP2014), Riva del Garda, Italy.
Zurück zum Zitat Gillani, S. A., & Kő, A. (2014). Process-based knowledge extraction in a public authority: A text mining approach. In Electronic government and the information systems perspective (pp. 91–103). Cham: Springer International Publishing. Gillani, S. A., & Kő, A. (2014). Process-based knowledge extraction in a public authority: A text mining approach. In Electronic government and the information systems perspective (pp. 91–103). Cham: Springer International Publishing.
Zurück zum Zitat Gruber, T. R. (1993). A translation approach to portable ontology specifications. Knowledge Acquisition, 5(2), 199–220.CrossRef Gruber, T. R. (1993). A translation approach to portable ontology specifications. Knowledge Acquisition, 5(2), 199–220.CrossRef
Zurück zum Zitat Guo, W., & Diab, M. (2012). A simple unsupervised latent semantics based approach for sentence similarity. Paper presented at the Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation. Guo, W., & Diab, M. (2012). A simple unsupervised latent semantics based approach for sentence similarity. Paper presented at the Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation.
Zurück zum Zitat Islam, N., Siddiqui, M. S., & Shaikh, Z. (2010). TODE: A Dot Net based tool for ontology development and editing. Paper presented at the 2nd International Conference on Computer Engineering and Technology (ICCET). Islam, N., Siddiqui, M. S., & Shaikh, Z. (2010). TODE: A Dot Net based tool for ontology development and editing. Paper presented at the 2nd International Conference on Computer Engineering and Technology (ICCET).
Zurück zum Zitat Jiang, X., & Tan, A. H. (2010). CRCTOL: A semantic‐based domain ontology learning system. Journal of the American Society for Information Science and Technology, 61(1), 150–168.CrossRef Jiang, X., & Tan, A. H. (2010). CRCTOL: A semantic‐based domain ontology learning system. Journal of the American Society for Information Science and Technology, 61(1), 150–168.CrossRef
Zurück zum Zitat Kang, Y.-B., Haghighi, P. D., & Burstein, F. (2014). CFinder: An intelligent key concept finder from text for ontology development. Expert Systems with Applications, 41(9), 4494–4504.CrossRef Kang, Y.-B., Haghighi, P. D., & Burstein, F. (2014). CFinder: An intelligent key concept finder from text for ontology development. Expert Systems with Applications, 41(9), 4494–4504.CrossRef
Zurück zum Zitat Landauer, T. K., Foltz, P. W., & Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 25(2–3), 259–284.CrossRef Landauer, T. K., Foltz, P. W., & Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 25(2–3), 259–284.CrossRef
Zurück zum Zitat Lindén, K., & Piitulainen, J. O. (2004). Discovering synonyms and other related words. Paper presented at the Proceedings of COLING 2004 CompuTerm 2004: 3rd International Workshop on Computational Terminology. Lindén, K., & Piitulainen, J. O. (2004). Discovering synonyms and other related words. Paper presented at the Proceedings of COLING 2004 CompuTerm 2004: 3rd International Workshop on Computational Terminology.
Zurück zum Zitat Lund, K., & Burgess, C. (1996, April). Hyperspace analogue to language (HAL): A general model semantic representation. Brain and Cognition, 30(3), 5–5. 525 B ST, STE 1900, San Diego, CA 92101-4495: Academic press Inc JNL-COMP Subscriptions. Lund, K., & Burgess, C. (1996, April). Hyperspace analogue to language (HAL): A general model semantic representation. Brain and Cognition, 30(3), 5–5. 525 B ST, STE 1900, San Diego, CA 92101-4495: Academic press Inc JNL-COMP Subscriptions.
Zurück zum Zitat Luong, H., Wang, Q., & Gauch, S. (2012). Ontology learning using word net lexical expansion and text mining. INTECH Open Access Publisher. Luong, H., Wang, Q., & Gauch, S. (2012). Ontology learning using word net lexical expansion and text mining. INTECH Open Access Publisher.
Zurück zum Zitat Maedche, A., & Staab, S. (2000). The text-to-onto ontology learning environment. Paper presented at the Software Demonstration at ICCS-2000-Eight International Conference on Conceptual Structures. Maedche, A., & Staab, S. (2000). The text-to-onto ontology learning environment. Paper presented at the Software Demonstration at ICCS-2000-Eight International Conference on Conceptual Structures.
Zurück zum Zitat Maedche, A., & Staab, S. (2004). Ontology learning. In Handbook on ontologies (pp. 173–190). Berlin Heidelberg: Springer.CrossRef Maedche, A., & Staab, S. (2004). Ontology learning. In Handbook on ontologies (pp. 173–190). Berlin Heidelberg: Springer.CrossRef
Zurück zum Zitat Meng, L., Huang, R., & Gu, J. (2013). A review of semantic similarity measures in wordnet. International Journal of Hybrid Information Technology, 6(1), 1–12. Meng, L., Huang, R., & Gu, J. (2013). A review of semantic similarity measures in wordnet. International Journal of Hybrid Information Technology, 6(1), 1–12.
Zurück zum Zitat Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.CrossRef Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.CrossRef
Zurück zum Zitat Nagar, A., & Al-Mubaid, H. (2008). A new path length measure based on go for gene similarity with evaluation using sgd pathways. Paper presented at the 21st IEEE International Symposium on Computer-Based Medical Systems, 2008. CBMS’08. Nagar, A., & Al-Mubaid, H. (2008). A new path length measure based on go for gene similarity with evaluation using sgd pathways. Paper presented at the 21st IEEE International Symposium on Computer-Based Medical Systems, 2008. CBMS’08.
Zurück zum Zitat Nie, X., & Zhou, J. (2008). A domain adaptive ontology learning framework. Paper presented at the IEEE International Conference on Networking, Sensing and Control, 2008. ICNSC 2008. Nie, X., & Zhou, J. (2008). A domain adaptive ontology learning framework. Paper presented at the IEEE International Conference on Networking, Sensing and Control, 2008. ICNSC 2008.
Zurück zum Zitat Noy, N. F., & Musen, M. A. (2003). The PROMPT suite: Interactive tools for ontology merging and mapping. International Journal of Human-Computer Studies, 59(6), 983–1024.CrossRef Noy, N. F., & Musen, M. A. (2003). The PROMPT suite: Interactive tools for ontology merging and mapping. International Journal of Human-Computer Studies, 59(6), 983–1024.CrossRef
Zurück zum Zitat Noy, N. F., Sintek, M., Decker, S., Crubézy, M., Fergerson, R. W., & Musen, M. A. (2001). Creating semantic web contents with protege-2000. IEEE Intelligent Systems, 16(2), 60–71.CrossRef Noy, N. F., Sintek, M., Decker, S., Crubézy, M., Fergerson, R. W., & Musen, M. A. (2001). Creating semantic web contents with protege-2000. IEEE Intelligent Systems, 16(2), 60–71.CrossRef
Zurück zum Zitat Park, J., Cho, W., & Rho, S. (2010). Evaluating ontology extraction tools using a comprehensive evaluation framework. Data and Knowledge Engineering, 69(10), 1043–1061.CrossRef Park, J., Cho, W., & Rho, S. (2010). Evaluating ontology extraction tools using a comprehensive evaluation framework. Data and Knowledge Engineering, 69(10), 1043–1061.CrossRef
Zurück zum Zitat Pedersen, T., Pakhomov, S. V., Patwardhan, S., & Chute, C. G. (2007). Measures of semantic similarity and relatedness in the biomedical domain. Journal of Biomedical Informatics, 40(3), 288–299.CrossRef Pedersen, T., Pakhomov, S. V., Patwardhan, S., & Chute, C. G. (2007). Measures of semantic similarity and relatedness in the biomedical domain. Journal of Biomedical Informatics, 40(3), 288–299.CrossRef
Zurück zum Zitat Pirró, G. (2009). A semantic similarity metric combining features and intrinsic information content. Data and Knowledge Engineering, 68(11), 1289–1308.CrossRef Pirró, G. (2009). A semantic similarity metric combining features and intrinsic information content. Data and Knowledge Engineering, 68(11), 1289–1308.CrossRef
Zurück zum Zitat Qin, P., Lu, Z., Yan, Y., & Wu, F. (2009). A new measure of word semantic similarity based on wordnet hierarchy and dag theory. Paper presented at the International Conference on Web Information Systems and Mining, 2009. WISM 2009. Qin, P., Lu, Z., Yan, Y., & Wu, F. (2009). A new measure of word semantic similarity based on wordnet hierarchy and dag theory. Paper presented at the International Conference on Web Information Systems and Mining, 2009. WISM 2009.
Zurück zum Zitat Rada, R., Mili, H., Bicknell, E., & Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man and Cybernetics, 19(1), 17–30.CrossRef Rada, R., Mili, H., Bicknell, E., & Blettner, M. (1989). Development and application of a metric on semantic nets. IEEE Transactions on Systems, Man and Cybernetics, 19(1), 17–30.CrossRef
Zurück zum Zitat Raunich, S., & Rahm, E. (2011). ATOM: Automatic target-driven ontology merging. Paper presented at the IEEE 27th International Conference on Data Engineering (ICDE), 2011. Raunich, S., & Rahm, E. (2011). ATOM: Automatic target-driven ontology merging. Paper presented at the IEEE 27th International Conference on Data Engineering (ICDE), 2011.
Zurück zum Zitat Resnik, P. (1995, August 20–25). Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (pp. 448–453). Montreal, QC, Canada. Resnik, P. (1995, August 20–25). Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (pp. 448–453). Montreal, QC, Canada.
Zurück zum Zitat Saleena, B., & Srivatsa, S. (2015). Using concept similarity in cross ontology for adaptive e-Learning systems. Journal of King Saud University-Computer and Information Sciences, 27(1), 1–12.CrossRef Saleena, B., & Srivatsa, S. (2015). Using concept similarity in cross ontology for adaptive e-Learning systems. Journal of King Saud University-Computer and Information Sciences, 27(1), 1–12.CrossRef
Zurück zum Zitat Salton, G., & Michael, J. (1983). Introduction to modern information retrieval (pp. 24–51). New York: McGraw-Hill. Salton, G., & Michael, J. (1983). Introduction to modern information retrieval (pp. 24–51). New York: McGraw-Hill.
Zurück zum Zitat Sánchez, D., Batet, M., & Isern, D. (2011). Ontology-based information content computation. Knowledge-Based Systems, 24(2), 297–303.CrossRef Sánchez, D., Batet, M., & Isern, D. (2011). Ontology-based information content computation. Knowledge-Based Systems, 24(2), 297–303.CrossRef
Zurück zum Zitat Santoso, H. A., Haw, S.-C., & Abdul-Mehdi, Z. T. (2011). Ontology extraction from relational database: Concept hierarchy as background knowledge. Knowledge-Based Systems, 24(3), 457–464.CrossRef Santoso, H. A., Haw, S.-C., & Abdul-Mehdi, Z. T. (2011). Ontology extraction from relational database: Concept hierarchy as background knowledge. Knowledge-Based Systems, 24(3), 457–464.CrossRef
Zurück zum Zitat Schutz, A., & Buitelaar, P. (2005). Relext: A tool for relation extraction from text in ontology extension. In The semantic web–ISWC 2005 (pp. 593–606). Berlin Heidelberg: Springer.CrossRef Schutz, A., & Buitelaar, P. (2005). Relext: A tool for relation extraction from text in ontology extension. In The semantic web–ISWC 2005 (pp. 593–606). Berlin Heidelberg: Springer.CrossRef
Zurück zum Zitat Slimani, T. (2013). Description and evaluation of semantic similarity measures approaches. International Journal of Computer Applications, 80(10), 0975–8887.CrossRef Slimani, T. (2013). Description and evaluation of semantic similarity measures approaches. International Journal of Computer Applications, 80(10), 0975–8887.CrossRef
Zurück zum Zitat Sure, Y., Angele, J., & Staab, S. (2002). OntoEdit: Guiding ontology development by methodology and inferencing. In Proceedings of the International Conference on Ontologies, Databases and Applications of SEmantics ODBASE 2002. Irvine, CA: University of California. Sure, Y., Angele, J., & Staab, S. (2002). OntoEdit: Guiding ontology development by methodology and inferencing. In Proceedings of the International Conference on Ontologies, Databases and Applications of SEmantics ODBASE 2002. Irvine, CA: University of California.
Zurück zum Zitat Sussna, M. J. (1997). Text retrieval using inference in semantic metanetworks. Sussna, M. J. (1997). Text retrieval using inference in semantic metanetworks.
Zurück zum Zitat Wang, G., Yu, Y., & Zhu, H. (2007). Pore: Positive-only relation extraction from wikipedia text. Berlin: Springer. Wang, G., Yu, Y., & Zhu, H. (2007). Pore: Positive-only relation extraction from wikipedia text. Berlin: Springer.
Zurück zum Zitat Wu, X., & Bolivar, A. (2008). Keyword extraction for contextual advertisement. Paper presented at the Proceedings of the 17th International Conference on World Wide Web. Wu, X., & Bolivar, A. (2008). Keyword extraction for contextual advertisement. Paper presented at the Proceedings of the 17th International Conference on World Wide Web.
Zurück zum Zitat Wu, Z., & Palmer, M. (1994). Verbs semantics and lexical selection. Paper presented at the Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics. Wu, Z., & Palmer, M. (1994). Verbs semantics and lexical selection. Paper presented at the Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics.
Zurück zum Zitat Yang, Y., & Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. Paper presented at the ICML. Yang, Y., & Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. Paper presented at the ICML.
Zurück zum Zitat Zablith, F. (2008). Dynamic ontology evolution. International Semantic Web Conference (ISWC) Doctoral Consortium, Karlsruhe, Germany. Zablith, F. (2008). Dynamic ontology evolution. International Semantic Web Conference (ISWC) Doctoral Consortium, Karlsruhe, Germany.
Zurück zum Zitat Zouaq, A. (2011). An overview of shallow and deep natural language processing for ontology learning. Ontology Learning and Knowledge Discovery Using the Web: Challenges and Recent Advances, 2, 16–37.CrossRef Zouaq, A. (2011). An overview of shallow and deep natural language processing for ontology learning. Ontology Learning and Knowledge Discovery Using the Web: Challenges and Recent Advances, 2, 16–37.CrossRef
Zurück zum Zitat Zouaq, A., Gasevic, D., & Hatala, M. (2011). Towards open ontology learning and filtering. Information Systems, 36(7), 1064–1081.CrossRef Zouaq, A., Gasevic, D., & Hatala, M. (2011). Towards open ontology learning and filtering. Information Systems, 36(7), 1064–1081.CrossRef
Metadaten
Titel
ProMine: A Text Mining Solution for Concept Extraction and Filtering
verfasst von
Saira Gillani
Andrea Kő
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-28917-5_3

Premium Partner