Skip to main content
Top
Published in: Neural Computing and Applications 14/2024

15-03-2024 | Original Article

An enrichment multi-layer Arabic text classification model based on siblings patterns extraction

Authors: Amira M. Idrees, Abdul Lateef Marzouq Al-Solami

Published in: Neural Computing and Applications | Issue 14/2024

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Ontologies extraction is the cornerstone for a meaningful knowledge representation. Ontologies represent the semantic relations repository in a readable format with a clear representation of the domain knowledge. This made the automated ontologies construction a promising research objective with a direct and clear impact in many related fields, including knowledge base systems, text classification, etc. In this research, a workflow is set up for successful ontology learning from Arabic textual data. One of the bottlenecks for the text analytics field is the continuous requirement of up-to-date resources such as lexicons. This challenge is one of the main focuses of the current research, which proposes an automated ontology extraction method with no use of pre-defined resources. The research proposes a novel generic ontology learning and document classification model based on no utilization of prior text analysis resources. Moreover, a self-enrichment approach is proposed to ensure continuous knowledge construction. The research extends the ontology learning process to include the ontologies’ semantic relationships, targeting a higher level of extraction and model enrichment. Two experiments have been applied with two different datasets that belong to different fields to ensure the generality of the proposed model. The results of the two experiments confirmed the high accuracy of the proposed model and its positive contribution to the classification task. The results of the ontology learning task reached 95%, while the classification task revealed the advancement of the Bagging algorithm over other machine learning algorithms with an accuracy equal to 97.92%.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Hassouna DHA, Khedr AE, Idrees AM, ElSeddawy AI (2020) Intelligent personalized system for enhancing the quality of learning. J Theor Appl Inf Technol 98(13):2199–2213 Hassouna DHA, Khedr AE, Idrees AM, ElSeddawy AI (2020) Intelligent personalized system for enhancing the quality of learning. J Theor Appl Inf Technol 98(13):2199–2213
2.
go back to reference Idrees AM, ElSeddawy AI, Zeidan MO (2019) Knowledge discovery based framework for enhancing the house of quality. Int J Adv Comp Sci Appl (IJACSA) 10(7):324–331 Idrees AM, ElSeddawy AI, Zeidan MO (2019) Knowledge discovery based framework for enhancing the house of quality. Int J Adv Comp Sci Appl (IJACSA) 10(7):324–331
3.
go back to reference Al Mazroi A, Khedr AE, Idrees AM (2021) A proposed customer relationship framework based on information retrieval for effective firms’ competitiveness. Expert Syst Appl, vol. 176. Al Mazroi A, Khedr AE, Idrees AM (2021) A proposed customer relationship framework based on information retrieval for effective firms’ competitiveness. Expert Syst Appl, vol. 176.
4.
go back to reference Dahab MY, Idrees AM, Hassan HA, Rafea A (2010) Pattern based concept extraction for Arabic documents. Int J Intell Comput Inform Sci 10(2). Dahab MY, Idrees AM, Hassan HA, Rafea A (2010) Pattern based concept extraction for Arabic documents. Int J Intell Comput Inform Sci 10(2).
5.
go back to reference Sabri T, El Beggar O, Kissi M (2022) Comparative study of Arabic text classification using feature vectorization methods. Proc Comp Sci, vol. 198. Sabri T, El Beggar O, Kissi M (2022) Comparative study of Arabic text classification using feature vectorization methods. Proc Comp Sci, vol. 198.
6.
go back to reference Mohsen AM, Hassan HA, Idrees AM (2016) Documents emotions classification model based on TF-IDF weighting. Int J Comp Elect Automat Control Inform Eng 10(1):252–258 Mohsen AM, Hassan HA, Idrees AM (2016) Documents emotions classification model based on TF-IDF weighting. Int J Comp Elect Automat Control Inform Eng 10(1):252–258
7.
go back to reference Mohsen M, Idrees AM, Hassan HA (2019) Emotion analysis for opinion mining from text: a comparative study. Int J e-Collaboration 15(1) Mohsen M, Idrees AM, Hassan HA (2019) Emotion analysis for opinion mining from text: a comparative study. Int J e-Collaboration 15(1)
8.
go back to reference Horesh U, Cotter WM (2016) Current research on linguistic variation in the arabic-speaking world. Lang Linguist Compass 10(8) Horesh U, Cotter WM (2016) Current research on linguistic variation in the arabic-speaking world. Lang Linguist Compass 10(8)
9.
go back to reference Zaki S, Ghali N, Abo Elfetooh A, Idrees AM (2023) Predictive Analysis of Big data in Egypt Census, (2017) comparison of four ML predictive models. J Theor Appl Inf Technol 101(1):2023 Zaki S, Ghali N, Abo Elfetooh A, Idrees AM (2023) Predictive Analysis of Big data in Egypt Census, (2017) comparison of four ML predictive models. J Theor Appl Inf Technol 101(1):2023
10.
go back to reference Hassan HA, Dahab MY, Bahnassy K, Idrees AM, Gamal F (2015) Arabic documents classification method a step towards efficient documents summarization. Int J Recent Innovat Trends Comput Commun 3(1):351–359CrossRef Hassan HA, Dahab MY, Bahnassy K, Idrees AM, Gamal F (2015) Arabic documents classification method a step towards efficient documents summarization. Int J Recent Innovat Trends Comput Commun 3(1):351–359CrossRef
11.
go back to reference Hassan HA, Dahab MY, Bahnasy K, Idrees AM, Gamal F (2014) Query answering approach based on document summarization. Int Open Access J Mod Eng Res 4(12) Hassan HA, Dahab MY, Bahnasy K, Idrees AM, Gamal F (2014) Query answering approach based on document summarization. Int Open Access J Mod Eng Res 4(12)
12.
go back to reference Mostafa AM, Helmy YM, Khedr AE, Idrees AM (2020) A proposed architectural framework for generating personalized users’ query responsE. J Southwest Jiaotong Univ 55(5) Mostafa AM, Helmy YM, Khedr AE, Idrees AM (2020) A proposed architectural framework for generating personalized users’ query responsE. J Southwest Jiaotong Univ 55(5)
13.
go back to reference Khedr E, Idrees Am, Alsheref FK (2019) A proposed framework to explore semantic relations for learning process management. Int J e-Collaboration 15(4) Khedr E, Idrees Am, Alsheref FK (2019) A proposed framework to explore semantic relations for learning process management. Int J e-Collaboration 15(4)
14.
go back to reference Qaffas A, Idrees AM, Khedr AE, Kholeif AS (2023) A smart testing model based on mining semantic relations. IEEE Access, vol. 11. Qaffas A, Idrees AM, Khedr AE, Kholeif AS (2023) A smart testing model based on mining semantic relations. IEEE Access, vol. 11.
15.
go back to reference Idrees M, Alsheref FK, ElSeddawy AI (2019) A proposed model for detecting facebook news’ credibility. Int J Adv Comp Sci Appl (IJACSA) 10(7):311–316 Idrees M, Alsheref FK, ElSeddawy AI (2019) A proposed model for detecting facebook news’ credibility. Int J Adv Comp Sci Appl (IJACSA) 10(7):311–316
16.
go back to reference Khedr E, Idrees AM, Shabaan E (2020) Automated Ham-Spam Lexicon generation based on semantic relations extraction. Int J e-Collaboration (IJeC) 16(2):45–64CrossRef Khedr E, Idrees AM, Shabaan E (2020) Automated Ham-Spam Lexicon generation based on semantic relations extraction. Int J e-Collaboration (IJeC) 16(2):45–64CrossRef
17.
go back to reference Yasser F, AbdelMawgoud S, Idrees AM (2022) Mining perspectives for news credibility: the road to trust social networks. In: Handbook of Research on Technologies and Systems for E-Collaboration During Global Crises, IGI Global. Yasser F, AbdelMawgoud S, Idrees AM (2022) Mining perspectives for news credibility: the road to trust social networks. In: Handbook of Research on Technologies and Systems for E-Collaboration During Global Crises, IGI Global.
18.
go back to reference Idrees AM, Helmy Y, Khedr AE (2022) Credibility aspects’ perceptions of social networks, a survey. Social Netw Anal Min 12(1) Idrees AM, Helmy Y, Khedr AE (2022) Credibility aspects’ perceptions of social networks, a survey. Social Netw Anal Min 12(1)
19.
go back to reference Yasser F, Abdelgaber S, Idrees AM (2022) A survey for news credibility in social networks. Int J e-Collaborat 18(1). Yasser F, Abdelgaber S, Idrees AM (2022) A survey for news credibility in social networks. Int J e-Collaborat 18(1).
20.
go back to reference Idrees AM, Shabaan E (2020) Building a knowledge base shell based on exploring text semantic relations from arabic text. Int J Intell Eng Syst 13(1) Idrees AM, Shabaan E (2020) Building a knowledge base shell based on exploring text semantic relations from arabic text. Int J Intell Eng Syst 13(1)
21.
go back to reference Korde V, Mahender CN (2012) Text classification and classifiers. Int J Artif Intell Appl 3(2) Korde V, Mahender CN (2012) Text classification and classifiers. Int J Artif Intell Appl 3(2)
22.
go back to reference Saber YM, Abdel-Galil H, El-Fatah Belal M (2022) Arabic ontology extraction model from unstructured text. J King Saud Univ Comp Inform Sci 34(8):6066–6076. Saber YM, Abdel-Galil H, El-Fatah Belal M (2022) Arabic ontology extraction model from unstructured text. J King Saud Univ Comp Inform Sci 34(8):6066–6076.
23.
go back to reference Chau H, Labutov I, Thaker K, Brusilovsky P (2021) Automatic concept extraction for domain and student modeling in adaptive textbooks. Int J Artif Intell Educ 31(4):820–846CrossRef Chau H, Labutov I, Thaker K, Brusilovsky P (2021) Automatic concept extraction for domain and student modeling in adaptive textbooks. Int J Artif Intell Educ 31(4):820–846CrossRef
24.
go back to reference AL-Aswadi FN, Chan HY, Gan KH, Al Alma’aitah WZ (2023) Enhancing relevant concepts extraction for ontology learning using domain time relevance. Inform Process Manage, vol. 60. AL-Aswadi FN, Chan HY, Gan KH, Al Alma’aitah WZ (2023) Enhancing relevant concepts extraction for ontology learning using domain time relevance. Inform Process Manage, vol. 60.
25.
go back to reference Al-Aswadi SN, Chan HY, Gan KH (2020) Automatic ontology construction from text: a review from shallow to deep learning trend. Artif Intell Rev 53(6):3901–3928CrossRef Al-Aswadi SN, Chan HY, Gan KH (2020) Automatic ontology construction from text: a review from shallow to deep learning trend. Artif Intell Rev 53(6):3901–3928CrossRef
26.
go back to reference Fernandez-Martínez NJ, Miguel Felices-Lago A (2021) Automatic lexical collocate extraction for corpus-based ontology building and refinement: a FunGramKB case study of the THEFT conceptual scenario. Rev Espanola Linguist Apl 34(2):435–463. Fernandez-Martínez NJ, Miguel Felices-Lago A (2021) Automatic lexical collocate extraction for corpus-based ontology building and refinement: a FunGramKB case study of the THEFT conceptual scenario. Rev Espanola Linguist Apl 34(2):435–463.
27.
go back to reference Luo L, Feng H, Yu H (2020) Automatic structuring of ontology terms based on lexical granularity and machine learning: algorithm development and validation. JMIR Med Inf 8(11) Luo L, Feng H, Yu H (2020) Automatic structuring of ontology terms based on lexical granularity and machine learning: algorithm development and validation. JMIR Med Inf 8(11)
28.
go back to reference Althubaiti S, Kafkas S, Abdelhakim M (2020) Combining lexical and context features for automatic ontology extension. J Biomed Semant 11(1):1–13CrossRef Althubaiti S, Kafkas S, Abdelhakim M (2020) Combining lexical and context features for automatic ontology extension. J Biomed Semant 11(1):1–13CrossRef
29.
go back to reference Hawashin A, Mansour M, Aljawarneh S (2013) An efficient feature selection method for arabic text classification. Int J Comp Appl 83(17) Hawashin A, Mansour M, Aljawarneh S (2013) An efficient feature selection method for arabic text classification. Int J Comp Appl 83(17)
30.
go back to reference Wang K, Cao K, Chen M, Yan Z, Zhong L, Yang H, Cai S (2022) Front-page news classification model based on the stacking of textual context and attribute information. Sci Program Wang K, Cao K, Chen M, Yan Z, Zhong L, Yang H, Cai S (2022) Front-page news classification model based on the stacking of textual context and attribute information. Sci Program
31.
go back to reference Jasti VDP, Kumar GK, Kumar MS, Maheshwari V, Jayagopal P, Pant B, Karthick A, Muhibbullah M (2022) Relevant-based feature ranking (RBFR) method for text classification based on machine learning algorithm. Funct Nanomater-Based Flexible Electron Jasti VDP, Kumar GK, Kumar MS, Maheshwari V, Jayagopal P, Pant B, Karthick A, Muhibbullah M (2022) Relevant-based feature ranking (RBFR) method for text classification based on machine learning algorithm. Funct Nanomater-Based Flexible Electron
32.
go back to reference Ashokkumar P, Shankar S, Srivastava P, Maddikunta P, Gadekallu T (2021) A two-stage text feature selection algorithm for improving text classification. ACM Trans Asian Low-Resour Lang Inform Process 20(3):1–19CrossRef Ashokkumar P, Shankar S, Srivastava P, Maddikunta P, Gadekallu T (2021) A two-stage text feature selection algorithm for improving text classification. ACM Trans Asian Low-Resour Lang Inform Process 20(3):1–19CrossRef
33.
go back to reference Siva Shankar G, Ashokkumar P, Vinayakumar R, Ghosh U, Mansoor W, Alnumay W (2020) An embedded-based weighted feature selection algorithm for classifying web document. Wireless Commun Mobile Comp, vol. 2020. Siva Shankar G, Ashokkumar P, Vinayakumar R, Ghosh U, Mansoor W, Alnumay W (2020) An embedded-based weighted feature selection algorithm for classifying web document. Wireless Commun Mobile Comp, vol. 2020.
34.
go back to reference Aninditya A, Hasibuan MA, Sutoyo E (2019) 2019 IEEE International Conference on Internet of Things and Intelligence System (IoTaIS) Aninditya A, Hasibuan MA, Sutoyo E (2019) 2019 IEEE International Conference on Internet of Things and Intelligence System (IoTaIS)
35.
go back to reference Mohammed M, Omar N (2020) Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec. Plos One 15(3) Mohammed M, Omar N (2020) Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec. Plos One 15(3)
36.
go back to reference Hermessi H (2021) Arabic news Articles nataset. Kaggle Hermessi H (2021) Arabic news Articles nataset. Kaggle
Metadata
Title
An enrichment multi-layer Arabic text classification model based on siblings patterns extraction
Authors
Amira M. Idrees
Abdul Lateef Marzouq Al-Solami
Publication date
15-03-2024
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 14/2024
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-023-09405-z

Other articles of this Issue 14/2024

Neural Computing and Applications 14/2024 Go to the issue

Premium Partner