Skip to main content
Top
Published in: Knowledge and Information Systems 4/2024

22-12-2023 | Regular Paper

Integrating semantic similarity with Dirichlet multinomial mixture model for enhanced web service clustering

Authors: Neha Agarwal, Geeta Sikka, Lalit Kumar Awasthi

Published in: Knowledge and Information Systems | Issue 4/2024

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

With accelerated advancement of web 2.0, developers generally describe the functionality of services in short natural text. Keyword-based searching techniques are not an efficient way of discovering services from repositories. It suffers from vocabulary problems. Latent Dirichlet allocation (LDA) with word embedding techniques is widely adopted for efficiently extracting latent features from the service descriptions. However, LDA is not efficient on short text due to limited content and inadequate occurring words. The word vectors generated by word embedding techniques are of finer quality than topic modeling techniques. Gibbs sampling algorithm for Dirichlet multinomial mixture (GSDMM) model gives better results on web service description documents because it provides one topic corresponding to one document. In this paper, we evaluate the performance of GSDMM model with word embeddings and propose WV+GSDMMK model. The proposed model improves service-to-topic mapping by determining semantic similarity among features. K-means clustering is applied on service to topic representation. Results are evaluated on five real-time datasets based on intrinsic and extrinsic evaluation measures. Experimental results demonstrate that the proposed method outperforms other baseline techniques, and the accuracy score is also increased by 5%, 18%, 3%, 4%, and 6% on datasets DS1, DS2, DS3, DS4, and DS5, respectively.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
6.
go back to reference Agarwal N, Sikka G, Awasthi LK (2022) A systematic literature review on web service clustering approaches to enhance service discovery, selection and recommendation. Comput Sci Rev 45:100498CrossRef Agarwal N, Sikka G, Awasthi LK (2022) A systematic literature review on web service clustering approaches to enhance service discovery, selection and recommendation. Comput Sci Rev 45:100498CrossRef
9.
go back to reference Kumara BT, Paik I, Chen W, Ryu KH (2014) Web service clustering using a hybrid term-similarity measure with ontology learning. Int J Web Serv Res (IJWSR) 11(2):24–45CrossRef Kumara BT, Paik I, Chen W, Ryu KH (2014) Web service clustering using a hybrid term-similarity measure with ontology learning. Int J Web Serv Res (IJWSR) 11(2):24–45CrossRef
10.
go back to reference Rupasingha RA, Paik I, Kumara BT (2018) Specificity-aware ontology generation for improving web service clustering. IEICE TRANS Inf Syst 101(8):2035–2043CrossRef Rupasingha RA, Paik I, Kumara BT (2018) Specificity-aware ontology generation for improving web service clustering. IEICE TRANS Inf Syst 101(8):2035–2043CrossRef
13.
go back to reference Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(Jan):993–1022 Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(Jan):993–1022
14.
go back to reference Blei D, Lafferty J (2006) Correlated topic models. Adv Neural Inf Process Syst 18:147 Blei D, Lafferty J (2006) Correlated topic models. Adv Neural Inf Process Syst 18:147
22.
go back to reference Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119 Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119
25.
go back to reference Bianchi F, Terragni S, Hovy D (2021) Pre-training is a hot topic: contextualized document embeddings improve topic coherence. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing, volume 2: Short Papers, pp 759–766 Bianchi F, Terragni S, Hovy D (2021) Pre-training is a hot topic: contextualized document embeddings improve topic coherence. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing, volume 2: Short Papers, pp 759–766
28.
go back to reference Zeng K, Paik I (2021) Semantic service clustering with lightweight bert-based service embedding using invocation sequences. IEEE Access 9:54298–54309CrossRef Zeng K, Paik I (2021) Semantic service clustering with lightweight bert-based service embedding using invocation sequences. IEEE Access 9:54298–54309CrossRef
29.
go back to reference Zou G, Qin Z, He Q, Wang P, Zhang B, Gan Y (2019) Deepwsc: a novel framework with deep neural network for web service clustering. In: 2019 IEEE international conference on web services (ICWS). IEEE, pp 434–436 Zou G, Qin Z, He Q, Wang P, Zhang B, Gan Y (2019) Deepwsc: a novel framework with deep neural network for web service clustering. In: 2019 IEEE international conference on web services (ICWS). IEEE, pp 434–436
31.
32.
go back to reference Bruni E, Boleda G, Baroni M, Tran N-K (2012) Distributional semantics in technicolor. In: Proceedings of the 50th annual meeting of the association for computational linguistics: long Papers-volume 1. Association for Computational Linguistics, pp 136–145 Bruni E, Boleda G, Baroni M, Tran N-K (2012) Distributional semantics in technicolor. In: Proceedings of the 50th annual meeting of the association for computational linguistics: long Papers-volume 1. Association for Computational Linguistics, pp 136–145
39.
go back to reference Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp. 1188–1196 Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp. 1188–1196
Metadata
Title
Integrating semantic similarity with Dirichlet multinomial mixture model for enhanced web service clustering
Authors
Neha Agarwal
Geeta Sikka
Lalit Kumar Awasthi
Publication date
22-12-2023
Publisher
Springer London
Published in
Knowledge and Information Systems / Issue 4/2024
Print ISSN: 0219-1377
Electronic ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-023-02034-x

Other articles of this Issue 4/2024

Knowledge and Information Systems 4/2024 Go to the issue

Premium Partner