Skip to main content
Top

2016 | OriginalPaper | Chapter

Implementing and Evaluating Collaborative Filtering (CF) Using Clustering

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A tremendous increase has taken place in the amount of online content. As a result, by using traditional approaches, service-relevant data becomes too big to be effectively processed. In order to solve this problem, an approach called clustering based collaborative filtering (CF) is proposed in this paper. Its objective is to recommend services collaboratively in the same clusters. It is a very successful approach in such settings where interaction can be done between data analysis and querying. However the large systems which have large data and users, the collaboration are many times delayed due to unrealistic runtimes. The proposed approach works in two stages. First, the services which are available are divided into small clusters for processing and then collaborative filtering algorithm is used in second stage on one of the clusters. It is estimated to decrease the online execution time of collaborative filtering algorithm because the number of the services in a cluster is much less than the entire services available on the web.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Wu, X., Wu, G., Zhu, X.: Data mining with big data. IEEE Trans. Knowl. Data Eng. 26(1), 97–107 (January 2014) Wu, X., Wu, G., Zhu, X.: Data mining with big data. IEEE Trans. Knowl. Data Eng. 26(1), 97–107 (January 2014)
2.
go back to reference Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets. University Press of Cambridge Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets. University Press of Cambridge
3.
go back to reference Bellogín, A., Díez, F., Cantador, I.: An empirical comparison of social, collaborative filtering (CF), hybrid recommender. ACM Trans. Intell. Syst. Tech. 4(1), 1–37 (January 2013) Bellogín, A., Díez, F., Cantador, I.: An empirical comparison of social, collaborative filtering (CF), hybrid recommender. ACM Trans. Intell. Syst. Tech. 4(1), 1–37 (January 2013)
4.
go back to reference Zeng, W., Zhang, Q., Shang, M.: Can dissimilar users contribute to accuracy and diversity of personalized recommendation? IJMPC 21(10), 1217–1227 (June 2010) Zeng, W., Zhang, Q., Shang, M.: Can dissimilar users contribute to accuracy and diversity of personalized recommendation? IJMPC 21(10), 1217–1227 (June 2010)
5.
go back to reference Havens, T.C., Hall, L.O., Leckie, C., Palaniswami, M.: Fuzzy c-means algorithm for very large data. IEEE Trans. Fuzzy Syst. 20(6), 1130–1146 (December 2012) Havens, T.C., Hall, L.O., Leckie, C., Palaniswami, M.: Fuzzy c-means algorithm for very large data. IEEE Trans. Fuzzy Syst. 20(6), 1130–1146 (December 2012)
6.
go back to reference Liu, Z., Zheng, Y., Li, P.: Clustering to find exemplar terms for key-phrase extraction. In: Proceedings of Conference on Empirical Methods in Natural Language Processing, pp. 257–266, May 2009 Liu, Z., Zheng, Y., Li, P.: Clustering to find exemplar terms for key-phrase extraction. In: Proceedings of Conference on Empirical Methods in Natural Language Processing, pp. 257–266, May 2009
7.
go back to reference Rodriguez, A., Chaovalitwongse, W., Zhe, L.: Master defect record retrieval using network based feature ass. IEEE Trans. Syst. Man Cybern. App. Rev. 40(3), 319–329 (October 2010) Rodriguez, A., Chaovalitwongse, W., Zhe, L.: Master defect record retrieval using network based feature ass. IEEE Trans. Syst. Man Cybern. App. Rev. 40(3), 319–329 (October 2010)
8.
go back to reference Adomavicis, G., Zhang, J.: Stability of recommendation algorithms. ACM Trans. Inf. Syst. 30(4), 23:1–23:31 (August 2012) Adomavicis, G., Zhang, J.: Stability of recommendation algorithms. ACM Trans. Inf. Syst. 30(4), 23:1–23:31 (August 2012)
9.
go back to reference Liu, X., Mei, H., Huang, G.: Discovering homogeneous web services community in the user centric web environment. IEEE Trans. Serv. Comput. 2(2), 167–181 Liu, X., Mei, H., Huang, G.: Discovering homogeneous web services community in the user centric web environment. IEEE Trans. Serv. Comput. 2(2), 167–181
10.
go back to reference Li, H.H., Tian, X., Du, X.Y.: A review based reputation evaluation approach for web services. Int. J. Comput. Sci. Tech. 249(5), 893–900 (Sep 2009) Li, H.H., Tian, X., Du, X.Y.: A review based reputation evaluation approach for web services. Int. J. Comput. Sci. Tech. 249(5), 893–900 (Sep 2009)
11.
go back to reference Zielinnski, K., Szydlo, T., Szymacha, R.: Adaptive soa solution stack. IEEE Trans. Serv. Comput. 5(2), 149–163 (April-June 2012) Zielinnski, K., Szydlo, T., Szymacha, R.: Adaptive soa solution stack. IEEE Trans. Serv. Comput. 5(2), 149–163 (April-June 2012)
12.
go back to reference Shafer, J., Rixner, S.T., Cox, A.: The hadoop distributed file system (HDFS): balancing portability and performance. IEEE Int. Symp. Perform. Anal. Syst. S\W. doi: 10.1109/ISPASS.2010.5452045. pp. 122–133, 28–30 March 2010 Shafer, J., Rixner, S.T., Cox, A.: The hadoop distributed file system (HDFS): balancing portability and performance. IEEE Int. Symp. Perform. Anal. Syst. S\W. doi: 10.​1109/​ISPASS.​2010.​5452045. pp. 122–133, 28–30 March 2010
13.
go back to reference Kirankumar, R., Vijayakumari, R., Gangadhara, R.K.: Comparative analysis of google file system and hadoop distributed file system. IJAT CSE. 3(1), 24–25 (Feb 2014) Kirankumar, R., Vijayakumari, R., Gangadhara, R.K.: Comparative analysis of google file system and hadoop distributed file system. IJAT CSE. 3(1), 24–25 (Feb 2014)
15.
go back to reference Li, M.J., Cheung, Y., Ng, M.: Agglomerative fuzzy k means clustering algorithm with selection of number of clusters. IEEE Trans. Knowl. Data Eng. 20(11), 1519–1534 (November 2008) Li, M.J., Cheung, Y., Ng, M.: Agglomerative fuzzy k means clustering algorithm with selection of number of clusters. IEEE Trans. Knowl. Data Eng. 20(11), 1519–1534 (November 2008)
16.
go back to reference Zhao, Y., Fayyad, U., Karypis, G.: Hierarchical clustering algorithms for document datasets. Data Min. Knowl. Discov. 10(2), 141–168 (November 2005) Zhao, Y., Fayyad, U., Karypis, G.: Hierarchical clustering algorithms for document datasets. Data Min. Knowl. Discov. 10(2), 141–168 (November 2005)
17.
go back to reference Platzer, C., Dustdar, S., Rosenberg, F.: Web service clustering using multi-dimensional angle as proximity measures. ACM Trans. Internet Tech. 9(3), 11:1–11:26 (July 2009) Platzer, C., Dustdar, S., Rosenberg, F.: Web service clustering using multi-dimensional angle as proximity measures. ACM Trans. Internet Tech. 9(3), 11:1–11:26 (July 2009)
18.
go back to reference Taherian, T.F., Niknam, T., Pourjafarian, N.: An efficient algorithm based on modified imperialist competitive algorithm & K means for data clustering. Eng. App. Artif. Intell. 24(2), 306–317 (March 2011) Taherian, T.F., Niknam, T., Pourjafarian, N.: An efficient algorithm based on modified imperialist competitive algorithm & K means for data clustering. Eng. App. Artif. Intell. 24(2), 306–317 (March 2011)
19.
go back to reference Thilagavathi, G., Aparna, N., Srivaishnavi, D.: A survey on efficient hierarchical algorithm used in clustering. Int. J. Eng. 2(9), 306–317 (Sep 2013) Thilagavathi, G., Aparna, N., Srivaishnavi, D.: A survey on efficient hierarchical algorithm used in clustering. Int. J. Eng. 2(9), 306–317 (Sep 2013)
20.
go back to reference Julie, D., Kumar, K.: Optimal web service selection scheme with dynamic QoS property assignment. IJART. 2(2), 69–75 (May 2012) Julie, D., Kumar, K.: Optimal web service selection scheme with dynamic QoS property assignment. IJART. 2(2), 69–75 (May 2012)
21.
go back to reference Wu, J., Chen, L., Feng, Y.: Predicting quality of service for selection by neighborhood based collaborative filtering (CF). IEEE Trans. Syst. Man Cybern. Syst. 43(2), 428–439 (March 2013) Wu, J., Chen, L., Feng, Y.: Predicting quality of service for selection by neighborhood based collaborative filtering (CF). IEEE Trans. Syst. Man Cybern. Syst. 43(2), 428–439 (March 2013)
Metadata
Title
Implementing and Evaluating Collaborative Filtering (CF) Using Clustering
Authors
Sachin S. Agrawal
Ganjendra R. Bamnote
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-30927-9_16

Premium Partner