Skip to main content
Erschienen in: Discover Computing 1/2008

01.02.2008

Nearest-biclusters collaborative filtering based on constant and coherent values

verfasst von: Panagiotis Symeonidis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, Yannis Manolopoulos

Erschienen in: Discover Computing | Ausgabe 1/2008

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Collaborative Filtering (CF) Systems have been studied extensively for more than a decade to confront the “information overload” problem. Nearest-neighbor CF is based either on similarities between users or between items, to form a neighborhood of users or items, respectively. Recent research has tried to combine the two aforementioned approaches to improve effectiveness. Traditional clustering approaches (k-means or hierarchical clustering) has been also used to speed up the recommendation process. In this paper, we use biclustering to disclose this duality between users and items, by grouping them in both dimensions simultaneously. We propose a novel nearest-biclusters algorithm, which uses a new similarity measure that achieves partial matching of users’ preferences. We apply nearest-biclusters in combination with two different types of biclustering algorithms—Bimax and xMotif—for constant and coherent biclustering, respectively. Extensive performance evaluation results in three real-life data sets are provided, which show that the proposed method improves substantially the performance of the CF process.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Since, in its off-line part, IB learns relationships between items according a model, it could be considered as a model-based algorithm as well.
 
2
For implementation issues, we use the Bimax and xMotif biclustering algorithms, however any other algorithm can be used equally well, as our approach is independent of the specific biclustering algorithm that is used.
 
3
In future work we plan to investigate the role of negatively rated items.
 
Literatur
Zurück zum Zitat Balabanovic, M., & Fab, S. Y. (1997). Content-based, collaborative recommendation. ACM Communications, 40(3), 66–72.CrossRef Balabanovic, M., & Fab, S. Y. (1997). Content-based, collaborative recommendation. ACM Communications, 40(3), 66–72.CrossRef
Zurück zum Zitat Barkow, S., Bleuler, S., Prelic, A., Zimmermann, P., & Zitzler, E. (2006). BicAT: A biclustering analysis toolbox. Bioinformatics, 22(10), 1282–1283. Barkow, S., Bleuler, S., Prelic, A., Zimmermann, P., & Zitzler, E. (2006). BicAT: A biclustering analysis toolbox. Bioinformatics, 22(10), 1282–1283.
Zurück zum Zitat Breese, J., Heckerman, D., & Kadie, C. (1998). Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the Uncertainty in Artificial Intelligence Conference (pp. 43–52). Breese, J., Heckerman, D., & Kadie, C. (1998). Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the Uncertainty in Artificial Intelligence Conference (pp. 43–52).
Zurück zum Zitat Cheng, Y., & Church, G. (2000). Biclustering of expression data. In Proceedings of the ISMB Conference (pp. 93–103). Cheng, Y., & Church, G. (2000). Biclustering of expression data. In Proceedings of the ISMB Conference (pp. 93–103).
Zurück zum Zitat Deshpande, M., & Karypis, G. (2004). Item-based top-n recommendation algorithms. ACM Transactions on Information Systems, 22(1), 143–177.CrossRef Deshpande, M., & Karypis, G. (2004). Item-based top-n recommendation algorithms. ACM Transactions on Information Systems, 22(1), 143–177.CrossRef
Zurück zum Zitat Dhillon, I. S. (2001). Co-clustering documents and words using bipartite spectral graph partitioning. In Proceedings of the ACM SIGKDD Conference. Dhillon, I. S. (2001). Co-clustering documents and words using bipartite spectral graph partitioning. In Proceedings of the ACM SIGKDD Conference.
Zurück zum Zitat Dhillon, I. S., & Mallela, D. S., & Modha, S. (2003). Information theoretic co-clustering. In Proceedings of the ACM SIGKDD Conference. Dhillon, I. S., & Mallela, D. S., & Modha, S. (2003). Information theoretic co-clustering. In Proceedings of the ACM SIGKDD Conference.
Zurück zum Zitat George, T., & Merugu, S. (2005). A scalable collaborative filtering framework based on co-clustering. In Proceedings of the IEEE ICDM Conference. George, T., & Merugu, S. (2005). A scalable collaborative filtering framework based on co-clustering. In Proceedings of the IEEE ICDM Conference.
Zurück zum Zitat Hartigan, J. A. (1972). Direct clustering of a data matrix. Journal of the American Statistical Association, 67(337), 123–129.CrossRef Hartigan, J. A. (1972). Direct clustering of a data matrix. Journal of the American Statistical Association, 67(337), 123–129.CrossRef
Zurück zum Zitat Herlocker, J., Konstan, J., Borchers, A., & Riedl, J. (1999). An algorithmic framework for performing collaborative filtering. In Proceedings of the ACM SIGIR Conference (pp. 230–237). Herlocker, J., Konstan, J., Borchers, A., & Riedl, J. (1999). An algorithmic framework for performing collaborative filtering. In Proceedings of the ACM SIGIR Conference (pp. 230–237).
Zurück zum Zitat Herlocker, J., Konstan, J., & Riedl, J. (2002). An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms. Information Retrieval, 5(4), 287–310.CrossRef Herlocker, J., Konstan, J., & Riedl, J. (2002). An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms. Information Retrieval, 5(4), 287–310.CrossRef
Zurück zum Zitat Herlocker, J., Konstan, J., Terveen, L., & Riedl, J. (2004). Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems, 22(1), 5–53.CrossRef Herlocker, J., Konstan, J., Terveen, L., & Riedl, J. (2004). Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems, 22(1), 5–53.CrossRef
Zurück zum Zitat Hofmann, T. (2004). Latent semantic models for collaborative filtering. ACM Transactions on Information Systems, 22(1), 89–115.CrossRef Hofmann, T. (2004). Latent semantic models for collaborative filtering. ACM Transactions on Information Systems, 22(1), 89–115.CrossRef
Zurück zum Zitat Hofmann, T., & Puzicha, J. (1999). Latent class models for collaborative filtering. In Proceedings of the IJCAI Conference. Hofmann, T., & Puzicha, J. (1999). Latent class models for collaborative filtering. In Proceedings of the IJCAI Conference.
Zurück zum Zitat Jin, R., Si, L., & Zhai, C. (2006). A study of mixture models for collaborative filtering. Information Retrieval, 9(3), 357–382.CrossRef Jin, R., Si, L., & Zhai, C. (2006). A study of mixture models for collaborative filtering. Information Retrieval, 9(3), 357–382.CrossRef
Zurück zum Zitat Karypis, G. (2001). Evaluation of item-based top-n recommendation algorithms. In Proceedings of the ACM CIKM Conference (pp. 247–254). Karypis, G. (2001). Evaluation of item-based top-n recommendation algorithms. In Proceedings of the ACM CIKM Conference (pp. 247–254).
Zurück zum Zitat Kohrs, A., & Merialdo, B. (1998). Clustering for collaborative filtering applications. In Proceedings of the CIMKA Conference. Kohrs, A., & Merialdo, B. (1998). Clustering for collaborative filtering applications. In Proceedings of the CIMKA Conference.
Zurück zum Zitat Lemire, D., & Maclachlan, A. (2005). Slope one predictors for online rating-based collaborative filtering. In Proceedings of SIAM Data Mining Conference. Lemire, D., & Maclachlan, A. (2005). Slope one predictors for online rating-based collaborative filtering. In Proceedings of SIAM Data Mining Conference.
Zurück zum Zitat Long, B., Zhangm, Z., & Yu, P. S. (2005). A formal statistical approach to collaborative filtering. In Proceedings of the ACM SIGKDD Conference. Long, B., Zhangm, Z., & Yu, P. S. (2005). A formal statistical approach to collaborative filtering. In Proceedings of the ACM SIGKDD Conference.
Zurück zum Zitat Madeira, S., & Oliveira, A. (2004). Biclustering algorithms for biological data analysis: A survey. ACM Transactions on Computational Biology and Bioinformatics, 1, 24–45.CrossRef Madeira, S., & Oliveira, A. (2004). Biclustering algorithms for biological data analysis: A survey. ACM Transactions on Computational Biology and Bioinformatics, 1, 24–45.CrossRef
Zurück zum Zitat McLauglin, R., & Herlocher, J. (2004). A collaborative filtering algorithm and evaluation metric that accurately model the user experience. In Proceedings of the ACM SIGIR Conference (pp. 329–336). McLauglin, R., & Herlocher, J. (2004). A collaborative filtering algorithm and evaluation metric that accurately model the user experience. In Proceedings of the ACM SIGIR Conference (pp. 329–336).
Zurück zum Zitat Melville, P., Mooney, R. J., & Nagarajan, R. (2002). Content-boosted collaborative filtering for improved recommendations. In Proc. AAAI conf. (pp. 187–192). Melville, P., Mooney, R. J., & Nagarajan, R. (2002). Content-boosted collaborative filtering for improved recommendations. In Proc. AAAI conf. (pp. 187–192).
Zurück zum Zitat Mirkin, B. (1996). Mathematical classification and clustering. Kluwer Academic Publishers: Dordrecht.MATH Mirkin, B. (1996). Mathematical classification and clustering. Kluwer Academic Publishers: Dordrecht.MATH
Zurück zum Zitat Mobasher, B., Dai, H., Luo, T., & Nakagawa, M. (2001). Improving the effectiveness of collaborative filtering on anonymous web usage data. In Proceedings of the Workshop Intelligent Techniques for Web Personalization (pp. 53–60). Mobasher, B., Dai, H., Luo, T., & Nakagawa, M. (2001). Improving the effectiveness of collaborative filtering on anonymous web usage data. In Proceedings of the Workshop Intelligent Techniques for Web Personalization (pp. 53–60).
Zurück zum Zitat Murali, T., & Kasif, S. (2003). Extracting conserved gene expression motifs from gene expression data. In Proceedings of the Pacific Symposium on Biocompomputing Conference (Vol. 8, pp. 77–88). Murali, T., & Kasif, S. (2003). Extracting conserved gene expression motifs from gene expression data. In Proceedings of the Pacific Symposium on Biocompomputing Conference (Vol. 8, pp. 77–88).
Zurück zum Zitat Prelic, A., et al. (2006). A systematic comparison and evaluation of biclustering methods for gene expression data. Bioinformatics, 22(9), 1122–1129. Prelic, A., et al. (2006). A systematic comparison and evaluation of biclustering methods for gene expression data. Bioinformatics, 22(9), 1122–1129.
Zurück zum Zitat Resnick, P., Iacovou, N., Suchak, M., Bergstrom, P., & Riedl, J. (1994). Grouplens: An open architecture for collaborative filtering on netnews. In Proceedings of the Computer Supported Collaborative Work Conference (pp. 175–186). Resnick, P., Iacovou, N., Suchak, M., Bergstrom, P., & Riedl, J. (1994). Grouplens: An open architecture for collaborative filtering on netnews. In Proceedings of the Computer Supported Collaborative Work Conference (pp. 175–186).
Zurück zum Zitat Salter, J., & Antonopoulos, N. (2006). Cinemascreen recommender agent: Combining collaborative and content-based filtering. Intelligent Systems Magazine, 21(1), 35–41. Salter, J., & Antonopoulos, N. (2006). Cinemascreen recommender agent: Combining collaborative and content-based filtering. Intelligent Systems Magazine, 21(1), 35–41.
Zurück zum Zitat Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (2000). Analysis of recommendation algorithms for e-commerce. In Proceedings of the ACM Electronic Commerce Conference (pp. 158–167). Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (2000). Analysis of recommendation algorithms for e-commerce. In Proceedings of the ACM Electronic Commerce Conference (pp. 158–167).
Zurück zum Zitat Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (2001). Item-based collaborative filtering recommendation algorithms. In Proceedings of the WWW Conference (pp. 285–295). Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (2001). Item-based collaborative filtering recommendation algorithms. In Proceedings of the WWW Conference (pp. 285–295).
Zurück zum Zitat Shafiei, M., & Milios, E. (2005). Model-based overlapping co-clustering. In Proceedings of the IEEE SDM Conference. Shafiei, M., & Milios, E. (2005). Model-based overlapping co-clustering. In Proceedings of the IEEE SDM Conference.
Zurück zum Zitat Symeonidis, P., Nanopoulos, A., Papadopoulos, A., & Manolopoulos, Y. (2006). Collaborative filtering process in a whole new light. In Proc. IDEAS conf. (pp. 29–36). Symeonidis, P., Nanopoulos, A., Papadopoulos, A., & Manolopoulos, Y. (2006). Collaborative filtering process in a whole new light. In Proc. IDEAS conf. (pp. 29–36).
Zurück zum Zitat Ungar, L., & Foster, D. (1998). A formal statistical approach to collaborative filtering. In Proceedings of the CONALD Conference. Ungar, L., & Foster, D. (1998). A formal statistical approach to collaborative filtering. In Proceedings of the CONALD Conference.
Zurück zum Zitat Wang, J., Vries, A., & Reinders, M. (2006). Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In Proceedings of the SIGIR Conference (pp. 501–508). Wang, J., Vries, A., & Reinders, M. (2006). Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In Proceedings of the SIGIR Conference (pp. 501–508).
Zurück zum Zitat Xue, G., Lin, C., & Yang, Q., et al. (2005). Scalable collaborative filtering using cluster-based smoothing. In Proceedings of the ACM SIGIR Conference (pp. 114–121). Xue, G., Lin, C., & Yang, Q., et al. (2005). Scalable collaborative filtering using cluster-based smoothing. In Proceedings of the ACM SIGIR Conference (pp. 114–121).
Metadaten
Titel
Nearest-biclusters collaborative filtering based on constant and coherent values
verfasst von
Panagiotis Symeonidis
Alexandros Nanopoulos
Apostolos N. Papadopoulos
Yannis Manolopoulos
Publikationsdatum
01.02.2008
Verlag
Springer Netherlands
Erschienen in
Discover Computing / Ausgabe 1/2008
Print ISSN: 2948-2984
Elektronische ISSN: 2948-2992
DOI
https://doi.org/10.1007/s10791-007-9038-4

Premium Partner