Skip to main content
Top
Published in: Soft Computing 6/2010

01-04-2010 | Original Paper

Multiobjective evolutionary clustering of Web user sessions: a case study in Web page recommendation

Authors: G. Nildem Demir, A. Şima Uyar, Şule Gündüz-Öğüdücü

Published in: Soft Computing | Issue 6/2010

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this study, we experiment with several multiobjective evolutionary algorithms to determine a suitable approach for clustering Web user sessions, which consist of sequences of Web pages visited by the users. Our experimental results show that the multiobjective evolutionary algorithm-based approaches are successful for sequence clustering. We look at a commonly used cluster validity index to verify our findings. The results for this index indicate that the clustering solutions are of high quality. As a case study, the obtained clusters are then used in a Web recommender system for representing usage patterns. As a result of the experiments, we see that these approaches can successfully be applied for generating clustering solutions that lead to a high recommendation accuracy in the recommender model we used in this paper.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
go back to reference Bleuler S, Laumanns M, Thiele L, Zitzler E (2003) PISA—a platform and programming language independent interface for search algorithms. In: Proceeding of evolutionary multi-criterion optimization (EMO 2003). Lecture notes in computer science, vol 2632. Springer, Berlin, pp 494–508 Bleuler S, Laumanns M, Thiele L, Zitzler E (2003) PISA—a platform and programming language independent interface for search algorithms. In: Proceeding of evolutionary multi-criterion optimization (EMO 2003). Lecture notes in computer science, vol 2632. Springer, Berlin, pp 494–508
go back to reference Branke J, Deb K, Dierolf H, Osswald M (2004) Finding knees in multi-objective optimization. In: Proceedings of the parallel problem solving from nature (PPSN 2004), pp 722–731 Branke J, Deb K, Dierolf H, Osswald M (2004) Finding knees in multi-objective optimization. In: Proceedings of the parallel problem solving from nature (PPSN 2004), pp 722–731
go back to reference Charter K, Schaeffer J, Szafron D (2000) Sequence alignment using FastLSA. In: Proceedings of international conference on mathematics and engineering techniques in medicine and biological sciences, pp 48–57 Charter K, Schaeffer J, Szafron D (2000) Sequence alignment using FastLSA. In: Proceedings of international conference on mathematics and engineering techniques in medicine and biological sciences, pp 48–57
go back to reference Cheng C-K, Wei YA (1991) An improved two-way partitioning algorithm with stable performance. IEEE Trans Comput-Aided Design Integr Circuits Syst 10:1502–1511CrossRef Cheng C-K, Wei YA (1991) An improved two-way partitioning algorithm with stable performance. IEEE Trans Comput-Aided Design Integr Circuits Syst 10:1502–1511CrossRef
go back to reference Coello Coello CA, Lamont GB, Van Veldhuizen DA (2007) Evolutionary algorithms for solving multi-objective problems, 2nd edn. Springer, Berlin Coello Coello CA, Lamont GB, Van Veldhuizen DA (2007) Evolutionary algorithms for solving multi-objective problems, 2nd edn. Springer, Berlin
go back to reference Cole RM (1998) Clustering with genetic algorithms. Master’s thesis, University of Western Australia, Nedlands 6907, Australia Cole RM (1998) Clustering with genetic algorithms. Master’s thesis, University of Western Australia, Nedlands 6907, Australia
go back to reference Conover W (1999) Practical nonparametric statistics, 3rd edn. Wiley, New York Conover W (1999) Practical nonparametric statistics, 3rd edn. Wiley, New York
go back to reference Cooley R, Mobasher B, Srivastava J (1999) Data preparation for mining world wide web browsing patterns. J Knowl Inform Syst 1(1):5–32 Cooley R, Mobasher B, Srivastava J (1999) Data preparation for mining world wide web browsing patterns. J Knowl Inform Syst 1(1):5–32
go back to reference Corne DW, Jerram NR, Knowles JD, Oates MJ (2001) PESA-II: region based selection in evolutionary multiobjective optimization. In: Proceedings of the genetic and evolutionary computation conference (GECCO-2001). Morgan Kaufmann, Menlo Park, pp 283–290 Corne DW, Jerram NR, Knowles JD, Oates MJ (2001) PESA-II: region based selection in evolutionary multiobjective optimization. In: Proceedings of the genetic and evolutionary computation conference (GECCO-2001). Morgan Kaufmann, Menlo Park, pp 283–290
go back to reference Davies D, Bouldin D (1979) A cluster separation measure. IEEE Trans Pattern Anal Mach Intell 1(2):224–227CrossRef Davies D, Bouldin D (1979) A cluster separation measure. IEEE Trans Pattern Anal Mach Intell 1(2):224–227CrossRef
go back to reference Deb K, Pratab A, Agrawal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6(2):182–197CrossRef Deb K, Pratab A, Agrawal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6(2):182–197CrossRef
go back to reference Demir GN, Uyar AŞ, Gündüz Ögüdücü Ş (2007) Graph-based sequence clustering through multiobjective evolutionary algorithms for web recommender systems. In: Proceedings of the genetic and evolutionary computation conference (GECCO-2007). ACM, New York, pp 1943–1950 Demir GN, Uyar AŞ, Gündüz Ögüdücü Ş (2007) Graph-based sequence clustering through multiobjective evolutionary algorithms for web recommender systems. In: Proceedings of the genetic and evolutionary computation conference (GECCO-2007). ACM, New York, pp 1943–1950
go back to reference Demir GN, Göksedef M, Uyar AŞ (2007) Effects of session representation models on the performance of web recommender systems. In: Proceedings of the workshop on data mining and business intelligence, pp 931–936 Demir GN, Göksedef M, Uyar AŞ (2007) Effects of session representation models on the performance of web recommender systems. In: Proceedings of the workshop on data mining and business intelligence, pp 931–936
go back to reference Ding C, Xiaofeng H, Hongyuan Z, Ming G, Simon H (2001) A min-max cut algorithm for graph partitioning and data clustering. In Proceedings of the IEEE international conference on data mining, pp 107–114 Ding C, Xiaofeng H, Hongyuan Z, Ming G, Simon H (2001) A min-max cut algorithm for graph partitioning and data clustering. In Proceedings of the IEEE international conference on data mining, pp 107–114
go back to reference Du J, Korkmaz E, Alhajj R, Barker K (2004) Novel clustering approach that employs genetic algorithm with new representation scheme and multiple objectives. In: Proceedings of the 6th international conference on data warehousing and knowledge discovery (DAWAK 2004). Lecture notes in computer science, vol 3181. Springer, Berlin, pp 219–233 Du J, Korkmaz E, Alhajj R, Barker K (2004) Novel clustering approach that employs genetic algorithm with new representation scheme and multiple objectives. In: Proceedings of the 6th international conference on data warehousing and knowledge discovery (DAWAK 2004). Lecture notes in computer science, vol 3181. Springer, Berlin, pp 219–233
go back to reference Eiben AE, Smith JE (2003) Introduction to evolutionary computing. Springer, Berlin Eiben AE, Smith JE (2003) Introduction to evolutionary computing. Springer, Berlin
go back to reference Faceli K, de Carvalho ACPLF, de Souto MCP (2007) Multi-objective clustering ensemble. Int J Hybrid Intell Syst 4(3):145–156MATH Faceli K, de Carvalho ACPLF, de Souto MCP (2007) Multi-objective clustering ensemble. Int J Hybrid Intell Syst 4(3):145–156MATH
go back to reference Garcia S, Molina D, Lozano M, Herrera F (2008) A study on the use of nonparametric tests for analyzing the evolutionary algorithms’ behaviour: a case study on the cec 2005 special session on real parameter optimization. J Heuristics. doi:10.1007/s10732-008-9080-4 Garcia S, Molina D, Lozano M, Herrera F (2008) A study on the use of nonparametric tests for analyzing the evolutionary algorithms’ behaviour: a case study on the cec 2005 special session on real parameter optimization. J Heuristics. doi:10.​1007/​s10732-008-9080-4
go back to reference Göksedef M, Gündüz Ögüdücü Ş (2007) A consensus recommender for web users. In: Proceedings of the 3rd international conference on advanced data mining and applications. Lecture notes in artificial intelligence, vol 4632. Springer, Berlin, pp 287–299 Göksedef M, Gündüz Ögüdücü Ş (2007) A consensus recommender for web users. In: Proceedings of the 3rd international conference on advanced data mining and applications. Lecture notes in artificial intelligence, vol 4632. Springer, Berlin, pp 287–299
go back to reference Gündüz Ş, Özsu MT (2003) A web page prediction model based on click-stream tree representation of user behavior. In: Proceedings of ninth ACM international conference on knowledge discovery and data mining (KDD), pp 535–540 Gündüz Ş, Özsu MT (2003) A web page prediction model based on click-stream tree representation of user behavior. In: Proceedings of ninth ACM international conference on knowledge discovery and data mining (KDD), pp 535–540
go back to reference Gündüz Ş, Özsu MT (2006) Incremental click-stream tree model: learning from new users for web page prediction. Distributed Parallel Databases 19(1):5–27CrossRef Gündüz Ş, Özsu MT (2006) Incremental click-stream tree model: learning from new users for web page prediction. Distributed Parallel Databases 19(1):5–27CrossRef
go back to reference Gündüz Öğüdücü Ş, Uyar AŞ (2004) A graph based clustering method using a hybrid evolutionary algorithm. WSEAS Trans Math 3(3):731–736MathSciNet Gündüz Öğüdücü Ş, Uyar AŞ (2004) A graph based clustering method using a hybrid evolutionary algorithm. WSEAS Trans Math 3(3):731–736MathSciNet
go back to reference Günter S, Bunke H (2003) Validation indices for graph clustering. Pattern Recognit Lett 24(8):1107–1113MATHCrossRef Günter S, Bunke H (2003) Validation indices for graph clustering. Pattern Recognit Lett 24(8):1107–1113MATHCrossRef
go back to reference Handl J, Knowles J (2005) Multiobjective clustering around medoids. In: Proceedings of the congress on evolutionary computation (CEC-2005). IEEE, New York, pp 632–639 Handl J, Knowles J (2005) Multiobjective clustering around medoids. In: Proceedings of the congress on evolutionary computation (CEC-2005). IEEE, New York, pp 632–639
go back to reference Handl J, Knowles J (2007) An evolutionary approach to multiobjective clustering. IEEE Trans Evol Comput 11(1):56–76CrossRef Handl J, Knowles J (2007) An evolutionary approach to multiobjective clustering. IEEE Trans Evol Comput 11(1):56–76CrossRef
go back to reference Horn J, Nafpliotis N, Goldberg DE (1994) A niched pareto genetic algorithm for multiobjective optimization. In: Proceedings of the congress on evolutionary computation (CEC-1994). IEEE, New York, pp 82–87 Horn J, Nafpliotis N, Goldberg DE (1994) A niched pareto genetic algorithm for multiobjective optimization. In: Proceedings of the congress on evolutionary computation (CEC-1994). IEEE, New York, pp 82–87
go back to reference Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3):264–323CrossRef Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3):264–323CrossRef
go back to reference Kaufman L, Rousseeuw PJ (1987) Clustering by means of medoids. In: Statistical data analysis based on the L1 norm and related methods, pp 405–416 Kaufman L, Rousseeuw PJ (1987) Clustering by means of medoids. In: Statistical data analysis based on the L1 norm and related methods, pp 405–416
go back to reference Kim S (2003) Computational biology and genome informatics. World Scientific, Singapore Kim S (2003) Computational biology and genome informatics. World Scientific, Singapore
go back to reference Kim S, Lee J (2006) BAG: a graph theoretic sequence clustering algorithm. Int J Data Min Bioinform 1(2):178–200CrossRef Kim S, Lee J (2006) BAG: a graph theoretic sequence clustering algorithm. Int J Data Min Bioinform 1(2):178–200CrossRef
go back to reference Knowles J, Thiele L, Zitzler E (2006) A tutorial on the performance assessment of stochastic multiobjective optimizers. TIK Report 214, Computer Engineering and Networks Laboratory (TIK). ETH Zurich Knowles J, Thiele L, Zitzler E (2006) A tutorial on the performance assessment of stochastic multiobjective optimizers. TIK Report 214, Computer Engineering and Networks Laboratory (TIK). ETH Zurich
go back to reference Korkmaz E (2006) A two-level clustering method using linear linkage encoding. In: Proceedings of the parallel problem solving from nature (PPSN 2006). Lecture notes in computer science, vol 4193. Springer, Berlin, pp 681–690 Korkmaz E (2006) A two-level clustering method using linear linkage encoding. In: Proceedings of the parallel problem solving from nature (PPSN 2006). Lecture notes in computer science, vol 4193. Springer, Berlin, pp 681–690
go back to reference Kruskal WH, Wallis WA (1952) Use of ranks in one-criterion variance analysis. J Am Stat Assoc 47(260):583–621MATHCrossRef Kruskal WH, Wallis WA (1952) Use of ranks in one-criterion variance analysis. J Am Stat Assoc 47(260):583–621MATHCrossRef
go back to reference Law MHC, Topchy AP, Jain AK (2004) Multiobjective data clustering. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp 424–430 Law MHC, Topchy AP, Jain AK (2004) Multiobjective data clustering. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp 424–430
go back to reference Manouselis N, Costopoulou C (2007) Analysis and classification of multi-criteria recommender systems. World Wide Web 10(4):415–441CrossRef Manouselis N, Costopoulou C (2007) Analysis and classification of multi-criteria recommender systems. World Wide Web 10(4):415–441CrossRef
go back to reference Mobasher B, Dai H, Luo T, Nakagawa M (2002) Discovery of aggregate usage profiles for web personalization. Data Min Knowl Discov 6(1):61–82CrossRefMathSciNet Mobasher B, Dai H, Luo T, Nakagawa M (2002) Discovery of aggregate usage profiles for web personalization. Data Min Knowl Discov 6(1):61–82CrossRefMathSciNet
go back to reference Mohr G, Kimpton M, Stack M, Ranitovic I (2004) Introduction to Heritrix : an open source archival quality web crawler. In: Proceedings of the 4th international web archiving workshop Mohr G, Kimpton M, Stack M, Ranitovic I (2004) Introduction to Heritrix : an open source archival quality web crawler. In: Proceedings of the 4th international web archiving workshop
go back to reference Ozyer T, Liu Y, Alhajj R, Barker K (2004) Multi-objective genetic algorithm based clustering approach and its application to gene expression data. In: Proceedings of the advances in information systems (ADVIS 2004). Lecture notes in computer science, vol 3261. Springer, Berlin, pp 451–461 Ozyer T, Liu Y, Alhajj R, Barker K (2004) Multi-objective genetic algorithm based clustering approach and its application to gene expression data. In: Proceedings of the advances in information systems (ADVIS 2004). Lecture notes in computer science, vol 3261. Springer, Berlin, pp 451–461
go back to reference Park YJ, Song MS (1998) A genetic algorithm for clustering problems. In: Proceedings of the 3rd annual conference on genetic programming, pp 568–575 Park YJ, Song MS (1998) A genetic algorithm for clustering problems. In: Proceedings of the 3rd annual conference on genetic programming, pp 568–575
go back to reference Perugini S, Gonçalves MA, Fox EA (2004) Recommender systems research: a connection-centric survey. J Intell Inform Syst 23(2):107–143MATHCrossRef Perugini S, Gonçalves MA, Fox EA (2004) Recommender systems research: a connection-centric survey. J Intell Inform Syst 23(2):107–143MATHCrossRef
go back to reference Rosen KH (1991) Discrete mathematics and its applications, 2nd edn. McGraw-Hill, New York Rosen KH (1991) Discrete mathematics and its applications, 2nd edn. McGraw-Hill, New York
go back to reference Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20(1):53–65MATHCrossRef Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20(1):53–65MATHCrossRef
go back to reference Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Patterns Anal Mach Intell (PAMI) 22(8):888–905CrossRef Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Patterns Anal Mach Intell (PAMI) 22(8):888–905CrossRef
go back to reference Speer N, Spieth C, Zell A (2005) Biological cluster validity indices based on the gene ontology. In: Proceedings of advances in intelligent data anaylsis VI: 6th international symposium on intelligent data analysis (IDA 2005). Lecture notes in computer science, vol 3646. Springer, Berlin, pp 429–439 Speer N, Spieth C, Zell A (2005) Biological cluster validity indices based on the gene ontology. In: Proceedings of advances in intelligent data anaylsis VI: 6th international symposium on intelligent data analysis (IDA 2005). Lecture notes in computer science, vol 3646. Springer, Berlin, pp 429–439
go back to reference Srivastava J, Cooley R, Deshpande M, Tan P-N (2000) Web usage mining: Discovery and applications of usage patterns from web data. ACM SIGKDD Explor Newsl 1(2):12–23CrossRef Srivastava J, Cooley R, Deshpande M, Tan P-N (2000) Web usage mining: Discovery and applications of usage patterns from web data. ACM SIGKDD Explor Newsl 1(2):12–23CrossRef
go back to reference Strehl A, Ghosh J (2003) Cluster ensembles—a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3:583–617MATHCrossRefMathSciNet Strehl A, Ghosh J (2003) Cluster ensembles—a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3:583–617MATHCrossRefMathSciNet
go back to reference Uyar AŞ, Gündüz Öğüdücü Ş (2005) A new graph-based evolutionary approach to sequence clustering. In: Proceedings of fourth international conference of machine learning and applications, pp 273–278 Uyar AŞ, Gündüz Öğüdücü Ş (2005) A new graph-based evolutionary approach to sequence clustering. In: Proceedings of fourth international conference of machine learning and applications, pp 273–278
go back to reference Yan TW, Jacobsen M, Garcia-Molina H, Dayal U (1996) From user access patterns to dynamic hypertext linking. In: Proceedings of the fifth world wide web conference (WWW5), pp 1007–1014 Yan TW, Jacobsen M, Garcia-Molina H, Dayal U (1996) From user access patterns to dynamic hypertext linking. In: Proceedings of the fifth world wide web conference (WWW5), pp 1007–1014
go back to reference Zitzler E, Laumanns M, Thiele L (2001) SPEA2: improving the strength pareto evolutionary algorithm for multiobjective optimization. In: Proceedings of the EUROGEN 2001—evolutionary methods for design, optimisation and control with applications to industrial problems, pp 95–100 Zitzler E, Laumanns M, Thiele L (2001) SPEA2: improving the strength pareto evolutionary algorithm for multiobjective optimization. In: Proceedings of the EUROGEN 2001—evolutionary methods for design, optimisation and control with applications to industrial problems, pp 95–100
go back to reference Zitzler E, Thiele L (1999) Multiobjective evolutionary algorithms: a comparative case study and the strength pareto evolutionary algorithm. IEEE Trans Evol Comput 3(4):257–271CrossRef Zitzler E, Thiele L (1999) Multiobjective evolutionary algorithms: a comparative case study and the strength pareto evolutionary algorithm. IEEE Trans Evol Comput 3(4):257–271CrossRef
Metadata
Title
Multiobjective evolutionary clustering of Web user sessions: a case study in Web page recommendation
Authors
G. Nildem Demir
A. Şima Uyar
Şule Gündüz-Öğüdücü
Publication date
01-04-2010
Publisher
Springer-Verlag
Published in
Soft Computing / Issue 6/2010
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-009-0428-y

Other articles of this Issue 6/2010

Soft Computing 6/2010 Go to the issue

Premium Partner