ABSTRACT
Online newspaper readership is gradually increasing as users are actively spending more time on the Internet to read news. Understanding how users navigate the site is important so we can improve their Web experience. In this study, we collect Web server logs of an online Malaysian newspaper for 28 days in April 2012. We apply Markov model which is one the techniques in Web usage mining used to model a collection of user sessions. The content pages are categorized to three types; main page, article page and section page. We analyzed the navigation flow between different pages in a session. From the Markov state model, we discovered that users are inclined to continue reading the articles from the main page. Interestingly, majority of users that start navigation with the main page will also end their session with the main page as well. As for the section page, we investigate what section pages are read together in a session. Our findings also found that users tend are likely to read section pages together and we observed that National sections are always the favorite section, while Education is the least section to start their session. The findings from this study can be a basis for recommending pages to the user so they can navigate more pages in a session and in turn to increase traffic to the online newspaper site.
- Michael Barthel, "Newspapers: Fact Sheet," Pew Research Center, 2015. {Online}. Available: http://www.journalism.org/2016/06/15/newspapers-fact-sheet/. {Accessed: 17-Aug-2016}.Google Scholar
- C. Castillo, M. El-haddad, and M. Stempeck, "Characterizing the Life Cycle of Online News Stories Using Social Media Reactions," 2012.Google Scholar
- B. A. Huberman, P. L. T. Pirolli, J. E. Pitkow, and R. M. Lukose, "Strong Regularities in World Wide Web Surfing," Sciences (New. York)., pp. 95--97, 1998.Google ScholarCross Ref
- J. Borges and M. Levene, "Evaluating variable-length markov chain models for analysis of user web navigation sessions," Knowl. Data Eng. IEEE Trans., vol. 19, no. 4, pp. 441--452, 2007. Google ScholarDigital Library
- I. Cadez, C. Meek, and S. White, "Visualization of Navigation Patterns on a Web Site Using Model Based Clustering," in Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000, no. March. Google ScholarDigital Library
- V. N. Padmanabhan and J. C. Mogul, "Using Predictive Prefetching to Improve World Wide Web Latency," ACM SIGCOMM Comput. Commun. Rev., vol. 26, no. 3, pp. 22--36, 1996. Google ScholarDigital Library
- M. Eirinaki, M. Vazirgiannis, and D. Kapogiannis, "Web Path Recommendations based on Page Ranking and Markov Models," in Proceedings of the 7th annual ACM International workshop on Web information and data management, 2005. Google ScholarDigital Library
- J. Hughes, J. Zhu, J. Hong, and J. G. Hughes, "Using Markov Models for Web Site Link Prediction," in Proceedings of the thirteenth ACM conference on Hypertext and hypermedia, 2004, no. July. Google ScholarDigital Library
- R. Cooley, B. Mobasher, and J. Srivastava, "Data preparation for mining world wide web browsing patterns," Knowl. Inf. Syst., vol. 1, no. 1, pp. 5--32, 1999.Google ScholarDigital Library
- D. Nicholas and P. Huntington, "Evaluating the use of newspaper web sites logs," Int. J. Media Manag., vol. 2, no. 2, pp. 78--88, 2000.Google ScholarCross Ref
- H. Wettig, J. Lahtinen, T. Lepola, P. Myllymaki, and H. Tirri, "Bayesian analysis of online newspaper log data," in Applications and the Internet Workshops, 2003, pp. 282--287. Google ScholarDigital Library
- P. Batista and M. J. Silva, "Mining web access logs of an online newspaper," Dep. Informática, Fac. Ciencias-Universidade Lisboa. Lisboa. Port., 2002.Google Scholar
- Y. Gao, "From Editors' Choice to Readers' Favorites: Analyzing Server Logs of China's Biggest Online Newspaper," PhD, Fac. Inf. Media Stud. Univ. West. Ontario, London, 2000.Google Scholar
- L. D. Catledge and J. E. Pitkow, "Characterizing browsing strategies in the World-Wide Web," Comput. Networks ISDN Syst., vol. 27, no. 6, pp. 1065--1073, 1995. Google ScholarDigital Library
- P. Huntington, D. Nicholas, and H. R. Jamali, "Website usage metrics: A re-assessment of session data," Inf. Process. Manag., vol. 44, no. 1, pp. 358--372, 2008. Google ScholarDigital Library
- I. Cadez, D. Hackerman, C. Meeke, P. Smyth, and S. White, "Model-Based Clustering and Visualization," Data Min. Knowl. Discov., vol. 7, no. 4, pp. 399--424, 2003. Google ScholarDigital Library
- R. Sen and M. H. Hansen, "Predicting Web Users' Next Access Based on Log Data," J. Comput. Graph. Stat., vol. 12, no. 1, pp. 143--155, 2003.Google ScholarCross Ref
Index Terms
- Discovering users navigation of online newspaper using Markov model
Recommendations
Analysing User Access To An Online Newspaper
ADCS '14: Proceedings of the 19th Australasian Document Computing SymposiumThere have been several studies of online newspapers that use web server logs to analyze traffic and their user behavior but most of these studies were undertaken requiring a demographic profile of the users. Our study adds to the literature by ...
Process mining approach to analyze user navigation behavior of a news website
ICISS '21: Proceedings of the 4th International Conference on Information Science and SystemsWe apply process mining to investigate the user behaviour of an online newspaper website where we collected web server logs and pre-processed the logs using web usage mining techniques, then used Fluxicon Disco to generate the process map. We found that ...
Generating a New Model for Predicting the Next Accessed Web Page in Web Usage Mining
ICETET '10: Proceedings of the 2010 3rd International Conference on Emerging Trends in Engineering and TechnologyWorld Wide Web is growing rapidly. So it is necessary to study the user web navigation behavior to improve the quality of web services, offered to the web user. Analysis of user web navigation behavior is achieved through modeling web navigation ...
Comments