skip to main content
10.1145/3022227.3022236acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
short-paper

Discovering users navigation of online newspaper using Markov model

Published:05 January 2017Publication History

ABSTRACT

Online newspaper readership is gradually increasing as users are actively spending more time on the Internet to read news. Understanding how users navigate the site is important so we can improve their Web experience. In this study, we collect Web server logs of an online Malaysian newspaper for 28 days in April 2012. We apply Markov model which is one the techniques in Web usage mining used to model a collection of user sessions. The content pages are categorized to three types; main page, article page and section page. We analyzed the navigation flow between different pages in a session. From the Markov state model, we discovered that users are inclined to continue reading the articles from the main page. Interestingly, majority of users that start navigation with the main page will also end their session with the main page as well. As for the section page, we investigate what section pages are read together in a session. Our findings also found that users tend are likely to read section pages together and we observed that National sections are always the favorite section, while Education is the least section to start their session. The findings from this study can be a basis for recommending pages to the user so they can navigate more pages in a session and in turn to increase traffic to the online newspaper site.

References

  1. Michael Barthel, "Newspapers: Fact Sheet," Pew Research Center, 2015. {Online}. Available: http://www.journalism.org/2016/06/15/newspapers-fact-sheet/. {Accessed: 17-Aug-2016}.Google ScholarGoogle Scholar
  2. C. Castillo, M. El-haddad, and M. Stempeck, "Characterizing the Life Cycle of Online News Stories Using Social Media Reactions," 2012.Google ScholarGoogle Scholar
  3. B. A. Huberman, P. L. T. Pirolli, J. E. Pitkow, and R. M. Lukose, "Strong Regularities in World Wide Web Surfing," Sciences (New. York)., pp. 95--97, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  4. J. Borges and M. Levene, "Evaluating variable-length markov chain models for analysis of user web navigation sessions," Knowl. Data Eng. IEEE Trans., vol. 19, no. 4, pp. 441--452, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. I. Cadez, C. Meek, and S. White, "Visualization of Navigation Patterns on a Web Site Using Model Based Clustering," in Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000, no. March. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. V. N. Padmanabhan and J. C. Mogul, "Using Predictive Prefetching to Improve World Wide Web Latency," ACM SIGCOMM Comput. Commun. Rev., vol. 26, no. 3, pp. 22--36, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Eirinaki, M. Vazirgiannis, and D. Kapogiannis, "Web Path Recommendations based on Page Ranking and Markov Models," in Proceedings of the 7th annual ACM International workshop on Web information and data management, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Hughes, J. Zhu, J. Hong, and J. G. Hughes, "Using Markov Models for Web Site Link Prediction," in Proceedings of the thirteenth ACM conference on Hypertext and hypermedia, 2004, no. July. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. Cooley, B. Mobasher, and J. Srivastava, "Data preparation for mining world wide web browsing patterns," Knowl. Inf. Syst., vol. 1, no. 1, pp. 5--32, 1999.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Nicholas and P. Huntington, "Evaluating the use of newspaper web sites logs," Int. J. Media Manag., vol. 2, no. 2, pp. 78--88, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  11. H. Wettig, J. Lahtinen, T. Lepola, P. Myllymaki, and H. Tirri, "Bayesian analysis of online newspaper log data," in Applications and the Internet Workshops, 2003, pp. 282--287. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. P. Batista and M. J. Silva, "Mining web access logs of an online newspaper," Dep. Informática, Fac. Ciencias-Universidade Lisboa. Lisboa. Port., 2002.Google ScholarGoogle Scholar
  13. Y. Gao, "From Editors' Choice to Readers' Favorites: Analyzing Server Logs of China's Biggest Online Newspaper," PhD, Fac. Inf. Media Stud. Univ. West. Ontario, London, 2000.Google ScholarGoogle Scholar
  14. L. D. Catledge and J. E. Pitkow, "Characterizing browsing strategies in the World-Wide Web," Comput. Networks ISDN Syst., vol. 27, no. 6, pp. 1065--1073, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. P. Huntington, D. Nicholas, and H. R. Jamali, "Website usage metrics: A re-assessment of session data," Inf. Process. Manag., vol. 44, no. 1, pp. 358--372, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. I. Cadez, D. Hackerman, C. Meeke, P. Smyth, and S. White, "Model-Based Clustering and Visualization," Data Min. Knowl. Discov., vol. 7, no. 4, pp. 399--424, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Sen and M. H. Hansen, "Predicting Web Users' Next Access Based on Log Data," J. Comput. Graph. Stat., vol. 12, no. 1, pp. 143--155, 2003.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Discovering users navigation of online newspaper using Markov model

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication
      January 2017
      746 pages
      ISBN:9781450348881
      DOI:10.1145/3022227

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 5 January 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      IMCOM '17 Paper Acceptance Rate113of366submissions,31%Overall Acceptance Rate213of621submissions,34%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader