skip to main content
10.1145/2872518.2889380acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
poster

SparkTrails: A MapReduce Implementation of HypTrails for Comparing Hypotheses About Human Trails

Published:11 April 2016Publication History

ABSTRACT

HypTrails is a bayesian approach for comparing different hypotheses about human trails on the web. While a standard implementation exists, it exposes performance issues when working with large-scale data. In this paper, we propose a distributed implementation of HypTrails based on Apache Spark taking advantage of several structural properties inherent to HypTrails. The performance improves substantially. Our implementation is publicly available.

References

  1. M. Becker, P. Singer, F. Lemmerich, A. Hotho, D. Helic, and M. Strohmaier. Photowalking the city: Comparing hypotheses about urban photo trails on flickr. In Social Informatics, volume 9471 of Lecture Notes in CS. 2015.Google ScholarGoogle ScholarCross RefCross Ref
  2. P. Singer, D. Helic, A. Hotho, and M. Strohmaier. Hyptrails: A bayesian approach for comparing hypotheses about human trails. In 24th Intl. World Wide Web Conf. (WWW2015), 2015, best paper. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. E. Wulczyn and D. Taraborelli. Wikipedia Clickstream. figshare, 2015.Google ScholarGoogle Scholar

Index Terms

  1. SparkTrails: A MapReduce Implementation of HypTrails for Comparing Hypotheses About Human Trails

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web
      April 2016
      1094 pages
      ISBN:9781450341448

      Copyright © 2016 Copyright is held by the owner/author(s)

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      International World Wide Web Conferences Steering Committee

      Republic and Canton of Geneva, Switzerland

      Publication History

      • Published: 11 April 2016

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      WWW '16 Companion Paper Acceptance Rate115of727submissions,16%Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader