skip to main content
10.1145/2232817.2232886acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
poster

An unsupervised technical difficulty ranking model based on conceptual terrain in the latent space

Authors Info & Claims
Published:10 June 2012Publication History

ABSTRACT

Search results of the existing general-purpose search engines usually do not satisfy domain-specific information retrieval tasks as there is a mis-match between the technical expertise of a user and the results returned by the search engine. In this paper, we investigate the problem of ranking domain-specific documents based on the technical difficulty. We propose an unsupervised conceptual terrain model using Latent Semantic Indexing (LSI) for re-ranking search results obtained from a similarity based search system. We connect the sequences of terms under the latent space by the semantic distance between the terms and compute the traversal cost for a document indicating the technical difficulty. Our experiments on a domain-specific corpus demonstrate the efficacy of our method.

References

  1. J. Bellegarda. A multispan language modeling framework for large vocabulary speech recognition. IEEE Transactions on Speech and Audio Processing, 6(5):456 --467, sep 1998.Google ScholarGoogle ScholarCross RefCross Ref
  2. P. Freebody and R. C. Anderson. Effects of vocabulary difficulty, text cohesion, and schema availability on reading comprehension. Reading Research Quarterly, 18(3):pp. 277--294, 1983.Google ScholarGoogle ScholarCross RefCross Ref
  3. W. Kintsch. The role of knowledge in discourse comprehension: A construction-integration model. Psychological Review, 95:163--182, 1988.Google ScholarGoogle ScholarCross RefCross Ref
  4. L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. World Wide Web Internet And Web Information Systems, (1999--66):1--17, 1998.Google ScholarGoogle Scholar
  5. S. Robertson. Understanding inverse document frequency: on theoretical arguments for idf. Journal of Documentation, 60(5):503--520, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  6. S. E. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at trec-3. In TREC, 1994.Google ScholarGoogle Scholar
  7. G. Salton, A. Wong, and C. S. Yang. A vector space model for automatic indexing. Commun. ACM, 18:613--620, November 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. X. Shen, B. Tan, and C. Zhai. Implicit user modeling for personalized search. In Proc. of 14th CIKM, pages 824--831, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. An unsupervised technical difficulty ranking model based on conceptual terrain in the latent space

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          JCDL '12: Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
          June 2012
          458 pages
          ISBN:9781450311540
          DOI:10.1145/2232817

          Copyright © 2012 Authors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 10 June 2012

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • poster

          Acceptance Rates

          Overall Acceptance Rate415of1,482submissions,28%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader