- 1.J. Allan, L. Ballesteros, J. P. Callan, W. B. Croft, and Z. Lu. Recent experiments with INQUERY. In Harman {11}, pages 49-64.Google Scholar
- 2.V. N. Anh and A. Moffat. Compressed inverted files with reduced decoding overheads. In W. B. Croft, A. Moffat, C. J. van Rijsbergen, R. Wilkinson, and J. Zobel, editors, Proc. 21st Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, pages 290-297, Melbourne, Australia, August 1998. ACM Press, New York. Google ScholarDigital Library
- 3.N. J. Belkin, A. D. Narasimhalu, and P. Willett, editors. Proc. 20th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, PA, July 1997. ACM Press, New York. Google Scholar
- 4.C. Buckley and A. F. Lewit. Optimization of inverted vector searches. In Proc. 8th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, pages 97-110, Montreal, Canada, June 1985. ACM Press, New York. Google ScholarDigital Library
- 5.C. Buckley, A. Singhal, and M. Mitra. New retrieval approaches using SMART: TREC 4. In Harman {11}, pages 25-48.Google Scholar
- 6.C. L. A. Clarke, G. V. Cormack, and F. J. Burkowski. Shortest substring ranking (multitext experiments for TREC-4). In Harman {11}, pages 295-304.Google Scholar
- 7.O. de Kretser and A. Moffat. Locality-based information retrieval. In J. Roddick, editor, Proc. 10th Australasian Database Conference, pages 177-188, Auckland, New Zealand, January 1999. Springer-Verlag, Singapore.Google Scholar
- 8.W. B. Frakes and R. Baeza-Yates, editors. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, Englewood Cliffs, New Jersey, 1992. Google ScholarDigital Library
- 9.H.-P. Frei, D. Harman, P. Schguble, and R. Wilkinson, editors. Proc. 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zfirich, Switzerland, August 1996. ACM Press, New York. Google Scholar
- 10.D. K. Harman. Overview of the second text retrieval conference (TREC-2). Information Processing ~4 Management, 31(3):271-289, May 1995. Google ScholarDigital Library
- 11.D. K. Harman, editor. Proc. Fourth Text REtrieval Conference (TREC-$), Gaithersburg, MD, November 1995. National Institute of Standards and Technology Special Publication 500-236.Google ScholarCross Ref
- 12.D. K. Harman and G. Candela. Retrieving records from a gigabyte of text on a minicomputer using statistical ranking. Journal of the American Society for Information Science, 41(8):581-589, August 1990.Google ScholarCross Ref
- 13.D. Hawking and P. Thistlewaite. Proximity operators: So near and yet so far. In Harman {11}, pages 131-144.Google Scholar
- 14.D. Hawking and P. Thistlewaite. Relevance weighting using distance between term occurrences. Technical Report 96-08, Australian National University, 1996.Google Scholar
- 15.M. A. Hearst and J. O. Pedersen. Reexamining the cluster hypothesis: Scatter/gather on retrieval results. In Frei et al. {9}, pages 76-84. Google Scholar
- 16.M. Kaskiel and J. Zobel. Passage retrieval revisited. In Belkin et al. {3}, pages 178-185. Google Scholar
- 17.D. Knaus, E. Mittendorf, P. Schguble, and P. Sheridan. Highlighting relevant passages for users of the interactive SPIDER retrieval system. In Harman {11}, pages 233-244.Google Scholar
- 18.R. R. Korfhage. Information Storage and Retrieval. Wiley, New York, 1997. Google ScholarDigital Library
- 19.MG public domain software for indexing and retrieving text, including tools for compressing text, bilevel images, grayscale images, and textual images, 1999. http ://www. cs. mu. oz. au/mg/.Google Scholar
- 20.A. Moffat and J. Zobel. Self-indexing inverted files for fast text retrieval. A CM Transactions on Information Systems, 14(4):349-379, October 1996. Google ScholarDigital Library
- 21.L. T. Nowell, R. K. France, D. Hix, L. S. Heath, and E. A. Fox. Visualizing search results: Some alternatives to querydocument similarity. In Frei et al. {9}, pages 67-75. Google Scholar
- 22.M. Persin, J. Zobel, and R. Sacks-Davis. Filtered document retrieval with frequency-sorted indexes. Journal of the American Society for Information Science, 47(10):749-764, October 1996. Google ScholarCross Ref
- 23.G. Salton and M. J. McGill. Introduction to Modern Information Retrieval. McGraw-Hill, New York, 1983. Google ScholarDigital Library
- 24.A. Singhal, C. Buckley, and M. Mitra. Pivoted document length normalization. In Frei et al. {9}, pages 21-29. Google Scholar
- 25.C. J. van Rijsbergen. Information Retrieval. Butterworths, London, second edition, 1979. Google ScholarDigital Library
- 26.A. Veerasamy and N. J. Belkin. Evaluation of a tool for visualization of information retrieval results. In Frei et al. {9}, pages 85-92. Google Scholar
- 27.A. Veerasamy and R. Heikes. Effectiveness of a graphical display of retrieval results. In Belkin et al. {3}, pages 236- 245. Google Scholar
- 28.I. H. Witten, A. Moffat, and T. C. Bell. Managing Gigabytes: Compressing and Indexing Documents and Images. Morgan Kaufmann, San Francisco, second edition, 1999. Google ScholarDigital Library
- 29.J. Zobel and A. Moffat. Exploring the similarity space. A CM SIGIR Forum, 32(1):18-34, Spring 1998. Google ScholarDigital Library
- 30.J. Zobel, A. Moffat, R. Wilkinson, and R. Sacks-Davis. Efficient retrieval of partial documents. Information Processing ~4 Management, 31(3):361-377, May 1995. Google ScholarDigital Library
Index Terms
- Effective document presentation with a locality-based similarity heuristic
Recommendations
Effective measures for inter-document similarity
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge ManagementWhile supervised learning-to-rank algorithms have largely supplanted unsupervised query-document similarity measures for search, the exploration of query-document measures by many researchers over many years produced insights that might be exploited in ...
Effective cache prefetching on bus-based multiprocessors
Compiler-directed cache prefetching has the potential to hide much of the high memory latency seen by current and future high-performance processors. However, prefetching is not without costs, particularly on a shared-memory multiprocessor. Prefetching ...
A document management methodology based on similarity contents
Special issue: Informatics and computer science intelligent systems applicationsThe advent of the WWW and distributed information systems have made it possible to share documents between different users and organisations. However, this has created many problems related to the security, accessibility, right and most importantly the ...
Comments