Skip to main content
Erschienen in: Discover Computing 5/2010

01.10.2010 | S.I.: Focused Retrieval and Result Aggr.

Entity ranking in Wikipedia: utilising categories, links and topic difficulty prediction

verfasst von: Jovan Pehcevski, James A. Thom, Anne-Marie Vercoustre, Vladimir Naumovski

Erschienen in: Discover Computing | Ausgabe 5/2010

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Entity ranking has recently emerged as a research field that aims at retrieving entities as answers to a query. Unlike entity extraction where the goal is to tag names of entities in documents, entity ranking is primarily focused on returning a ranked list of relevant entity names for the query. Many approaches to entity ranking have been proposed, and most of them were evaluated on the INEX Wikipedia test collection. In this paper, we describe a system we developed for ranking Wikipedia entities in answer to a query. The entity ranking approach implemented in our system utilises the known categories, the link structure of Wikipedia, as well as the link co-occurrences with the entity examples (when provided) to retrieve relevant entities as answers to the query. We also extend our entity ranking approach by utilising the knowledge of predicted classes of topic difficulty. To predict the topic difficulty, we generate a classifier that uses features extracted from an INEX topic definition to classify the topic into an experimentally pre-determined class. This knowledge is then utilised to dynamically set the optimal values for the retrieval parameters of our entity ranking system. Our experiments demonstrate that the use of categories and the link structure of Wikipedia can significantly improve entity ranking effectiveness, and that topic difficulty prediction is a promising approach that could also be exploited to further improve the entity ranking performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
These 46 topics in the INEX 2007 XER testing data set can also be used for training when testing using the INEX 2008 XER testing data set.
 
4
We discarded external links and some internal collection links that do not refer to existing pages in the INEX Wikipedia collection. The number N has been kept to a relatively small value mainly for performance purposes, since Wikipedia pages contain many links that would need to be extracted. We carried out some preliminary experiments with different values of the parameter N, by varying it between 5 and 100 with a step of 5, and found that N = 20 was a good compromise between maintaining satisfactory performance and discovering more potentially good entities.
 
5
The first two runs do not use any of Zettair’s category indexes and are included for comparison.
 
6
The value ten was determined experimentally on the INEX 2007 XER training topic set.
 
Literatur
Zurück zum Zitat Adelberg, B., & Denny, M. (1999). Nodose version 2.0. In Proceedings of the 1999 ACM SIGMOD international conference on management of data (SIGMOD’99), Philadelphia, Pennsylvania, pp. 559–561. Adelberg, B., & Denny, M. (1999). Nodose version 2.0. In Proceedings of the 1999 ACM SIGMOD international conference on management of data (SIGMOD’99), Philadelphia, Pennsylvania, pp. 559–561.
Zurück zum Zitat Awang Iskandar, D., Pehcevski, J., Thom, J. A., & Tahaghoghi, S. M. M. (2007). Social media retrieval using image features and structured text. In Comparative evaluation of XML information retrieval systems: Fifth workshop of the INitiative for the evaluation of XML retrieval, INEX 2006, Lecture notes in computer science, Vol. 4518, pp. 358–372. Awang Iskandar, D., Pehcevski, J., Thom, J. A., & Tahaghoghi, S. M. M. (2007). Social media retrieval using image features and structured text. In Comparative evaluation of XML information retrieval systems: Fifth workshop of the INitiative for the evaluation of XML retrieval, INEX 2006, Lecture notes in computer science, Vol. 4518, pp. 358–372.
Zurück zum Zitat Bast, H., Chitea, A., Suchanek, F., & Weber, I. (2007). ESTER: Efficient search on text, entities, and relations. In Proceedings of the 30th ACM international conference on research and development in information retrieval (SIGIR’07), Amsterdam, The Netherlands, pp. 671–678. Bast, H., Chitea, A., Suchanek, F., & Weber, I. (2007). ESTER: Efficient search on text, entities, and relations. In Proceedings of the 30th ACM international conference on research and development in information retrieval (SIGIR’07), Amsterdam, The Netherlands, pp. 671–678.
Zurück zum Zitat Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the 7th international conference on world wide web, Brisbane, Australia, pp. 107–117. Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the 7th international conference on world wide web, Brisbane, Australia, pp. 107–117.
Zurück zum Zitat Cai, D., He, X., Wen, J. R., & Ma, W. Y. (2004). Block-level link analysis. In Proceedings of the 27th ACM international conference on research and development in information retrieval (SIGIR’04), Sheffield, UK, pp. 440–447. Cai, D., He, X., Wen, J. R., & Ma, W. Y. (2004). Block-level link analysis. In Proceedings of the 27th ACM international conference on research and development in information retrieval (SIGIR’04), Sheffield, UK, pp. 440–447.
Zurück zum Zitat Callan, J., & Mitamura, T. (2002). Knowledge-based extraction of named entities. In Proceedings of the 11th ACM conference on information and knowledge management (CIKM’02), McLean, Virginia, pp. 532–537. Callan, J., & Mitamura, T. (2002). Knowledge-based extraction of named entities. In Proceedings of the 11th ACM conference on information and knowledge management (CIKM’02), McLean, Virginia, pp. 532–537.
Zurück zum Zitat Carmel, D., Yom-Tov, E., & Soboroff, I. (2005). Predicting query difficulty—methods and applications. SIGIR Forum 39(2), 25–28.CrossRef Carmel, D., Yom-Tov, E., & Soboroff, I. (2005). Predicting query difficulty—methods and applications. SIGIR Forum 39(2), 25–28.CrossRef
Zurück zum Zitat Cronen-Townsend, S., Zhou, Y., & Croft, W. B. (2002). Predicting query performance. In Proceedings of the 25th ACM SIGIR conference on research and development in information retrieval (SIGIR’02), Tampere, Finland, pp. 299–306. Cronen-Townsend, S., Zhou, Y., & Croft, W. B. (2002). Predicting query performance. In Proceedings of the 25th ACM SIGIR conference on research and development in information retrieval (SIGIR’02), Tampere, Finland, pp. 299–306.
Zurück zum Zitat Cucerzan, S. (2007). Large-scale named entity disambiguation based on Wikipedia data. In Proceedings of the 2007 joint conference on EMNLP and CoNLL, Prague, The Czech Republic, pp. 708–716. Cucerzan, S. (2007). Large-scale named entity disambiguation based on Wikipedia data. In Proceedings of the 2007 joint conference on EMNLP and CoNLL, Prague, The Czech Republic, pp. 708–716.
Zurück zum Zitat Cucerzan, S., & Yarowsky, D. (1999). Language independent named entity recognition combining morphological and contextual evidence. In Proceedings of the 1999 joint SIGDAT conference on EMNLP and VLC, Maryland, MD, pp. 90–99. Cucerzan, S., & Yarowsky, D. (1999). Language independent named entity recognition combining morphological and contextual evidence. In Proceedings of the 1999 joint SIGDAT conference on EMNLP and VLC, Maryland, MD, pp. 90–99.
Zurück zum Zitat de Vries A. P., Vercoustre A. M., Thom J. A., Craswell N., & Lalmas M. (2008). Overview of the INEX 2007 entity ranking track. In Focused access to XML documents: Sixth international workshop of the initiative for the evaluation of XML retrieval, INEX 2007, Lecture notes in computer science, Vol. 4862, pp. 1–23. de Vries A. P., Vercoustre A. M., Thom J. A., Craswell N., & Lalmas M. (2008). Overview of the INEX 2007 entity ranking track. In Focused access to XML documents: Sixth international workshop of the initiative for the evaluation of XML retrieval, INEX 2007, Lecture notes in computer science, Vol. 4862, pp. 1–23.
Zurück zum Zitat Demartini, G., de Vries, A. P., Iofciu, T., & Zhu, J. (2009). Overview of the INEX 2008 entity ranking track. In Advances in focused retrieval: Seventh international workshop of the initiative for the evaluation of XML retrieval, INEX 2008, Lecture notes in computer science, Vol. 5631. Demartini, G., de Vries, A. P., Iofciu, T., & Zhu, J. (2009). Overview of the INEX 2008 entity ranking track. In Advances in focused retrieval: Seventh international workshop of the initiative for the evaluation of XML retrieval, INEX 2008, Lecture notes in computer science, Vol. 5631.
Zurück zum Zitat Denoyer, L., & Gallinari, P. (2006). The Wikipedia XML corpus. SIGIR Forum 40(1), 64–69CrossRef Denoyer, L., & Gallinari, P. (2006). The Wikipedia XML corpus. SIGIR Forum 40(1), 64–69CrossRef
Zurück zum Zitat Ehrig, M., Haase, P., Stojanovic, N., & Hefke, M. (2005). Similarity for ontologies—a comprehensive framework. In Proceedings of the 13th European conference on information systems. Ehrig, M., Haase, P., Stojanovic, N., & Hefke, M. (2005). Similarity for ontologies—a comprehensive framework. In Proceedings of the 13th European conference on information systems.
Zurück zum Zitat Fissaha Adafre, S., de Rijke, M., & Sang, E. T. K. (2007). Entity retrieval. In Proceedings of international conference on recent advances in natural language processing (RANLP—2007), September 27–29, Borovets, Bulgaria. Fissaha Adafre, S., de Rijke, M., & Sang, E. T. K. (2007). Entity retrieval. In Proceedings of international conference on recent advances in natural language processing (RANLP—2007), September 27–29, Borovets, Bulgaria.
Zurück zum Zitat Grivolla, J., Jourlin, P., & de Mori, R. (2005). Automatic classification of queries by expected retrieval performance. In Proceedings of the SIGIR workshop on predicting query difficulty, Salvador, Brazil. Grivolla, J., Jourlin, P., & de Mori, R. (2005). Automatic classification of queries by expected retrieval performance. In Proceedings of the SIGIR workshop on predicting query difficulty, Salvador, Brazil.
Zurück zum Zitat Hassell, J., Aleman-Meza, B., & Arpinar, I. B. (2006). Ontology-driven automatic entity disambiguation in unstructured text. In Proceedings of the 5th international semantic web conference (ISWC), Athens, GA, Lecture notes in computer science, Vol. 4273, pp. 44–57. Hassell, J., Aleman-Meza, B., & Arpinar, I. B. (2006). Ontology-driven automatic entity disambiguation in unstructured text. In Proceedings of the 5th international semantic web conference (ISWC), Athens, GA, Lecture notes in computer science, Vol. 4273, pp. 44–57.
Zurück zum Zitat He, B., & Ounis, I. (2006). Query performance prediction. Information Systems 31(7), 585–594.CrossRef He, B., & Ounis, I. (2006). Query performance prediction. Information Systems 31(7), 585–594.CrossRef
Zurück zum Zitat Hu, G., Liu, J., Li, H., Cao, Y., Nie, J. Y., & Gao, J. (2006). A supervised learning approach to entity search. In Proceedings of the Asia information retrieval symposium (AIRS 2006). Lecture notes in computer science, Vol. 4182, pp. 54–66. Hu, G., Liu, J., Li, H., Cao, Y., Nie, J. Y., & Gao, J. (2006). A supervised learning approach to entity search. In Proceedings of the Asia information retrieval symposium (AIRS 2006). Lecture notes in computer science, Vol. 4182, pp. 54–66.
Zurück zum Zitat Kamps, J., & Larsen, B. (2006). Understanding differences between search requests in XML element retrieval. In Proceedings of the SIGIR 2006 workshop on XML element retrieval methodology, Seattle, Washington, pp. 13–19. Kamps, J., & Larsen, B. (2006). Understanding differences between search requests in XML element retrieval. In Proceedings of the SIGIR 2006 workshop on XML element retrieval methodology, Seattle, Washington, pp. 13–19.
Zurück zum Zitat Kaptein, R., & Kamps, J. (2009). Finding entities or information using annotations. In ECIR workshop on information retrieval over social networks, pp. 71–78. Kaptein, R., & Kamps, J. (2009). Finding entities or information using annotations. In ECIR workshop on information retrieval over social networks, pp. 71–78.
Zurück zum Zitat Kazama, J., & Torisawa, K. (2007). Exploiting Wikipedia as external knowledge for named entity recognition. In Proceedings of the 2007 joint conference on EMNLP and CoNLL, Prague, The Czech Republic, pp. 698–707. Kazama, J., & Torisawa, K. (2007). Exploiting Wikipedia as external knowledge for named entity recognition. In Proceedings of the 2007 joint conference on EMNLP and CoNLL, Prague, The Czech Republic, pp. 698–707.
Zurück zum Zitat Kwok, K. (2005). An attempt to identify weakest and strongest queries. In Proceedings of the SIGIR workshop on predicting query difficulty, Salvador, Brazil. Kwok, K. (2005). An attempt to identify weakest and strongest queries. In Proceedings of the SIGIR workshop on predicting query difficulty, Salvador, Brazil.
Zurück zum Zitat Lang, H., Wang, B., Jones, G., Li, J. T., Ding, F., & Liu, Y. X. (2008). Query performance prediction for information retrieval based on covering topic score. Journal of Computer Science and technology 23(4), 590–601.CrossRef Lang, H., Wang, B., Jones, G., Li, J. T., Ding, F., & Liu, Y. X. (2008). Query performance prediction for information retrieval based on covering topic score. Journal of Computer Science and technology 23(4), 590–601.CrossRef
Zurück zum Zitat Lerman, K., Minton, S. N., & Knoblock, C. A. (2003). Wrapper maintenance: A machine learning approach. Journal of Artificial Intelligence Research 18, 149–181.MATH Lerman, K., Minton, S. N., & Knoblock, C. A. (2003). Wrapper maintenance: A machine learning approach. Journal of Artificial Intelligence Research 18, 149–181.MATH
Zurück zum Zitat Loper, E., & Bird, S. (2002). NLTK: The natural language toolkit. In Proceedings of the ACL-02 workshop on effective tools and methodologies for teaching natural language processing and computational linguistics, Philadelphia, Pennsylvania, pp. 63–70. Loper, E., & Bird, S. (2002). NLTK: The natural language toolkit. In Proceedings of the ACL-02 workshop on effective tools and methodologies for teaching natural language processing and computational linguistics, Philadelphia, Pennsylvania, pp. 63–70.
Zurück zum Zitat Mizzaro, S. (2008). The good, the bad, the difficult, and the easy: Something wrong with information retrieval evaluation? In Proceedings of the 30th European conference on information retrieval (ECIR’08), Lecture Notes in Computer Science, Vol. 4956, pp. 642–646. Mizzaro, S. (2008). The good, the bad, the difficult, and the easy: Something wrong with information retrieval evaluation? In Proceedings of the 30th European conference on information retrieval (ECIR’08), Lecture Notes in Computer Science, Vol. 4956, pp. 642–646.
Zurück zum Zitat Mizzaro, S., & Robertson, S. (2007). HITS hits TREC: Exploring IR evaluation results with network analysis. In Proceedings of the 30th ACM SIGIR conference on research and development in information retrieval (SIGIR’07), Amsterdam, The Netherlands, pp. 479–486. Mizzaro, S., & Robertson, S. (2007). HITS hits TREC: Exploring IR evaluation results with network analysis. In Proceedings of the 30th ACM SIGIR conference on research and development in information retrieval (SIGIR’07), Amsterdam, The Netherlands, pp. 479–486.
Zurück zum Zitat Mothe, J., & Tanguy, L. (2005). Linguistic features to predict query difficulty. In Proceedings of the SIGIR workshop on predicting query difficulty, Salvador, Brazil. Mothe, J., & Tanguy, L. (2005). Linguistic features to predict query difficulty. In Proceedings of the SIGIR workshop on predicting query difficulty, Salvador, Brazil.
Zurück zum Zitat Nie, L., Davison, B. D., & Qi, X. (2006). Topical link analysis for web search. In Proceedings of the 29th ACM international conference on research and development in information retrieval (SIGIR’06), Seattle, Washington, pp. 91–98. Nie, L., Davison, B. D., & Qi, X. (2006). Topical link analysis for web search. In Proceedings of the 29th ACM international conference on research and development in information retrieval (SIGIR’06), Seattle, Washington, pp. 91–98.
Zurück zum Zitat Pehcevski, J., Thom, J. A., & Vercoustre, A. M. (2005). Hybrid XML retrieval: Combining information retrieval and a native XML database. Information Retrieval 8(4), 571–600.CrossRef Pehcevski, J., Thom, J. A., & Vercoustre, A. M. (2005). Hybrid XML retrieval: Combining information retrieval and a native XML database. Information Retrieval 8(4), 571–600.CrossRef
Zurück zum Zitat Pehcevski, J., Vercoustre, A. M., & Thom, J. A. (2008). Exploiting locality of Wikipedia links in entity ranking. In Proceedings of the 30th European conference on information retrieval (ECIR’08), Lecture notes in computer science, Vol. 4956, pp. 258–269. Pehcevski, J., Vercoustre, A. M., & Thom, J. A. (2008). Exploiting locality of Wikipedia links in entity ranking. In Proceedings of the 30th European conference on information retrieval (ECIR’08), Lecture notes in computer science, Vol. 4956, pp. 258–269.
Zurück zum Zitat Quinlan, J. R. (1993). C4.5: Programs for machine learning. Morgan Kaufmann Publishers, Inc. Quinlan, J. R. (1993). C4.5: Programs for machine learning. Morgan Kaufmann Publishers, Inc.
Zurück zum Zitat Sahuguet, A., & Azavant, F. (1999). Building light-weight wrappers for legacy web data-sources using W4F. In Proceedings of 25th international conference on very large data bases (VLDB’99), Edinburgh, Scotland, UK, pp. 738–741. Sahuguet, A., & Azavant, F. (1999). Building light-weight wrappers for legacy web data-sources using W4F. In Proceedings of 25th international conference on very large data bases (VLDB’99), Edinburgh, Scotland, UK, pp. 738–741.
Zurück zum Zitat Soboroff, I., de Vries, A. P., & Craswell, N. (2006). Overview of the TREC 2006 Enterprise track. In Proceedings of the fifteenth text retrieval conference (TREC 2006), pp. 32–51. Soboroff, I., de Vries, A. P., & Craswell, N. (2006). Overview of the TREC 2006 Enterprise track. In Proceedings of the fifteenth text retrieval conference (TREC 2006), pp. 32–51.
Zurück zum Zitat Thom, J. A., Pehcevski, J., & Vercoustre, A. M. (2007). Use of Wikipedia categories in entity ranking. In Proceedings of 12th Australasian document computing symposium (ADCS’07), Melbourne, Australia, pp. 56–63. Thom, J. A., Pehcevski, J., & Vercoustre, A. M. (2007). Use of Wikipedia categories in entity ranking. In Proceedings of 12th Australasian document computing symposium (ADCS’07), Melbourne, Australia, pp. 56–63.
Zurück zum Zitat Tsikrika, T., Serdyukov, P., Rode, H., Westerveld, T., Aly, R., Hiemstra, D., et al. (2008). Structured document retrieval, multimedia retrieval, and entity ranking using PF/Tijah. In Focused access to XML documents: Sixth international workshop of the initiative for the evaluation of XML retrieval, INEX 2007, Lecture notes in computer science, Vol. 4862, pp. 306–320. Tsikrika, T., Serdyukov, P., Rode, H., Westerveld, T., Aly, R., Hiemstra, D., et al. (2008). Structured document retrieval, multimedia retrieval, and entity ranking using PF/Tijah. In Focused access to XML documents: Sixth international workshop of the initiative for the evaluation of XML retrieval, INEX 2007, Lecture notes in computer science, Vol. 4862, pp. 306–320.
Zurück zum Zitat Vercoustre, A. M., & Paradis, F. (1997). A descriptive language for information object reuse through virtual documents. In 4th International conference on object-oriented information systems (OOIS’97), Brisbane, Australia, pp. 299–311. Vercoustre, A. M., & Paradis, F. (1997). A descriptive language for information object reuse through virtual documents. In 4th International conference on object-oriented information systems (OOIS’97), Brisbane, Australia, pp. 299–311.
Zurück zum Zitat Vercoustre, A. M., Pehcevski, J., & Thom, J. A. (2008a). Using Wikipedia categories and links in entity ranking. In Focused access to XML documents: Sixth international workshop of the initiative for the evaluation of XML retrieval, INEX 2007, Lecture notes in computer science, vol. 4862, pp. 321–335. Vercoustre, A. M., Pehcevski, J., & Thom, J. A. (2008a). Using Wikipedia categories and links in entity ranking. In Focused access to XML documents: Sixth international workshop of the initiative for the evaluation of XML retrieval, INEX 2007, Lecture notes in computer science, vol. 4862, pp. 321–335.
Zurück zum Zitat Vercoustre, A. M., Thom, J. A., & Pehcevski, J. (2008b). Entity ranking in Wikipedia. In Proceedings of the 23rd ACM symposium on applied computing, Fortaleza, Ceará, Brazil, pp. 1101–1106. Vercoustre, A. M., Thom, J. A., & Pehcevski, J. (2008b). Entity ranking in Wikipedia. In Proceedings of the 23rd ACM symposium on applied computing, Fortaleza, Ceará, Brazil, pp. 1101–1106.
Zurück zum Zitat Vercoustre, A. M., Pehcevski, J., & Naumovski, V. (2009). Topic difficulty prediction in entity ranking. In Advances in focused retrieval: Seventh international workshop of the initiative for the evaluation of XML retrieval, INEX 2008, Lecture notes in computer science, Vol. 5631. Vercoustre, A. M., Pehcevski, J., & Naumovski, V. (2009). Topic difficulty prediction in entity ranking. In Advances in focused retrieval: Seventh international workshop of the initiative for the evaluation of XML retrieval, INEX 2008, Lecture notes in computer science, Vol. 5631.
Zurück zum Zitat Voorhees, E. M. (2004). The TREC robust retrieval track. In Proceedings of the thirteenth text retrieval conference (TREC 2004). Voorhees, E. M. (2004). The TREC robust retrieval track. In Proceedings of the thirteenth text retrieval conference (TREC 2004).
Zurück zum Zitat Webber, W., Moffat, A., & Zobel, J. (2008). Score standardization for inter-collection comparison of retrieval systems. In Proceedings of the 31st ACM SIGIR conference on research and development in information retrieval (SIGIR’08), Singapore, pp. 51–58. Webber, W., Moffat, A., & Zobel, J. (2008). Score standardization for inter-collection comparison of retrieval systems. In Proceedings of the 31st ACM SIGIR conference on research and development in information retrieval (SIGIR’08), Singapore, pp. 51–58.
Zurück zum Zitat Witten, I. H., & Frank, E. (2005). Data mining: Practical machine learning tools and techniques, second edition. Morgan Kaufmann Publishers, Inc. Witten, I. H., & Frank, E. (2005). Data mining: Practical machine learning tools and techniques, second edition. Morgan Kaufmann Publishers, Inc.
Zurück zum Zitat Yom-Tov, E., Fine, S., Carmel, D., Darlow, A., & Amitay, E. (2004). Juru at TREC 2004: Experiments with prediction of query difficulty. In Proceedings of the thirteenth text retrieval conference (TREC 2004). Yom-Tov, E., Fine, S., Carmel, D., Darlow, A., & Amitay, E. (2004). Juru at TREC 2004: Experiments with prediction of query difficulty. In Proceedings of the thirteenth text retrieval conference (TREC 2004).
Zurück zum Zitat Yu, J., Thom, J. A., & Tam, A. (2007). Ontology evaluation using Wikipedia categories for browsing. In Proceedings of the 16th ACM conference on information and knowledge management (CIKM’07), Lisboa, Portugal, pp. 223–232. Yu, J., Thom, J. A., & Tam, A. (2007). Ontology evaluation using Wikipedia categories for browsing. In Proceedings of the 16th ACM conference on information and knowledge management (CIKM’07), Lisboa, Portugal, pp. 223–232.
Zurück zum Zitat Zhou, Y., & Croft, W. B. (2007). Query performance prediction in web search environments. In Proceedings of the 30th ACM SIGIR conference on research and development in information retrieval (SIGIR’07), Amsterdam, The Netherlands, pp. 543–550. Zhou, Y., & Croft, W. B. (2007). Query performance prediction in web search environments. In Proceedings of the 30th ACM SIGIR conference on research and development in information retrieval (SIGIR’07), Amsterdam, The Netherlands, pp. 543–550.
Metadaten
Titel
Entity ranking in Wikipedia: utilising categories, links and topic difficulty prediction
verfasst von
Jovan Pehcevski
James A. Thom
Anne-Marie Vercoustre
Vladimir Naumovski
Publikationsdatum
01.10.2010
Verlag
Springer Netherlands
Erschienen in
Discover Computing / Ausgabe 5/2010
Print ISSN: 2948-2984
Elektronische ISSN: 2948-2992
DOI
https://doi.org/10.1007/s10791-009-9125-9

Weitere Artikel der Ausgabe 5/2010

Discover Computing 5/2010 Zur Ausgabe