Skip to main content
Erschienen in: Discover Computing 1/2012

01.02.2012

Graph-based term weighting for information retrieval

verfasst von: Roi Blanco, Christina Lioma

Erschienen in: Discover Computing | Ausgabe 1/2012

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A standard approach to Information Retrieval (IR) is to model text as a bag of words. Alternatively, text can be modelled as a graph, whose vertices represent words, and whose edges represent relations between the words, defined on the basis of any meaningful statistical or linguistic relation. Given such a text graph, graph theoretic computations can be applied to measure various properties of the graph, and hence of the text. This work explores the usefulness of such graph-based text representations for IR. Specifically, we propose a principled graph-theoretic approach of (1) computing term weights and (2) integrating discourse aspects into retrieval. Given a text graph, whose vertices denote terms linked by co-occurrence and grammatical modification, we use graph ranking computations (e.g. PageRank Page et al. in The pagerank citation ranking: Bringing order to the Web. Technical report, Stanford Digital Library Technologies Project, 1998) to derive weights for each vertex, i.e. term weights, which we use to rank documents against queries. We reason that our graph-based term weights do not necessarily need to be normalised by document length (unlike existing term weights) because they are already scaled by their graph-ranking computation. This is a departure from existing IR ranking functions, and we experimentally show that it performs comparably to a tuned ranking baseline, such as BM25 (Robertson et al. in NIST Special Publication 500-236: TREC-4, 1995). In addition, we integrate into ranking graph properties, such as the average path length, or clustering coefficient, which represent different aspects of the topology of the graph, and by extension of the document represented as a graph. Integrating such properties into ranking allows us to consider issues such as discourse coherence, flow and density during retrieval. We experimentally show that this type of ranking performs comparably to BM25, and can even outperform it, across different TREC (Voorhees and Harman in TREC: Experiment and evaluation in information retrieval, MIT Press, 2005) datasets and evaluation measures.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
The term ‘connectionist’ has been used to denote most network or graph based approaches, despite the fact that, strictly speaking, classical connectionist systems should consist of weighted, unlabeled links and should exhibit some adaptive learning capabilities.
 
2
The difference between co-occurrences and collocations is that collocations are significant recurrent co-occurrences (see Sinclair 1991 for more). There exist several measures for distinguishing collocations from insignificant, though recurrent co-occurrences, overviewed in Manning and Schutze (1999).
 
3
The illustrations in Figs. 5 and 6 have been generated with CFinder: http://​cfinder.​org/​.
 
5
TF-IDF is used here with pivoted document length normalisation (Singhal et al. 1996).
 
Literatur
Zurück zum Zitat Agirre, E., & Soroa, A. (2009). Personalizing pagerank for word sense disambiguation. In EACL (pp. 33–41). The Association for Computer Linguistics. Agirre, E., & Soroa, A. (2009). Personalizing pagerank for word sense disambiguation. In EACL (pp. 33–41). The Association for Computer Linguistics.
Zurück zum Zitat Albert, R. (2005). Scale-free networks in cell biology. Journal of Cell Science, 118, 4947–4957.CrossRef Albert, R. (2005). Scale-free networks in cell biology. Journal of Cell Science, 118, 4947–4957.CrossRef
Zurück zum Zitat Albert, R., & Barabási, A. L. (2001). Statistical mechanics of complex networks. CoRR cond-mat/0106096. Albert, R., & Barabási, A. L. (2001). Statistical mechanics of complex networks. CoRR cond-mat/0106096.
Zurück zum Zitat Albert, R., & Barabási, A. L. (2002). Statistical mechanics of complex networks. Review of Modern Physics, 74, 47–97.MATHCrossRef Albert, R., & Barabási, A. L. (2002). Statistical mechanics of complex networks. Review of Modern Physics, 74, 47–97.MATHCrossRef
Zurück zum Zitat Albert, R., Jeong, H., & Barabási, A. L. (1999). The diameter of the world wide web. CoRR cond-mat/9907038. Albert, R., Jeong, H., & Barabási, A. L. (1999). The diameter of the world wide web. CoRR cond-mat/9907038.
Zurück zum Zitat Allan, J., Aslam, J. A., Sanderson, M., Zhai, C., & Zobel, J. (Eds.). (2009). Proceedings of the 32nd annual international ACM SIGIR conference on research and development in information retrieval, SIGIR 2009. Boston, MA, USA: ACM. July 19–23. Allan, J., Aslam, J. A., Sanderson, M., Zhai, C., & Zobel, J. (Eds.). (2009). Proceedings of the 32nd annual international ACM SIGIR conference on research and development in information retrieval, SIGIR 2009. Boston, MA, USA: ACM. July 19–23.
Zurück zum Zitat Baeza-Yates, R. A., & Ribeiro-Neto, B. A. (1999). Modern information retrieval. New York: ACM Press/Addison-Wesley. Baeza-Yates, R. A., & Ribeiro-Neto, B. A. (1999). Modern information retrieval. New York: ACM Press/Addison-Wesley.
Zurück zum Zitat Baeza-Yates, R. A., Ziviani, N., Marchionini, G., Moffat, A., & Tait, J. (Eds.) (2005). SIGIR 2005: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval. Salvador, Brazil: ACM. August 15–19. Baeza-Yates, R. A., Ziviani, N., Marchionini, G., Moffat, A., & Tait, J. (Eds.) (2005). SIGIR 2005: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval. Salvador, Brazil: ACM. August 15–19.
Zurück zum Zitat Barrat, A., Barthélemy, M., Pastor-Satorras, R., & Vespignani, A. (2004). The architecture of complex weighted networks. Proceedings of National Academic Science, 101(11), 3747–3752. Barrat, A., Barthélemy, M., Pastor-Satorras, R., & Vespignani, A. (2004). The architecture of complex weighted networks. Proceedings of National Academic Science, 101(11), 3747–3752.
Zurück zum Zitat Bekkerman, R., Zilberstein, S., & Allan, J. (2007). Web page clustering using heuristic search in the web graph. In IJCAI (pp. 2280–2285). Bekkerman, R., Zilberstein, S., & Allan, J. (2007). Web page clustering using heuristic search in the web graph. In IJCAI (pp. 2280–2285).
Zurück zum Zitat Belew, R. K. (2011). Adaptive information retrieval: Using a connectionist representation to retrieve and learn about documents. In Belkin and van Rijsbergen (1989), pp. 11–20. Belew, R. K. (2011). Adaptive information retrieval: Using a connectionist representation to retrieve and learn about documents. In Belkin and van Rijsbergen (1989), pp. 11–20.
Zurück zum Zitat Belew, R. K. (2005). Scientific impact quantity and quality: Analysis of two sources of bibliographic data. CoRR abs/cs/0504036. Belew, R. K. (2005). Scientific impact quantity and quality: Analysis of two sources of bibliographic data. CoRR abs/cs/0504036.
Zurück zum Zitat Belkin, N. J., & van Rijsbergen, C. J. (Eds.). (1989). SIGIR’89, 12th international conference on research and development in information retrieval. Cambridge, Massachusetts, USA: ACM. June 25–28 (Proceedings). Belkin, N. J., & van Rijsbergen, C. J. (Eds.). (1989). SIGIR’89, 12th international conference on research and development in information retrieval. Cambridge, Massachusetts, USA: ACM. June 25–28 (Proceedings).
Zurück zum Zitat Berlow, E. L. (1999). Strong effects of weak interactions in ecological communities. Nature, 398, 330–334.CrossRef Berlow, E. L. (1999). Strong effects of weak interactions in ecological communities. Nature, 398, 330–334.CrossRef
Zurück zum Zitat Blanco, R., & Lioma, C. (2007). Random walk term weighting for information retrieval. In SIGIR (pp. 829–830). Blanco, R., & Lioma, C. (2007). Random walk term weighting for information retrieval. In SIGIR (pp. 829–830).
Zurück zum Zitat Boccaletti, S., Latora, V., Moreno, Y., Chavez, M., & Hwang, D. U. (2006). Complex networks: structure and dynamics. Physics Reports, 424, 175–308.MathSciNetCrossRef Boccaletti, S., Latora, V., Moreno, Y., Chavez, M., & Hwang, D. U. (2006). Complex networks: structure and dynamics. Physics Reports, 424, 175–308.MathSciNetCrossRef
Zurück zum Zitat Bollobás, B. (1979). Graph theory: An introductory course. New York: Springer.MATH Bollobás, B. (1979). Graph theory: An introductory course. New York: Springer.MATH
Zurück zum Zitat Bollobás, B. (1985). Random graphs. London: Academic Press.MATH Bollobás, B. (1985). Random graphs. London: Academic Press.MATH
Zurück zum Zitat Bookstein, A., Chiaramella, Y., Salton, G., & Raghavan, V. V. (Eds.). (1991). Proceedings of the 14th annual international ACM SIGIR conference on research and development in information retrieval. Chicago, Illinois, USA: ACM. October 13–16 (Special Issue of the SIGIR Forum). Bookstein, A., Chiaramella, Y., Salton, G., & Raghavan, V. V. (Eds.). (1991). Proceedings of the 14th annual international ACM SIGIR conference on research and development in information retrieval. Chicago, Illinois, USA: ACM. October 13–16 (Special Issue of the SIGIR Forum).
Zurück zum Zitat Caldeira, S. M. G., Lobao, T. C. P., Andrade, R. F. S., Neme, A., & Miranda, J. G. V. (2005). The network of concepts in written texts. Caldeira, S. M. G., Lobao, T. C. P., Andrade, R. F. S., Neme, A., & Miranda, J. G. V. (2005). The network of concepts in written texts.
Zurück zum Zitat i Cancho, R. F., Capocci, A., & Caldarelli, G. (2007). Spectral methods cluster words of the same class in a syntactic dependency network. International Journal of Bifurcation and Chaos, 17(7), 2453–2463.MATHCrossRef i Cancho, R. F., Capocci, A., & Caldarelli, G. (2007). Spectral methods cluster words of the same class in a syntactic dependency network. International Journal of Bifurcation and Chaos, 17(7), 2453–2463.MATHCrossRef
Zurück zum Zitat i Cancho, R. F., Capocci, A., & Caldarelli, G. (2007). Spectral methods cluster words of the same class in a syntactic dependency network. International Journal of Bifurcation and Chaos, 17(7), 2453–2463.MATHCrossRef i Cancho, R. F., Capocci, A., & Caldarelli, G. (2007). Spectral methods cluster words of the same class in a syntactic dependency network. International Journal of Bifurcation and Chaos, 17(7), 2453–2463.MATHCrossRef
Zurück zum Zitat Cao, G., Nie, J. Y., & Bai, J. (2005). Integrating word relationships into language models. In: R. A. Baeza-Yates, N. Ziviani, G. Marchionini, A. Moffat, & J. Tait (Eds.), SIGIR (pp. 298–305). Cao, G., Nie, J. Y., & Bai, J. (2005). Integrating word relationships into language models. In: R. A. Baeza-Yates, N. Ziviani, G. Marchionini, A. Moffat, & J. Tait (Eds.), SIGIR (pp. 298–305).
Zurück zum Zitat Chakrabarti, S., Dom, B., Raghavan, P., Rajagopalan, S., Gibson, D., & Kleinberg, J. M. (1998). Automatic resource compilation by analyzing hyperlink structure and associated text. Computer Networks, 30(1–7), 65–74. Chakrabarti, S., Dom, B., Raghavan, P., Rajagopalan, S., Gibson, D., & Kleinberg, J. M. (1998). Automatic resource compilation by analyzing hyperlink structure and associated text. Computer Networks, 30(1–7), 65–74.
Zurück zum Zitat Choudhury, M., Thomas, M., Mukherjee, A., Basu, A., & Ganguly, N. (2007). How difficult is it to develop a perfect spell-checker? A cross-linguistic analysis through complex network approach. In Proceedings of the second workshop on TextGraphs: Graph-based algorithms for natural language processing (pp. 81–88). Rochester, NY, USA: Association for Computational Linguistics. url: http://www.aclweb.org/anthology/W/W07/W07-021. Choudhury, M., Thomas, M., Mukherjee, A., Basu, A., & Ganguly, N. (2007). How difficult is it to develop a perfect spell-checker? A cross-linguistic analysis through complex network approach. In Proceedings of the second workshop on TextGraphs: Graph-based algorithms for natural language processing (pp. 81–88). Rochester, NY, USA: Association for Computational Linguistics. url: http://​www.​aclweb.​org/​anthology/​W/​W07/​W07-021.
Zurück zum Zitat Christensen, C., & Albert, R. (2007). Using graph concepts to understand the organization of complex systems. International Journal of Bifurcation and Chaos, 17(7), 2201–2214.MathSciNetMATHCrossRef Christensen, C., & Albert, R. (2007). Using graph concepts to understand the organization of complex systems. International Journal of Bifurcation and Chaos, 17(7), 2201–2214.MathSciNetMATHCrossRef
Zurück zum Zitat Cramer, P. (1968). Word association. New York, USA: Academic Press. Cramer, P. (1968). Word association. New York, USA: Academic Press.
Zurück zum Zitat Craswell, N., Robertson, S. E., Zaragoza, H., & Taylor, M. J. (2005). Relevance weighting for query independent evidence. In SIGIR (pp. 416–423). Craswell, N., Robertson, S. E., Zaragoza, H., & Taylor, M. J. (2005). Relevance weighting for query independent evidence. In SIGIR (pp. 416–423).
Zurück zum Zitat Craswell, N., & Szummer, M. (2007) Random walks on the click graph. In Kraaij et al. (2007), pp. 239–246. Craswell, N., & Szummer, M. (2007) Random walks on the click graph. In Kraaij et al. (2007), pp. 239–246.
Zurück zum Zitat Crestani, F., & van Rijsbergen, C. J. (1998). A study of probability kinematics in information retrieval. ACM Transaction of Informational System, 16(3), 225–255.CrossRef Crestani, F., & van Rijsbergen, C. J. (1998). A study of probability kinematics in information retrieval. ACM Transaction of Informational System, 16(3), 225–255.CrossRef
Zurück zum Zitat Deese, J. (1965). The structure of associations in language and thought. Baltimore, USA: The John Hopkins Press. Deese, J. (1965). The structure of associations in language and thought. Baltimore, USA: The John Hopkins Press.
Zurück zum Zitat Doszkocs, T. E., Reggia, J., & Lin, X. (1990). Connectionist models and information retrieval. Annual Review of Information Science and Technology (ARIST), 25, 209–260. Doszkocs, T. E., Reggia, J., & Lin, X. (1990). Connectionist models and information retrieval. Annual Review of Information Science and Technology (ARIST), 25, 209–260.
Zurück zum Zitat Erdos, P., & Renyi, A. (1959). On random graphs i. Publicationes Mathematicae (Debrecen), 6, 290–297.MathSciNet Erdos, P., & Renyi, A. (1959). On random graphs i. Publicationes Mathematicae (Debrecen), 6, 290–297.MathSciNet
Zurück zum Zitat Erdos, P., & Renyi, A. (1960). On the evolution of random graphs. Publication Mathematical Institution of Hungarian Academic Science, 5, 17–61.MathSciNet Erdos, P., & Renyi, A. (1960). On the evolution of random graphs. Publication Mathematical Institution of Hungarian Academic Science, 5, 17–61.MathSciNet
Zurück zum Zitat Erdos, P., & Renyi, A. (1961). On the evolution of random graphs. Bulletin Institution of International Statistics, 38, 343–347.MathSciNet Erdos, P., & Renyi, A. (1961). On the evolution of random graphs. Bulletin Institution of International Statistics, 38, 343–347.MathSciNet
Zurück zum Zitat Erkan, G., & Radev, D. R. (2004). Lexrank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research (JAIR), 22, 457–479. Erkan, G., & Radev, D. R. (2004). Lexrank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research (JAIR), 22, 457–479.
Zurück zum Zitat Esuli, A., & Sebastiani, F. (2007) Pageranking wordnet synsets: An application to opinion mining. In The Association for Computer Linguistics (ACL). Esuli, A., & Sebastiani, F. (2007) Pageranking wordnet synsets: An application to opinion mining. In The Association for Computer Linguistics (ACL).
Zurück zum Zitat Faloutsos, M., Faloutsos, P., & Faloutsos, C. (1999). On power-law relationships of the internet topology. In SIGCOMM (pp. 251–262). Faloutsos, M., Faloutsos, P., & Faloutsos, C. (1999). On power-law relationships of the internet topology. In SIGCOMM (pp. 251–262).
Zurück zum Zitat Feinberg, M. (1980). Chemical oscillations, multiple equilibria, and reaction network structure. In W. Stewart, W. Rey, & C. Conley (Eds.), Dynamics of reactive systems (pp. 59–130). New York: Academic Press. Feinberg, M. (1980). Chemical oscillations, multiple equilibria, and reaction network structure. In W. Stewart, W. Rey, & C. Conley (Eds.), Dynamics of reactive systems (pp. 59–130). New York: Academic Press.
Zurück zum Zitat Ferrer i Cancho, R. (2005). The structure of syntactic dependency networks: Insights from recent advances in network theory. In G. Altmann, V. Levickij, & V. Perebyinis (Eds.), The problems of quantitative linguistics (pp. 60–75). Chernivtsi: Ruta. Ferrer i Cancho, R. (2005). The structure of syntactic dependency networks: Insights from recent advances in network theory. In G. Altmann, V. Levickij, & V. Perebyinis (Eds.), The problems of quantitative linguistics (pp. 60–75). Chernivtsi: Ruta.
Zurück zum Zitat Ferrer i Cancho, R., & Solé, R. V. (2001). Two regimes in the frequency of words and the origins of complex lexicons: Zipf’s law revisited. Journal of Quantitative Linguistics, 8(3), 165–173.CrossRef Ferrer i Cancho, R., & Solé, R. V. (2001). Two regimes in the frequency of words and the origins of complex lexicons: Zipf’s law revisited. Journal of Quantitative Linguistics, 8(3), 165–173.CrossRef
Zurück zum Zitat Firth, J. R. (1968b). A synopsis of linguistic theory. In F. R. Palmer (Ed.), Selected papers of J.R. Firth 1952–1959 (pp. 168–205). London: Longmans. Firth, J. R. (1968b). A synopsis of linguistic theory. In F. R. Palmer (Ed.), Selected papers of J.R. Firth 1952–1959 (pp. 168–205). London: Longmans.
Zurück zum Zitat Gaume, B. (2008). Mapping the forms of meaning in small worlds. International Journal of Intelligence System, 23(7), 848–862.CrossRef Gaume, B. (2008). Mapping the forms of meaning in small worlds. International Journal of Intelligence System, 23(7), 848–862.CrossRef
Zurück zum Zitat Girvan, M., & Newman, M. E. J. (2002). Community structure in social and biological networks. Proceedings of National Academic Science USA, 99(12), 7821–7826.MathSciNetMATHCrossRef Girvan, M., & Newman, M. E. J. (2002). Community structure in social and biological networks. Proceedings of National Academic Science USA, 99(12), 7821–7826.MathSciNetMATHCrossRef
Zurück zum Zitat Goldberg, A., Zhu, X. (2006). Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization. In Proceedings of TextGraphs: The first workshop on graph based methods for natural language processing (pp. 45–52). New York City: Association for Computational Linguistics. url:http://www.aclweb.org/anthology/W/W06/W06-380. Goldberg, A., Zhu, X. (2006). Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization. In Proceedings of TextGraphs: The first workshop on graph based methods for natural language processing (pp. 45–52). New York City: Association for Computational Linguistics. url:http://​www.​aclweb.​org/​anthology/​W/​W06/​W06-380.
Zurück zum Zitat Guimera, R., Mossa, S., Turtschi, A., & Amaral, L. A. N. (2005). The worldwide air transportation network: Anomalous centrality, community structure, and cities’ global roles. Proceedings of National Academic Science USA, 102, 7794–7799. Guimera, R., Mossa, S., Turtschi, A., & Amaral, L. A. N. (2005). The worldwide air transportation network: Anomalous centrality, community structure, and cities’ global roles. Proceedings of National Academic Science USA, 102, 7794–7799.
Zurück zum Zitat Halliday, M., & Hasan, R. (1976). Cohesion in English. London: Longman. Halliday, M., & Hasan, R. (1976). Cohesion in English. London: Longman.
Zurück zum Zitat Hassan, S., Banea, C. (2006). Random-walk term weighting for improved text classification. In Proceedings of TextGraphs: The first workshop on graph based methods for natural language processing (pp. 53–60). New York City: Association for Computational Linguistics. url:http://www.aclweb.org/anthology/W/W06/W06-380. Hassan, S., Banea, C. (2006). Random-walk term weighting for improved text classification. In Proceedings of TextGraphs: The first workshop on graph based methods for natural language processing (pp. 53–60). New York City: Association for Computational Linguistics. url:http://​www.​aclweb.​org/​anthology/​W/​W06/​W06-380.
Zurück zum Zitat Ho, N. D., & Fairon, C. (2004). Lexical similarity based on quantity of information exchanged—synonym extraction. In RIVF (pp. 193–198). Ho, N. D., & Fairon, C. (2004). Lexical similarity based on quantity of information exchanged—synonym extraction. In RIVF (pp. 193–198).
Zurück zum Zitat Hoey, M. (1991). Patterns of lexis in text. Oxford, UK: Oxford University Press. Hoey, M. (1991). Patterns of lexis in text. Oxford, UK: Oxford University Press.
Zurück zum Zitat Hopfield, J. J. (1982). Neural networks and physical systems with emergent collective computational abilities. Proceedings of the National Academy of Sciences, 79(8), 2554–2558. Hopfield, J. J. (1982). Neural networks and physical systems with emergent collective computational abilities. Proceedings of the National Academy of Sciences, 79(8), 2554–2558.
Zurück zum Zitat Hopfield, J. J., & Tank, D. W. (1986). Computing with neural circuits: A model. Science, 233, 625–633.CrossRef Hopfield, J. J., & Tank, D. W. (1986). Computing with neural circuits: A model. Science, 233, 625–633.CrossRef
Zurück zum Zitat Huang, W. Y., & Lippmann, R. (1987). Neural net and traditional classifiers. In D. Z. Anderson (Ed.) NIPS (pp. 387–396). American Institue of Physics. Huang, W. Y., & Lippmann, R. (1987). Neural net and traditional classifiers. In D. Z. Anderson (Ed.) NIPS (pp. 387–396). American Institue of Physics.
Zurück zum Zitat Hughes, T., & Ramage, D. (2007). Lexical semantic relatedness with random graph walks. In Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL) (pp. 581–589). Prague, Czech Republic: Association for Computational Linguistics. url: http://www.aclweb.org/anthology/D/D07/D07-106. Hughes, T., & Ramage, D. (2007). Lexical semantic relatedness with random graph walks. In Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL) (pp. 581–589). Prague, Czech Republic: Association for Computational Linguistics. url: http://​www.​aclweb.​org/​anthology/​D/​D07/​D07-106.
Zurück zum Zitat Jespersen, O. (1929). The philosophy of grammar. London: Allen and Unwin. Jespersen, O. (1929). The philosophy of grammar. London: Allen and Unwin.
Zurück zum Zitat Joyce, T., & Miyake, M. (2008). Capturing the structures in association knowledge: Application of network analyses to large-scale databases of japanese word associations. In T. Tokunaga, A. Ortega (Eds.), Lecture notes in computer science (LKR) (Vol. 4938, pp. 116–131). Springer. Joyce, T., & Miyake, M. (2008). Capturing the structures in association knowledge: Application of network analyses to large-scale databases of japanese word associations. In T. Tokunaga, A. Ortega (Eds.), Lecture notes in computer science (LKR) (Vol. 4938, pp. 116–131). Springer.
Zurück zum Zitat Jung, J., Makoshi, N., & Akama, H. (2008). Associative language learning support applying graph clustering for vocabulary learning and improving associative ability. In ICALT (pp. 228–232). IEEE. Jung, J., Makoshi, N., & Akama, H. (2008). Associative language learning support applying graph clustering for vocabulary learning and improving associative ability. In ICALT (pp. 228–232). IEEE.
Zurück zum Zitat Kleinberg, J. M. (2006). Social networks, incentives, and search. In: E. N. Efthimiadis, S. T. Dumais, D. Hawking, & K. Järvelin (Eds.), SIGIR (pp. 210–211). ACM. Kleinberg, J. M. (2006). Social networks, incentives, and search. In: E. N. Efthimiadis, S. T. Dumais, D. Hawking, & K. Järvelin (Eds.), SIGIR (pp. 210–211). ACM.
Zurück zum Zitat Knospe, W., Santen, L., Schadschneider, A., Schreckenberg, M. (2002). Single vehicle data of highway traffic: Microscopic description of traffic phases. Physical Review, E65, 056133. Knospe, W., Santen, L., Schadschneider, A., Schreckenberg, M. (2002). Single vehicle data of highway traffic: Microscopic description of traffic phases. Physical Review, E65, 056133.
Zurück zum Zitat Konstas, I., Stathopoulos, V., & Jose, J. M. (2009). On social networks and collaborative recommendation. In Allan et al. (2009), pp. 195–202. Konstas, I., Stathopoulos, V., & Jose, J. M. (2009). On social networks and collaborative recommendation. In Allan et al. (2009), pp. 195–202.
Zurück zum Zitat Kozima, H. (1993). Similarity between words computed by spreading activation on an english dictionary. In EACL (pp. 232–239). Kozima, H. (1993). Similarity between words computed by spreading activation on an english dictionary. In EACL (pp. 232–239).
Zurück zum Zitat Kraaij, W., de Vries, A. P., Clarke, C. L. A., Fuhr, N., & Kando, N. (Eds.). (2007). SIGIR 2007: Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval. Amsterdam, The Netherlands: ACM. July 23–27. Kraaij, W., de Vries, A. P., Clarke, C. L. A., Fuhr, N., & Kando, N. (Eds.). (2007). SIGIR 2007: Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval. Amsterdam, The Netherlands: ACM. July 23–27.
Zurück zum Zitat Krapivsky, P. L., Redner, S., & Leyvraz, F. (2000). Connectivity of growing random networks. Physical Review Letters, 85, 4629–4632.CrossRef Krapivsky, P. L., Redner, S., & Leyvraz, F. (2000). Connectivity of growing random networks. Physical Review Letters, 85, 4629–4632.CrossRef
Zurück zum Zitat Krovetz, R. (2000). Viewing morphology as an inference process. Artificial Intelligence, 118(1–2), 277–294.MATHCrossRef Krovetz, R. (2000). Viewing morphology as an inference process. Artificial Intelligence, 118(1–2), 277–294.MATHCrossRef
Zurück zum Zitat Kurland, O., & Lee, L. (2010). Pagerank without hyperlinks: Structural re-ranking using links induced by language models. In Baeza-Yates et al. (2005), pp. 306–313. Kurland, O., & Lee, L. (2010). Pagerank without hyperlinks: Structural re-ranking using links induced by language models. In Baeza-Yates et al. (2005), pp. 306–313.
Zurück zum Zitat Kwok, K. L. (2011) A neural network for probabilistic information retrieval. In Belkin and van Rijsbergen (1989), pp. 21–30. Kwok, K. L. (2011) A neural network for probabilistic information retrieval. In Belkin and van Rijsbergen (1989), pp. 21–30.
Zurück zum Zitat Latora, V., & Marchiori, M. (2001). Efficient behavior of small-world networks. Physical Review Letters, 87, 198701–198704.CrossRef Latora, V., & Marchiori, M. (2001). Efficient behavior of small-world networks. Physical Review Letters, 87, 198701–198704.CrossRef
Zurück zum Zitat Latora, V., & Marchiori, M. (2003). Economic small-world behaviour in weighted networks. European Physics Journal, B32, 249–263. Latora, V., & Marchiori, M. (2003). Economic small-world behaviour in weighted networks. European Physics Journal, B32, 249–263.
Zurück zum Zitat Leicht, E. A., Holme, P., & Newman, M. E. J. (2006) Vertex similarity in networks. Physical Review E, (73). Leicht, E. A., Holme, P., & Newman, M. E. J. (2006) Vertex similarity in networks. Physical Review E, (73).
Zurück zum Zitat Lemke, N., Herédia, F., Barcellos, C. K., dos Reis, A. N., & Mombach, J. C. M. (2004). Essentiality and damage in metabolic networks. Bioinformatics, 20(1), 115–119.CrossRef Lemke, N., Herédia, F., Barcellos, C. K., dos Reis, A. N., & Mombach, J. C. M. (2004). Essentiality and damage in metabolic networks. Bioinformatics, 20(1), 115–119.CrossRef
Zurück zum Zitat Lempel, R., & Moran, S. (2001). SALSA: The stochastic approach for link-structure analysis. ACM Transaction on Informational System, 19(2), 131–160.CrossRef Lempel, R., & Moran, S. (2001). SALSA: The stochastic approach for link-structure analysis. ACM Transaction on Informational System, 19(2), 131–160.CrossRef
Zurück zum Zitat Li, W., & Cai, X. (2004). Statistical analysis of airport network of china. Physical Review, E69, 046106. Li, W., & Cai, X. (2004). Statistical analysis of airport network of china. Physical Review, E69, 046106.
Zurück zum Zitat Lin, X., Soergel, D., Marchionini, G. A self-organizing semantic map for information retrieval. In Bookstein et al. (1991), pp. 262–269. Lin, X., Soergel, D., Marchionini, G. A self-organizing semantic map for information retrieval. In Bookstein et al. (1991), pp. 262–269.
Zurück zum Zitat Lioma, C., & Blanco, R. (2009). Part of speech based term weighting for information retrieval. In: M. Boughanem, C. Berrut, J. Mothe, & C. Soulé-Dupuy (Eds.), ECIR, lecture notes in computer science (Vol. 5478, pp. 412–423). Springer. Lioma, C., & Blanco, R. (2009). Part of speech based term weighting for information retrieval. In: M. Boughanem, C. Berrut, J. Mothe, & C. Soulé-Dupuy (Eds.), ECIR, lecture notes in computer science (Vol. 5478, pp. 412–423). Springer.
Zurück zum Zitat Lioma, C., & Van Rijsbergen, C. J. K. (2008). Part of speech n-grams and information retrieval. RFLA, 8, 9–22. Lioma, C., & Van Rijsbergen, C. J. K. (2008). Part of speech n-grams and information retrieval. RFLA, 8, 9–22.
Zurück zum Zitat Ma’ayan, A., Blitzer, R. D., & Iyengar, R. (2004). Toward predictive models of mammalian cells. Annual Review of Giophysics and Biomolecular Structure, 319–349. Ma’ayan, A., Blitzer, R. D., & Iyengar, R. (2004). Toward predictive models of mammalian cells. Annual Review of Giophysics and Biomolecular Structure, 319–349.
Zurück zum Zitat Ma’ayan, A., Jenkins, S. L., Neves, S., Hasseldine, A., Grace, E., Dubin-Thaler, et al. (2005). Formation of regulatory patterns during signal propagation in a mammalian cellular network. Science, 309(5737), 1078–1083.CrossRef Ma’ayan, A., Jenkins, S. L., Neves, S., Hasseldine, A., Grace, E., Dubin-Thaler, et al. (2005). Formation of regulatory patterns during signal propagation in a mammalian cellular network. Science, 309(5737), 1078–1083.CrossRef
Zurück zum Zitat Macleod, K. J., & Robertson, W. (1991). A neural algorithm for document clustering. Information Processing & Management, 27(4), 337–346.CrossRef Macleod, K. J., & Robertson, W. (1991). A neural algorithm for document clustering. Information Processing & Management, 27(4), 337–346.CrossRef
Zurück zum Zitat Manning, C. D., & Schutze, H. (1999). Foundations of statistical language processing. London: The MIT Press.MATH Manning, C. D., & Schutze, H. (1999). Foundations of statistical language processing. London: The MIT Press.MATH
Zurück zum Zitat McCann, K., Hastings, A., & Huxel, G. R. (1998). Weak trophic interactions and the balance of nature. Nature, 395, 794–798.CrossRef McCann, K., Hastings, A., & Huxel, G. R. (1998). Weak trophic interactions and the balance of nature. Nature, 395, 794–798.CrossRef
Zurück zum Zitat Mehler, A. (2007). Large text networks as an object of corpus linguistic studies. In: Corpus linguistics. An international handbook of the science of language and society. Mehler, A. (2007). Large text networks as an object of corpus linguistic studies. In: Corpus linguistics. An international handbook of the science of language and society.
Zurück zum Zitat Mihalcea, R., & Tarau, P. (2004). TextRank: Bringing order into texts. In EMNLP (pp. 404–411). Mihalcea, R., & Tarau, P. (2004). TextRank: Bringing order into texts. In EMNLP (pp. 404–411).
Zurück zum Zitat Minkov, E., & Cohen, W. W. (2008). Learning graph walk based similarity measures for parsed text. In EMNLP (pp. 907–916). ACL. Minkov, E., & Cohen, W. W. (2008). Learning graph walk based similarity measures for parsed text. In EMNLP (pp. 907–916). ACL.
Zurück zum Zitat Minsky, M. L. (1969). Semantic information processing. Cambridge: The MIT Press. Minsky, M. L. (1969). Semantic information processing. Cambridge: The MIT Press.
Zurück zum Zitat Mizzaro, S., & Robertson, S. (2007). Hits hits trec: exploring ir evaluation results with network analysis. In Kraaij et al. (2007), pp. 479–486 Mizzaro, S., & Robertson, S. (2007). Hits hits trec: exploring ir evaluation results with network analysis. In Kraaij et al. (2007), pp. 479–486
Zurück zum Zitat Moore, C., & Newman, M. E. J. (2000). Epidemics and percolation in small-world networks. Physical Review, E61, 5678–5682. Moore, C., & Newman, M. E. J. (2000). Epidemics and percolation in small-world networks. Physical Review, E61, 5678–5682.
Zurück zum Zitat Motter, A. E., de Moura, A. P. S., Lai, Y. C., & Dasgupta, P. (2011). Topology of the conceptual network of language. Physics Review E, 65(6). Motter, A. E., de Moura, A. P. S., Lai, Y. C., & Dasgupta, P. (2011). Topology of the conceptual network of language. Physics Review E, 65(6).
Zurück zum Zitat Muller, P., Hathout, N., & Gaume, B. (2006). Synonym extraction using a semantic distance on a dictionary. In Proceedings of TextGraphs: The first workshop on graph based methods for natural language processing (pp. 65–72). New York City: Association for Computational Linguistics. url:http://www.aclweb.org/anthology/W/W06/W06-3811 Muller, P., Hathout, N., & Gaume, B. (2006). Synonym extraction using a semantic distance on a dictionary. In Proceedings of TextGraphs: The first workshop on graph based methods for natural language processing (pp. 65–72). New York City: Association for Computational Linguistics. url:http://​www.​aclweb.​org/​anthology/​W/​W06/​W06-3811
Zurück zum Zitat Nastase, V., Sayyad-Shirabad, J., Sokolova, M., & Szpakowicz, S. (2006). Learning noun-modifier semantic relations with corpus-based and wordnet-based features. In AAAI. AAAI Press Nastase, V., Sayyad-Shirabad, J., Sokolova, M., & Szpakowicz, S. (2006). Learning noun-modifier semantic relations with corpus-based and wordnet-based features. In AAAI. AAAI Press
Zurück zum Zitat Noh, T. G., Park, S. B., Yoon, H. G., Lee, S. J., & Park, S. Y. (2009). An automatic translation of tags for multimedia contents using folksonomy networks. In Allan et al. (2009), pp. 492–499. Noh, T. G., Park, S. B., Yoon, H. G., Lee, S. J., & Park, S. Y. (2009). An automatic translation of tags for multimedia contents using folksonomy networks. In Allan et al. (2009), pp. 492–499.
Zurück zum Zitat Ounis, I., Lioma, C., Macdonald, C., & Plachouras, V. (2007). Research directions in terrier: A search engine for advanced retrieval on the Web. Novatica/UPGRADE Special Issue on Web Information Access. Ounis, I., Lioma, C., Macdonald, C., & Plachouras, V. (2007). Research directions in terrier: A search engine for advanced retrieval on the Web. Novatica/UPGRADE Special Issue on Web Information Access.
Zurück zum Zitat Ozmutlu, S., Spink, A., & Ozmutlu, H. C. (2004). A day in the life of Web searching: An exploratory study. Information Processing & Management, 40(2), 319–345.CrossRef Ozmutlu, S., Spink, A., & Ozmutlu, H. C. (2004). A day in the life of Web searching: An exploratory study. Information Processing & Management, 40(2), 319–345.CrossRef
Zurück zum Zitat Pado, S., & Lapata, M. (2007). Dependency-based construction of semantic space models. Computational Linquistics, 33(2), 161–199.CrossRef Pado, S., & Lapata, M. (2007). Dependency-based construction of semantic space models. Computational Linquistics, 33(2), 161–199.CrossRef
Zurück zum Zitat Page, L., Brin, S., Motwani, R., & Winograd, T. (1998). The pagerank citation ranking: Bringing order to the Web. Technical report, Stanford Digital Library Technologies Project. url: citeseer.ist.psu.edu/page98pagerank.html. Page, L., Brin, S., Motwani, R., & Winograd, T. (1998). The pagerank citation ranking: Bringing order to the Web. Technical report, Stanford Digital Library Technologies Project. url: citeseer.ist.psu.edu/page98pagerank.html.
Zurück zum Zitat Pearl, J. (1988). Probabilistic reasoning in intelligent systems: networks of plausible inference. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc. Pearl, J. (1988). Probabilistic reasoning in intelligent systems: networks of plausible inference. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
Zurück zum Zitat Pedersen, T., Patwardhan, S., & Michelizzi, J. (2004). Wordnet: Similarity—Measuring the relatedness of concepts. In D. L. McGuinness, & G. Ferguson (Eds.) AAAI (pp. 1024–1025). AAAI Press/The MIT Press. Pedersen, T., Patwardhan, S., & Michelizzi, J. (2004). Wordnet: Similarity—Measuring the relatedness of concepts. In D. L. McGuinness, & G. Ferguson (Eds.) AAAI (pp. 1024–1025). AAAI Press/The MIT Press.
Zurück zum Zitat Plaza, L., Daz, A., Gervs, P. (2008). Concept-graph based biomedical automatic summarization using ontologies. In Coling 2008: Proceedings of the 3rd textgraphs workshop on graph-based algorithms for natural language processing (pp. 53–56). Manchester, UK: Coling 2008 Organizing Committee. url:http://www.aclweb.org/anthology/W08-200. Plaza, L., Daz, A., Gervs, P. (2008). Concept-graph based biomedical automatic summarization using ontologies. In Coling 2008: Proceedings of the 3rd textgraphs workshop on graph-based algorithms for natural language processing (pp. 53–56). Manchester, UK: Coling 2008 Organizing Committee. url:http://​www.​aclweb.​org/​anthology/​W08-200.
Zurück zum Zitat Polis, G. A. (1998). Ecology: Stability is woven by complex webs. Nature, 395, 744–745.CrossRef Polis, G. A. (1998). Ecology: Stability is woven by complex webs. Nature, 395, 744–745.CrossRef
Zurück zum Zitat Ponte, J. M., & Croft, W. B. (1998). A language modeling approach to information retrieval. In SIGIR (pp. 275–281). ACM. Ponte, J. M., & Croft, W. B. (1998). A language modeling approach to information retrieval. In SIGIR (pp. 275–281). ACM.
Zurück zum Zitat Popescu, A. M., & Etzioni, O. (2005) Extracting product features and opinions from reviews. In HLT/EMNLP. The Association for Computational Linguistics. Popescu, A. M., & Etzioni, O. (2005) Extracting product features and opinions from reviews. In HLT/EMNLP. The Association for Computational Linguistics.
Zurück zum Zitat Ramage, D., Rafferty, A. N., & Manning, C. D. (2009). Random walks for text semantic similarity. In Proceedings of the 2009 workshop on graph-based methods for natural language processing (TextGraphs-4) (pp. 23–31). Suntec, Singapore: Association for Computational Linguistics. url:http://www.aclweb.org/anthology/W/W09/W09-3204 Ramage, D., Rafferty, A. N., & Manning, C. D. (2009). Random walks for text semantic similarity. In Proceedings of the 2009 workshop on graph-based methods for natural language processing (TextGraphs-4) (pp. 23–31). Suntec, Singapore: Association for Computational Linguistics. url:http://​www.​aclweb.​org/​anthology/​W/​W09/​W09-3204
Zurück zum Zitat Reynal, V. F., & Brainerd, C. J. (2005). Fuzzy processing in transitivity development. Annals of Operations Research, 23(1), 37–63.CrossRef Reynal, V. F., & Brainerd, C. J. (2005). Fuzzy processing in transitivity development. Annals of Operations Research, 23(1), 37–63.CrossRef
Zurück zum Zitat Robertson, S., & Sparck Jones, K. (1976). Relevance weighting of search terms. Journal of the American Society of Information Science, 27, 129–146.CrossRef Robertson, S., & Sparck Jones, K. (1976). Relevance weighting of search terms. Journal of the American Society of Information Science, 27, 129–146.CrossRef
Zurück zum Zitat Robertson, S., Walker, S., Beaulieu, M., Gatford, M., & Payne, A. (1995). Okapi at trec-4. In NIST Special Publication 500-236: TREC-4. Robertson, S., Walker, S., Beaulieu, M., Gatford, M., & Payne, A. (1995). Okapi at trec-4. In NIST Special Publication 500-236: TREC-4.
Zurück zum Zitat Ruge, G. (1995). Human memory models and term association. In Fox, E. A., Ingwersen, P., Fidel, R. (Eds.), SIGIR (pp. 219–227). ACM Press. Ruge, G. (1995). Human memory models and term association. In Fox, E. A., Ingwersen, P., Fidel, R. (Eds.), SIGIR (pp. 219–227). ACM Press.
Zurück zum Zitat Scellato, S., Cardillo, A., Latora, V., & Porta, S. (2005). The backbone of a city. European Physics Journal B, 50(physics/0511063. 1–2), 221–225 (manuscript not submitted to the proceedings NEXT-SigmaPhi). Scellato, S., Cardillo, A., Latora, V., & Porta, S. (2005). The backbone of a city. European Physics Journal B, 50(physics/0511063. 1–2), 221–225 (manuscript not submitted to the proceedings NEXT-SigmaPhi).
Zurück zum Zitat Schenkel, R., Crecelius, T., Kacimi, M., Michel, S., Neumann, T., Parreira, J. X., et al. (2008). Efficient top-k querying over social-tagging networks. In S. H. Myaeng, D. W. Oard, F. Sebastiani, T. S. Chua, & M. K. Leong (Eds.), SIGIR (pp. 523–530). ACM. Schenkel, R., Crecelius, T., Kacimi, M., Michel, S., Neumann, T., Parreira, J. X., et al. (2008). Efficient top-k querying over social-tagging networks. In S. H. Myaeng, D. W. Oard, F. Sebastiani, T. S. Chua, & M. K. Leong (Eds.), SIGIR (pp. 523–530). ACM.
Zurück zum Zitat Schmid, H. (1994). Probabilistic part-of-speech tagging using decision trees. In International conference on new methods in language processing (pp. 44–49). Schmid, H. (1994). Probabilistic part-of-speech tagging using decision trees. In International conference on new methods in language processing (pp. 44–49).
Zurück zum Zitat Schütze, H., & Pedersen, J. O. (1995). Information retrieval based on word senses. In Symposium on document analysis and information retrieval (pp. 161–175). Schütze, H., & Pedersen, J. O. (1995). Information retrieval based on word senses. In Symposium on document analysis and information retrieval (pp. 161–175).
Zurück zum Zitat Sigman, M., & Cecchi, G. A. (2002). Global organization of the WordNet lexicon. Proceedings of the National Academy of Sciences 3(99), 1742–1747. Sigman, M., & Cecchi, G. A. (2002). Global organization of the WordNet lexicon. Proceedings of the National Academy of Sciences 3(99), 1742–1747.
Zurück zum Zitat Sigurd, B., Eeg-Olofsson, M., van de Weijer, J., Eeg-Olofsson, M., & van de Weijer, J. (2004). Word length, sentence length and frequency: Zipf’s law revisited. Studia Linguistica, 58(1), 37–52.CrossRef Sigurd, B., Eeg-Olofsson, M., van de Weijer, J., Eeg-Olofsson, M., & van de Weijer, J. (2004). Word length, sentence length and frequency: Zipf’s law revisited. Studia Linguistica, 58(1), 37–52.CrossRef
Zurück zum Zitat Sinclair, J. (1991). Corpus, concordance, collocation. Oxford: Oxford University Press. Sinclair, J. (1991). Corpus, concordance, collocation. Oxford: Oxford University Press.
Zurück zum Zitat Singhal, A. (2001). Modern information retrieval: A brief overview. IEEE Data Engineer Bulletin, 24(4), 35–43. Singhal, A. (2001). Modern information retrieval: A brief overview. IEEE Data Engineer Bulletin, 24(4), 35–43.
Zurück zum Zitat Singhal, A., Buckley, C., & Mitra, M. (1996). Pivoted document length normalization. In: H. P. Frei, D. Harman, P. Schäuble, & R. Wilkinson (Eds.), SIGIR (pp. 21–29). ACM. Singhal, A., Buckley, C., & Mitra, M. (1996). Pivoted document length normalization. In: H. P. Frei, D. Harman, P. Schäuble, & R. Wilkinson (Eds.), SIGIR (pp. 21–29). ACM.
Zurück zum Zitat Singhal, A., Buckley, C., & Mitra, M. (1996). Pivoted document length normalization. In SIGIR (pp. 21–29) Singhal, A., Buckley, C., & Mitra, M. (1996). Pivoted document length normalization. In SIGIR (pp. 21–29)
Zurück zum Zitat Sinha, S., Pan, R. K., Yadav, N., Vahia, M., & Mahadevan, I. (2009). Network analysis reveals structure indicative of syntax in the corpus of undeciphered indus civilization inscriptions. In: Proceedings of the 2009 workshop on graph-based methods for natural language processing (TextGraphs-4) (pp. 5–13). Suntec, Singapore: Association for Computational Linguistics. url:http://www.aclweb.org/anthology/W/W09/W09-3202 Sinha, S., Pan, R. K., Yadav, N., Vahia, M., & Mahadevan, I. (2009). Network analysis reveals structure indicative of syntax in the corpus of undeciphered indus civilization inscriptions. In: Proceedings of the 2009 workshop on graph-based methods for natural language processing (TextGraphs-4) (pp. 5–13). Suntec, Singapore: Association for Computational Linguistics. url:http://​www.​aclweb.​org/​anthology/​W/​W09/​W09-3202
Zurück zum Zitat Somasundaran, S., Namata, G., Getoor, L., & Wiebe, J. (2009). Opinion graphs for polarity and discourse classification. In: Proceedings of the 2009 workshop on graph-based methods for natural language processing (TextGraphs-4) (pp. 66–74). Suntec, Singapore: Association for Computational Linguistics. url:http://www.aclweb.org/anthology/W/W09/W09-321. Somasundaran, S., Namata, G., Getoor, L., & Wiebe, J. (2009). Opinion graphs for polarity and discourse classification. In: Proceedings of the 2009 workshop on graph-based methods for natural language processing (TextGraphs-4) (pp. 66–74). Suntec, Singapore: Association for Computational Linguistics. url:http://​www.​aclweb.​org/​anthology/​W/​W09/​W09-321.
Zurück zum Zitat Sparck Jones, K. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28, 11–21.CrossRef Sparck Jones, K. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28, 11–21.CrossRef
Zurück zum Zitat Sporns, O., Tononi, G., Edelman, G. M. (2002). Theoretical neuroanatomy and the connectivity of the cerebral cortex. Behavioural Brain Research, 135, 69–74.CrossRef Sporns, O., Tononi, G., Edelman, G. M. (2002). Theoretical neuroanatomy and the connectivity of the cerebral cortex. Behavioural Brain Research, 135, 69–74.CrossRef
Zurück zum Zitat Steyvers, M., & Tenenbaum, J. (2005). The large-scale structure of semantic networks: Statistical analyses and a model of semantic growth. Cognitive Science, 1(29), 41–78.CrossRef Steyvers, M., & Tenenbaum, J. (2005). The large-scale structure of semantic networks: Statistical analyses and a model of semantic growth. Cognitive Science, 1(29), 41–78.CrossRef
Zurück zum Zitat Takamura, H., Inui, T., & Okumura, M. (2007). Extracting semantic orientations of phrases from dictionary. In C. L. Sidner, T. Schultz, M. Stone, & C. Zhai (Eds.), HLT-NAACL (pp. 292–299). The Association for Computational Linguistics. Takamura, H., Inui, T., & Okumura, M. (2007). Extracting semantic orientations of phrases from dictionary. In C. L. Sidner, T. Schultz, M. Stone, & C. Zhai (Eds.), HLT-NAACL (pp. 292–299). The Association for Computational Linguistics.
Zurück zum Zitat Turtle, H. R., & Croft, W. B. (1991). Evaluation of an inference network-based retrieval model. ACM Transaction on Information System, 9(3), 187–222.CrossRef Turtle, H. R., & Croft, W. B. (1991). Evaluation of an inference network-based retrieval model. ACM Transaction on Information System, 9(3), 187–222.CrossRef
Zurück zum Zitat Véronis, J., & Ide, N. (1990). Word sense disambiguation with very large neural networks extracted from machine readable dictionaries. In COLING (pp. 389–394). Véronis, J., & Ide, N. (1990). Word sense disambiguation with very large neural networks extracted from machine readable dictionaries. In COLING (pp. 389–394).
Zurück zum Zitat Vitevitch, M. S., & Rodrguez, E. (2005). Neighborhood density effects in spoken word recognition in spanish. Journal of Multilingual Communication Disorders, 3, 64–73.CrossRef Vitevitch, M. S., & Rodrguez, E. (2005). Neighborhood density effects in spoken word recognition in spanish. Journal of Multilingual Communication Disorders, 3, 64–73.CrossRef
Zurück zum Zitat Wagner, A., & Fell, D. A. (2001). The small world inside large metabolic networks. Proceedings of the Royal Society of London Series B Biological Sciences, 268, 1803–1810. Wagner, A., & Fell, D. A. (2001). The small world inside large metabolic networks. Proceedings of the Royal Society of London Series B Biological Sciences, 268, 1803–1810.
Zurück zum Zitat Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications (structural analysis in the social sciences). New York: Cambridge University Press. Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications (structural analysis in the social sciences). New York: Cambridge University Press.
Zurück zum Zitat Watts, D., & Strogatz, S. H. (1998). Collective dynamics of ’small-world’ networks. Nature, 393, 440–442.CrossRef Watts, D., & Strogatz, S. H. (1998). Collective dynamics of ’small-world’ networks. Nature, 393, 440–442.CrossRef
Zurück zum Zitat Widdows, D., & Dorow, B. (2002). A graph model for unsupervised lexical acquisition. In COLING. Widdows, D., & Dorow, B. (2002). A graph model for unsupervised lexical acquisition. In COLING.
Zurück zum Zitat Wilkinson, R., & Hingston, P. (1991). Using the cosine measure in a neural network for document. In Bookstein et al. (1991), pp. 202–210. Wilkinson, R., & Hingston, P. (1991). Using the cosine measure in a neural network for document. In Bookstein et al. (1991), pp. 202–210.
Zurück zum Zitat Zhang, B., Li, H., Liu, Y., Ji, L., Xi, W., Fan, W., et al. (2005) . Improving web search results using affinity graph. In Baeza-Yates et al. (2005), pp. 504–511. Zhang, B., Li, H., Liu, Y., Ji, L., Xi, W., Fan, W., et al. (2005) . Improving web search results using affinity graph. In Baeza-Yates et al. (2005), pp. 504–511.
Zurück zum Zitat Zhou, D., Schölkopf, B., & Hofmann, T. (2004). Semi supervised learning on directed graphs. In NIPS. Zhou, D., Schölkopf, B., & Hofmann, T. (2004). Semi supervised learning on directed graphs. In NIPS.
Metadaten
Titel
Graph-based term weighting for information retrieval
verfasst von
Roi Blanco
Christina Lioma
Publikationsdatum
01.02.2012
Verlag
Springer Netherlands
Erschienen in
Discover Computing / Ausgabe 1/2012
Print ISSN: 2948-2984
Elektronische ISSN: 2948-2992
DOI
https://doi.org/10.1007/s10791-011-9172-x

Premium Partner