Skip to main content
Erschienen in: Discover Computing 6/2011

01.12.2011

Online community search using conversational structures

verfasst von: Jangwon Seo, W. Bruce Croft, David A. Smith

Erschienen in: Discover Computing | Ausgabe 6/2011

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Online communities are valuable information sources where knowledge is accumulated by interactions between people. Search services provided by online community sites such as forums are often, however, quite poor. To address this, we investigate retrieval techniques that exploit the hierarchical thread structures in community sites. Since these structures are sometimes not explicit or accurately annotated, we introduce structure discovery techniques that use a variety of features to model relations between posts. We then make use of thread structures in retrieval experiments with two online forums and one email archive. Our results show that using thread structures that have been accurately annotated can lead to significant improvements in retrieval performance compared to strong baselines.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Arguello, J., Elsas, J., Callan, J., & Carbonell, J. (2008). Document representation and query expansion models for blog recommendation. In Proceedings of the second international conference on weblogs and social media (ICWSM 2008). Arguello, J., Elsas, J., Callan, J., & Carbonell, J. (2008). Document representation and query expansion models for blog recommendation. In Proceedings of the second international conference on weblogs and social media (ICWSM 2008).
Zurück zum Zitat Bishop, C. M. (2006). Mixture models and EM. In Pattern recognition and machine learning (pp. 423–459). Berlin: Springer. Bishop, C. M. (2006). Mixture models and EM. In Pattern recognition and machine learning (pp. 423–459). Berlin: Springer.
Zurück zum Zitat Buckley, C., Allan, J., & Salton, G. (1994). Automatic routing and ad-hoc retrieval using SMART. In The second text REtrieval conference (TREC-2) proceedings. Buckley, C., Allan, J., & Salton, G. (1994). Automatic routing and ad-hoc retrieval using SMART. In The second text REtrieval conference (TREC-2) proceedings.
Zurück zum Zitat Carvalho, V. R., & Cohen, W. W. (2005). On the collective classification of email “speech acts”. In SIGIR ’05: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval (pp. 345–352). Carvalho, V. R., & Cohen, W. W. (2005). On the collective classification of email “speech acts”. In SIGIR ’05: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval (pp. 345–352).
Zurück zum Zitat Cong, G., Wang, L., Lin, C. Y., Song, Y. I., & Sun, Y. (2008). Finding question-answer pairs from online forums. In SIGIR ’08: Proceedings of the 31th annual international ACM SIGIR conference on research and development in information retrieval (pp. 467–474). Cong, G., Wang, L., Lin, C. Y., Song, Y. I., & Sun, Y. (2008). Finding question-answer pairs from online forums. In SIGIR ’08: Proceedings of the 31th annual international ACM SIGIR conference on research and development in information retrieval (pp. 467–474).
Zurück zum Zitat Croft, W. B., & Lafferty, J. (2003) Language modeling for information retrieval. Dordrecht: Kluwer.MATH Croft, W. B., & Lafferty, J. (2003) Language modeling for information retrieval. Dordrecht: Kluwer.MATH
Zurück zum Zitat Elsas, J. L., & Carbonell, J. G. (2009). It pays to be picky: An evaluation of thread retrieval in online forums. In SIGIR ’09: Proceeding of the 32rd international ACM SIGIR conference on research and development in information retrieval (pp. 714–715). Elsas, J. L., & Carbonell, J. G. (2009). It pays to be picky: An evaluation of thread retrieval in online forums. In SIGIR ’09: Proceeding of the 32rd international ACM SIGIR conference on research and development in information retrieval (pp. 714–715).
Zurück zum Zitat Elsas, J. L., Arguello, J., Callan, J., & Carbonell, J. G. (2008). Retrieval and feedback models for blog feed search. In SIGIR ’08: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval (pp 347–354). Elsas, J. L., Arguello, J., Callan, J., & Carbonell, J. G. (2008). Retrieval and feedback models for blog feed search. In SIGIR ’08: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval (pp 347–354).
Zurück zum Zitat Elsner, M., & Charniak, E. (2008). You talking to me? A corpus and algorithm for conversation disentanglement. In The 46th annual meeting of the association for computational linguistics: Human language technology conference (ACL-08: HLT) (pp 834–842). Elsner, M., & Charniak, E. (2008). You talking to me? A corpus and algorithm for conversation disentanglement. In The 46th annual meeting of the association for computational linguistics: Human language technology conference (ACL-08: HLT) (pp 834–842).
Zurück zum Zitat Erera, S., & Carmel, D. (2008) Conversation detection in email systems. Lecture Notes in Computer Science, 4956, 498–505.CrossRef Erera, S., & Carmel, D. (2008) Conversation detection in email systems. Lecture Notes in Computer Science, 4956, 498–505.CrossRef
Zurück zum Zitat Joachims, T. (2002). Optimizing search engines using clickthrough data. In The eighth ACM SIGKDD conference on knowledge discovery and data mining (KDD ’02) (pp 133–142). Joachims, T. (2002). Optimizing search engines using clickthrough data. In The eighth ACM SIGKDD conference on knowledge discovery and data mining (KDD ’02) (pp 133–142).
Zurück zum Zitat Krovetz, R. (1993) Viewing morphology as an inference process. In SIGIR ’93: Proceedings of the sixteenth annual international ACM SIGIR conference on research and development in information retrieval (pp 191–202). Krovetz, R. (1993) Viewing morphology as an inference process. In SIGIR ’93: Proceedings of the sixteenth annual international ACM SIGIR conference on research and development in information retrieval (pp 191–202).
Zurück zum Zitat Lewis, D. D., & Knowles, K. A. (1997) Threading electronic mail—A preliminary study. Information Process Management, 33(2), 209–217.CrossRef Lewis, D. D., & Knowles, K. A. (1997) Threading electronic mail—A preliminary study. Information Process Management, 33(2), 209–217.CrossRef
Zurück zum Zitat Lin, C., Yang, J. M., Cai, R., Wang, X. J., Wang, W., & Zhang, L. (2009). Modeling semantics and structure of discussion threads. In The 18th international world wide web conference (WWW ’09) (pp. 1103–1104). Lin, C., Yang, J. M., Cai, R., Wang, X. J., Wang, W., & Zhang, L. (2009). Modeling semantics and structure of discussion threads. In The 18th international world wide web conference (WWW ’09) (pp. 1103–1104).
Zurück zum Zitat Liu, X., & Croft, W. B. (2004). Cluster-based retrieval using language models. In SIGIR ’04: Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval (pp. 186–193). Liu, X., & Croft, W. B. (2004). Cluster-based retrieval using language models. In SIGIR ’04: Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval (pp. 186–193).
Zurück zum Zitat Liu, X., & Croft, W. B. (2008). Evaluating text representations for retrieval of the best group of documents. In Proceedings of 30th European conference on IR research, (ECIR 2008) (pp. 454–462). Liu, X., & Croft, W. B. (2008). Evaluating text representations for retrieval of the best group of documents. In Proceedings of 30th European conference on IR research, (ECIR 2008) (pp. 454–462).
Zurück zum Zitat Ogilvie, P., & Callan, J. (2004). Hierarchical language models for retrieval of XML components. In Initiative for the evaluation of XML retrieval (INEX) 2004. Ogilvie, P., & Callan, J. (2004). Hierarchical language models for retrieval of XML components. In Initiative for the evaluation of XML retrieval (INEX) 2004.
Zurück zum Zitat Petkova, D., & Croft, W. B. (2007). UMass at TREC 2006: Enterprise track. In The fifteenth text retrieval conference (TREC 2006) proceedings. Petkova, D., & Croft, W. B. (2007). UMass at TREC 2006: Enterprise track. In The fifteenth text retrieval conference (TREC 2006) proceedings.
Zurück zum Zitat Porter, M. (1980) An algorithm for suffix stripping. Program, 14(3), 130–137. Porter, M. (1980) An algorithm for suffix stripping. Program, 14(3), 130–137.
Zurück zum Zitat Seo, J., & Croft, W. B. (2008) Blog site search using resource selection. In: CIKM ’08 Proceedings of the seventeenth ACM international conference on Information and knowledge management (pp. 1053–1062). Seo, J., & Croft, W. B. (2008) Blog site search using resource selection. In: CIKM ’08 Proceedings of the seventeenth ACM international conference on Information and knowledge management (pp. 1053–1062).
Zurück zum Zitat Seo, J., & Croft, W. B. (2010). Geometric representations for multiple documents. In SIGIR ’10: Proceeding of the 33rd international ACM SIGIR conference on research and development in information retrieval (pp. 251–258). Seo, J., & Croft, W. B. (2010). Geometric representations for multiple documents. In SIGIR ’10: Proceeding of the 33rd international ACM SIGIR conference on research and development in information retrieval (pp. 251–258).
Zurück zum Zitat Shrestha, L., & McKeown, K. (2004). Detection of question-answer pairs in email conversations. In COLING ’04: The 20th international conference on computational linguistics. Shrestha, L., & McKeown, K. (2004). Detection of question-answer pairs in email conversations. In COLING ’04: The 20th international conference on computational linguistics.
Zurück zum Zitat Smith, M., Cadiz, J. J., & Burkhalter, B. (2000). Conversation trees and threaded chats. In CSCW ’00: Proceedings of the 2000 ACM conference on Computer supported cooperative work (pp. 97–105). Smith, M., Cadiz, J. J., & Burkhalter, B. (2000). Conversation trees and threaded chats. In CSCW ’00: Proceedings of the 2000 ACM conference on Computer supported cooperative work (pp. 97–105).
Zurück zum Zitat Soboroff, I. de Vries, A. P., & Craswell, N. (2007). Overview of the TREC 2006 enterprise track. In Text retrieval conference (TREC) 2006. Soboroff, I. de Vries, A. P., & Craswell, N. (2007). Overview of the TREC 2006 enterprise track. In Text retrieval conference (TREC) 2006.
Zurück zum Zitat Wang, L., & Oard, D. W. (2009). Context-based message expansion for disentanglement of interleaved text conversations. In Proceedings of human language technologies: The 2009 annual conference of the North American chapter of the association for computational linguistics (pp. 200–208). Wang, L., & Oard, D. W. (2009). Context-based message expansion for disentanglement of interleaved text conversations. In Proceedings of human language technologies: The 2009 annual conference of the North American chapter of the association for computational linguistics (pp. 200–208).
Zurück zum Zitat Wang, Y. C., Joshi, M., Cohen, W. W., & Rose, C. (2008). Recovering implicit thread structure in newsgroup style conversations. In Proceedings of the second international conference on weblogs and social media (ICWSM 2008). Wang, Y. C., Joshi, M., Cohen, W. W., & Rose, C. (2008). Recovering implicit thread structure in newsgroup style conversations. In Proceedings of the second international conference on weblogs and social media (ICWSM 2008).
Zurück zum Zitat Xi, W., Lind, J., & Brill, E. (2004). Learning effective ranking functions for newsgroup search. In SIGIR ’04: Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval (pp. 394–401). Xi, W., Lind, J., & Brill, E. (2004). Learning effective ranking functions for newsgroup search. In SIGIR ’04: Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval (pp. 394–401).
Zurück zum Zitat Yeh, J. Y., & Harnly, A. (2006). Email thread reassembly using similarity matching. In CEAS 2006—Third conference on email and anti-spam. Yeh, J. Y., & Harnly, A. (2006). Email thread reassembly using similarity matching. In CEAS 2006—Third conference on email and anti-spam.
Zurück zum Zitat Zhai, C., & Lafferty, J. (2001). A study of smoothing methods for language models applied to ad hoc information retrieval. In SIGIR ’01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval (pp. 334–342). Zhai, C., & Lafferty, J. (2001). A study of smoothing methods for language models applied to ad hoc information retrieval. In SIGIR ’01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval (pp. 334–342).
Metadaten
Titel
Online community search using conversational structures
verfasst von
Jangwon Seo
W. Bruce Croft
David A. Smith
Publikationsdatum
01.12.2011
Verlag
Springer Netherlands
Erschienen in
Discover Computing / Ausgabe 6/2011
Print ISSN: 2948-2984
Elektronische ISSN: 2948-2992
DOI
https://doi.org/10.1007/s10791-011-9166-8