Skip to main content
Top
Published in: Discover Computing 5/2016

01-10-2016

Towards a probabilistic model for supporting collaborative information access

Authors: Thilo Böhm, Claus-Peter Klas, Matthias Hemmje

Published in: Discover Computing | Issue 5/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In information retrieval research, models and systems traditionally assume that a single person is querying and reviewing the results. However, several empirical studies of professional practice identified collaboration during IR as everyday work patterns in order to solve a shared information need and to benefit from the diverse expertise and experience of the team members. Moreover, most IR systems that are employed in professional work routines are designed for individual use and prototype collaborative systems are too limited to support use in todays work practice. To bridge this gap, this papers develops and formalizes a decision theoretic approach towards supporting a team of people that explicitly set out together to resolve a shared information need. We develop a formal cost model for collaborative IR that considers the trade-off between estimated relevance of a document as well as estimated document redundancy. From this cost model, we use a decision theoretic approach to derive the notion of activity suggestions, that is, a formal optimum criterion that describes optimum collaboration strategies in IR as the solution of an integer linear program. Those collaboration strategies are suggested to team members with the aim to facilitate the collaborative performance of information retrieval tasks. We demonstrate the application of our model by means of search result division in two collaborative search tasks. In the conducted experiments, we study the effects of different domain knowledge and resulting relevance assessments of team members in four different conditions. The gathered results indicate that our approach can improve the retrieval effectiveness of teams in recall-oriented tasks.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
This assumes that the costs for the whole team, arising from the collaborative search task performance, is the sum of interaction costs of the single team members.
 
Literature
go back to reference Arampatzis, A., & Kamps, J. (2008). A study of query length. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (pp. 811–812). ACM. Arampatzis, A., & Kamps, J. (2008). A study of query length. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (pp. 811–812). ACM.
go back to reference Attfield, S., Blandford, A., & Makri, S. (2010). Social and interactional practices for disseminating current awareness information in an organisational setting. Information Processing & Management, 46(6), 632–645.CrossRef Attfield, S., Blandford, A., & Makri, S. (2010). Social and interactional practices for disseminating current awareness information in an organisational setting. Information Processing & Management, 46(6), 632–645.CrossRef
go back to reference Azzopardi, L. (2009). Query side evaluation: An empirical analysis of effectiveness and effort. In Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval (pp. 556–563). ACM. Azzopardi, L. (2009). Query side evaluation: An empirical analysis of effectiveness and effort. In Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval (pp. 556–563). ACM.
go back to reference Azzopardi, L., de Rijke, M., & Balog, K. (2007) Building simulated queries for known-item topics. In Proceedings of the 30st annual international ACM SIGIR conference on research and development in information retrieval (pp. 455–462). Azzopardi, L., de Rijke, M., & Balog, K. (2007) Building simulated queries for known-item topics. In Proceedings of the 30st annual international ACM SIGIR conference on research and development in information retrieval (pp. 455–462).
go back to reference Baeza-Yates, R., & Pino, J. A. (1997). A first step to formally evaluate collaborative work. In Proceedings of the international ACM SIGGROUP conference on Supporting group work: the integration challenge (pp. 56–60). ACM. Baeza-Yates, R., & Pino, J. A. (1997). A first step to formally evaluate collaborative work. In Proceedings of the international ACM SIGGROUP conference on Supporting group work: the integration challenge (pp. 56–60). ACM.
go back to reference Bailey, P., Craswell, N., Soboroff, I., Thomas, P., de Vries, A. P., & Yilmaz, E. (2008). Relevance assessment: Are judges exchangeable and does it matter. In Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval (pp. 667–674). ACM. Bailey, P., Craswell, N., Soboroff, I., Thomas, P., de Vries, A. P., & Yilmaz, E. (2008). Relevance assessment: Are judges exchangeable and does it matter. In Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval (pp. 667–674). ACM.
go back to reference Becks, D., Eibl, M., Jürgens, J., Kürsten, J., Wilhelm, T., & Womser-Hacker, C. (2011). Does patent ir profit from linguistics or maximum query length? In CLEF (notebook papers/labs/workshop). Becks, D., Eibl, M., Jürgens, J., Kürsten, J., Wilhelm, T., & Womser-Hacker, C. (2011). Does patent ir profit from linguistics or maximum query length? In CLEF (notebook papers/labs/workshop).
go back to reference Bennett, P. N., White, R. W., Chu, W., Dumais, S. T., Bailey, P., Borisyuk, F., et al. (2012). Modeling the impact of short-and long-term behavior on search personalization. In Proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval (pp. 185–194). ACM. Bennett, P. N., White, R. W., Chu, W., Dumais, S. T., Bailey, P., Borisyuk, F., et al. (2012). Modeling the impact of short-and long-term behavior on search personalization. In Proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval (pp. 185–194). ACM.
go back to reference Böhm, T., Klas, C.-P., & Hemmje, M. (2013). Supporting collaborative information seeking and searching in distributed environments. In Proceedings of LWA 2013 conference. Böhm, T., Klas, C.-P., & Hemmje, M. (2013). Supporting collaborative information seeking and searching in distributed environments. In Proceedings of LWA 2013 conference.
go back to reference Cummings, J. N. (2004). Work groups, structural diversity, and knowledge sharing in a global organization. Management Science, 50(3), 352–364.MathSciNetCrossRef Cummings, J. N. (2004). Work groups, structural diversity, and knowledge sharing in a global organization. Management Science, 50(3), 352–364.MathSciNetCrossRef
go back to reference Fidel, R., Bruce, H., Pejtersen, A., Dumais, S., Grudin, J., & Poltrock, S. (2000). Collaborative information retrieval. The New Review of Information Behaviour Research, 1(1), 235–247. Fidel, R., Bruce, H., Pejtersen, A., Dumais, S., Grudin, J., & Poltrock, S. (2000). Collaborative information retrieval. The New Review of Information Behaviour Research, 1(1), 235–247.
go back to reference Foley, C., & Smeaton, A. F. (2010). Division of labour and sharing of knowledge for synchronous collaborative information retrieval. Information Processing & Management, 46(6), 762–772.CrossRef Foley, C., & Smeaton, A. F. (2010). Division of labour and sharing of knowledge for synchronous collaborative information retrieval. Information Processing & Management, 46(6), 762–772.CrossRef
go back to reference Fuhr, N. (1999). A decision-theoretic approach to database selection in networked IR. ACM Transactions on Information Systems (TOIS), 17(3), 229–249.CrossRef Fuhr, N. (1999). A decision-theoretic approach to database selection in networked IR. ACM Transactions on Information Systems (TOIS), 17(3), 229–249.CrossRef
go back to reference Fuhr, N. (2008). A probability ranking principle for interactive information retrieval. Information Retrieval, 11(3), 251–265.CrossRef Fuhr, N. (2008). A probability ranking principle for interactive information retrieval. Information Retrieval, 11(3), 251–265.CrossRef
go back to reference Hansen, P., & Järvelin, K. (2005). Collaborative information retrieval in an information-intensive domain. Information Processing & Management, 41(5), 1101–1119.CrossRef Hansen, P., & Järvelin, K. (2005). Collaborative information retrieval in an information-intensive domain. Information Processing & Management, 41(5), 1101–1119.CrossRef
go back to reference Hersh, W., Buckley, C., Leone, T., & Hickam, D. (1994). Ohsumed: An interactive retrieval evaluation and new large test collection for research. In SIGIR 94 (pp. 192–201). Springer. Hersh, W., Buckley, C., Leone, T., & Hickam, D. (1994). Ohsumed: An interactive retrieval evaluation and new large test collection for research. In SIGIR 94 (pp. 192–201). Springer.
go back to reference Jansen, B. J., Spink, A., & Saracevic, T. (2000). Real life, real users, and real needs: A study and analysis of user queries on the web. Information Processing & Management, 36(2), 207–227.CrossRef Jansen, B. J., Spink, A., & Saracevic, T. (2000). Real life, real users, and real needs: A study and analysis of user queries on the web. Information Processing & Management, 36(2), 207–227.CrossRef
go back to reference Jochim, C., Lioma, C., Schütze, H., Koch, S., & Ertl, T. (2010). Preliminary study into query translation for patent retrieval. In Proceedings of the 3rd international workshop on patent information retrieval (pp. 57–66). ACM. Jochim, C., Lioma, C., Schütze, H., Koch, S., & Ertl, T. (2010). Preliminary study into query translation for patent retrieval. In Proceedings of the 3rd international workshop on patent information retrieval (pp. 57–66). ACM.
go back to reference Joho, H., Azzopardi, L. A., & Vanderbauwhede, W. (2010). A survey of patent users: An analysis of tasks, behavior, search functionality and system requirements. In Proceedings of the third symposium on information interaction in context (pp. 13–24). ACM. Joho, H., Azzopardi, L. A., & Vanderbauwhede, W. (2010). A survey of patent users: An analysis of tasks, behavior, search functionality and system requirements. In Proceedings of the third symposium on information interaction in context (pp. 13–24). ACM.
go back to reference Joho, H., Hannah, D., & Jose, J. M. (2009). Revisiting IR techniques for collaborative search strategies. In Advances in information retrieval (pp. 66–77). Springer. Joho, H., Hannah, D., & Jose, J. M. (2009). Revisiting IR techniques for collaborative search strategies. In Advances in information retrieval (pp. 66–77). Springer.
go back to reference Kim, Y., Seo, J., & Croft, W. B. (2011). Automatic boolean query suggestion for professional search. In Proceedings of the 34th international ACM SIGIR conference on research and development in information retrieval (pp. 825–834). ACM. Kim, Y., Seo, J., & Croft, W. B. (2011). Automatic boolean query suggestion for professional search. In Proceedings of the 34th international ACM SIGIR conference on research and development in information retrieval (pp. 825–834). ACM.
go back to reference Klas, C.-P., Kriewel, S., & Hemmje, M. (2008). An experimental system for adaptive services in information retrieval. In Proceedings of the 2nd international workshop on adaptive information retrieval (AIR 2008). Citeseer. Klas, C.-P., Kriewel, S., & Hemmje, M. (2008). An experimental system for adaptive services in information retrieval. In Proceedings of the 2nd international workshop on adaptive information retrieval (AIR 2008). Citeseer.
go back to reference Landwich, P., Klas, C. P., & Hemmje, M. (2000). Catching the user-logging the information retrieval dialogue. In SIGIR 2009 workshop: understanding the user, Boston, USA. Landwich, P., Klas, C. P., & Hemmje, M. (2000). Catching the user-logging the information retrieval dialogue. In SIGIR 2009 workshop: understanding the user, Boston, USA.
go back to reference Linden, G., Smith, B., & York, J. (2003). Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Computing, 7(1), 76–80.CrossRef Linden, G., Smith, B., & York, J. (2003). Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Computing, 7(1), 76–80.CrossRef
go back to reference Morris, M. R. (2013). Collaborative search revisited. In Proceedings of the 2013 conference on computer supported cooperative work (pp. 1181–1192). ACM. Morris, M. R. (2013). Collaborative search revisited. In Proceedings of the 2013 conference on computer supported cooperative work (pp. 1181–1192). ACM.
go back to reference Morris, M. R., & Horvitz, E. (2007). Searchtogether: An interface for collaborative web search. In Proceedings of the 20th annual ACM symposium on user interface software and technology (pp. 3–12). ACM. Morris, M. R., & Horvitz, E. (2007). Searchtogether: An interface for collaborative web search. In Proceedings of the 20th annual ACM symposium on user interface software and technology (pp. 3–12). ACM.
go back to reference North, D. W. (1968). A tutorial introduction to decision theory. IEEE Transactions on Systems Science and Cybernetics, 4(3), 200–210.CrossRef North, D. W. (1968). A tutorial introduction to decision theory. IEEE Transactions on Systems Science and Cybernetics, 4(3), 200–210.CrossRef
go back to reference Pickens, J., Golovchinsky, G., Shah, C., Qvarfordt, P., & Back, M. (2008). Algorithmic mediation for collaborative exploratory search. In Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval (pp. 315–322). ACM. Pickens, J., Golovchinsky, G., Shah, C., Qvarfordt, P., & Back, M. (2008). Algorithmic mediation for collaborative exploratory search. In Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval (pp. 315–322). ACM.
go back to reference Poltrock, S., Grudin, J., Dumais, S., Fidel, R., Bruce, H., & Pejtersen, A. M. (2003). Information seeking and sharing in design teams. In Proceedings of the 2003 international ACM SIGGROUP conference on supporting group work (pp. 239–247). Poltrock, S., Grudin, J., Dumais, S., Fidel, R., Bruce, H., & Pejtersen, A. M. (2003). Information seeking and sharing in design teams. In Proceedings of the 2003 international ACM SIGGROUP conference on supporting group work (pp. 239–247).
go back to reference Reddy, M., & Spence, P. (2008). Collaborative information seeking: A field study of a multidisciplinary patient care team. Information Processing & Management, 44(1), 242–255.CrossRef Reddy, M., & Spence, P. (2008). Collaborative information seeking: A field study of a multidisciplinary patient care team. Information Processing & Management, 44(1), 242–255.CrossRef
go back to reference Robertson, S. E. (1977). The probability ranking principle in IR. Journal of Documentation, 33(4), 294–304.CrossRef Robertson, S. E. (1977). The probability ranking principle in IR. Journal of Documentation, 33(4), 294–304.CrossRef
go back to reference Robertson, S., Zaragoza, H., & Taylor, M. (2004). Simple bm25 extension to multiple weighted fields. In Proceedings of the thirteenth ACM international conference on information and knowledge management (pp. 42–49). ACM. Robertson, S., Zaragoza, H., & Taylor, M. (2004). Simple bm25 extension to multiple weighted fields. In Proceedings of the thirteenth ACM international conference on information and knowledge management (pp. 42–49). ACM.
go back to reference Roda, G., Tait, J., Piroi, F., & Zenz, V. (2010). Clef-ip 2009: Retrieval experiments in the intellectual property domain. In Multilingual information access evaluation I. Text retrieval experiments (pp. 385–409). Springer. Roda, G., Tait, J., Piroi, F., & Zenz, V. (2010). Clef-ip 2009: Retrieval experiments in the intellectual property domain. In Multilingual information access evaluation I. Text retrieval experiments (pp. 385–409). Springer.
go back to reference Saracevic, T. (2007). Relevance: A review of the literature and a framework for thinking on the notion in information science. Part iii: Behavior and effects of relevance. Journal of the American Society for Information Science and Technology, 58(13), 2126–2144.CrossRef Saracevic, T. (2007). Relevance: A review of the literature and a framework for thinking on the notion in information science. Part iii: Behavior and effects of relevance. Journal of the American Society for Information Science and Technology, 58(13), 2126–2144.CrossRef
go back to reference Shah, C., & Marchionini, G. (2010). Awareness in collaborative information seeking. Journal of the American Society for Information Science and Technology, 61(10), 1970–1986.CrossRef Shah, C., & Marchionini, G. (2010). Awareness in collaborative information seeking. Journal of the American Society for Information Science and Technology, 61(10), 1970–1986.CrossRef
go back to reference Shah, C., Pickens, J., & Golovchinsky, G. (2010). Role-based results redistribution for collaborative information retrieval. Information Processing & Management, 46(6), 773–781.CrossRef Shah, C., Pickens, J., & Golovchinsky, G. (2010). Role-based results redistribution for collaborative information retrieval. Information Processing & Management, 46(6), 773–781.CrossRef
go back to reference Soulier, L., Shah, C., & Tamine, L. (2014). User-driven system-mediated collaborative information retrieval. In Proceedings of the 37th international ACM SIGIR conference on research & development in information retrieval (pp. 485–494). ACM. Soulier, L., Shah, C., & Tamine, L. (2014). User-driven system-mediated collaborative information retrieval. In Proceedings of the 37th international ACM SIGIR conference on research & development in information retrieval (pp. 485–494). ACM.
go back to reference Soulier, L., Tamine, L., & Bahsoun, W. (2013). A collaborative document ranking model for a multi-faceted search. In Information retrieval technology (pp. 109–120). Springer. Soulier, L., Tamine, L., & Bahsoun, W. (2013). A collaborative document ranking model for a multi-faceted search. In Information retrieval technology (pp. 109–120). Springer.
go back to reference Tait, J. I. (2014). An introduction to professional search. In Professional search in the modern world (pp. 1–5). Springer. Tait, J. I. (2014). An introduction to professional search. In Professional search in the modern world (pp. 1–5). Springer.
go back to reference Twidale, M. B., Nichols, D. M., & Paice, C. D. (1997). Browsing is a collaborative process. Information Processing & Management, 33(6), 761–783.CrossRef Twidale, M. B., Nichols, D. M., & Paice, C. D. (1997). Browsing is a collaborative process. Information Processing & Management, 33(6), 761–783.CrossRef
go back to reference Villa, R., & Halvey, M. (2013). Is relevance hard work? Evaluating the effort of making relevant assessments. In Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval (pp. 765–768). ACM. Villa, R., & Halvey, M. (2013). Is relevance hard work? Evaluating the effort of making relevant assessments. In Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval (pp. 765–768). ACM.
go back to reference Wang, J., & Zhu, J. (2009). Portfolio theory of information retrieval. In Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval (pp. 115–122). ACM. Wang, J., & Zhu, J. (2009). Portfolio theory of information retrieval. In Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval (pp. 115–122). ACM.
go back to reference White, R. W. , Dumais, S. T., & Teevan, J. (2009). Characterizing the influence of domain expertise on web search behavior. In Proceedings of the second ACM International conference on web search and data mining (pp. 132–141). ACM. White, R. W. , Dumais, S. T., & Teevan, J. (2009). Characterizing the influence of domain expertise on web search behavior. In Proceedings of the second ACM International conference on web search and data mining (pp. 132–141). ACM.
go back to reference Xue, X., & Croft, W. B. (2009). Automatic query generation for patent search. In Proceedings of the 18th ACM conference on information and knowledge management (pp. 2037–2040). ACM. Xue, X., & Croft, W. B. (2009). Automatic query generation for patent search. In Proceedings of the 18th ACM conference on information and knowledge management (pp. 2037–2040). ACM.
go back to reference Yue, Z., Han, S., & He, D. (2014). Modeling search processes using hidden states in collaborative exploratory web search. In Proceedings of the 17th ACM conference on computer supported cooperative work & social computing (pp. 820–830). ACM. Yue, Z., Han, S., & He, D. (2014). Modeling search processes using hidden states in collaborative exploratory web search. In Proceedings of the 17th ACM conference on computer supported cooperative work & social computing (pp. 820–830). ACM.
go back to reference Zuccon, G., & Azzopardi, L. (2010). Using the quantum probability ranking principle to rank interdependent documents. In Advances in information retrieval (pp. 357–369). Springer. Zuccon, G., & Azzopardi, L. (2010). Using the quantum probability ranking principle to rank interdependent documents. In Advances in information retrieval (pp. 357–369). Springer.
go back to reference Zuccon, G., Azzopardi, L., & van Rijsbergen, C. (2010). Has portfolio theory got any principles? In Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval (pp. 755–756). ACM. Zuccon, G., Azzopardi, L., & van Rijsbergen, C. (2010). Has portfolio theory got any principles? In Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval (pp. 755–756). ACM.
Metadata
Title
Towards a probabilistic model for supporting collaborative information access
Authors
Thilo Böhm
Claus-Peter Klas
Matthias Hemmje
Publication date
01-10-2016
Publisher
Springer Netherlands
Published in
Discover Computing / Issue 5/2016
Print ISSN: 2948-2984
Electronic ISSN: 2948-2992
DOI
https://doi.org/10.1007/s10791-016-9285-3

Premium Partner