skip to main content
10.1145/1868914.1868949acmotherconferencesArticle/Chapter ViewAbstractPublication PagesnordichiConference Proceedingsconference-collections
research-article

Evaluating multimodal systems: a comparison of established questionnaires and interaction parameters

Published:16 October 2010Publication History

ABSTRACT

This paper describes the analysis of established and new questionnaires concerning their applicability for the assessment of quality aspects of multimodal systems. To this purpose, an experiment with 27 participants interacting with a a smart-home system via a voice interface, a smartphone-based interface and a multimodal interface, was conducted. Interaction parameters were assessed and related to constructs measured with these questionnaires. The results indicate that some of the questionnaires are suitable for evaluating multimodal interfaces. On the basis of correlations with interaction parameters subscales of these questionnaires can be mapped to quality aspects, such as effectiveness and efficiency. Recommendations are given how to meet two important evaluation requirements, namely which questionnaire to use for comparing two or more systems or system versions and how to identify factors or components in a system that have to be improved. This is another step forward to establish evaluation methods for multimodal systems.

References

  1. N. Beringer, U. Kartal, K. Louka, F. Schiel, and U. Türk. PROMISE: -- A procedure for multimodal interactive system evaluation. In Proc. of the LREC Workshop on Multimodal Resources and Multimodal Systems Evaluation, Las Palmas, pages 77--80, 2002.Google ScholarGoogle Scholar
  2. N. Bevan. What is the difference between the purpose of usability and user experience evaluation methods? In Proc. of the UXEM Workshop at INTERACT, 2009.Google ScholarGoogle Scholar
  3. J. Brooke. SUS: A quick and dirty usability scale. In P. W. Jordan, B. Weerdmeester, A. Thomas, and I. L. McClelland, editors, Usability evaluation in industry, pages 189--194. Taylor and Francis, London, 1996.Google ScholarGoogle Scholar
  4. J. P. Chin, V. A. Diehl, and K. Norman. Development of an instrument measuring user satisfaction of the human-computer interface. In Proc. of ACM CHI, pages 213--218, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Coutaz, L. Nigay, D. Salber, A. Blandford, J. May, and R. Young. Four easy pieces for assessing the usability of multimodal interaction: The CARE properties. In Human-Computer Interaction, Interact, pages 115--120. Chapman & Hall, London, 1995.Google ScholarGoogle Scholar
  6. B. Dumas, D. Lalanne, and S. Oviatt. Multimodal interfaces: A survey of principles, models and frameworks. pages 3--26, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. E. Frøkjær, M. Hertzum, and K. Hornbæk. Measuring usability: are effectiveness, efficiency, and satisfaction really correlated? In Proc. of CHI, pages 345--352. ACM, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Hassenzahl, M. Burmester, and F. Koller. AttrakDiff: Ein Fragebogen zur Messung wahrgenommener hedonischer und pragmatischer Qualität {a questionnaire for measuring perceived hedonic and pracmatic quality}. Mensch & Computer 2003. Interaktion in Bewegung, pages 187--196, 2003.Google ScholarGoogle Scholar
  9. M. Hassenzahl, R. Kekez, and M. Burmester. The importance of a software's pragmatic quality depends on usage modes. In H. Luczak, A. E. Cakir, and G. Cakir, editors, Proc. of the 6th international conference on Work With Display Units WWDU 2002, pages 275--276, Berlin, 2002. ERGONOMIC Institut für Arbeits- und Sozialforschung.Google ScholarGoogle Scholar
  10. M. Hassenzahl and A. Monk. The inference of perceived usability from beauty. Human-Computer Interaction (accepted), 2010.Google ScholarGoogle Scholar
  11. K. S. Hone and R. Graham. Towards a tool for the Subjective Assessment of Speech System Interfaces (SASSI). Nat. Lang. Eng., 6(3--4):287--303, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Hornbæk and E. L.-C. Law. Meta-analysis of correlations among usability measures. In Proc. of CHI, pages 617--626. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Kirakowski and M. Corbett. SUMI: The Software Usability Measurement Inventory. British Journal of Educational Technology, (3):210--212, 1993.Google ScholarGoogle Scholar
  14. C. Kühnel, B. Weiss, and S. Möller. Parameters describing multimodal interaction --- definitions and three usage scenarios. In accepted for Interspeech 2010, 2010.Google ScholarGoogle Scholar
  15. E. L.-C. Law, V. Roto, M. Hassenzahl, A. P. Vermeeren, and J. Kort. Understanding, scoping and defining user experience: a survey approach. In Proc. of CHI, pages 719--728, New York, NY, USA, 2009. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Lewis. Ibm computer usability satisfaction questionnaires: Psychometric evaluation and instructions for use. International Journal of Human-Computer Interaction, 7(1):57--78, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. López-Cózar Delgado and M. Araki. Spoken, multilingual and multimodal dialogue systems: development and assessment. John Wiley & Sons, Chichester, 2005.Google ScholarGoogle Scholar
  18. A. M. Lund. Measuring usability with the USE questionnaire. Usability Interface, 8(2), 2001.Google ScholarGoogle Scholar
  19. S. Möller, K.-P. Engelbrecht, C. Kühnel, I. Wechsung, and B. Weiss. Evaluation of multimodal interfaces for ambient intelligence. In Human-Centric Interfaces for Ambient Intelligence, pages 347--370. Elsevier, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  20. S. Möller, K.-P. Engelbrecht, and R. Schleicher. Predicting the quality and usability of spoken dialogue services. Speech Commun., 50(8--9):730--744, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. Möller, J. Krebber, A. Raake, P. Smeele, M. Rajman, M. Melichar, V. Pallotta, G. Tsakou, B. Kladis, A. Vovos, J. Hoonhout, D. Schuchardt, N. Fakotakis, T. Ganchev, and I. Potamitis. INSPIRE: Evaluation of a Smart-Home System for Infotainment Management and Device Control. In Proc. 4th Int. Conf. on Language Resources and Evaluation (LREC), volume 5, pages 1603--1606, May 2004.Google ScholarGoogle Scholar
  22. A. Naumann and I. Wechsung. Developing usability methods for multimodal systems: The use of subjective and objective measures. In E. L.-C. Law, N. Bevan, G. Christou, M. Springett, and M. Larusdottir, editors, Proc. of the International Workshop on Meaningful Measures: Valid Useful User Experience Measurement (VUUM), pages 8--12. Institute of Research in Informatics of Toulouse (IRIT), 2008.Google ScholarGoogle Scholar
  23. A. Naumann, I. Wechsung, and J. Hurtienne. Multimodality, inclusive design, and intuitive use. In Proc. of the 23rd British Computer Society Human Computer Interaction Workshop and Conference (HCI 2009), 2009.Google ScholarGoogle Scholar
  24. J. Nielsen and J. Levy. Measuring usability: Preference vs. performance. Communications of the ACM, 37(4):66--75, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. Nielsen, T. Moeslund, M. Störring, and E. Granum. A procedure for developing intuitive and ergonomic gesture interfaces for HCI. In Proc. of Gesture-Based Communication in Human-Computer Interaction, 5th International Gesture Workshop, pages 409--420. Springer, 2003.Google ScholarGoogle Scholar
  26. L. Nigay and J. Coutaz. A design space for multimodal systems: concurrent processing and data fusion. In Proc. of the INTERACT and CHI, pages 172--178, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. S. Oviatt. Multimodal interfaces. pages 286--304, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. T. Paek. Empirical methods for evaluating dialog systems. In Proc. of the Second SIGdial Workshop on Discourse and Dialogue, pages 1--9, Morristown, NJ, USA, 2001. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. L. M. Reeves, J. Lai, J. A. Larson, S. Oviatt, T. S. Balaji, S. Buisine, P. Collings, P. Cohen, B. Kraal, J.-C. Martin, M. McTear, T. Raman, K. M. Stanney, H. Su, and Q. Y. Wang. Guidelines for multimodal user interface design. Communications of ACM, 47:57--59, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. J. Sauro and E. Kindlund. A method to standardize usability metrics into a single score. In Proc. of CHI, pages 401--409. ACM Press, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. N. Tractinsky. Aesthetics and apparent usability: empirically assessing cultural and methodological issues. In CHI '97: Proc. of the SIGCHI conference on Human factors in computing systems, pages 115--122, New York, NY, USA, 1997. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. T. Tullis and J. Stetson. A comparison of questionnaires for assessing website usability. In Proc. of Usability Professionals Association (UPA).Google ScholarGoogle Scholar
  33. M. Turunen, J. Hakulinen, A. Melto, T. Heimonen, T. Laivo, and J. Hella. Suxes -- user experience evaluation method for spoken and multimodal interaction. In Proc. of Interspeech, pages 2567--2570, 2009.Google ScholarGoogle Scholar
  34. M. A. Walker, D. J. Litman, C. A. Kamm, and A. Abella. PARADISE: a framework for evaluating spoken dialogue agents. In Proc. of the ACL/EACL 35th Meeting of the Assoc. for Computational Linguistics, Madrid, pages 271--280, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. I. Wechsung and A. Naumann. Established usability evaluation methods for multimodal systems: A comparison of standardized usability questionnaires. In Proc. of PIT. Heidelberg: Springer, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. J. O. Wobbrock, M. R. Morris, and A. D. Wilson. User-Defined Gestures for Surface Computing. In Proc. of CHI, pages 1083--1092. ACM Press, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Evaluating multimodal systems: a comparison of established questionnaires and interaction parameters

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      NordiCHI '10: Proceedings of the 6th Nordic Conference on Human-Computer Interaction: Extending Boundaries
      October 2010
      889 pages
      ISBN:9781605589343
      DOI:10.1145/1868914

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 October 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate379of1,572submissions,24%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader