skip to main content
10.1145/502348.502372acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
Article

Voice as sound: using non-verbal voice input for interactive control

Published:11 November 2001Publication History

ABSTRACT

We describe the use of non-verbal features in voice for direct control of interactive applications. Traditional speech recognition interfaces are based on an indirect, conversational model. First the user gives a direction and then the system performs certain operation. Our goal is to achieve more direct, immediate interaction like using a button or joystick by using lower-level features of voice such as pitch and volume. We are developing several prototype interaction techniques based on this idea, such as "control by continuous voice", "rate-based parameter control by pitch," and "discrete parameter control by tonguing." We have implemented several prototype systems, and they suggest that voice-as-sound techniques can enhance traditional voice recognition approach.

References

  1. 1.Goto M., Itou,K., Akiba,T., Hayamizu,S. Speech Completion: New Speech Interface with On-demand Completion Assistance, Proc. of HCI International 2001, 2001.(in press)Google ScholarGoogle Scholar
  2. 2.Hirose,Y., Ozeki,K., Takagi,K., Effectiveness of prosodic features in syntactic analysis of read Japanese sentences, Proceedings of ICSLP2000, Vol.3, pp.215-218, 2000.Google ScholarGoogle Scholar
  3. 3.Igarashi,T., Hinckley,K. Speed-dependent automatic zooming for browsing large documents, Proceedings of UIST'00, pp.139-148, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.Iwano,K., Hirose,K., Prosodic Word Boundary Detection Using Statistical Modeling of Moraic Fundamental Frequency Contours and Its Use for Continuous Speech Recognition, Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.1, pp.133-136, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.Lieske,C., Bos,J., Emele,M., Gamback,B., Rupp,CJ., Giving prosody a meaning, Eurospeech97 vo13 pp.1431-1434, 1997.Google ScholarGoogle Scholar
  6. 6.Manaris,B., McCauley,R., MacGyvers,V., An Intelligent Interface for Keyboard and Mouse Control--Providing Full Access to PC Functionality via Speech, Proceedings of 14th International Florida AI Research Symposium (FLAIRS-01), 2001, (to appear). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.Tsukahara,W., Ward,N, Responding to Subtle, Fleeting Changes in the User's Internal State. Proceedings of CHI 2001, pp.77-84, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.Westphal,M., Waibel,A. Towards Spontaneous Speech Recognition For On-Board Car Navigation And Information Systems, Proceedings of the Eurospeech 99, 1999.Google ScholarGoogle Scholar

Index Terms

  1. Voice as sound: using non-verbal voice input for interactive control

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        UIST '01: Proceedings of the 14th annual ACM symposium on User interface software and technology
        November 2001
        242 pages
        ISBN:158113438X
        DOI:10.1145/502348
        • Conference Chair:
        • Joe Marks,
        • Program Chair:
        • Elizabeth Mynatt

        Copyright © 2001 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 11 November 2001

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate842of3,967submissions,21%

        Upcoming Conference

        UIST '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader