Article

Voice as sound: using non-verbal voice input for interactive control

Authors:
Takeo Igarashi

Brown University, Providence, RI

Brown University, Providence, RI
View Profile

,
John F. Hughes

Brown University, Providence, RI

Brown University, Providence, RI
View Profile

UIST '01: Proceedings of the 14th annual ACM symposium on User interface software and technologyNovember 2001Pages 155–156https://doi.org/10.1145/502348.502372

Published:11 November 2001Publication History

UIST '01: Proceedings of the 14th annual ACM symposium on User interface software and technology

Pages 155–156

ABSTRACT

We describe the use of non-verbal features in voice for direct control of interactive applications. Traditional speech recognition interfaces are based on an indirect, conversational model. First the user gives a direction and then the system performs certain operation. Our goal is to achieve more direct, immediate interaction like using a button or joystick by using lower-level features of voice such as pitch and volume. We are developing several prototype interaction techniques based on this idea, such as "control by continuous voice", "rate-based parameter control by pitch," and "discrete parameter control by tonguing." We have implemented several prototype systems, and they suggest that voice-as-sound techniques can enhance traditional voice recognition approach.

References

1.Goto M., Itou,K., Akiba,T., Hayamizu,S. Speech Completion: New Speech Interface with On-demand Completion Assistance, Proc. of HCI International 2001, 2001.(in press)Google Scholar
2.Hirose,Y., Ozeki,K., Takagi,K., Effectiveness of prosodic features in syntactic analysis of read Japanese sentences, Proceedings of ICSLP2000, Vol.3, pp.215-218, 2000.Google Scholar
3.Igarashi,T., Hinckley,K. Speed-dependent automatic zooming for browsing large documents, Proceedings of UIST'00, pp.139-148, 2000. Google ScholarDigital Library
4.Iwano,K., Hirose,K., Prosodic Word Boundary Detection Using Statistical Modeling of Moraic Fundamental Frequency Contours and Its Use for Continuous Speech Recognition, Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.1, pp.133-136, 1999. Google ScholarDigital Library
5.Lieske,C., Bos,J., Emele,M., Gamback,B., Rupp,CJ., Giving prosody a meaning, Eurospeech97 vo13 pp.1431-1434, 1997.Google Scholar
6.Manaris,B., McCauley,R., MacGyvers,V., An Intelligent Interface for Keyboard and Mouse Control--Providing Full Access to PC Functionality via Speech, Proceedings of 14th International Florida AI Research Symposium (FLAIRS-01), 2001, (to appear). Google ScholarDigital Library
7.Tsukahara,W., Ward,N, Responding to Subtle, Fleeting Changes in the User's Internal State. Proceedings of CHI 2001, pp.77-84, 2001. Google ScholarDigital Library
8.Westphal,M., Waibel,A. Towards Spontaneous Speech Recognition For On-Board Car Navigation And Information Systems, Proceedings of the Eurospeech 99, 1999.Google Scholar

Index Terms

Voice as sound: using non-verbal voice input for interactive control
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction paradigms
      1. Natural language interfaces

Recommendations

The effect of voice cuing on releasing Chinese speech from informational masking

In a cocktail-party environment, human listeners are able to use perceptual-level and cognitive-level cues to segregate the attended target speech from other background conversations. At the cognitive level, priming the listener with part of the target ...
Read More
Voice pathology assessment based on automatic speech recognition using Amazigh digits
ICSDE'18: Proceedings of the 2nd International Conference on Smart Digital Environment

In the past few years, research on automatic systems to assess voice disorders has received appreciable attention due to its objectivity and noninvasive nature. The work presented in this paper aims to build an automatic speech recognition system based ...
Read More
Adding voice to whisper using a simple heuristic algorithm inferred from empirical observation
ICCHP'10: Proceedings of the 12th international conference on Computers helping people with special needs: Part I

The aim of the work described in this paper is to allow people that are enforced to use "whispery voice" to be endowed with "voiced voice". A very simple method and algorithm obtained by empirical observation of corresponding speech signals is presented ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
UIST '01: Proceedings of the 14th annual ACM symposium on User interface software and technology
November 2001
242 pages
ISBN:158113438X
DOI:10.1145/502348
Conference Chair:
Joe Marks
Mitsubishi Electric Research Laboratories
,
Program Chair:
Elizabeth Mynatt
Georgia Institute of Technology
Copyright © 2001 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 November 2001
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Interaction technique
Voice
direct manipulation
entertainment
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate842of3,967submissions,21%
Upcoming Conference
UIST '24

Sponsor:

sigchi

sigchi

UIST '24: The 37th Annual ACM Symposium on User Interface Software and Technology

October 13 - 16, 2024

Pittsburgh , PA , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 124
  Total Citations
  View Citations
- 1,942
  Total Downloads
- Downloads (Last 12 months)56
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Voice as sound: using non-verbal voice input for interactive control

UIST '01: Proceedings of the 14th annual ACM symposium on User interface software and technology

ABSTRACT

References

Cited By

Index Terms

Recommendations

The effect of voice cuing on releasing Chinese speech from informational masking

Voice pathology assessment based on automatic speech recognition using Amazigh digits

Adding voice to whisper using a simple heuristic algorithm inferred from empirical observation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Voice as sound: using non-verbal voice input for interactive control

UIST '01: Proceedings of the 14th annual ACM symposium on User interface software and technology

ABSTRACT

References

Cited By

Index Terms

Recommendations

The effect of voice cuing on releasing Chinese speech from informational masking

Voice pathology assessment based on automatic speech recognition using Amazigh digits

Adding voice to whisper using a simple heuristic algorithm inferred from empirical observation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media