Article

Hearsay: enabling audio browsing on hypertext content

Authors:
I. V. Ramakrishnan

Stony Brook University, Stony Brook, NY

Stony Brook University, Stony Brook, NY
View Profile

,
Amanda Stent

Stony Brook University, Stony Brook, NY

Stony Brook University, Stony Brook, NY
View Profile

,
Guizhen Yang

University at Buffalo, Buffalo, NY

University at Buffalo, Buffalo, NY
View Profile

WWW '04: Proceedings of the 13th international conference on World Wide WebMay 2004Pages 80–89https://doi.org/10.1145/988672.988684

Published:17 May 2004Publication History

WWW '04: Proceedings of the 13th international conference on World Wide Web

Pages 80–89

ABSTRACT

In this paper we present HearSay, a system for browsing hypertext Web documents via audio. The HearSay system is based on our novel approach to automatically creating audio browsable content from hypertext Web documents. It combines two key technologies: (1) automatic partitioning of Web documents through tightly coupled structural and semantic analysis, which transforms raw HTML documents into semantic structures so as to facilitate audio browsing; and (2) VoiceXML, an already standardized technology which we adopt to represent voice dialogs automatically created from the XML output of partitioning. This paper describes the software components of HearSay and presents an initial system evaluation.

References

http://www.freedomscientific.com/.Google Scholar
netECHO{tm}. http://www.internetspeech.com.Google Scholar
Speech Application Language Tags. http://www.saltforum.org/.Google Scholar
VoiceXML. http://www.voicexml.org/.Google Scholar
Wireless Markup Language Specification. http://www.wapforum.org/what/technical.htm.Google Scholar
WordNet.http://www.cogsci.princeton.edu/wn/.Google Scholar
C. Asakawa and T. Itoh. User interface of a home page reader. In ACM International Conference on Assistive Technologies (ASSETS), 1998. Google ScholarDigital Library
C. Asakawa and C. Laws. Home Page Reader: IBM's talking Web browser. Technical report, IBM, 1998.Google Scholar
A. Blum and T. M. Mitchell. Combining labeled and unlabeled data with co-training. In Computational Learning Theory (COLT), 1998. Google ScholarDigital Library
O. Buyukkoten, H. Garcia-Molina, and A. Paepcke. Focussed Web searching with PDAs. In International World Wide Web Conference (WWW), 2000. Google ScholarDigital Library
C. Y. Chung, M. Gertz, and N. Sundaresan. Reverse engineering for Web data: From visual to semantic structures. In International Conference on Data Engineering (ICDE), 2002. Google ScholarDigital Library
S. Dill, N. Eiron, D. Gibson, D. Gruhl, R. Guha, A. Jhingran, T. Kanungo, S. Rajagopalan, A. Tomkins, J. Tomlin, and J. Yien. SemTag and Seeker: Bootstrapping the semantic web via automated semantic annotation. In International World Wide Web Conference (WWW), 2003. Google ScholarDigital Library
C. Earl and J. Leventhal. A survey of Windows screen reader users: Recent improvements in accessibility. Journal of Visual Impairment and Blindness, 93(3), 1999.Google ScholarCross Ref
D. Embley and L. Xu. Record location and reconfiguration in unstructured multiple-record Web documents. In ACM International Workshop on the Web and Databases (WebDB), 2000. Google ScholarDigital Library
D. W. Embley, D. M. Campbell, R. D. Smith, and S. W. Liddle. Ontology-based extraction and structuring of information from data-rich unstructured documents. In International Conference on Information and Knowledge Management (CIKM), 1998. Google ScholarDigital Library
D. W. Embley, Y. Jiang, and Y.-K. Ng. Record-boundary discovery in Web documents. In ACM International Conference on Management of Data (SIGMOD), 1999. Google ScholarDigital Library
J. Franke, G. Nakhaeizadeh, and I. Renz, editors. Text Mining: Theoretical Aspects and Applications. Springer-Verlag, 2003. Google ScholarDigital Library
J. Goldstein, V. O. Mittal, J. G. Carbonell, and J. P. Callan. Creating and evaluating multi-document sentence extract summaries. In International Conference on Information and Knowledge Management (CIKM), 2000. Google ScholarDigital Library
J. Gunderson and R. Mendelson. Usability of World Wide Web browsers by persons with visual impairments. In RESNA Annual Conference, 1997.Google Scholar
D. Gusfield. Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press, 1997. Google ScholarDigital Library
S. Handschuh and S. Staab. Authoring and annotation of Web pages in CREAM. In International World Wide Web Conference (WWW), 2002. Google ScholarDigital Library
S. Handschuh, S. Staab, and R. Volz. On deep annotation. In International World Wide Web Conference (WWW), 2003. Google ScholarDigital Library
J. Heflin, J. A. Hendler, and S. Luke. SHOE: A blueprint for the semantic web. In D. Fensel, J. A. Hendler, H. Lieberman, and W. Wahlster, editors, Spinning the Semantic Web. MIT Press, 2003.Google Scholar
A. Huang and N. Sundaresan. A semantic transcoding system to adapt web services for users with disabilities. In ACM International Conference on Assistive Technologies (ASSETS), 2000. Google ScholarDigital Library
IBM. IBM special needs systems. http://www.ibm.com/sns, 1998.Google Scholar
H. Kochocki, S. Townsend, N. Mitchell, and A. Lloyd. W3C launches internation Web accessibility initiative. Technical report, W3C, 1997.Google Scholar
H. Lieberman. Letizia: An agent that assists Web browsing. In International Joint Conference on Artificial Intelligence (IJCAI), 1995. Google ScholarDigital Library
C. Lin and E. Hovy. From single to multi-document summarization: A prototype system and its evaluation. In Meeting of the Association for Computational Linguistics (ACL), 2002. Google ScholarDigital Library
Microsoft Corporation. Microsoft accessibility technology for everyone. http://www.microsoft.com/enable/, 1998.Google Scholar
T. Oogane and C. Asakawa. An interactive method for accessing tables in HTML. In ACM International Conference on Assistive Technologies (ASSETS), 1998. Google ScholarDigital Library
C. Schmandt. Audio Hallway: A virtual acoustic environment for browsing. In ACM Symposium on User Interface Software and Technology (UIST), 1998. Google ScholarDigital Library
SUN Microsystems. Accessibility support for the Java platform. 1998.Google Scholar
H. Takagi, C. Asakawa, K. Fukuda, and J. Maeda. Side-wide annotation: Reconstructing existing pages to be accessible In ACM International Conference on Assistive Technologies (ASSETS), 2002. Google ScholarDigital Library
M. A. Walker, R. Passonneau, and J. E. Boland. Quantitative and qualitative evaluation of DARPA Communicator spoken dialogue systems. In Meeting of the Association of Computational Lingustics (ACL), 2001. Google ScholarDigital Library
G. Yang, S. Mukherjee, and I. V. Ramakrishnan. On precision and recall of multi-attribute data extraction from semistructured sources. In IEEE International Conference on Data Mining (ICDM), 2003. Google ScholarDigital Library
Y. Yang and H. Zhang. HTML page analysis based on visual cues. In International Conference on Document Analysis and Recognition (ICDAR), 2001. Google ScholarDigital Library
S. Yu, D. Cai, J.-R. Wen, and W.-Y. Ma. Improving pseudo-relevance feedback in Web information retrieval using Web page segnmentation. In International World Wide Web Conference (WWW), 2003. Google ScholarDigital Library
M. Zajicek, C. Powell, and C. Reeves. Web search and orientation with BrookesTalk. In Technology and Persons with Disabilities Conference, 1999.Google Scholar

Index Terms

Hearsay: enabling audio browsing on hypertext content
1. Information systems
  1. World Wide Web
    1. Web interfaces
      1. Browsers
2. Software and its engineering
  1. Software organization and properties
    1. Software system structures
      1. Software architectures

Recommendations

The HearSay non-visual web browser
W4A '07: Proceedings of the 2007 international cross-disciplinary conference on Web accessibility (W4A)

This paper describes HearSay, a non-visual Web browser, featuring context-directed browsing, a unique and innovative Web accessibility feature, and an extensible VoiceXML dialog interface. The browser provides most of the standard browsing ...
Read More
Loosely-coupled approach towards multi-modal browsing

Contemplating the concept of universal-access multi-modal browsing comes as one of the emerging "killer" technologies that promises broader and more flexible access to information, faster task completion, and advanced user experience. Inheriting the ...
Read More
A Declarative Language for Querying and Restructuring the Web
RIDE '96: Proceedings of the 6th International Workshop on Research Issues in Data Engineering (RIDE '96) Interoperability of Nontraditional Database Systems

World Wide Web is a hypertext based, distributed information system that provides access to vast amounts of information in the internet. A fundamental problem with the Web is the difficulty of retrieving specific information of interest to the user, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '04: Proceedings of the 13th international conference on World Wide Web
May 2004
754 pages
ISBN:158113844X
DOI:10.1145/988672
Conference Chairs:
Stuart Feldman
IBM Research
,
Mike Uretsky
New York University
,
Program Chairs:
Marc Najork
Microsoft Research
,
Craig Wills
Worcester Polytechnic Institute
Copyright © 2004 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 May 2004
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
HTML
VoiceXML
World Wide Web
audio browser
semantic analysis
structural analysis
user interface
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,899of8,196submissions,23%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 69
  Total Citations
  View Citations
- 1,173
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Hearsay: enabling audio browsing on hypertext content

WWW '04: Proceedings of the 13th international conference on World Wide Web

ABSTRACT

References

Cited By

Index Terms

Recommendations

The HearSay non-visual web browser

Loosely-coupled approach towards multi-modal browsing

A Declarative Language for Querying and Restructuring the Web