ABSTRACT
In this paper we present HearSay, a system for browsing hypertext Web documents via audio. The HearSay system is based on our novel approach to automatically creating audio browsable content from hypertext Web documents. It combines two key technologies: (1) automatic partitioning of Web documents through tightly coupled structural and semantic analysis, which transforms raw HTML documents into semantic structures so as to facilitate audio browsing; and (2) VoiceXML, an already standardized technology which we adopt to represent voice dialogs automatically created from the XML output of partitioning. This paper describes the software components of HearSay and presents an initial system evaluation.
- http://www.freedomscientific.com/.Google Scholar
- netECHO{tm}. http://www.internetspeech.com.Google Scholar
- Speech Application Language Tags. http://www.saltforum.org/.Google Scholar
- VoiceXML. http://www.voicexml.org/.Google Scholar
- Wireless Markup Language Specification. http://www.wapforum.org/what/technical.htm.Google Scholar
- WordNet.http://www.cogsci.princeton.edu/wn/.Google Scholar
- C. Asakawa and T. Itoh. User interface of a home page reader. In ACM International Conference on Assistive Technologies (ASSETS), 1998. Google ScholarDigital Library
- C. Asakawa and C. Laws. Home Page Reader: IBM's talking Web browser. Technical report, IBM, 1998.Google Scholar
- A. Blum and T. M. Mitchell. Combining labeled and unlabeled data with co-training. In Computational Learning Theory (COLT), 1998. Google ScholarDigital Library
- O. Buyukkoten, H. Garcia-Molina, and A. Paepcke. Focussed Web searching with PDAs. In International World Wide Web Conference (WWW), 2000. Google ScholarDigital Library
- C. Y. Chung, M. Gertz, and N. Sundaresan. Reverse engineering for Web data: From visual to semantic structures. In International Conference on Data Engineering (ICDE), 2002. Google ScholarDigital Library
- S. Dill, N. Eiron, D. Gibson, D. Gruhl, R. Guha, A. Jhingran, T. Kanungo, S. Rajagopalan, A. Tomkins, J. Tomlin, and J. Yien. SemTag and Seeker: Bootstrapping the semantic web via automated semantic annotation. In International World Wide Web Conference (WWW), 2003. Google ScholarDigital Library
- C. Earl and J. Leventhal. A survey of Windows screen reader users: Recent improvements in accessibility. Journal of Visual Impairment and Blindness, 93(3), 1999.Google ScholarCross Ref
- D. Embley and L. Xu. Record location and reconfiguration in unstructured multiple-record Web documents. In ACM International Workshop on the Web and Databases (WebDB), 2000. Google ScholarDigital Library
- D. W. Embley, D. M. Campbell, R. D. Smith, and S. W. Liddle. Ontology-based extraction and structuring of information from data-rich unstructured documents. In International Conference on Information and Knowledge Management (CIKM), 1998. Google ScholarDigital Library
- D. W. Embley, Y. Jiang, and Y.-K. Ng. Record-boundary discovery in Web documents. In ACM International Conference on Management of Data (SIGMOD), 1999. Google ScholarDigital Library
- J. Franke, G. Nakhaeizadeh, and I. Renz, editors. Text Mining: Theoretical Aspects and Applications. Springer-Verlag, 2003. Google ScholarDigital Library
- J. Goldstein, V. O. Mittal, J. G. Carbonell, and J. P. Callan. Creating and evaluating multi-document sentence extract summaries. In International Conference on Information and Knowledge Management (CIKM), 2000. Google ScholarDigital Library
- J. Gunderson and R. Mendelson. Usability of World Wide Web browsers by persons with visual impairments. In RESNA Annual Conference, 1997.Google Scholar
- D. Gusfield. Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press, 1997. Google ScholarDigital Library
- S. Handschuh and S. Staab. Authoring and annotation of Web pages in CREAM. In International World Wide Web Conference (WWW), 2002. Google ScholarDigital Library
- S. Handschuh, S. Staab, and R. Volz. On deep annotation. In International World Wide Web Conference (WWW), 2003. Google ScholarDigital Library
- J. Heflin, J. A. Hendler, and S. Luke. SHOE: A blueprint for the semantic web. In D. Fensel, J. A. Hendler, H. Lieberman, and W. Wahlster, editors, Spinning the Semantic Web. MIT Press, 2003.Google Scholar
- A. Huang and N. Sundaresan. A semantic transcoding system to adapt web services for users with disabilities. In ACM International Conference on Assistive Technologies (ASSETS), 2000. Google ScholarDigital Library
- IBM. IBM special needs systems. http://www.ibm.com/sns, 1998.Google Scholar
- H. Kochocki, S. Townsend, N. Mitchell, and A. Lloyd. W3C launches internation Web accessibility initiative. Technical report, W3C, 1997.Google Scholar
- H. Lieberman. Letizia: An agent that assists Web browsing. In International Joint Conference on Artificial Intelligence (IJCAI), 1995. Google ScholarDigital Library
- C. Lin and E. Hovy. From single to multi-document summarization: A prototype system and its evaluation. In Meeting of the Association for Computational Linguistics (ACL), 2002. Google ScholarDigital Library
- Microsoft Corporation. Microsoft accessibility technology for everyone. http://www.microsoft.com/enable/, 1998.Google Scholar
- T. Oogane and C. Asakawa. An interactive method for accessing tables in HTML. In ACM International Conference on Assistive Technologies (ASSETS), 1998. Google ScholarDigital Library
- C. Schmandt. Audio Hallway: A virtual acoustic environment for browsing. In ACM Symposium on User Interface Software and Technology (UIST), 1998. Google ScholarDigital Library
- SUN Microsystems. Accessibility support for the Java platform. 1998.Google Scholar
- H. Takagi, C. Asakawa, K. Fukuda, and J. Maeda. Side-wide annotation: Reconstructing existing pages to be accessible In ACM International Conference on Assistive Technologies (ASSETS), 2002. Google ScholarDigital Library
- M. A. Walker, R. Passonneau, and J. E. Boland. Quantitative and qualitative evaluation of DARPA Communicator spoken dialogue systems. In Meeting of the Association of Computational Lingustics (ACL), 2001. Google ScholarDigital Library
- G. Yang, S. Mukherjee, and I. V. Ramakrishnan. On precision and recall of multi-attribute data extraction from semistructured sources. In IEEE International Conference on Data Mining (ICDM), 2003. Google ScholarDigital Library
- Y. Yang and H. Zhang. HTML page analysis based on visual cues. In International Conference on Document Analysis and Recognition (ICDAR), 2001. Google ScholarDigital Library
- S. Yu, D. Cai, J.-R. Wen, and W.-Y. Ma. Improving pseudo-relevance feedback in Web information retrieval using Web page segnmentation. In International World Wide Web Conference (WWW), 2003. Google ScholarDigital Library
- M. Zajicek, C. Powell, and C. Reeves. Web search and orientation with BrookesTalk. In Technology and Persons with Disabilities Conference, 1999.Google Scholar
Index Terms
- Hearsay: enabling audio browsing on hypertext content
Recommendations
The HearSay non-visual web browser
W4A '07: Proceedings of the 2007 international cross-disciplinary conference on Web accessibility (W4A)This paper describes HearSay, a non-visual Web browser, featuring context-directed browsing, a unique and innovative Web accessibility feature, and an extensible VoiceXML dialog interface. The browser provides most of the standard browsing ...
Loosely-coupled approach towards multi-modal browsing
Contemplating the concept of universal-access multi-modal browsing comes as one of the emerging "killer" technologies that promises broader and more flexible access to information, faster task completion, and advanced user experience. Inheriting the ...
A Declarative Language for Querying and Restructuring the Web
RIDE '96: Proceedings of the 6th International Workshop on Research Issues in Data Engineering (RIDE '96) Interoperability of Nontraditional Database SystemsWorld Wide Web is a hypertext based, distributed information system that provides access to vast amounts of information in the internet. A fundamental problem with the Web is the difficulty of retrieving specific information of interest to the user, ...
Comments