skip to main content
10.1145/988672.988684acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
Article

Hearsay: enabling audio browsing on hypertext content

Published:17 May 2004Publication History

ABSTRACT

In this paper we present HearSay, a system for browsing hypertext Web documents via audio. The HearSay system is based on our novel approach to automatically creating audio browsable content from hypertext Web documents. It combines two key technologies: (1) automatic partitioning of Web documents through tightly coupled structural and semantic analysis, which transforms raw HTML documents into semantic structures so as to facilitate audio browsing; and (2) VoiceXML, an already standardized technology which we adopt to represent voice dialogs automatically created from the XML output of partitioning. This paper describes the software components of HearSay and presents an initial system evaluation.

References

  1. http://www.freedomscientific.com/.Google ScholarGoogle Scholar
  2. netECHO{tm}. http://www.internetspeech.com.Google ScholarGoogle Scholar
  3. Speech Application Language Tags. http://www.saltforum.org/.Google ScholarGoogle Scholar
  4. VoiceXML. http://www.voicexml.org/.Google ScholarGoogle Scholar
  5. Wireless Markup Language Specification. http://www.wapforum.org/what/technical.htm.Google ScholarGoogle Scholar
  6. WordNet.http://www.cogsci.princeton.edu/wn/.Google ScholarGoogle Scholar
  7. C. Asakawa and T. Itoh. User interface of a home page reader. In ACM International Conference on Assistive Technologies (ASSETS), 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Asakawa and C. Laws. Home Page Reader: IBM's talking Web browser. Technical report, IBM, 1998.Google ScholarGoogle Scholar
  9. A. Blum and T. M. Mitchell. Combining labeled and unlabeled data with co-training. In Computational Learning Theory (COLT), 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. O. Buyukkoten, H. Garcia-Molina, and A. Paepcke. Focussed Web searching with PDAs. In International World Wide Web Conference (WWW), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. Y. Chung, M. Gertz, and N. Sundaresan. Reverse engineering for Web data: From visual to semantic structures. In International Conference on Data Engineering (ICDE), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Dill, N. Eiron, D. Gibson, D. Gruhl, R. Guha, A. Jhingran, T. Kanungo, S. Rajagopalan, A. Tomkins, J. Tomlin, and J. Yien. SemTag and Seeker: Bootstrapping the semantic web via automated semantic annotation. In International World Wide Web Conference (WWW), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. C. Earl and J. Leventhal. A survey of Windows screen reader users: Recent improvements in accessibility. Journal of Visual Impairment and Blindness, 93(3), 1999.Google ScholarGoogle ScholarCross RefCross Ref
  14. D. Embley and L. Xu. Record location and reconfiguration in unstructured multiple-record Web documents. In ACM International Workshop on the Web and Databases (WebDB), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. W. Embley, D. M. Campbell, R. D. Smith, and S. W. Liddle. Ontology-based extraction and structuring of information from data-rich unstructured documents. In International Conference on Information and Knowledge Management (CIKM), 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. W. Embley, Y. Jiang, and Y.-K. Ng. Record-boundary discovery in Web documents. In ACM International Conference on Management of Data (SIGMOD), 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Franke, G. Nakhaeizadeh, and I. Renz, editors. Text Mining: Theoretical Aspects and Applications. Springer-Verlag, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Goldstein, V. O. Mittal, J. G. Carbonell, and J. P. Callan. Creating and evaluating multi-document sentence extract summaries. In International Conference on Information and Knowledge Management (CIKM), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Gunderson and R. Mendelson. Usability of World Wide Web browsers by persons with visual impairments. In RESNA Annual Conference, 1997.Google ScholarGoogle Scholar
  20. D. Gusfield. Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. Handschuh and S. Staab. Authoring and annotation of Web pages in CREAM. In International World Wide Web Conference (WWW), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. S. Handschuh, S. Staab, and R. Volz. On deep annotation. In International World Wide Web Conference (WWW), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Heflin, J. A. Hendler, and S. Luke. SHOE: A blueprint for the semantic web. In D. Fensel, J. A. Hendler, H. Lieberman, and W. Wahlster, editors, Spinning the Semantic Web. MIT Press, 2003.Google ScholarGoogle Scholar
  24. A. Huang and N. Sundaresan. A semantic transcoding system to adapt web services for users with disabilities. In ACM International Conference on Assistive Technologies (ASSETS), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. IBM. IBM special needs systems. http://www.ibm.com/sns, 1998.Google ScholarGoogle Scholar
  26. H. Kochocki, S. Townsend, N. Mitchell, and A. Lloyd. W3C launches internation Web accessibility initiative. Technical report, W3C, 1997.Google ScholarGoogle Scholar
  27. H. Lieberman. Letizia: An agent that assists Web browsing. In International Joint Conference on Artificial Intelligence (IJCAI), 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. C. Lin and E. Hovy. From single to multi-document summarization: A prototype system and its evaluation. In Meeting of the Association for Computational Linguistics (ACL), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Microsoft Corporation. Microsoft accessibility technology for everyone. http://www.microsoft.com/enable/, 1998.Google ScholarGoogle Scholar
  30. T. Oogane and C. Asakawa. An interactive method for accessing tables in HTML. In ACM International Conference on Assistive Technologies (ASSETS), 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. C. Schmandt. Audio Hallway: A virtual acoustic environment for browsing. In ACM Symposium on User Interface Software and Technology (UIST), 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. SUN Microsystems. Accessibility support for the Java platform. 1998.Google ScholarGoogle Scholar
  33. H. Takagi, C. Asakawa, K. Fukuda, and J. Maeda. Side-wide annotation: Reconstructing existing pages to be accessible In ACM International Conference on Assistive Technologies (ASSETS), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. M. A. Walker, R. Passonneau, and J. E. Boland. Quantitative and qualitative evaluation of DARPA Communicator spoken dialogue systems. In Meeting of the Association of Computational Lingustics (ACL), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. G. Yang, S. Mukherjee, and I. V. Ramakrishnan. On precision and recall of multi-attribute data extraction from semistructured sources. In IEEE International Conference on Data Mining (ICDM), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Y. Yang and H. Zhang. HTML page analysis based on visual cues. In International Conference on Document Analysis and Recognition (ICDAR), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. S. Yu, D. Cai, J.-R. Wen, and W.-Y. Ma. Improving pseudo-relevance feedback in Web information retrieval using Web page segnmentation. In International World Wide Web Conference (WWW), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. M. Zajicek, C. Powell, and C. Reeves. Web search and orientation with BrookesTalk. In Technology and Persons with Disabilities Conference, 1999.Google ScholarGoogle Scholar

Index Terms

  1. Hearsay: enabling audio browsing on hypertext content

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        WWW '04: Proceedings of the 13th international conference on World Wide Web
        May 2004
        754 pages
        ISBN:158113844X
        DOI:10.1145/988672

        Copyright © 2004 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 May 2004

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate1,899of8,196submissions,23%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader