skip to main content
10.1145/1065385.1065464acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

On assigning place names to geography related web pages

Authors Info & Claims
Published:07 June 2005Publication History

ABSTRACT

In this paper, we attempt to give spatial semantics to web pages by assigning them place names. The entire assignment task is divided into three sub-problems, namely place name extraction, place name disambiguation and place name assignment. We propose our approaches to address these sub-problems. In particular, we have modified GATE, a well-known named entity extraction software, to perform place name extraction using a US Census gazetteer. A rule-based place name disambiguation method and a place name assignment method capable of assigning place names to web page segments have also been proposed. We have evaluated our proposed disambiguation and assignment methods on a web page collection referenced by the DLESE metadata collection. The results returned by our methods are compared with manually disambiguated place names and place name assignment. It is shown that our proposed place name disambiguation method works well for geo/geo ambiguities. The preliminary results of our place name assignment method indicate promising results given the existence of geo/non-geo ambiguities among place names.

References

  1. E. Amitay, N. Har'El, R. Sivan, and A. Soffer. Web-a-where: Geotagging web content. In SIGIR 2004, Sheffield, South Yorkshire, UK, July 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. N. Chinchor. MUC-7 named entity task definition version 3.5. In Seventh Message Understanding Conference (MUC-7), 1998.]]Google ScholarGoogle Scholar
  3. H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan. GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In 40th Anniversary Meeting of the Association for Computational Linguistics, 2002.]]Google ScholarGoogle Scholar
  4. Digital Library for Earth System Education. http://www.dlese.org.]]Google ScholarGoogle Scholar
  5. J. Leidner. Towards a reference corpus for automatic toponym resolution evaluation. In SIGIR 2004, Sheffield, South Yorkshire, UK, July 2004.]]Google ScholarGoogle Scholar
  6. H. Li, R. Srihari, C. Niu, and W. Li. Location normalization for information extraction. In 19th Conference on Computational Linguistics (COLING'02), Taipei, Taiwan, August 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. H. Li, R. K. Srihari, C. Niu, and W. Li. Infoxtract location normalization: a hybrid approach to geographic references in information extraction. In Proc. of HLT-NAACL 2003 Workshop on Analysis of Geographic References, Alberta, Canada, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. E.-P. Lim, D. H.-L. Goh, Z. Liu, W.-K. Ng, C. S.-G. Khoo, and S. E. Higgins. G-Portal: A map-based digital library for distributed geospatial and georeferenced resources. In Proceedings of the Second ACM+IEEE Joint Conference on Digital Libraries (JCDL 2002), Portland, Oregon, USA, July 14-18 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. Manov, A. Kiryakov, B. Popov, K. Bontcheva, and D. Maynard. Experiments with geographic knowledge for information extraction. In HLT-NAACL 2003 Workshop on Analysis of Geographic References, Edmonton, Canada, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Y. Morimoto, M. Aono, M. E. Houle, and K. McCurley. Extracting spatial knowledge from the web. In Symposium on Applications and the Internet (SAINT'03), 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. E. Rauch, M. Bukatin, and K. Baker. A confidence-based framework for disambiguating geographic terms. In HLT-NAACL 2003 Workshop on Analysis of Geographic References, pages 50--54, Edmonton, Canada, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Smith and G. Crane. Disambiguating geographic names in a historical digital library. In ECDL, pages 127--136, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. US Census Bureau. http://www.census.gov.]]Google ScholarGoogle Scholar

Index Terms

  1. On assigning place names to geography related web pages

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          JCDL '05: Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
          June 2005
          450 pages
          ISBN:1581138768
          DOI:10.1145/1065385
          • General Chair:
          • Mary Marlino,
          • Program Chairs:
          • Tamara Sumner,
          • Frank Shipman

          Copyright © 2005 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 7 June 2005

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate415of1,482submissions,28%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader