Skip to main content

2017 | OriginalPaper | Buchkapitel

Unlocking Textual Content from Historical Maps - Potentials and Applications, Trends, and Outlooks

verfasst von : Yao-Yi Chiang

Erschienen in: Recent Trends in Image Processing and Pattern Recognition

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Digital map processing has been an interest in the image processing and pattern recognition community since the early 80s. With the exponential growth of available map scans in the archives and on the internet, a variety of disciplines in the natural and social sciences grow interests in using historical maps as a primary source of geographical and political information in their studies. Today, many organizations such as the United States Geological Survey, David Rumsey Map Collection, OldMapsOnline.org, and National Library of Scotland, store numerous historical maps in either paper or scanned format. Only a small portion of these historical maps is georeferenced, and even fewer of them have machine-readable content or comprehensive metadata. The lack of a searchable textual content including the spatial and temporal information prevents researchers from efficiently finding relevant maps for their research and using the map content in their studies. These challenges present a tremendous collaboration opportunity for the image processing and pattern recognition community to build advance map processing technologies for transforming the natural and social science studies that use historical maps. This paper presents the potentials of using historical maps in scientific research, describes the current trends and challenges in extracting and recognizing text content from historical maps, and discusses the future outlook.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
USGS NGMDB (2016) [Website]. Retrieved from http://​ngmdb.​usgs.​gov/​ngmdb/​ngmdb_​home.​html.
 
2
USGS topoView (2016) [Website]. Retrieved from http://​ngmdb.​usgs.​gov/​maps/​TopoView/​.
 
3
David Rumsey. (2016). [Website]. Retrieved from http://​www.​davidrumsey.​com/​.
 
4
OldMapsOnline (2016) [Website]. Retrieved from http://​www.​oldmapsonline.​org/​.
 
5
NLS (2016) [Website]. Retrieved from http://​maps.​nls.​uk/​.
 
6
USGS topoView (2016) [Website]. Retrieved from http://​ngmdb.​usgs.​gov/​maps/​TopoView/​.
 
7
CalFlora (2016) [Data set]. Retrieved from http://​www.​calflora.​org/​.
 
8
CLAVIN (2016) [Computer software]. Retrieved from https://​clavin.​bericotechnologi​es.​com/​.
 
9
U.S. Census Gazetteer (2016) [Data set]. Retrieved from https://​www.​census.​gov/​geo/​maps-data/​data/​gazetteer.​html.
 
10
USGS GNIS (2016) [Data set]. Retrieved from http://​geonames.​usgs.​gov/​.
 
11
GeoNames (2016) [Data set]. Retrieved from http://​www.​geonames.​org/​.
 
12
OpenStreetMap (2016) [Website]. Retrieved from https://​www.​openstreetmap.​org/​.
 
13
Los Angeles Public Library Map Collection (2016) [Website]. Retrieved from https://​www.​lapl.​org/​collections-resources/​visual-collections/​map-collection.
 
14
NHGIS (2016) [Website]. Retrieved from https://​www.​nhgis.​org/​.
 
15
A Vision of Britain through Time (2016) [Website]. Retrieved from http://​www.​visionofbritain.​org.​uk/​.
 
16
Dr. Kurashige’s article published in the Southern California Quarterly won the 2015 Carl I. Wheat Award for the best demonstration of scholarship in that journal from 2012–2014 by a senior historian.
 
18
Spatial technology opens a window into history (2016) [News article]. Retrieved from https://​news.​usc.​edu/​91625/​spatial-technology-opens-a-window-into-history/​.
 
19
Peter Feigl's Journey Through Historical Maps (2016) [Website]. Retrieved from http://​www.​arcgis.​com/​apps/​MapJournal/​index.​html?​appid=​6c3b4136b9304df0​9c9adcf86dd30dd5​.
 
20
Tesseract-OCR (2016) [Computer software]. Retrieved from https://​github.​com/​tesseract-ocr.
 
21
NYPL map-vectorizer (2016) [Computer software]. https://​github.​com/​NYPL/​map-vectorizer.
 
22
Plageois Commons (2016) [Website]. Retrieved from http://​commons.​pelagios.​org/​.
 
Literatur
Zurück zum Zitat Adams, O.G.: Place Names in the North Central Counties of Missouri (Ph. D.). University of Missouri-Columbia (1928) Adams, O.G.: Place Names in the North Central Counties of Missouri (Ph. D.). University of Missouri-Columbia (1928)
Zurück zum Zitat Alex, B., Byrne, K., Grover, C., Tobin, R.: Adapting the Edinburgh geoparser for historical georeferencing. Int. J. Humanit. Comput. 9(1), 15–35 (2015)CrossRef Alex, B., Byrne, K., Grover, C., Tobin, R.: Adapting the Edinburgh geoparser for historical georeferencing. Int. J. Humanit. Comput. 9(1), 15–35 (2015)CrossRef
Zurück zum Zitat Arteaga, M.G.: Historical map polygon and feature extractor. In: Proceedings of the 1st ACM SIGSPATIAL International Workshop on MapInteraction, pp. 66–71. ACM (2013) Arteaga, M.G.: Historical map polygon and feature extractor. In: Proceedings of the 1st ACM SIGSPATIAL International Workshop on MapInteraction, pp. 66–71. ACM (2013)
Zurück zum Zitat Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Seman. Web Inf. Syst. 5(3), 1–22 (2009)CrossRef Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Seman. Web Inf. Syst. 5(3), 1–22 (2009)CrossRef
Zurück zum Zitat Chiang, Y.-Y., Knoblock, C.A.: Recognizing text in raster maps. GeoInformatica 19(1), 1–27 (2014)CrossRef Chiang, Y.-Y., Knoblock, C.A.: Recognizing text in raster maps. GeoInformatica 19(1), 1–27 (2014)CrossRef
Zurück zum Zitat Chiang, Y.-Y., Leyk, S., Knoblock, C.A.: A survey of digital map processing techniques. ACM Comput. Surv. (CSUR) 47(1), 1 (2014)CrossRef Chiang, Y.-Y., Leyk, S., Knoblock, C.A.: A survey of digital map processing techniques. ACM Comput. Surv. (CSUR) 47(1), 1 (2014)CrossRef
Zurück zum Zitat Chiang, Y.-Y., Leyk, S., Nazari, N.H., Moghaddam, S., Tan, T.X.: Assessing the impact of graphical quality on automatic text recognition in digital maps. Comput. Geosci. 93, 21–35 (2016)CrossRef Chiang, Y.-Y., Leyk, S., Nazari, N.H., Moghaddam, S., Tan, T.X.: Assessing the impact of graphical quality on automatic text recognition in digital maps. Comput. Geosci. 93, 21–35 (2016)CrossRef
Zurück zum Zitat Davis, C.C., Willis, C.G., Connolly, B., Kelly, C., Ellison, A.M.: Herbarium records are reliable sources of phenological change driven by climate and provide novel insights into species’ phenological cueing mechanisms. Am. J. Bot. 102(10), 1599–1609 (2015)CrossRef Davis, C.C., Willis, C.G., Connolly, B., Kelly, C., Ellison, A.M.: Herbarium records are reliable sources of phenological change driven by climate and provide novel insights into species’ phenological cueing mechanisms. Am. J. Bot. 102(10), 1599–1609 (2015)CrossRef
Zurück zum Zitat D’Ignazio, C., Bhargava, R., Zuckerman, E.: Cliff-clavin: determining geographic focus for news. In: NewsKDD: Data Science for News Publishing (2014) D’Ignazio, C., Bhargava, R., Zuckerman, E.: Cliff-clavin: determining geographic focus for news. In: NewsKDD: Data Science for News Publishing (2014)
Zurück zum Zitat Godfrey, B., Eveleth, H.: An adaptable approach for generating vector features from scanned historical thematic maps using image enhancement and remote sensing techniques in a in a geographic information system. J. Map Geogr. Librar. 11(1), 18–36 (2015) Godfrey, B., Eveleth, H.: An adaptable approach for generating vector features from scanned historical thematic maps using image enhancement and remote sensing techniques in a in a geographic information system. J. Map Geogr. Librar. 11(1), 18–36 (2015)
Zurück zum Zitat Gregory, I., Donaldson, C., Murrieta-Flores, P., Rayson, P.: Geoparsing, GIS, and textual analysis: current developments in spatial humanities research. Int. J. Humanit. Comput. 9(1), 1–14 (2015)CrossRef Gregory, I., Donaldson, C., Murrieta-Flores, P., Rayson, P.: Geoparsing, GIS, and textual analysis: current developments in spatial humanities research. Int. J. Humanit. Comput. 9(1), 1–14 (2015)CrossRef
Zurück zum Zitat Gregory, I.N., Ell, P.S.: Historical GIS: Technologies, Methodologies, and Scholarship, vol. 39. Cambridge University Press, Cambridge (2007) Gregory, I.N., Ell, P.S.: Historical GIS: Technologies, Methodologies, and Scholarship, vol. 39. Cambridge University Press, Cambridge (2007)
Zurück zum Zitat Guralnick, R.P., Wieczorek, J., Beaman, R., Hijmans, R.J., Group, B.W., et al.: BioGeomancer: automated georeferencing to map the world’s biodiversity data. PLoS Biol. 4(11), e381 (2006) Guralnick, R.P., Wieczorek, J., Beaman, R., Hijmans, R.J., Group, B.W., et al.: BioGeomancer: automated georeferencing to map the world’s biodiversity data. PLoS Biol. 4(11), e381 (2006)
Zurück zum Zitat Hill, A.W., Guralnick, R., Flemons, P., Beaman, R., Wieczorek, J., Ranipeta, A., Chavan, V., Remsen, D.: Location, location, location: utilizing pipelines and services to more effectively georeference the world’s biodiversity data. BMC Bioinf. 10(Suppl 14), S3 (2009) Hill, A.W., Guralnick, R., Flemons, P., Beaman, R., Wieczorek, J., Ranipeta, A., Chavan, V., Remsen, D.: Location, location, location: utilizing pipelines and services to more effectively georeference the world’s biodiversity data. BMC Bioinf. 10(Suppl 14), S3 (2009)
Zurück zum Zitat Honarvar Nazari, N., Tan, T.X., Chiang, Y.-Y.: Integrating text recognition for overlapping text detection in maps. Electron. Imaging Doc. Recogn. Retrieval XXIII 17, 1–8 (2016) Honarvar Nazari, N., Tan, T.X., Chiang, Y.-Y.: Integrating text recognition for overlapping text detection in maps. Electron. Imaging Doc. Recogn. Retrieval XXIII 17, 1–8 (2016)
Zurück zum Zitat Khotanzad, A., Zink, E.: Contour line and geographic feature extraction from USGS color topographical paper maps. IEEE Trans. Pattern Anal. Mach. Intell. 25(1), 18–31 (2003)CrossRef Khotanzad, A., Zink, E.: Contour line and geographic feature extraction from USGS color topographical paper maps. IEEE Trans. Pattern Anal. Mach. Intell. 25(1), 18–31 (2003)CrossRef
Zurück zum Zitat Kurashige, L.: Rethinking anti-immigrant racism: lessons from the Los Angeles vote on the 1920 Alien Land Law. Southern Calif. Q. 95(3), 265–283 (2013)CrossRef Kurashige, L.: Rethinking anti-immigrant racism: lessons from the Los Angeles vote on the 1920 Alien Land Law. Southern Calif. Q. 95(3), 265–283 (2013)CrossRef
Zurück zum Zitat Lavoie, C.: Biological collections in an ever changing world: herbaria as tools for biogeographical and environmental studies. Perspect. Plant Ecol. Evol. Syst. 15(1), 68–76 (2013)CrossRef Lavoie, C.: Biological collections in an ever changing world: herbaria as tools for biogeographical and environmental studies. Perspect. Plant Ecol. Evol. Syst. 15(1), 68–76 (2013)CrossRef
Zurück zum Zitat Leidner, J.L., Lieberman, M.D.: Detecting geographical references in the form of place names and associated spatial natural language. Sigspatial Spec. 3(2), 5–11 (2011)CrossRef Leidner, J.L., Lieberman, M.D.: Detecting geographical references in the form of place names and associated spatial natural language. Sigspatial Spec. 3(2), 5–11 (2011)CrossRef
Zurück zum Zitat Leyk, S., Boesch, R.: Colors of the past: color image segmentation in historical topographic maps based on homogeneity. GeoInformatica 14(1), 1–21 (2009)CrossRef Leyk, S., Boesch, R.: Colors of the past: color image segmentation in historical topographic maps based on homogeneity. GeoInformatica 14(1), 1–21 (2009)CrossRef
Zurück zum Zitat Leyk, S., Boesch, R., Weibel, R.: Saliency and semantic processing: extracting forest cover from historical topographic maps. Pattern Recogn. 39(5), 953–968 (2006)CrossRef Leyk, S., Boesch, R., Weibel, R.: Saliency and semantic processing: extracting forest cover from historical topographic maps. Pattern Recogn. 39(5), 953–968 (2006)CrossRef
Zurück zum Zitat Li, L., Nagy, G., Samal, A., Seth, S., Xu, Y.: Integrated text and line-art extraction from a topographic map. Int. J. Doc. Anal. Recogn. 2(4), 177–185 (2000)CrossRef Li, L., Nagy, G., Samal, A., Seth, S., Xu, Y.: Integrated text and line-art extraction from a topographic map. Int. J. Doc. Anal. Recogn. 2(4), 177–185 (2000)CrossRef
Zurück zum Zitat Murphey, P.C., Guralnick, R.P., Glaubitz, R., Neufeld, D., Ryan, J.A.: Georeferencing of museum collections: a review of problems and automated tools, and the methodology developed by the mountain and plains spatio-temporal database-informatics initiative (Mapstedi). Phyloinformatics 1(3), 1–29 (2004) Murphey, P.C., Guralnick, R.P., Glaubitz, R., Neufeld, D., Ryan, J.A.: Georeferencing of museum collections: a review of problems and automated tools, and the methodology developed by the mountain and plains spatio-temporal database-informatics initiative (Mapstedi). Phyloinformatics 1(3), 1–29 (2004)
Zurück zum Zitat Nagy, G., Samal, A., Seth, S., Fisher, T.: Reading street names from maps-technical challenges. In: Proceedings of GIS/LIS (1997) Nagy, G., Samal, A., Seth, S., Fisher, T.: Reading street names from maps-technical challenges. In: Proceedings of GIS/LIS (1997)
Zurück zum Zitat Nanetti, A., Cattaneo, A., Cheong, S.A., Lin, C.-Y.: Maps as knowledge aggregators: from Renaissance Italy Fra mauro to web search engines. Cartographic J. 52(2), 159–167 (2015)CrossRef Nanetti, A., Cattaneo, A., Cheong, S.A., Lin, C.-Y.: Maps as knowledge aggregators: from Renaissance Italy Fra mauro to web search engines. Cartographic J. 52(2), 159–167 (2015)CrossRef
Zurück zum Zitat Newbold, T.: Applications and limitations of museum data for conservation and ecology, with particular attention to species distribution models. Prog. Phys. Geogr. 34(1), 3–22 (2010)CrossRef Newbold, T.: Applications and limitations of museum data for conservation and ecology, with particular attention to species distribution models. Prog. Phys. Geogr. 34(1), 3–22 (2010)CrossRef
Zurück zum Zitat Ngo, V., Swift, J., Chiang, Y.-Y.: Visualizing land reclamation in Hong Kong: a web application. In: International Cartographic Conference (2015) Ngo, V., Swift, J., Chiang, Y.-Y.: Visualizing land reclamation in Hong Kong: a web application. In: International Cartographic Conference (2015)
Zurück zum Zitat Pezeshk, A., Tutwiler, R.L.: Improved multi angled parallelism for separation of text from intersecting linear features in scanned topographic maps. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1078–1081. IEEE (2010) Pezeshk, A., Tutwiler, R.L.: Improved multi angled parallelism for separation of text from intersecting linear features in scanned topographic maps. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1078–1081. IEEE (2010)
Zurück zum Zitat Pezeshk, A., Tutwiler, R.L.: Automatic feature extraction and text recognition from scanned topographic maps. IEEE Trans. Geosci. Remote Sens. 49(12), 5047–5063 (2011). A Publication of the IEEE Geoscience and Remote Sensing SocietyCrossRef Pezeshk, A., Tutwiler, R.L.: Automatic feature extraction and text recognition from scanned topographic maps. IEEE Trans. Geosci. Remote Sens. 49(12), 5047–5063 (2011). A Publication of the IEEE Geoscience and Remote Sensing SocietyCrossRef
Zurück zum Zitat Pyke, G.H., Ehrlich, P.R.: Biological collections and ecological/environmental research: a review, some observations and a look to the future. Biol. Rev. Camb. Philos. Soc. 85(2), 247–266 (2010)CrossRef Pyke, G.H., Ehrlich, P.R.: Biological collections and ecological/environmental research: a review, some observations and a look to the future. Biol. Rev. Camb. Philos. Soc. 85(2), 247–266 (2010)CrossRef
Zurück zum Zitat Raveaux, R., Burie, J.C., Ogier, J.M.: A colour document interpretation: application to ancient cadastral maps. In: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 2, pp. 1128–1132. IEEE (2007) Raveaux, R., Burie, J.C., Ogier, J.M.: A colour document interpretation: application to ancient cadastral maps. In: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 2, pp. 1128–1132. IEEE (2007)
Zurück zum Zitat Raveaux, R., Burie, J.C., Ogier, J.M.: Object extraction from colour cadastral maps. In: The Eighth IAPR International Workshop on Document Analysis Systems, DAS 2008, pp. 506–514. IEEE (2008) Raveaux, R., Burie, J.C., Ogier, J.M.: Object extraction from colour cadastral maps. In: The Eighth IAPR International Workshop on Document Analysis Systems, DAS 2008, pp. 506–514. IEEE (2008)
Zurück zum Zitat Rios, N.E., Bart, H.L.: GEOLocate (Version 3.22) [Computer software] (2010) Rios, N.E., Bart, H.L.: GEOLocate (Version 3.22) [Computer software] (2010)
Zurück zum Zitat Samy, G., Chavan, V., Ariño, A.H., Otegui, J., Hobern, D., Sood, R., Robles, E.: Content assessment of the primary biodiversity data published through GBIF network: status, challenges and potentials. Biodivers. Inform. 8(2) (2013). http://doi.org/10.17161/bi.v8i2.4124 Samy, G., Chavan, V., Ariño, A.H., Otegui, J., Hobern, D., Sood, R., Robles, E.: Content assessment of the primary biodiversity data published through GBIF network: status, challenges and potentials. Biodivers. Inform. 8(2) (2013). http://​doi.​org/​10.​17161/​bi.​v8i2.​4124
Zurück zum Zitat Simon, R., Barker, E., Isaksen, L.: Linking early geospatial documents, one place at a time: annotation of geographic documents with Recogito. E-Perimetron 10(2), 49–59 (2015) Simon, R., Barker, E., Isaksen, L.: Linking early geospatial documents, one place at a time: annotation of geographic documents with Recogito. E-Perimetron 10(2), 49–59 (2015)
Zurück zum Zitat Simon, R., Pilgerstorfer, P., Isaksen, L., Barker, E.: Towards semi-automatic annotation of toponyms on old maps. E - Perimetron 9(3), 105–128 (2014) Simon, R., Pilgerstorfer, P., Isaksen, L., Barker, E.: Towards semi-automatic annotation of toponyms on old maps. E - Perimetron 9(3), 105–128 (2014)
Zurück zum Zitat Simon, R., Sadilek, C., Korb, J., Baldauf, M., Haslhofer, B.: Tag clouds and old maps: annotations as linked spatiotemporal data in the cultural heritage domain. In: Workshop on Linked Spatiotemporal Data, Zurich, Switzerland (2010) Simon, R., Sadilek, C., Korb, J., Baldauf, M., Haslhofer, B.: Tag clouds and old maps: annotations as linked spatiotemporal data in the cultural heritage domain. In: Workshop on Linked Spatiotemporal Data, Zurich, Switzerland (2010)
Zurück zum Zitat Torr, P.H.S., Zisserman, A.: MLESAC: a new robust estimator with application to estimating image geometry. Comput. Vis. Image Underst. CVIU 78(1), 138–156 (2000)CrossRef Torr, P.H.S., Zisserman, A.: MLESAC: a new robust estimator with application to estimating image geometry. Comput. Vis. Image Underst. CVIU 78(1), 138–156 (2000)CrossRef
Zurück zum Zitat Vellend, M., Brown, C.D., Kharouba, H.M., McCune, J.L., Myers-Smith, I.H.: Historical ecology: using unconventional data sources to test for effects of global environmental change. Am. J. Bot. 100(7), 1294–1305 (2013)CrossRef Vellend, M., Brown, C.D., Kharouba, H.M., McCune, J.L., Myers-Smith, I.H.: Historical ecology: using unconventional data sources to test for effects of global environmental change. Am. J. Bot. 100(7), 1294–1305 (2013)CrossRef
Zurück zum Zitat Weinman, J.: Toponym recognition in historical maps by Gazetteer alignment. In: Proceedings of the 12th International Conference on Document Analysis and Recognition, pp. 1044–1048 (2013) Weinman, J.: Toponym recognition in historical maps by Gazetteer alignment. In: Proceedings of the 12th International Conference on Document Analysis and Recognition, pp. 1044–1048 (2013)
Zurück zum Zitat Yoshida, K., Burbano, H.A., Krause, J., Thines, M., Weigel, D., Kamoun, S.: Mining herbaria for plant pathogen genomes: back to the future. PLoS Pathog. 10(4), e1004028 (2014)CrossRef Yoshida, K., Burbano, H.A., Krause, J., Thines, M., Weigel, D., Kamoun, S.: Mining herbaria for plant pathogen genomes: back to the future. PLoS Pathog. 10(4), e1004028 (2014)CrossRef
Zurück zum Zitat Yu, R., Luo, Z., Chiang, Y.-Y.: Recognizing text on historical maps using maps from multiple time periods. In: Proceedings of the 23rd International Conference on Pattern Recognition (2016) Yu, R., Luo, Z., Chiang, Y.-Y.: Recognizing text on historical maps using maps from multiple time periods. In: Proceedings of the 23rd International Conference on Pattern Recognition (2016)
Metadaten
Titel
Unlocking Textual Content from Historical Maps - Potentials and Applications, Trends, and Outlooks
verfasst von
Yao-Yi Chiang
Copyright-Jahr
2017
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-4859-3_11