Skip to main content
Top

2015 | OriginalPaper | Chapter

Spatial Natural Language Generation for Location Description in Photo Captions

Authors : Mark M. Hall, Christopher B. Jones, Philip Smart

Published in: Spatial Information Theory

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We present a spatial natural language generation system to create captions that describe the geographical context of geo-referenced photos. An analysis of existing photo captions was used to design templates representing typical caption language patterns, while the results of human subject experiments were used to create field-based spatial models of the applicability of some commonly used spatial prepositions. The language templates are instantiated with geo-data retrieved from the vicinity of the photo locations. A human subject evaluation was used to validate and to improve the spatial language generation procedure, examples of the results of which are presented in the paper.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bateman, J.A., Hois, J., Ross, R.J., Tenbrink, T.: A linguistic ontology of space for natural language processing. Artif. Intell. 174(14), 1027–1071 (2010)CrossRef Bateman, J.A., Hois, J., Ross, R.J., Tenbrink, T.: A linguistic ontology of space for natural language processing. Artif. Intell. 174(14), 1027–1071 (2010)CrossRef
2.
go back to reference Carolis, B.D., Cozzolongo, G., Pizzutilo, S., Silvestri, V.: Mymap: generating personalized tourist descriptions. Appl. Intell. 26(2), 111–124 (2007)CrossRef Carolis, B.D., Cozzolongo, G., Pizzutilo, S., Silvestri, V.: Mymap: generating personalized tourist descriptions. Appl. Intell. 26(2), 111–124 (2007)CrossRef
3.
go back to reference Dale, R., Geldof, S., Prost, J.: Using natural language generation in automatic route. J. Res. Pract. Inf. Technol. 36(3), 23 (2004) Dale, R., Geldof, S., Prost, J.: Using natural language generation in automatic route. J. Res. Pract. Inf. Technol. 36(3), 23 (2004)
4.
go back to reference Dethlefs, N., Wu, Y., Kazerani, A., Winter, S.: Generation of adaptive route descriptions in urban environments. Spat. Cogn. Comput. 11(2), 153–177 (2011) Dethlefs, N., Wu, Y., Kazerani, A., Winter, S.: Generation of adaptive route descriptions in urban environments. Spat. Cogn. Comput. 11(2), 153–177 (2011)
5.
go back to reference Fisher, P.F., Orf, T.M.: An investigation of the meaning of near and close on a university campus. Comput. Environ. Urban Syst. 15(1–2), 23–35 (1991)CrossRef Fisher, P.F., Orf, T.M.: An investigation of the meaning of near and close on a university campus. Comput. Environ. Urban Syst. 15(1–2), 23–35 (1991)CrossRef
6.
go back to reference Gahegan, M.: Proximity operators for qualitative spatial reasoning. In: Kuhn, W., Frank, A.U. (eds.) COSIT 1995. LNCS, vol. 988, pp. 31–44. Springer, Heidelberg (1995) Gahegan, M.: Proximity operators for qualitative spatial reasoning. In: Kuhn, W., Frank, A.U. (eds.) COSIT 1995. LNCS, vol. 988, pp. 31–44. Springer, Heidelberg (1995)
7.
go back to reference Hall, M., Jones, C.: Quantifying spatial prepositions: an experimental study. In: Proceedings of the ACM GIS 2008, pp. 451–454 (2008) Hall, M., Jones, C.: Quantifying spatial prepositions: an experimental study. In: Proceedings of the ACM GIS 2008, pp. 451–454 (2008)
8.
go back to reference Hall, M., Smart, P., Jones, C.: Interpreting spatial language in image captions. Cogn. Process. 12(1), 67–94 (2011)CrossRef Hall, M., Smart, P., Jones, C.: Interpreting spatial language in image captions. Cogn. Process. 12(1), 67–94 (2011)CrossRef
9.
go back to reference Herskovits, A.: Semantics and pragmatics of locative expressions. Cogn. Sci. Multi. J. 9(3), 341–378 (1985)CrossRef Herskovits, A.: Semantics and pragmatics of locative expressions. Cogn. Sci. Multi. J. 9(3), 341–378 (1985)CrossRef
10.
go back to reference Kelleher, J., Costello, F.: Applying computational models of spatial prepositions to visually situated dialog. Comput. Linguist. 35(2), 271–306 (2009)CrossRef Kelleher, J., Costello, F.: Applying computational models of spatial prepositions to visually situated dialog. Comput. Linguist. 35(2), 271–306 (2009)CrossRef
11.
go back to reference Landau, B., Jackendoff, R.: “What" and “where" in spatial language and spatial cognition. Behav. Brain Sci. 16(2), 217–238 (1993)CrossRef Landau, B., Jackendoff, R.: “What" and “where" in spatial language and spatial cognition. Behav. Brain Sci. 16(2), 217–238 (1993)CrossRef
12.
go back to reference Levinson, S.: Space in Language and Cognition: Explorations in Cognitive Diversity. CUP, Cambridge (2003)CrossRef Levinson, S.: Space in Language and Cognition: Explorations in Cognitive Diversity. CUP, Cambridge (2003)CrossRef
13.
go back to reference Logan, G., Sadler, D.: A computational analysis of the apprehension of spatial relations. In: Bloom, P., Peterson, M., Garrett, M., Nadel, L. (eds.) Language and Space, pp. 493–529. MIT Press, Cambridge (1996) Logan, G., Sadler, D.: A computational analysis of the apprehension of spatial relations. In: Bloom, P., Peterson, M., Garrett, M., Nadel, L. (eds.) Language and Space, pp. 493–529. MIT Press, Cambridge (1996)
14.
go back to reference Mukerjee, A., Gupta, K., Nautiyal, S., Singh, M., Mishra, N.: Conceptual description of visual scenes from linguistic models. Image Vis. Comput. 18(2), 173–187 (2000)CrossRef Mukerjee, A., Gupta, K., Nautiyal, S., Singh, M., Mishra, N.: Conceptual description of visual scenes from linguistic models. Image Vis. Comput. 18(2), 173–187 (2000)CrossRef
15.
go back to reference Naaman, M., Nair, R.: Zonetag’s collaborative tag suggestions: what is this person doing in my phone? IEEE MultiMedia 15(3), 34–40 (2008)CrossRef Naaman, M., Nair, R.: Zonetag’s collaborative tag suggestions: what is this person doing in my phone? IEEE MultiMedia 15(3), 34–40 (2008)CrossRef
16.
go back to reference Naaman, M., Song, Y., Paepcke, A., Molina, H.G.: Automatic organization for digital photographs with geographic coordinates. In: JCDL, pp. 53–62 (2004) Naaman, M., Song, Y., Paepcke, A., Molina, H.G.: Automatic organization for digital photographs with geographic coordinates. In: JCDL, pp. 53–62 (2004)
17.
go back to reference Oliver, M., Webster, R.: Kriging: a method of interpolation for geographical information systems. Int. J. Geogr. Inf. Syst. 4(3), 313–332 (1990)CrossRef Oliver, M., Webster, R.: Kriging: a method of interpolation for geographical information systems. Int. J. Geogr. Inf. Syst. 4(3), 313–332 (1990)CrossRef
18.
go back to reference Reiter, E., Dale, R.: Building Natural Language Generation Systems. Cambridge University Press, Cambridge (2000)CrossRef Reiter, E., Dale, R.: Building Natural Language Generation Systems. Cambridge University Press, Cambridge (2000)CrossRef
19.
go back to reference Richter, D., Vasardani, M., Stirling, L., Richter, K.F., Winter, S.: Zooming in - zooming out: hierarchies in place descriptions. In: Krisp, J.M. (ed.) Progress in Location-Based Services, pp. 339–355. Springer, Heidelberg (2013)CrossRef Richter, D., Vasardani, M., Stirling, L., Richter, K.F., Winter, S.: Zooming in - zooming out: hierarchies in place descriptions. In: Krisp, J.M. (ed.) Progress in Location-Based Services, pp. 339–355. Springer, Heidelberg (2013)CrossRef
20.
go back to reference Robinson, V.: Interactive machine acquisition of a fuzzy spatial relation. Comput. Geosci. 16, 857–872 (1990)CrossRef Robinson, V.: Interactive machine acquisition of a fuzzy spatial relation. Comput. Geosci. 16, 857–872 (1990)CrossRef
21.
go back to reference Robinson, V.: Individual and multipersonal fuzzy spatial relations acquired using human-machine interaction. Fuzzy Sets Syst. 113(1), 133–145 (2000)CrossRefMATH Robinson, V.: Individual and multipersonal fuzzy spatial relations acquired using human-machine interaction. Fuzzy Sets Syst. 113(1), 133–145 (2000)CrossRefMATH
22.
go back to reference Schirra, J.: A contribution to reference semantics of spatial prepositions: the visualization problem and its solution in VITRA. In: Zelinsky-Wibbelt, C. (ed.) The Semantics of Prepositions: From Mental Processing to Natural Language Processing, pp. 471–515. Mouton de Gruyter, Berlin (1993) Schirra, J.: A contribution to reference semantics of spatial prepositions: the visualization problem and its solution in VITRA. In: Zelinsky-Wibbelt, C. (ed.) The Semantics of Prepositions: From Mental Processing to Natural Language Processing, pp. 471–515. Mouton de Gruyter, Berlin (1993)
23.
go back to reference Schockaert, S., de Cock, M., Kerre, E.: Location approximation for local search services using natural language hints. Int. J. Geogr. Inf. Sci. 22(3), 315–336 (2008)CrossRef Schockaert, S., de Cock, M., Kerre, E.: Location approximation for local search services using natural language hints. Int. J. Geogr. Inf. Sci. 22(3), 315–336 (2008)CrossRef
24.
go back to reference Skubic, M., Perzanowski, D., Blisard, S., Schultz, A., Adams, W., Bugajska, M., Brock, D.: Spatial language for human-robot dialogs. IEEE Trans. Syst. Man Cyber. Part C Appl. Rev. 34(2), 154–167 (2004)CrossRef Skubic, M., Perzanowski, D., Blisard, S., Schultz, A., Adams, W., Bugajska, M., Brock, D.: Spatial language for human-robot dialogs. IEEE Trans. Syst. Man Cyber. Part C Appl. Rev. 34(2), 154–167 (2004)CrossRef
25.
go back to reference Smart, P.D., Jones, C.B., Twaroch, F.A.: Multi-source toponym data integration and mediation for a meta-gazetteer service. In: Fabrikant, S.I., Reichenbacher, T., van Kreveld, M., Schlieder, C. (eds.) GIScience 2010. LNCS, vol. 6292, pp. 234–248. Springer, Heidelberg (2010) CrossRef Smart, P.D., Jones, C.B., Twaroch, F.A.: Multi-source toponym data integration and mediation for a meta-gazetteer service. In: Fabrikant, S.I., Reichenbacher, T., van Kreveld, M., Schlieder, C. (eds.) GIScience 2010. LNCS, vol. 6292, pp. 234–248. Springer, Heidelberg (2010) CrossRef
26.
go back to reference Snavely, N., Seitz, S., Szeliski, R.: Modeling the world from internet photo collections. Int. J. Comput. Vis. 80(2), 189–210 (2007)CrossRef Snavely, N., Seitz, S., Szeliski, R.: Modeling the world from internet photo collections. Int. J. Comput. Vis. 80(2), 189–210 (2007)CrossRef
27.
go back to reference Sorrows, M.E., Hirtle, S.C.: The nature of landmarks for real and electronic spaces. In: Freksa, C., Mark, D.M. (eds.) COSIT 1999. LNCS, vol. 1661, pp. 37–50. Springer, Heidelberg (1999) Sorrows, M.E., Hirtle, S.C.: The nature of landmarks for real and electronic spaces. In: Freksa, C., Mark, D.M. (eds.) COSIT 1999. LNCS, vol. 1661, pp. 37–50. Springer, Heidelberg (1999)
28.
go back to reference Spinellis, D.: Position-annotated photographs: a geotemporal web. IEEE Pervasive Comput. 2(2), 72–79 (2003)CrossRef Spinellis, D.: Position-annotated photographs: a geotemporal web. IEEE Pervasive Comput. 2(2), 72–79 (2003)CrossRef
29.
go back to reference Talmy, L.: How language structures space. In: Pick Jr., H.L., Acredolo, L.P. (eds.) Spatial Orientation, pp. 225–282. Plenum, New York (1983) CrossRef Talmy, L.: How language structures space. In: Pick Jr., H.L., Acredolo, L.P. (eds.) Spatial Orientation, pp. 225–282. Plenum, New York (1983) CrossRef
30.
go back to reference Tanasescu, V., Smart, P., Jones, C.: Reverse geocoding for photo captioning with a meta-gazetteer. In: SIGSPATIAL 2014. ACM Press (2014) Tanasescu, V., Smart, P., Jones, C.: Reverse geocoding for photo captioning with a meta-gazetteer. In: SIGSPATIAL 2014. ACM Press (2014)
31.
go back to reference Tenbrink, T.: Reference frames of space and time in language. J. Pragmatics 43, 704–722 (2011)CrossRef Tenbrink, T.: Reference frames of space and time in language. J. Pragmatics 43, 704–722 (2011)CrossRef
32.
go back to reference Worboys, M.: Nearness relations in environmental space. Int. J. Geogr. Inf. Sci. 15(7), 633–651 (2001)CrossRef Worboys, M.: Nearness relations in environmental space. Int. J. Geogr. Inf. Sci. 15(7), 633–651 (2001)CrossRef
33.
go back to reference Worboys, M., Duckham, M., Kulik, L.: Commonsense notions of proximity and direction in environmental space. Spat. Cogn. Comput. 4(4), 285–312 (2004) Worboys, M., Duckham, M., Kulik, L.: Commonsense notions of proximity and direction in environmental space. Spat. Cogn. Comput. 4(4), 285–312 (2004)
Metadata
Title
Spatial Natural Language Generation for Location Description in Photo Captions
Authors
Mark M. Hall
Christopher B. Jones
Philip Smart
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-23374-1_10

Premium Partner