Skip to main content

2020 | OriginalPaper | Buchkapitel

Learning Aerial Image Similarity Using Triplet Networks

verfasst von : Vytautas Valaitis, Virginijus Marcinkevicius, Rokas Jurevicius

Erschienen in: Numerical Computations: Theory and Algorithms

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Unmanned aerial vehicles (UAV) faces localization challenges in satellite navigation systems denied environments. Images taken from on-board cameras can be used to compare against orthophotographical map to support visual localization algorithms. Image similarity estimation can be achieved calculating various similarity metrics. Pearson correlation was found to be the best choice for evaluating areal images similarity in our experiments. Still is not robust against image displacement caused by aircraft frame movement. We propose a new architecture of triplet neural network to learn image similarity measure. The proposed architecture incorporates VGG16 network base layers. Top layer structure, loss function and performance metrics being suggested by authors. Images were matched to the maps from satellite photo. The matching results from proposed neural network architecture were compared and evaluated against Pearson correlation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Arsenault, M.O.: Lossless triplet loss. A more efficient loss function for Siamese NN (2018) Arsenault, M.O.: Lossless triplet loss. A more efficient loss function for Siamese NN (2018)
3.
Zurück zum Zitat Banerjee, P., Bhunia, A.K., Bhattacharyya, A., Roy, P.P., Murala, S.: Local neighborhood intensity pattern: a new texture feature descriptor for image retrieval. CoRR (2017) Banerjee, P., Bhunia, A.K., Bhattacharyya, A., Roy, P.P., Murala, S.: Local neighborhood intensity pattern: a new texture feature descriptor for image retrieval. CoRR (2017)
4.
Zurück zum Zitat Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606 (2016) Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. arXiv preprint arXiv:​1607.​04606 (2016)
5.
Zurück zum Zitat Boyd, S.: Introduction to Applied Linear Algebra Vectors, Matrices, and Least Squares. Cambridge University Press, Cambridge (2018)CrossRef Boyd, S.: Introduction to Applied Linear Algebra Vectors, Matrices, and Least Squares. Cambridge University Press, Cambridge (2018)CrossRef
6.
Zurück zum Zitat Chang, Y., Hsieh, C., Chang, K., Ringgaard, M., Lin, C.: Training and testing low-degree polynomial data mappings via linear SVM. J. Mach. Learn. Res. 11, 1471–1490 (2010)MathSciNetMATH Chang, Y., Hsieh, C., Chang, K., Ringgaard, M., Lin, C.: Training and testing low-degree polynomial data mappings via linear SVM. J. Mach. Learn. Res. 11, 1471–1490 (2010)MathSciNetMATH
7.
Zurück zum Zitat Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16(1), 22–29 (1990) Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16(1), 22–29 (1990)
8.
Zurück zum Zitat Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: International Conference on Computer Vision & Pattern Recognition (CVPR 2005), vol. 1, pp. 886–893. IEEE Computer Society (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: International Conference on Computer Vision & Pattern Recognition (CVPR 2005), vol. 1, pp. 886–893. IEEE Computer Society (2005)
9.
Zurück zum Zitat Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: CVPR 2009 (2009) Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: CVPR 2009 (2009)
10.
Zurück zum Zitat Hadsell, R., Chopra, S., LeCun, Y.: Dimensionality reduction by learning an invariant mapping. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), vol. 2, pp. 1735–1742. IEEE (2006) Hadsell, R., Chopra, S., LeCun, Y.: Dimensionality reduction by learning an invariant mapping. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), vol. 2, pp. 1735–1742. IEEE (2006)
11.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. CoRR (2015) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. CoRR (2015)
12.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
13.
Zurück zum Zitat Huang, G.B., Mattar, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: a database for studying face recognition in unconstrained environments. In: Workshop on Faces in ‘Real-Life’ Images: Detection, Alignment, and Recognition (2008) Huang, G.B., Mattar, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: a database for studying face recognition in unconstrained environments. In: Workshop on Faces in ‘Real-Life’ Images: Detection, Alignment, and Recognition (2008)
14.
Zurück zum Zitat Jurevičius, R., Marcinkevičus, V.: Application of vision-based particle filter and visual odometry for UAV localization. In: WSCG 2017: Short Communications Proceedings: The 25th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision 2016 in co-operation with EUROGRAPHICS: University of West Bohemia, Plzen, Czech Republic, 29 May–2 June 2017, pp. 67–71. Václav Skala - UNION Agency (2017) Jurevičius, R., Marcinkevičus, V.: Application of vision-based particle filter and visual odometry for UAV localization. In: WSCG 2017: Short Communications Proceedings: The 25th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision 2016 in co-operation with EUROGRAPHICS: University of West Bohemia, Plzen, Czech Republic, 29 May–2 June 2017, pp. 67–71. Václav Skala - UNION Agency (2017)
16.
Zurück zum Zitat Koch, G., Zemel, R., Salakhutdinov, R.: Siamese neural networks for one-shot image recognition. In: ICML Deep Learning Workshop, vol. 2 (2015) Koch, G., Zemel, R., Salakhutdinov, R.: Siamese neural networks for one-shot image recognition. In: ICML Deep Learning Workshop, vol. 2 (2015)
17.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1097–1105. Curran Associates, Inc., New York (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1097–1105. Curran Associates, Inc., New York (2012)
18.
Zurück zum Zitat Lowe, D.G., et al.: Object recognition from local scale-invariant features. In: ICCV, vol. 99, no. 2, pp. 1150–1157 (1999) Lowe, D.G., et al.: Object recognition from local scale-invariant features. In: ICCV, vol. 99, no. 2, pp. 1150–1157 (1999)
19.
Zurück zum Zitat Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 26, pp. 3111–3119. Curran Associates, Inc., New York (2013) Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 26, pp. 3111–3119. Curran Associates, Inc., New York (2013)
20.
Zurück zum Zitat Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: EMNLP (2014) Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: EMNLP (2014)
21.
Zurück zum Zitat Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. CoRR (2015) Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. CoRR (2015)
22.
Zurück zum Zitat Sergeyev, Y.D., Kvasov, D.E., Mukhametzhanov, M.S.: On the efficiency of nature-inspired metaheuristics in expensive global optimization with limited budget. Sci. Rep. 8, article 453 (2018) Sergeyev, Y.D., Kvasov, D.E., Mukhametzhanov, M.S.: On the efficiency of nature-inspired metaheuristics in expensive global optimization with limited budget. Sci. Rep. 8, article 453 (2018)
23.
Zurück zum Zitat Szegedy, C., et al.: Going deeper with convolutions. CoRR (2014) Szegedy, C., et al.: Going deeper with convolutions. CoRR (2014)
24.
Zurück zum Zitat Taylor, G.W., Spiro, I., Bregler, C., Fergus, R.: Learning invariance through imitation. In: CVPR 2011, pp. 2729–2736. IEEE (2011) Taylor, G.W., Spiro, I., Bregler, C., Fergus, R.: Learning invariance through imitation. In: CVPR 2011, pp. 2729–2736. IEEE (2011)
25.
Zurück zum Zitat Ustinova, E., Lempitsky, V.: Learning deep embeddings with histogram loss. In: Lee, D.D., Sugiyama, M., Luxburg, U.V., Guyon, I., Garnett, R. (eds.) Advances in Neural Information Processing Systems 29, pp. 4170–4178. Curran Associates, Inc., New York (2016) Ustinova, E., Lempitsky, V.: Learning deep embeddings with histogram loss. In: Lee, D.D., Sugiyama, M., Luxburg, U.V., Guyon, I., Garnett, R. (eds.) Advances in Neural Information Processing Systems 29, pp. 4170–4178. Curran Associates, Inc., New York (2016)
26.
Zurück zum Zitat Wang, J., et al.: Learning fine-grained image similarity with deep ranking. CoRR (2014) Wang, J., et al.: Learning fine-grained image similarity with deep ranking. CoRR (2014)
27.
Zurück zum Zitat Wolf, L., Hassner, T., Maoz, I.: Face recognition in unconstrained videos with matched background similarity. IEEE (2011) Wolf, L., Hassner, T., Maoz, I.: Face recognition in unconstrained videos with matched background similarity. IEEE (2011)
Metadaten
Titel
Learning Aerial Image Similarity Using Triplet Networks
verfasst von
Vytautas Valaitis
Virginijus Marcinkevicius
Rokas Jurevicius
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-40616-5_15