Skip to main content
Top

2020 | OriginalPaper | Chapter

Arc Loss: Softmax with Additive Angular Margin for Answer Retrieval

Authors : Rikiya Suzuki, Sumio Fujita, Tetsuya Sakai

Published in: Information Retrieval Technology

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Answer retrieval is a crucial step in question answering. To determine the best Q–A pair in a candidate pool, traditional approaches adopt triplet loss (i.e., pairwise ranking loss) for a meaningful distributed representation. Triplet loss is widely used to push away a negative answer from a certain question in a feature space and leads to a better understanding of the relationship between questions and answers. However, triplet loss is inefficient because it requires two steps: triplet generation and negative sampling. In this study, we propose an alternative loss function, namely, arc loss, for more efficient and effective learning than that by triplet loss. We evaluate the proposed approach on a commonly used QA dataset and demonstrate that it significantly outperforms the triplet loss baseline.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bachrach, Y., et al.: An attention mechanism for neural answer selection using a combined global and local view. In: 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 425–432 (2017) Bachrach, Y., et al.: An attention mechanism for neural answer selection using a combined global and local view. In: 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 425–432 (2017)
2.
go back to reference Deng, J., Guo, J., Xue, N., Zafeiriou, S.: Arcface: Additive angular margin loss for deep face recognition. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019 Deng, J., Guo, J., Xue, N., Zafeiriou, S.: Arcface: Additive angular margin loss for deep face recognition. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019
3.
go back to reference Dos Santos, C., Tan, M., Xiang, B., Zhou, B.: Attentive pooling networks (2016) Dos Santos, C., Tan, M., Xiang, B., Zhou, B.: Attentive pooling networks (2016)
4.
go back to reference Dos Santos, C.N., Zadrozny, B.: Learning character-level representations for part-of-speech tagging. In: Proceedings of the 31st International Conference on International Conference on Machine Learning - Volume 32 ICML 2014, pp. II-1818-II-1826. JMLR.org (2014) Dos Santos, C.N., Zadrozny, B.: Learning character-level representations for part-of-speech tagging. In: Proceedings of the 31st International Conference on International Conference on Machine Learning - Volume 32 ICML 2014, pp. II-1818-II-1826. JMLR.org (2014)
5.
go back to reference Feng, M., Xiang, B., Glass, M.R., Wang, L., Zhou, B.: Applying deep learning to answer selection: a study and an open task. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 813–820. IEEE (2015) Feng, M., Xiang, B., Glass, M.R., Wang, L., Zhou, B.: Applying deep learning to answer selection: a study and an open task. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 813–820. IEEE (2015)
9.
go back to reference Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014) Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014)
11.
go back to reference Sakai, T., Ishikawa, D., Kando, N., Seki, Y., Kuriyama, K., Lin, C.Y.: Using graded-relevance metrics for evaluating community QA answer selection. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining WSDM 2011, pp. 187–196. ACM, New York, USA (2011). https://doi.org/10.1145/1935826.1935864 Sakai, T., Ishikawa, D., Kando, N., Seki, Y., Kuriyama, K., Lin, C.Y.: Using graded-relevance metrics for evaluating community QA answer selection. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining WSDM 2011, pp. 187–196. ACM, New York, USA (2011). https://​doi.​org/​10.​1145/​1935826.​1935864
12.
13.
go back to reference Tan, M., dos Santos, C., Xiang, B., Zhou, B.: Improved representation learning for question answer matching. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 464–473. Association for Computational Linguistics, Berlin, Germany, Aug 2016. https://doi.org/10.18653/v1/P16-1044 Tan, M., dos Santos, C., Xiang, B., Zhou, B.: Improved representation learning for question answer matching. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 464–473. Association for Computational Linguistics, Berlin, Germany, Aug 2016. https://​doi.​org/​10.​18653/​v1/​P16-1044
14.
go back to reference Tan, M., Santos, C.d., Xiang, B., Zhou, B.: LSTM-based deep learning models for non-factoid answer selection (2015) Tan, M., Santos, C.d., Xiang, B., Zhou, B.: LSTM-based deep learning models for non-factoid answer selection (2015)
15.
go back to reference Tran, N.K., Niederée, C.: Multihop attention networks for question answer matching. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval SIGIR 2018, pp. 325–334. ACM, New York, NY, USA (2018). https://doi.org/10.1145/3209978.3210009 Tran, N.K., Niederée, C.: Multihop attention networks for question answer matching. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval SIGIR 2018, pp. 325–334. ACM, New York, NY, USA (2018). https://​doi.​org/​10.​1145/​3209978.​3210009
16.
go back to reference Tran, N.K., Niederée, C.: A neural network-based framework for non-factoid question answering. In: Companion Proceedings of the The Web Conference 2018, WWW 2018, pp. 1979–1983. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland (2018). https://doi.org/10.1145/3184558.3191830 Tran, N.K., Niederée, C.: A neural network-based framework for non-factoid question answering. In: Companion Proceedings of the The Web Conference 2018, WWW 2018, pp. 1979–1983. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland (2018). https://​doi.​org/​10.​1145/​3184558.​3191830
17.
go back to reference Wang, B., Liu, K., Zhao, J.: Inner attention based recurrent neural networks for answer selection. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1288–1297. Association for Computational Linguistics, Berlin, Germany (Aug 2016). https://doi.org/10.18653/v1/P16-1122 Wang, B., Liu, K., Zhao, J.: Inner attention based recurrent neural networks for answer selection. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1288–1297. Association for Computational Linguistics, Berlin, Germany (Aug 2016). https://​doi.​org/​10.​18653/​v1/​P16-1122
18.
go back to reference Wang, J., et al.: IRGAN: A minimax game for unifying generative and discriminative information retrieval models. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval SIGIR 2017, pp. 515–524. ACM, New York, NY, USA (2017). https://doi.org/10.1145/3077136.3080786 Wang, J., et al.: IRGAN: A minimax game for unifying generative and discriminative information retrieval models. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval SIGIR 2017, pp. 515–524. ACM, New York, NY, USA (2017). https://​doi.​org/​10.​1145/​3077136.​3080786
Metadata
Title
Arc Loss: Softmax with Additive Angular Margin for Answer Retrieval
Authors
Rikiya Suzuki
Sumio Fujita
Tetsuya Sakai
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-42835-8_4