Skip to main content
Top
Published in: Neural Processing Letters 1/2023

30-06-2022

Augmenting Textbooks with cQA Question-Answers and Annotated YouTube Videos to Increase Its Relevance

Authors: Shobhan Kumar, Arun Chauhan

Published in: Neural Processing Letters | Issue 1/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The community question-answering (CQA) websites such as Quora1, and Reddit2, and YouTube provides a significant resource to the students. However, there is a redundancy issue which results in inadequate search results for a given question. On the other hand, e-book is primary source of knowledge for students. The Latent Dirichlet Allocation (LDA) topic model helps us to find the key topic of the e-book. The major flaw of this LDA is that it can’t capture the semantic knowledge in the documents. As a result, it fails to find the semantically cohesive, and meaningful topics. To address this issue, we propose a novel sBERT-LDA model, which augments the e-books with recommended question-answers and videos. We construct SiameseBERT (Bidirectional Encoder Representations from Transformers) network which provides the semantically relevant phrase embeddings. The model identifies the key topics of the e-book, after which sBERT is used to assess the similarity between the question-answers. This effort also provides advanced video indexing methods for each recommended video, allowing videos with “Table of Contents” and “Phrase Cloud” features to make videos more consumable. Experiments were carried out on question-answers datasets (Quora (QQP), TREC QA, and Yahoo Answers) as well as e-books on various subjects and across different grades.The model outperforms previous research by a large margin.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Abishek K, Hariharan BR, Valliyammai C (2019) An enhanced deep learning model for duplicate question pairs recognition. In: Nayak J, Abraham A, Krishna BM, Sekhar GTC, Das AK (eds) Soft computing in data analytics. Springer, Singapore, pp 769–777CrossRef Abishek K, Hariharan BR, Valliyammai C (2019) An enhanced deep learning model for duplicate question pairs recognition. In: Nayak J, Abraham A, Krishna BM, Sekhar GTC, Das AK (eds) Soft computing in data analytics. Springer, Singapore, pp 769–777CrossRef
2.
go back to reference Agrawal R, Gollapudi S, Kenthapadi K, Srivastava N, Velu R (2010) Enriching textbooks through data mining. In: Proceedings of the 1st ACM symposium on computing for development, ACM DEV’10, pp 19:1–19:9 Agrawal R, Gollapudi S, Kenthapadi K, Srivastava N, Velu R (2010) Enriching textbooks through data mining. In: Proceedings of the 1st ACM symposium on computing for development, ACM DEV’10, pp 19:1–19:9
3.
go back to reference Angelov D (2020) Top2vec: distributed representations of topics. CoRR, abs/2008.09470 Angelov D (2020) Top2vec: distributed representations of topics. CoRR, abs/2008.09470
4.
go back to reference Bishop CM (2006) Pattern recognition and machine learning (information science and statistics). Springer, BerlinMATH Bishop CM (2006) Pattern recognition and machine learning (information science and statistics). Springer, BerlinMATH
5.
go back to reference Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(null):993–1022MATH Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(null):993–1022MATH
6.
go back to reference Bonadiman D, Uva A, Moschitti A (2017) Effective shared representations with multitask learning for community question answering. In: Proceedings of the 15th conference of the European chapter of the Association for Computational Linguistics (ACL), vol 2, Short Papers, Vlencia, Spain, Apr 2017, pp 726–732 Bonadiman D, Uva A, Moschitti A (2017) Effective shared representations with multitask learning for community question answering. In: Proceedings of the 15th conference of the European chapter of the Association for Computational Linguistics (ACL), vol 2, Short Papers, Vlencia, Spain, Apr 2017, pp 726–732
7.
go back to reference Campello RJGB, Moulavi D, Sander J (2013) Density-based clustering based on hierarchical density estimates. In: Pei J, Tseng VS, Cao L, Motoda H, Xu G (eds) Advances in knowledge discovery and data mining. Springer, Berlin, pp 160–172CrossRef Campello RJGB, Moulavi D, Sander J (2013) Density-based clustering based on hierarchical density estimates. In: Pei J, Tseng VS, Cao L, Motoda H, Xu G (eds) Advances in knowledge discovery and data mining. Springer, Berlin, pp 160–172CrossRef
8.
go back to reference Cer D, Diab M, Agirre E, Lopez-Gazpio I, Specia L (2017) SemEval-2017 task 1: semantic textual similarity multilingual and crosslingual focused evaluation. In: Proceedings of the 11th international workshop on semantic evaluation (SemEval-2017), Vancouver, Canada, August 2017. Association for Computational Linguistics, pp 1–14 Cer D, Diab M, Agirre E, Lopez-Gazpio I, Specia L (2017) SemEval-2017 task 1: semantic textual similarity multilingual and crosslingual focused evaluation. In: Proceedings of the 11th international workshop on semantic evaluation (SemEval-2017), Vancouver, Canada, August 2017. Association for Computational Linguistics, pp 1–14
9.
go back to reference Cer D, Yang Y, Kong S, Hua N, Limtiaco N, St. John R, Constant N, Guajardo-Cespedes N, Yuan S, Tar C, Sung Y-H, Strope B, Kurzweil R (2018) Universal sentence encoder. CoRR, abs/1803.11175 Cer D, Yang Y, Kong S, Hua N, Limtiaco N, St. John R, Constant N, Guajardo-Cespedes N, Yuan S, Tar C, Sung Y-H, Strope B, Kurzweil R (2018) Universal sentence encoder. CoRR, abs/1803.11175
10.
go back to reference Chen D, Fisch A, Weston J, Bordes A (2017) Reading Wikipedia to answer open-domain questions. CoRR, abs/1704.00051 Chen D, Fisch A, Weston J, Bordes A (2017) Reading Wikipedia to answer open-domain questions. CoRR, abs/1704.00051
11.
go back to reference Chen Q, Hu Q, Huang JX, He L (2018) Can: enhancing sentence similarity modeling with collaborative and adversarial network. In: The 41st international ACM SIGIR conference on research; development in information retrieval, SIGIR’18, New York, NY, USA, pp 815–824 Chen Q, Hu Q, Huang JX, He L (2018) Can: enhancing sentence similarity modeling with collaborative and adversarial network. In: The 41st international ACM SIGIR conference on research; development in information retrieval, SIGIR’18, New York, NY, USA, pp 815–824
12.
go back to reference Chen Q, Hu Q, Huang X, He L (2018) Ca-rnn: using context-aligned recurrent neural networks for modeling sentence similarity. In: AAAI, 2018. Chen Q, Hu Q, Huang X, He L (2018) Ca-rnn: using context-aligned recurrent neural networks for modeling sentence similarity. In: AAAI, 2018.
13.
go back to reference Chtouki Y, Harroud H, Khalidi M, Bennani S (2012) The impact of youtube videos on the student’s learning. In: 2012 international conference on information technology based higher education and training (ITHET), June 2012, pp 1–4 Chtouki Y, Harroud H, Khalidi M, Bennani S (2012) The impact of youtube videos on the student’s learning. In: 2012 international conference on information technology based higher education and training (ITHET), June 2012, pp 1–4
14.
go back to reference Cirne M, Pedrini H (2017) Viscom: a robust video summarization approach using color co-occurrence matrices. Multimedia Tools Appl 77:01 Cirne M, Pedrini H (2017) Viscom: a robust video summarization approach using color co-occurrence matrices. Multimedia Tools Appl 77:01
15.
go back to reference Conneau A, Kiela D, Schwenk H, Barrault L, Bordes A (2017) Supervised learning of universal sentence representations from natural language inference data. In: Proceedings of the 2017 conference on empirical methods in natural language processing, Copenhagen, Denmark, September 2017. Association for Computational Linguistics, pp 670–680 Conneau A, Kiela D, Schwenk H, Barrault L, Bordes A (2017) Supervised learning of universal sentence representations from natural language inference data. In: Proceedings of the 2017 conference on empirical methods in natural language processing, Copenhagen, Denmark, September 2017. Association for Computational Linguistics, pp 670–680
16.
go back to reference Covington P, Adams J, Sargin E (2016) Deep neural networks for youtube recommendations. In: Proceedings of the 10th ACM conference on recommender systems, RecSys’16, New York, NY, USA, 2016, pp 191–198 Covington P, Adams J, Sargin E (2016) Deep neural networks for youtube recommendations. In: Proceedings of the 10th ACM conference on recommender systems, RecSys’16, New York, NY, USA, 2016, pp 191–198
17.
go back to reference Devlin J, Chang M-W, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 ACL: human language technologies, vol 1, Minneapolis, Minnesota, June 2019, pp 4171–4186 Devlin J, Chang M-W, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 ACL: human language technologies, vol 1, Minneapolis, Minnesota, June 2019, pp 4171–4186
18.
go back to reference DeWitt D, Alias N, Siraj S, Yaakub MY, Ayob J, Ishak R (2013) The potential of youtube for teaching and learning in the performing arts. Procedia Soc Behav Sci 103:1118–1126CrossRef DeWitt D, Alias N, Siraj S, Yaakub MY, Ayob J, Ishak R (2013) The potential of youtube for teaching and learning in the performing arts. Procedia Soc Behav Sci 103:1118–1126CrossRef
19.
go back to reference Gao H, Hu M, Cheng R, Gao T (2021) Hierarchical ranking for answer selection. CoRR. abs/2102.00677 Gao H, Hu M, Cheng R, Gao T (2021) Hierarchical ranking for answer selection. CoRR. abs/2102.00677
20.
go back to reference Guo J, Yue B, Xu G, Yang Z, Wei J-M (2017) An enhanced convolutional neural network model for answer selection. In: Proceedings of the 26th international conference on world wide web companion, WWW’17 Companion, Republic and Canton of Geneva, CHE, 2017. International World Wide Web Conferences Steering Committee, pp 789–790 Guo J, Yue B, Xu G, Yang Z, Wei J-M (2017) An enhanced convolutional neural network model for answer selection. In: Proceedings of the 26th international conference on world wide web companion, WWW’17 Companion, Republic and Canton of Geneva, CHE, 2017. International World Wide Web Conferences Steering Committee, pp 789–790
21.
go back to reference Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd edn. Morgan Kauf-mann Publishers Inc., San FranciscoMATH Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd edn. Morgan Kauf-mann Publishers Inc., San FranciscoMATH
22.
go back to reference Heriyanto D (2018) The effectiveness of using youtube for vocabulary mastery. ETERNAL (English Teach J ) 06 Heriyanto D (2018) The effectiveness of using youtube for vocabulary mastery. ETERNAL (English Teach J ) 06
23.
go back to reference Hoogeveen D, Bennett A, Li Y, Verspoor KM, Baldwin T (2018) Detecting misflagged duplicate questions in community question-answering archives. In: ICWSM, 2018. Hoogeveen D, Bennett A, Li Y, Verspoor KM, Baldwin T (2018) Detecting misflagged duplicate questions in community question-answering archives. In: ICWSM, 2018.
24.
go back to reference Hua H, Li X, Dou D, Xu C-Z, Luo J (2021) Noise stability regularization for improving BERT fine-tuning. In: Proceedings of the 2021 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp 3229–3241 Hua H, Li X, Dou D, Xu C-Z, Luo J (2021) Noise stability regularization for improving BERT fine-tuning. In: Proceedings of the 2021 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp 3229–3241
25.
go back to reference Hutto CJ, Gil E (2014) Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: ICWSM. The AAAI Press Hutto CJ, Gil E (2014) Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: ICWSM. The AAAI Press
26.
go back to reference Imtiaz Z, Umer M, Ahmad M, Ullah S, Choi GS, Mehmood A (2020) Duplicate questions pair detection using siamese malstm. IEEE Access 8:21932–21942CrossRef Imtiaz Z, Umer M, Ahmad M, Ullah S, Choi GS, Mehmood A (2020) Duplicate questions pair detection using siamese malstm. IEEE Access 8:21932–21942CrossRef
27.
go back to reference Jelodar H, Wang Y, Vajdi A, Rabbani M, Zhao R, Boukela L, Li H (2020) A hybrid fuzzy system via topic model for recommending highlight topics of CQA in developer communities. J Circuits Syst Comput 29(15):2050248CrossRef Jelodar H, Wang Y, Vajdi A, Rabbani M, Zhao R, Boukela L, Li H (2020) A hybrid fuzzy system via topic model for recommending highlight topics of CQA in developer communities. J Circuits Syst Comput 29(15):2050248CrossRef
28.
go back to reference Kamath S, Grau B, Ma Y (2019) Predicting and integrating expected answer types into a simple recurrent neural network model for answer sentence selection. In: 20th international conference on computational linguistics and intelligent text processing, La Rochelle, France, April 2019. Kamath S, Grau B, Ma Y (2019) Predicting and integrating expected answer types into a simple recurrent neural network model for answer sentence selection. In: 20th international conference on computational linguistics and intelligent text processing, La Rochelle, France, April 2019.
29.
go back to reference Karan M, Šnajder J (2018) Paraphrase-focused learning to rank for domain-specific frequently asked questions retrieval. Expert Syst Appl 91:418–433CrossRef Karan M, Šnajder J (2018) Paraphrase-focused learning to rank for domain-specific frequently asked questions retrieval. Expert Syst Appl 91:418–433CrossRef
30.
go back to reference Karpukhin V, Oğuz B, Min S, Lewis P, Wu L, Edunov S, Chen D, tau Yih W (2020) Dense passage retrieval for open-domain question answering Karpukhin V, Oğuz B, Min S, Lewis P, Wu L, Edunov S, Chen D, tau Yih W (2020) Dense passage retrieval for open-domain question answering
31.
go back to reference Kumar S, Chauhan A (2019) Enriching textbooks by question-answers using CQA. In: TENCON 2019-2019 IEEE Region 10 conference (TENCON), pp 707–714 Kumar S, Chauhan A (2019) Enriching textbooks by question-answers using CQA. In: TENCON 2019-2019 IEEE Region 10 conference (TENCON), pp 707–714
33.
go back to reference Laskar MTR, Hoque E, Huang JX (2020) Utilizing bidirectional encoder representations from transformers for answer selection Laskar MTR, Hoque E, Huang JX (2020) Utilizing bidirectional encoder representations from transformers for answer selection
34.
go back to reference Laskar MTR, Huang JX, Hoque E (2020) Contextualized embeddings based transformer encoder for sentence similarity modeling in answer selection task. In: Proceedings of the 12th language resources and evaluation conference, Marseille, France, May 2020. European Language Resources Association, pp 5505–5514 Laskar MTR, Huang JX, Hoque E (2020) Contextualized embeddings based transformer encoder for sentence similarity modeling in answer selection task. In: Proceedings of the 12th language resources and evaluation conference, Marseille, France, May 2020. European Language Resources Association, pp 5505–5514
35.
go back to reference Le QV, Mikolov T (2014) Distributed representations of sentences and documents. CoRR. abs/1405.4053 Le QV, Mikolov T (2014) Distributed representations of sentences and documents. CoRR. abs/1405.4053
36.
go back to reference Li B, Zhou H, He J, Wang M, Yang Y, Li L (2020) On the sentence embeddings from pre-trained language models. In: EMNLP Li B, Zhou H, He J, Wang M, Yang Y, Li L (2020) On the sentence embeddings from pre-trained language models. In: EMNLP
37.
go back to reference Liu F, Vulić I, Korhonen A, Collier N (2021) Fast, effective, and self-supervised: transforming masked language models into universal lexical and sentence encoders. In: Proceedings of the 2021 conference on empirical methods in natural language processing, Online and Punta Cana, Dominican Republic, November 2021. Association for Computational Linguistics, pp 1442–1459 Liu F, Vulić I, Korhonen A, Collier N (2021) Fast, effective, and self-supervised: transforming masked language models into universal lexical and sentence encoders. In: Proceedings of the 2021 conference on empirical methods in natural language processing, Online and Punta Cana, Dominican Republic, November 2021. Association for Computational Linguistics, pp 1442–1459
38.
go back to reference Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: a robustly optimized BERT pretraining approach. CoRR. abs/1907.11692 Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: a robustly optimized BERT pretraining approach. CoRR. abs/1907.11692
40.
go back to reference MacKay DJC (2002) Information theory, inference and learning algorithms. Cambridge University Press, Cambridge MacKay DJC (2002) Information theory, inference and learning algorithms. Cambridge University Press, Cambridge
41.
go back to reference Marc Jackman W, Roberts P (2014) Students’ perspectives on youtube video usage as an e-resource in the university classroom. J Educ Technol Syst 42(3):273–296CrossRef Marc Jackman W, Roberts P (2014) Students’ perspectives on youtube video usage as an e-resource in the university classroom. J Educ Technol Syst 42(3):273–296CrossRef
42.
go back to reference McInnes L, Healy J (2018) Umap: uniform manifold approximation and projection for dimension reduction. J Open Source Softw McInnes L, Healy J (2018) Umap: uniform manifold approximation and projection for dimension reduction. J Open Source Softw
43.
go back to reference Mikolov T, Chen K, Corrado GS, Dean J (2013) Efficient estimation of word representations in vector space Mikolov T, Chen K, Corrado GS, Dean J (2013) Efficient estimation of word representations in vector space
44.
go back to reference Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Burges CJC, Bottou L, Welling M, Ghahramani Z, Weinberger KQ (eds) Advances in neural information processing systems, vol 26. Curran Associates, Inc., Red Hook Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Burges CJC, Bottou L, Welling M, Ghahramani Z, Weinberger KQ (eds) Advances in neural information processing systems, vol 26. Curran Associates, Inc., Red Hook
45.
go back to reference Mirakyan M, Hambardzumyan K, Khachatrian H (2019) Natural language inference over interaction Mirakyan M, Hambardzumyan K, Khachatrian H (2019) Natural language inference over interaction
46.
go back to reference Narang K, Yang C,Krishnan A, Wang J, Sundaram H, Sutter C (2019) An induced multi-relational framework for answer selection in community question answer platforms Narang K, Yang C,Krishnan A, Wang J, Sundaram H, Sutter C (2019) An induced multi-relational framework for answer selection in community question answer platforms
47.
go back to reference Nie L, Wei X, Zhang D, Wang X, Gao Z, Yang Yi (2017) Data-driven answer selection in community QA systems. IEEE Trans Knowl Data Eng 29(6):1186–1198CrossRef Nie L, Wei X, Zhang D, Wang X, Gao Z, Yang Yi (2017) Data-driven answer selection in community QA systems. IEEE Trans Knowl Data Eng 29(6):1186–1198CrossRef
48.
go back to reference Ostendorff M, Ruas T, Schubotz M, Rehm G, Gipp B (2020) Pairwise multi-class document classification for semantic relations between wikipedia articles. CoRR. abs/2003.09881 Ostendorff M, Ruas T, Schubotz M, Rehm G, Gipp B (2020) Pairwise multi-class document classification for semantic relations between wikipedia articles. CoRR. abs/2003.09881
49.
go back to reference Peinelt N, Nguyen D, Liakata M (2020) tBERT: topic models and BERT joining forces for semantic similarity detection. In: Proceedings of the 58th annual meeting of the association for computational linguistics, Online, July 2020. Association for Computational Linguistics, pp 7047–7055 Peinelt N, Nguyen D, Liakata M (2020) tBERT: topic models and BERT joining forces for semantic similarity detection. In: Proceedings of the 58th annual meeting of the association for computational linguistics, Online, July 2020. Association for Computational Linguistics, pp 7047–7055
50.
go back to reference Pennington J, Socher R, Manning C (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on EMNLP, October 2014, pp 1532–1543 Pennington J, Socher R, Manning C (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on EMNLP, October 2014, pp 1532–1543
51.
go back to reference Qinghe Z, Jiang N, Yang M, Wang D (2020) A full stage data augmentation method in deep convolutional neural network for natural image classification. Discret Dyn Nat Soc 31:07MATH Qinghe Z, Jiang N, Yang M, Wang D (2020) A full stage data augmentation method in deep convolutional neural network for natural image classification. Discret Dyn Nat Soc 31:07MATH
52.
go back to reference Qinghe Z, Tian X, Yang M, Yulin Wu, Huake Su (2020) Pac-bayesian framework-based drop-path method for 2d discriminative convolutional network pruning. Multidimension Syst Signal Process 31:07MathSciNetMATH Qinghe Z, Tian X, Yang M, Yulin Wu, Huake Su (2020) Pac-bayesian framework-based drop-path method for 2d discriminative convolutional network pruning. Multidimension Syst Signal Process 31:07MathSciNetMATH
53.
go back to reference Rangaswamy S, Ghosh S, Jha S, Ramalingam S (2016) Metadata extraction and classification of youtube videos using sentiment analysis. In: 2016 IEEE-ICCST, Oct 2016, pp 1–2 Rangaswamy S, Ghosh S, Jha S, Ramalingam S (2016) Metadata extraction and classification of youtube videos using sentiment analysis. In: 2016 IEEE-ICCST, Oct 2016, pp 1–2
54.
go back to reference Rao J, Liu L, Tay Y, Yang W, Shi P, Lin J (2019) Bridging the gap between relevance matching and semantic matching for short text similarity modeling. In: Proceedings of EMNLP-IJCNLP, November 2019, pp 5370–5381 Rao J, Liu L, Tay Y, Yang W, Shi P, Lin J (2019) Bridging the gap between relevance matching and semantic matching for short text similarity modeling. In: Proceedings of EMNLP-IJCNLP, November 2019, pp 5370–5381
55.
go back to reference Reimers N, Beyer P, Gurevych I (2016) Task-oriented intrinsic evaluation of semantic textual similarity. In: Proceedings of COLING 2016, the 26th international conference on computational linguistics: technical papers, Osaka, Japan, December 2016. The COLING 2016 Organizing Committee, pp 87–96 Reimers N, Beyer P, Gurevych I (2016) Task-oriented intrinsic evaluation of semantic textual similarity. In: Proceedings of COLING 2016, the 26th international conference on computational linguistics: technical papers, Osaka, Japan, December 2016. The COLING 2016 Organizing Committee, pp 87–96
56.
go back to reference Roy PK, Jain A, Ahmad Z, Singh JP (2021) Identifying expert users on question answering sites. In: Goyal D, Bălaş VE, Mukherjee A, de Albuquerque VHC, Gupta AK (eds) Information management and machine intelligence. Springer, Singapore, pp 285–291CrossRef Roy PK, Jain A, Ahmad Z, Singh JP (2021) Identifying expert users on question answering sites. In: Goyal D, Bălaş VE, Mukherjee A, de Albuquerque VHC, Gupta AK (eds) Information management and machine intelligence. Springer, Singapore, pp 285–291CrossRef
57.
go back to reference Shao B, Yan J (2017) Recommending answerers for stack overflow with lda model. In: Proceedings of the 12th Chinese conference on computer supported cooperative work and social computing, ChineseCSCW’17, New York, NY, USA, 2017. Association for Computing Machinery, pp 80–86 Shao B, Yan J (2017) Recommending answerers for stack overflow with lda model. In: Proceedings of the 12th Chinese conference on computer supported cooperative work and social computing, ChineseCSCW’17, New York, NY, USA, 2017. Association for Computing Machinery, pp 80–86
58.
go back to reference Singh GK, Kumar V, Bhat S, Pedanekar N (2015) Automatically augmenting learning material with practical questions to increase its relevance. In: 2015 IEEE frontiers in education conference (FIE), pp 1–7 Singh GK, Kumar V, Bhat S, Pedanekar N (2015) Automatically augmenting learning material with practical questions to increase its relevance. In: 2015 IEEE frontiers in education conference (FIE), pp 1–7
59.
go back to reference Su J, Cao J, Liu W, Ou Y (2021) Whitening sentence representations for better semantics and faster retrieval. CoRR. abs/2103.15316 Su J, Cao J, Liu W, Ou Y (2021) Whitening sentence representations for better semantics and faster retrieval. CoRR. abs/2103.15316
60.
go back to reference Suneera CM, Prakash J (2021) A bert-based question representation for improved question retrieval in community question answering systems. In: Patnaik S, Yang X-S, Sethi IK (eds) Advances in machine learning and computational intelligence. Springer, Singapore, pp 341–348CrossRef Suneera CM, Prakash J (2021) A bert-based question representation for improved question retrieval in community question answering systems. In: Patnaik S, Yang X-S, Sethi IK (eds) Advances in machine learning and computational intelligence. Springer, Singapore, pp 341–348CrossRef
61.
go back to reference Syed S, Spruit M (2017) Full-text or abstract? Examining topic coherence scores using latent dirichlet allocation. In: 2017 IEEE international conference on data science and advanced analytics (DSAA), pp 165–174 Syed S, Spruit M (2017) Full-text or abstract? Examining topic coherence scores using latent dirichlet allocation. In: 2017 IEEE international conference on data science and advanced analytics (DSAA), pp 165–174
62.
go back to reference Tay Y, Phan MC, Tuan LA, Hui SC (2017) Learning to rank question answer pairs with holographic dual lstm architecture. In: SIGIR’17, New York, NY, USA, 2017. Association for Computing Machinery, pp 695–704 Tay Y, Phan MC, Tuan LA, Hui SC (2017) Learning to rank question answer pairs with holographic dual lstm architecture. In: SIGIR’17, New York, NY, USA, 2017. Association for Computing Machinery, pp 695–704
63.
go back to reference van der Maaten L, Hinton G (2008) Viualizing data using t-sne. J Mach Learn Res 9:2579–2605MATH van der Maaten L, Hinton G (2008) Viualizing data using t-sne. J Mach Learn Res 9:2579–2605MATH
64.
go back to reference Wakchaure M, Kulkarni P (2019) A scheme of answer selection in community question answering using machine learning techniques. In: 2019 international conference on intelligent computing and control systems (ICCS), pp 879–883 Wakchaure M, Kulkarni P (2019) A scheme of answer selection in community question answering using machine learning techniques. In: 2019 international conference on intelligent computing and control systems (ICCS), pp 879–883
65.
go back to reference Wang M, Smith NA, Mitamura T (2007) What is the jeopardy model? A quasi-synchronous grammar for QA Wang M, Smith NA, Mitamura T (2007) What is the jeopardy model? A quasi-synchronous grammar for QA
66.
go back to reference Wang L, Zhang Li, Jiang J (2020) Duplicate question detection with deep learning in stack overflow. IEEE Access 8:25964–25975CrossRef Wang L, Zhang Li, Jiang J (2020) Duplicate question detection with deep learning in stack overflow. IEEE Access 8:25964–25975CrossRef
67.
go back to reference Wang L, Zhao W, Liu J (2021) Aligning cross-lingual sentence representations with dual momentum contrast. CoRR. abs/2109.00253 Wang L, Zhao W, Liu J (2021) Aligning cross-lingual sentence representations with dual momentum contrast. CoRR. abs/2109.00253
68.
go back to reference Williams A, Nangia N, Bowman S (2018) A broad-coverage challenge corpus for sentence understanding through inference. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (Long Papers). Association for Computational Linguistics, pp 1112–1122 Williams A, Nangia N, Bowman S (2018) A broad-coverage challenge corpus for sentence understanding through inference. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (Long Papers). Association for Computational Linguistics, pp 1112–1122
69.
go back to reference Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M, Brew J (2019) Huggingface’s transformers: state-of-the-art natural language processing. CoRR. abs/1910.03771 Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M, Brew J (2019) Huggingface’s transformers: state-of-the-art natural language processing. CoRR. abs/1910.03771
70.
go back to reference Wu W, Sun X, Wang H (2018) Question condensing networks for answer selection in community question answering. In: Proceedings of the 56th annual meeting of the association for computational linguistics, vol 1: Long Papers, Melbourne, Australia, July 2018. Association for Computational Linguistics, pp 1746–1755 Wu W, Sun X, Wang H (2018) Question condensing networks for answer selection in community question answering. In: Proceedings of the 56th annual meeting of the association for computational linguistics, vol 1: Long Papers, Melbourne, Australia, July 2018. Association for Computational Linguistics, pp 1746–1755
71.
go back to reference Xu S, Campagna G, Li J, Lam MS (2020) Schema2qa: answering complex queries on the structured web with a neural model. CoRR. abs/2001.05609 Xu S, Campagna G, Li J, Lam MS (2020) Schema2qa: answering complex queries on the structured web with a neural model. CoRR. abs/2001.05609
72.
go back to reference Yang H, Meinel C (2014) Content based lecture video retrieval using speech and video text information. IEEE Trans Learn Technol 7(02):142–154CrossRef Yang H, Meinel C (2014) Content based lecture video retrieval using speech and video text information. IEEE Trans Learn Technol 7(02):142–154CrossRef
73.
go back to reference Yang M, Chen L, Lyu Z, Liu J, Shen Y, Qingyao Wu (2020) Hierarchical fusion of commonsense knowledge and classifier decisions for answer selection in community question answering. Neural Netw 132:53–65CrossRef Yang M, Chen L, Lyu Z, Liu J, Shen Y, Qingyao Wu (2020) Hierarchical fusion of commonsense knowledge and classifier decisions for answer selection in community question answering. Neural Netw 132:53–65CrossRef
74.
go back to reference Yang R, Zhang J, Gao X, Ji F, Chen H (2019) Simple and effective text matching with richer alignment features. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy, July 2019. Association for Computational Linguistics, pp 4699–4709 Yang R, Zhang J, Gao X, Ji F, Chen H (2019) Simple and effective text matching with richer alignment features. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy, July 2019. Association for Computational Linguistics, pp 4699–4709
75.
go back to reference Yang Y, Yih W-t, Meek C (2015) WikiQA: a challenge dataset for open-domain question answering. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, September 2015, pp 2013–2018 Yang Y, Yih W-t, Meek C (2015) WikiQA: a challenge dataset for open-domain question answering. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, September 2015, pp 2013–2018
76.
go back to reference Yang Z, Liu Q, Sun B, Zhao X (2019) Expert recommendation in community question answering: a review and future direction. Int J Crowd Sci 3:348–372CrossRef Yang Z, Liu Q, Sun B, Zhao X (2019) Expert recommendation in community question answering: a review and future direction. Int J Crowd Sci 3:348–372CrossRef
77.
go back to reference Yao Y, Tong H, Xie T, Akoglu L, Xu F, Lu J (2015) Detecting high-quality posts in community question answering sites. Inf Sci 302(C):70–82CrossRef Yao Y, Tong H, Xie T, Akoglu L, Xu F, Lu J (2015) Detecting high-quality posts in community question answering sites. Inf Sci 302(C):70–82CrossRef
78.
go back to reference Yianilos P (2000) Locally lifting the curse of dimensionality for nearest neighbor search. In: 11TH ACM-SIAM symposium on discrete algorithms (SODA’00) Yianilos P (2000) Locally lifting the curse of dimensionality for nearest neighbor search. In: 11TH ACM-SIAM symposium on discrete algorithms (SODA’00)
79.
go back to reference Zhang WE, Sheng QZ, Lau JH, Abebe E, Ruan W (2018) Duplicate detection in programming question answering communities. ACM Trans Internet Technol 18(3). Zhang WE, Sheng QZ, Lau JH, Abebe E, Ruan W (2018) Duplicate detection in programming question answering communities. ACM Trans Internet Technol 18(3).
80.
go back to reference Zheng Q, Yang M, Yang J, Zhang Q, Zhang X (2018) Improvement of generalization ability of deep CNN via implicit regularization in two-stage training process. IEEE Access 6:15844–15869CrossRef Zheng Q, Yang M, Yang J, Zhang Q, Zhang X (2018) Improvement of generalization ability of deep CNN via implicit regularization in two-stage training process. IEEE Access 6:15844–15869CrossRef
81.
go back to reference Zhou X, Hu B, Chen Q, Wang X (2018) Recurrent convolutional neural network for answer selection in community question answering. Neurocomputing 274:8–18CrossRef Zhou X, Hu B, Chen Q, Wang X (2018) Recurrent convolutional neural network for answer selection in community question answering. Neurocomputing 274:8–18CrossRef
Metadata
Title
Augmenting Textbooks with cQA Question-Answers and Annotated YouTube Videos to Increase Its Relevance
Authors
Shobhan Kumar
Arun Chauhan
Publication date
30-06-2022
Publisher
Springer US
Published in
Neural Processing Letters / Issue 1/2023
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-022-10897-4

Other articles of this Issue 1/2023

Neural Processing Letters 1/2023 Go to the issue