Skip to main content
Erschienen in: Neural Processing Letters 1/2023

30.06.2022

Augmenting Textbooks with cQA Question-Answers and Annotated YouTube Videos to Increase Its Relevance

verfasst von: Shobhan Kumar, Arun Chauhan

Erschienen in: Neural Processing Letters | Ausgabe 1/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The community question-answering (CQA) websites such as Quora1, and Reddit2, and YouTube provides a significant resource to the students. However, there is a redundancy issue which results in inadequate search results for a given question. On the other hand, e-book is primary source of knowledge for students. The Latent Dirichlet Allocation (LDA) topic model helps us to find the key topic of the e-book. The major flaw of this LDA is that it can’t capture the semantic knowledge in the documents. As a result, it fails to find the semantically cohesive, and meaningful topics. To address this issue, we propose a novel sBERT-LDA model, which augments the e-books with recommended question-answers and videos. We construct SiameseBERT (Bidirectional Encoder Representations from Transformers) network which provides the semantically relevant phrase embeddings. The model identifies the key topics of the e-book, after which sBERT is used to assess the similarity between the question-answers. This effort also provides advanced video indexing methods for each recommended video, allowing videos with “Table of Contents” and “Phrase Cloud” features to make videos more consumable. Experiments were carried out on question-answers datasets (Quora (QQP), TREC QA, and Yahoo Answers) as well as e-books on various subjects and across different grades.The model outperforms previous research by a large margin.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Abishek K, Hariharan BR, Valliyammai C (2019) An enhanced deep learning model for duplicate question pairs recognition. In: Nayak J, Abraham A, Krishna BM, Sekhar GTC, Das AK (eds) Soft computing in data analytics. Springer, Singapore, pp 769–777CrossRef Abishek K, Hariharan BR, Valliyammai C (2019) An enhanced deep learning model for duplicate question pairs recognition. In: Nayak J, Abraham A, Krishna BM, Sekhar GTC, Das AK (eds) Soft computing in data analytics. Springer, Singapore, pp 769–777CrossRef
2.
Zurück zum Zitat Agrawal R, Gollapudi S, Kenthapadi K, Srivastava N, Velu R (2010) Enriching textbooks through data mining. In: Proceedings of the 1st ACM symposium on computing for development, ACM DEV’10, pp 19:1–19:9 Agrawal R, Gollapudi S, Kenthapadi K, Srivastava N, Velu R (2010) Enriching textbooks through data mining. In: Proceedings of the 1st ACM symposium on computing for development, ACM DEV’10, pp 19:1–19:9
3.
Zurück zum Zitat Angelov D (2020) Top2vec: distributed representations of topics. CoRR, abs/2008.09470 Angelov D (2020) Top2vec: distributed representations of topics. CoRR, abs/2008.09470
4.
Zurück zum Zitat Bishop CM (2006) Pattern recognition and machine learning (information science and statistics). Springer, BerlinMATH Bishop CM (2006) Pattern recognition and machine learning (information science and statistics). Springer, BerlinMATH
5.
Zurück zum Zitat Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(null):993–1022MATH Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(null):993–1022MATH
6.
Zurück zum Zitat Bonadiman D, Uva A, Moschitti A (2017) Effective shared representations with multitask learning for community question answering. In: Proceedings of the 15th conference of the European chapter of the Association for Computational Linguistics (ACL), vol 2, Short Papers, Vlencia, Spain, Apr 2017, pp 726–732 Bonadiman D, Uva A, Moschitti A (2017) Effective shared representations with multitask learning for community question answering. In: Proceedings of the 15th conference of the European chapter of the Association for Computational Linguistics (ACL), vol 2, Short Papers, Vlencia, Spain, Apr 2017, pp 726–732
7.
Zurück zum Zitat Campello RJGB, Moulavi D, Sander J (2013) Density-based clustering based on hierarchical density estimates. In: Pei J, Tseng VS, Cao L, Motoda H, Xu G (eds) Advances in knowledge discovery and data mining. Springer, Berlin, pp 160–172CrossRef Campello RJGB, Moulavi D, Sander J (2013) Density-based clustering based on hierarchical density estimates. In: Pei J, Tseng VS, Cao L, Motoda H, Xu G (eds) Advances in knowledge discovery and data mining. Springer, Berlin, pp 160–172CrossRef
8.
Zurück zum Zitat Cer D, Diab M, Agirre E, Lopez-Gazpio I, Specia L (2017) SemEval-2017 task 1: semantic textual similarity multilingual and crosslingual focused evaluation. In: Proceedings of the 11th international workshop on semantic evaluation (SemEval-2017), Vancouver, Canada, August 2017. Association for Computational Linguistics, pp 1–14 Cer D, Diab M, Agirre E, Lopez-Gazpio I, Specia L (2017) SemEval-2017 task 1: semantic textual similarity multilingual and crosslingual focused evaluation. In: Proceedings of the 11th international workshop on semantic evaluation (SemEval-2017), Vancouver, Canada, August 2017. Association for Computational Linguistics, pp 1–14
9.
Zurück zum Zitat Cer D, Yang Y, Kong S, Hua N, Limtiaco N, St. John R, Constant N, Guajardo-Cespedes N, Yuan S, Tar C, Sung Y-H, Strope B, Kurzweil R (2018) Universal sentence encoder. CoRR, abs/1803.11175 Cer D, Yang Y, Kong S, Hua N, Limtiaco N, St. John R, Constant N, Guajardo-Cespedes N, Yuan S, Tar C, Sung Y-H, Strope B, Kurzweil R (2018) Universal sentence encoder. CoRR, abs/1803.11175
10.
Zurück zum Zitat Chen D, Fisch A, Weston J, Bordes A (2017) Reading Wikipedia to answer open-domain questions. CoRR, abs/1704.00051 Chen D, Fisch A, Weston J, Bordes A (2017) Reading Wikipedia to answer open-domain questions. CoRR, abs/1704.00051
11.
Zurück zum Zitat Chen Q, Hu Q, Huang JX, He L (2018) Can: enhancing sentence similarity modeling with collaborative and adversarial network. In: The 41st international ACM SIGIR conference on research; development in information retrieval, SIGIR’18, New York, NY, USA, pp 815–824 Chen Q, Hu Q, Huang JX, He L (2018) Can: enhancing sentence similarity modeling with collaborative and adversarial network. In: The 41st international ACM SIGIR conference on research; development in information retrieval, SIGIR’18, New York, NY, USA, pp 815–824
12.
Zurück zum Zitat Chen Q, Hu Q, Huang X, He L (2018) Ca-rnn: using context-aligned recurrent neural networks for modeling sentence similarity. In: AAAI, 2018. Chen Q, Hu Q, Huang X, He L (2018) Ca-rnn: using context-aligned recurrent neural networks for modeling sentence similarity. In: AAAI, 2018.
13.
Zurück zum Zitat Chtouki Y, Harroud H, Khalidi M, Bennani S (2012) The impact of youtube videos on the student’s learning. In: 2012 international conference on information technology based higher education and training (ITHET), June 2012, pp 1–4 Chtouki Y, Harroud H, Khalidi M, Bennani S (2012) The impact of youtube videos on the student’s learning. In: 2012 international conference on information technology based higher education and training (ITHET), June 2012, pp 1–4
14.
Zurück zum Zitat Cirne M, Pedrini H (2017) Viscom: a robust video summarization approach using color co-occurrence matrices. Multimedia Tools Appl 77:01 Cirne M, Pedrini H (2017) Viscom: a robust video summarization approach using color co-occurrence matrices. Multimedia Tools Appl 77:01
15.
Zurück zum Zitat Conneau A, Kiela D, Schwenk H, Barrault L, Bordes A (2017) Supervised learning of universal sentence representations from natural language inference data. In: Proceedings of the 2017 conference on empirical methods in natural language processing, Copenhagen, Denmark, September 2017. Association for Computational Linguistics, pp 670–680 Conneau A, Kiela D, Schwenk H, Barrault L, Bordes A (2017) Supervised learning of universal sentence representations from natural language inference data. In: Proceedings of the 2017 conference on empirical methods in natural language processing, Copenhagen, Denmark, September 2017. Association for Computational Linguistics, pp 670–680
16.
Zurück zum Zitat Covington P, Adams J, Sargin E (2016) Deep neural networks for youtube recommendations. In: Proceedings of the 10th ACM conference on recommender systems, RecSys’16, New York, NY, USA, 2016, pp 191–198 Covington P, Adams J, Sargin E (2016) Deep neural networks for youtube recommendations. In: Proceedings of the 10th ACM conference on recommender systems, RecSys’16, New York, NY, USA, 2016, pp 191–198
17.
Zurück zum Zitat Devlin J, Chang M-W, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 ACL: human language technologies, vol 1, Minneapolis, Minnesota, June 2019, pp 4171–4186 Devlin J, Chang M-W, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 ACL: human language technologies, vol 1, Minneapolis, Minnesota, June 2019, pp 4171–4186
18.
Zurück zum Zitat DeWitt D, Alias N, Siraj S, Yaakub MY, Ayob J, Ishak R (2013) The potential of youtube for teaching and learning in the performing arts. Procedia Soc Behav Sci 103:1118–1126CrossRef DeWitt D, Alias N, Siraj S, Yaakub MY, Ayob J, Ishak R (2013) The potential of youtube for teaching and learning in the performing arts. Procedia Soc Behav Sci 103:1118–1126CrossRef
19.
Zurück zum Zitat Gao H, Hu M, Cheng R, Gao T (2021) Hierarchical ranking for answer selection. CoRR. abs/2102.00677 Gao H, Hu M, Cheng R, Gao T (2021) Hierarchical ranking for answer selection. CoRR. abs/2102.00677
20.
Zurück zum Zitat Guo J, Yue B, Xu G, Yang Z, Wei J-M (2017) An enhanced convolutional neural network model for answer selection. In: Proceedings of the 26th international conference on world wide web companion, WWW’17 Companion, Republic and Canton of Geneva, CHE, 2017. International World Wide Web Conferences Steering Committee, pp 789–790 Guo J, Yue B, Xu G, Yang Z, Wei J-M (2017) An enhanced convolutional neural network model for answer selection. In: Proceedings of the 26th international conference on world wide web companion, WWW’17 Companion, Republic and Canton of Geneva, CHE, 2017. International World Wide Web Conferences Steering Committee, pp 789–790
21.
Zurück zum Zitat Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd edn. Morgan Kauf-mann Publishers Inc., San FranciscoMATH Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques, 3rd edn. Morgan Kauf-mann Publishers Inc., San FranciscoMATH
22.
Zurück zum Zitat Heriyanto D (2018) The effectiveness of using youtube for vocabulary mastery. ETERNAL (English Teach J ) 06 Heriyanto D (2018) The effectiveness of using youtube for vocabulary mastery. ETERNAL (English Teach J ) 06
23.
Zurück zum Zitat Hoogeveen D, Bennett A, Li Y, Verspoor KM, Baldwin T (2018) Detecting misflagged duplicate questions in community question-answering archives. In: ICWSM, 2018. Hoogeveen D, Bennett A, Li Y, Verspoor KM, Baldwin T (2018) Detecting misflagged duplicate questions in community question-answering archives. In: ICWSM, 2018.
24.
Zurück zum Zitat Hua H, Li X, Dou D, Xu C-Z, Luo J (2021) Noise stability regularization for improving BERT fine-tuning. In: Proceedings of the 2021 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp 3229–3241 Hua H, Li X, Dou D, Xu C-Z, Luo J (2021) Noise stability regularization for improving BERT fine-tuning. In: Proceedings of the 2021 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp 3229–3241
25.
Zurück zum Zitat Hutto CJ, Gil E (2014) Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: ICWSM. The AAAI Press Hutto CJ, Gil E (2014) Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: ICWSM. The AAAI Press
26.
Zurück zum Zitat Imtiaz Z, Umer M, Ahmad M, Ullah S, Choi GS, Mehmood A (2020) Duplicate questions pair detection using siamese malstm. IEEE Access 8:21932–21942CrossRef Imtiaz Z, Umer M, Ahmad M, Ullah S, Choi GS, Mehmood A (2020) Duplicate questions pair detection using siamese malstm. IEEE Access 8:21932–21942CrossRef
27.
Zurück zum Zitat Jelodar H, Wang Y, Vajdi A, Rabbani M, Zhao R, Boukela L, Li H (2020) A hybrid fuzzy system via topic model for recommending highlight topics of CQA in developer communities. J Circuits Syst Comput 29(15):2050248CrossRef Jelodar H, Wang Y, Vajdi A, Rabbani M, Zhao R, Boukela L, Li H (2020) A hybrid fuzzy system via topic model for recommending highlight topics of CQA in developer communities. J Circuits Syst Comput 29(15):2050248CrossRef
28.
Zurück zum Zitat Kamath S, Grau B, Ma Y (2019) Predicting and integrating expected answer types into a simple recurrent neural network model for answer sentence selection. In: 20th international conference on computational linguistics and intelligent text processing, La Rochelle, France, April 2019. Kamath S, Grau B, Ma Y (2019) Predicting and integrating expected answer types into a simple recurrent neural network model for answer sentence selection. In: 20th international conference on computational linguistics and intelligent text processing, La Rochelle, France, April 2019.
29.
Zurück zum Zitat Karan M, Šnajder J (2018) Paraphrase-focused learning to rank for domain-specific frequently asked questions retrieval. Expert Syst Appl 91:418–433CrossRef Karan M, Šnajder J (2018) Paraphrase-focused learning to rank for domain-specific frequently asked questions retrieval. Expert Syst Appl 91:418–433CrossRef
30.
Zurück zum Zitat Karpukhin V, Oğuz B, Min S, Lewis P, Wu L, Edunov S, Chen D, tau Yih W (2020) Dense passage retrieval for open-domain question answering Karpukhin V, Oğuz B, Min S, Lewis P, Wu L, Edunov S, Chen D, tau Yih W (2020) Dense passage retrieval for open-domain question answering
31.
Zurück zum Zitat Kumar S, Chauhan A (2019) Enriching textbooks by question-answers using CQA. In: TENCON 2019-2019 IEEE Region 10 conference (TENCON), pp 707–714 Kumar S, Chauhan A (2019) Enriching textbooks by question-answers using CQA. In: TENCON 2019-2019 IEEE Region 10 conference (TENCON), pp 707–714
33.
Zurück zum Zitat Laskar MTR, Hoque E, Huang JX (2020) Utilizing bidirectional encoder representations from transformers for answer selection Laskar MTR, Hoque E, Huang JX (2020) Utilizing bidirectional encoder representations from transformers for answer selection
34.
Zurück zum Zitat Laskar MTR, Huang JX, Hoque E (2020) Contextualized embeddings based transformer encoder for sentence similarity modeling in answer selection task. In: Proceedings of the 12th language resources and evaluation conference, Marseille, France, May 2020. European Language Resources Association, pp 5505–5514 Laskar MTR, Huang JX, Hoque E (2020) Contextualized embeddings based transformer encoder for sentence similarity modeling in answer selection task. In: Proceedings of the 12th language resources and evaluation conference, Marseille, France, May 2020. European Language Resources Association, pp 5505–5514
35.
Zurück zum Zitat Le QV, Mikolov T (2014) Distributed representations of sentences and documents. CoRR. abs/1405.4053 Le QV, Mikolov T (2014) Distributed representations of sentences and documents. CoRR. abs/1405.4053
36.
Zurück zum Zitat Li B, Zhou H, He J, Wang M, Yang Y, Li L (2020) On the sentence embeddings from pre-trained language models. In: EMNLP Li B, Zhou H, He J, Wang M, Yang Y, Li L (2020) On the sentence embeddings from pre-trained language models. In: EMNLP
37.
Zurück zum Zitat Liu F, Vulić I, Korhonen A, Collier N (2021) Fast, effective, and self-supervised: transforming masked language models into universal lexical and sentence encoders. In: Proceedings of the 2021 conference on empirical methods in natural language processing, Online and Punta Cana, Dominican Republic, November 2021. Association for Computational Linguistics, pp 1442–1459 Liu F, Vulić I, Korhonen A, Collier N (2021) Fast, effective, and self-supervised: transforming masked language models into universal lexical and sentence encoders. In: Proceedings of the 2021 conference on empirical methods in natural language processing, Online and Punta Cana, Dominican Republic, November 2021. Association for Computational Linguistics, pp 1442–1459
38.
Zurück zum Zitat Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: a robustly optimized BERT pretraining approach. CoRR. abs/1907.11692 Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: a robustly optimized BERT pretraining approach. CoRR. abs/1907.11692
40.
Zurück zum Zitat MacKay DJC (2002) Information theory, inference and learning algorithms. Cambridge University Press, Cambridge MacKay DJC (2002) Information theory, inference and learning algorithms. Cambridge University Press, Cambridge
41.
Zurück zum Zitat Marc Jackman W, Roberts P (2014) Students’ perspectives on youtube video usage as an e-resource in the university classroom. J Educ Technol Syst 42(3):273–296CrossRef Marc Jackman W, Roberts P (2014) Students’ perspectives on youtube video usage as an e-resource in the university classroom. J Educ Technol Syst 42(3):273–296CrossRef
42.
Zurück zum Zitat McInnes L, Healy J (2018) Umap: uniform manifold approximation and projection for dimension reduction. J Open Source Softw McInnes L, Healy J (2018) Umap: uniform manifold approximation and projection for dimension reduction. J Open Source Softw
43.
Zurück zum Zitat Mikolov T, Chen K, Corrado GS, Dean J (2013) Efficient estimation of word representations in vector space Mikolov T, Chen K, Corrado GS, Dean J (2013) Efficient estimation of word representations in vector space
44.
Zurück zum Zitat Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Burges CJC, Bottou L, Welling M, Ghahramani Z, Weinberger KQ (eds) Advances in neural information processing systems, vol 26. Curran Associates, Inc., Red Hook Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Burges CJC, Bottou L, Welling M, Ghahramani Z, Weinberger KQ (eds) Advances in neural information processing systems, vol 26. Curran Associates, Inc., Red Hook
45.
Zurück zum Zitat Mirakyan M, Hambardzumyan K, Khachatrian H (2019) Natural language inference over interaction Mirakyan M, Hambardzumyan K, Khachatrian H (2019) Natural language inference over interaction
46.
Zurück zum Zitat Narang K, Yang C,Krishnan A, Wang J, Sundaram H, Sutter C (2019) An induced multi-relational framework for answer selection in community question answer platforms Narang K, Yang C,Krishnan A, Wang J, Sundaram H, Sutter C (2019) An induced multi-relational framework for answer selection in community question answer platforms
47.
Zurück zum Zitat Nie L, Wei X, Zhang D, Wang X, Gao Z, Yang Yi (2017) Data-driven answer selection in community QA systems. IEEE Trans Knowl Data Eng 29(6):1186–1198CrossRef Nie L, Wei X, Zhang D, Wang X, Gao Z, Yang Yi (2017) Data-driven answer selection in community QA systems. IEEE Trans Knowl Data Eng 29(6):1186–1198CrossRef
48.
Zurück zum Zitat Ostendorff M, Ruas T, Schubotz M, Rehm G, Gipp B (2020) Pairwise multi-class document classification for semantic relations between wikipedia articles. CoRR. abs/2003.09881 Ostendorff M, Ruas T, Schubotz M, Rehm G, Gipp B (2020) Pairwise multi-class document classification for semantic relations between wikipedia articles. CoRR. abs/2003.09881
49.
Zurück zum Zitat Peinelt N, Nguyen D, Liakata M (2020) tBERT: topic models and BERT joining forces for semantic similarity detection. In: Proceedings of the 58th annual meeting of the association for computational linguistics, Online, July 2020. Association for Computational Linguistics, pp 7047–7055 Peinelt N, Nguyen D, Liakata M (2020) tBERT: topic models and BERT joining forces for semantic similarity detection. In: Proceedings of the 58th annual meeting of the association for computational linguistics, Online, July 2020. Association for Computational Linguistics, pp 7047–7055
50.
Zurück zum Zitat Pennington J, Socher R, Manning C (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on EMNLP, October 2014, pp 1532–1543 Pennington J, Socher R, Manning C (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on EMNLP, October 2014, pp 1532–1543
51.
Zurück zum Zitat Qinghe Z, Jiang N, Yang M, Wang D (2020) A full stage data augmentation method in deep convolutional neural network for natural image classification. Discret Dyn Nat Soc 31:07MATH Qinghe Z, Jiang N, Yang M, Wang D (2020) A full stage data augmentation method in deep convolutional neural network for natural image classification. Discret Dyn Nat Soc 31:07MATH
52.
Zurück zum Zitat Qinghe Z, Tian X, Yang M, Yulin Wu, Huake Su (2020) Pac-bayesian framework-based drop-path method for 2d discriminative convolutional network pruning. Multidimension Syst Signal Process 31:07MathSciNetMATH Qinghe Z, Tian X, Yang M, Yulin Wu, Huake Su (2020) Pac-bayesian framework-based drop-path method for 2d discriminative convolutional network pruning. Multidimension Syst Signal Process 31:07MathSciNetMATH
53.
Zurück zum Zitat Rangaswamy S, Ghosh S, Jha S, Ramalingam S (2016) Metadata extraction and classification of youtube videos using sentiment analysis. In: 2016 IEEE-ICCST, Oct 2016, pp 1–2 Rangaswamy S, Ghosh S, Jha S, Ramalingam S (2016) Metadata extraction and classification of youtube videos using sentiment analysis. In: 2016 IEEE-ICCST, Oct 2016, pp 1–2
54.
Zurück zum Zitat Rao J, Liu L, Tay Y, Yang W, Shi P, Lin J (2019) Bridging the gap between relevance matching and semantic matching for short text similarity modeling. In: Proceedings of EMNLP-IJCNLP, November 2019, pp 5370–5381 Rao J, Liu L, Tay Y, Yang W, Shi P, Lin J (2019) Bridging the gap between relevance matching and semantic matching for short text similarity modeling. In: Proceedings of EMNLP-IJCNLP, November 2019, pp 5370–5381
55.
Zurück zum Zitat Reimers N, Beyer P, Gurevych I (2016) Task-oriented intrinsic evaluation of semantic textual similarity. In: Proceedings of COLING 2016, the 26th international conference on computational linguistics: technical papers, Osaka, Japan, December 2016. The COLING 2016 Organizing Committee, pp 87–96 Reimers N, Beyer P, Gurevych I (2016) Task-oriented intrinsic evaluation of semantic textual similarity. In: Proceedings of COLING 2016, the 26th international conference on computational linguistics: technical papers, Osaka, Japan, December 2016. The COLING 2016 Organizing Committee, pp 87–96
56.
Zurück zum Zitat Roy PK, Jain A, Ahmad Z, Singh JP (2021) Identifying expert users on question answering sites. In: Goyal D, Bălaş VE, Mukherjee A, de Albuquerque VHC, Gupta AK (eds) Information management and machine intelligence. Springer, Singapore, pp 285–291CrossRef Roy PK, Jain A, Ahmad Z, Singh JP (2021) Identifying expert users on question answering sites. In: Goyal D, Bălaş VE, Mukherjee A, de Albuquerque VHC, Gupta AK (eds) Information management and machine intelligence. Springer, Singapore, pp 285–291CrossRef
57.
Zurück zum Zitat Shao B, Yan J (2017) Recommending answerers for stack overflow with lda model. In: Proceedings of the 12th Chinese conference on computer supported cooperative work and social computing, ChineseCSCW’17, New York, NY, USA, 2017. Association for Computing Machinery, pp 80–86 Shao B, Yan J (2017) Recommending answerers for stack overflow with lda model. In: Proceedings of the 12th Chinese conference on computer supported cooperative work and social computing, ChineseCSCW’17, New York, NY, USA, 2017. Association for Computing Machinery, pp 80–86
58.
Zurück zum Zitat Singh GK, Kumar V, Bhat S, Pedanekar N (2015) Automatically augmenting learning material with practical questions to increase its relevance. In: 2015 IEEE frontiers in education conference (FIE), pp 1–7 Singh GK, Kumar V, Bhat S, Pedanekar N (2015) Automatically augmenting learning material with practical questions to increase its relevance. In: 2015 IEEE frontiers in education conference (FIE), pp 1–7
59.
Zurück zum Zitat Su J, Cao J, Liu W, Ou Y (2021) Whitening sentence representations for better semantics and faster retrieval. CoRR. abs/2103.15316 Su J, Cao J, Liu W, Ou Y (2021) Whitening sentence representations for better semantics and faster retrieval. CoRR. abs/2103.15316
60.
Zurück zum Zitat Suneera CM, Prakash J (2021) A bert-based question representation for improved question retrieval in community question answering systems. In: Patnaik S, Yang X-S, Sethi IK (eds) Advances in machine learning and computational intelligence. Springer, Singapore, pp 341–348CrossRef Suneera CM, Prakash J (2021) A bert-based question representation for improved question retrieval in community question answering systems. In: Patnaik S, Yang X-S, Sethi IK (eds) Advances in machine learning and computational intelligence. Springer, Singapore, pp 341–348CrossRef
61.
Zurück zum Zitat Syed S, Spruit M (2017) Full-text or abstract? Examining topic coherence scores using latent dirichlet allocation. In: 2017 IEEE international conference on data science and advanced analytics (DSAA), pp 165–174 Syed S, Spruit M (2017) Full-text or abstract? Examining topic coherence scores using latent dirichlet allocation. In: 2017 IEEE international conference on data science and advanced analytics (DSAA), pp 165–174
62.
Zurück zum Zitat Tay Y, Phan MC, Tuan LA, Hui SC (2017) Learning to rank question answer pairs with holographic dual lstm architecture. In: SIGIR’17, New York, NY, USA, 2017. Association for Computing Machinery, pp 695–704 Tay Y, Phan MC, Tuan LA, Hui SC (2017) Learning to rank question answer pairs with holographic dual lstm architecture. In: SIGIR’17, New York, NY, USA, 2017. Association for Computing Machinery, pp 695–704
63.
Zurück zum Zitat van der Maaten L, Hinton G (2008) Viualizing data using t-sne. J Mach Learn Res 9:2579–2605MATH van der Maaten L, Hinton G (2008) Viualizing data using t-sne. J Mach Learn Res 9:2579–2605MATH
64.
Zurück zum Zitat Wakchaure M, Kulkarni P (2019) A scheme of answer selection in community question answering using machine learning techniques. In: 2019 international conference on intelligent computing and control systems (ICCS), pp 879–883 Wakchaure M, Kulkarni P (2019) A scheme of answer selection in community question answering using machine learning techniques. In: 2019 international conference on intelligent computing and control systems (ICCS), pp 879–883
65.
Zurück zum Zitat Wang M, Smith NA, Mitamura T (2007) What is the jeopardy model? A quasi-synchronous grammar for QA Wang M, Smith NA, Mitamura T (2007) What is the jeopardy model? A quasi-synchronous grammar for QA
66.
Zurück zum Zitat Wang L, Zhang Li, Jiang J (2020) Duplicate question detection with deep learning in stack overflow. IEEE Access 8:25964–25975CrossRef Wang L, Zhang Li, Jiang J (2020) Duplicate question detection with deep learning in stack overflow. IEEE Access 8:25964–25975CrossRef
67.
Zurück zum Zitat Wang L, Zhao W, Liu J (2021) Aligning cross-lingual sentence representations with dual momentum contrast. CoRR. abs/2109.00253 Wang L, Zhao W, Liu J (2021) Aligning cross-lingual sentence representations with dual momentum contrast. CoRR. abs/2109.00253
68.
Zurück zum Zitat Williams A, Nangia N, Bowman S (2018) A broad-coverage challenge corpus for sentence understanding through inference. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (Long Papers). Association for Computational Linguistics, pp 1112–1122 Williams A, Nangia N, Bowman S (2018) A broad-coverage challenge corpus for sentence understanding through inference. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (Long Papers). Association for Computational Linguistics, pp 1112–1122
69.
Zurück zum Zitat Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M, Brew J (2019) Huggingface’s transformers: state-of-the-art natural language processing. CoRR. abs/1910.03771 Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M, Brew J (2019) Huggingface’s transformers: state-of-the-art natural language processing. CoRR. abs/1910.03771
70.
Zurück zum Zitat Wu W, Sun X, Wang H (2018) Question condensing networks for answer selection in community question answering. In: Proceedings of the 56th annual meeting of the association for computational linguistics, vol 1: Long Papers, Melbourne, Australia, July 2018. Association for Computational Linguistics, pp 1746–1755 Wu W, Sun X, Wang H (2018) Question condensing networks for answer selection in community question answering. In: Proceedings of the 56th annual meeting of the association for computational linguistics, vol 1: Long Papers, Melbourne, Australia, July 2018. Association for Computational Linguistics, pp 1746–1755
71.
Zurück zum Zitat Xu S, Campagna G, Li J, Lam MS (2020) Schema2qa: answering complex queries on the structured web with a neural model. CoRR. abs/2001.05609 Xu S, Campagna G, Li J, Lam MS (2020) Schema2qa: answering complex queries on the structured web with a neural model. CoRR. abs/2001.05609
72.
Zurück zum Zitat Yang H, Meinel C (2014) Content based lecture video retrieval using speech and video text information. IEEE Trans Learn Technol 7(02):142–154CrossRef Yang H, Meinel C (2014) Content based lecture video retrieval using speech and video text information. IEEE Trans Learn Technol 7(02):142–154CrossRef
73.
Zurück zum Zitat Yang M, Chen L, Lyu Z, Liu J, Shen Y, Qingyao Wu (2020) Hierarchical fusion of commonsense knowledge and classifier decisions for answer selection in community question answering. Neural Netw 132:53–65CrossRef Yang M, Chen L, Lyu Z, Liu J, Shen Y, Qingyao Wu (2020) Hierarchical fusion of commonsense knowledge and classifier decisions for answer selection in community question answering. Neural Netw 132:53–65CrossRef
74.
Zurück zum Zitat Yang R, Zhang J, Gao X, Ji F, Chen H (2019) Simple and effective text matching with richer alignment features. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy, July 2019. Association for Computational Linguistics, pp 4699–4709 Yang R, Zhang J, Gao X, Ji F, Chen H (2019) Simple and effective text matching with richer alignment features. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy, July 2019. Association for Computational Linguistics, pp 4699–4709
75.
Zurück zum Zitat Yang Y, Yih W-t, Meek C (2015) WikiQA: a challenge dataset for open-domain question answering. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, September 2015, pp 2013–2018 Yang Y, Yih W-t, Meek C (2015) WikiQA: a challenge dataset for open-domain question answering. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, September 2015, pp 2013–2018
76.
Zurück zum Zitat Yang Z, Liu Q, Sun B, Zhao X (2019) Expert recommendation in community question answering: a review and future direction. Int J Crowd Sci 3:348–372CrossRef Yang Z, Liu Q, Sun B, Zhao X (2019) Expert recommendation in community question answering: a review and future direction. Int J Crowd Sci 3:348–372CrossRef
77.
Zurück zum Zitat Yao Y, Tong H, Xie T, Akoglu L, Xu F, Lu J (2015) Detecting high-quality posts in community question answering sites. Inf Sci 302(C):70–82CrossRef Yao Y, Tong H, Xie T, Akoglu L, Xu F, Lu J (2015) Detecting high-quality posts in community question answering sites. Inf Sci 302(C):70–82CrossRef
78.
Zurück zum Zitat Yianilos P (2000) Locally lifting the curse of dimensionality for nearest neighbor search. In: 11TH ACM-SIAM symposium on discrete algorithms (SODA’00) Yianilos P (2000) Locally lifting the curse of dimensionality for nearest neighbor search. In: 11TH ACM-SIAM symposium on discrete algorithms (SODA’00)
79.
Zurück zum Zitat Zhang WE, Sheng QZ, Lau JH, Abebe E, Ruan W (2018) Duplicate detection in programming question answering communities. ACM Trans Internet Technol 18(3). Zhang WE, Sheng QZ, Lau JH, Abebe E, Ruan W (2018) Duplicate detection in programming question answering communities. ACM Trans Internet Technol 18(3).
80.
Zurück zum Zitat Zheng Q, Yang M, Yang J, Zhang Q, Zhang X (2018) Improvement of generalization ability of deep CNN via implicit regularization in two-stage training process. IEEE Access 6:15844–15869CrossRef Zheng Q, Yang M, Yang J, Zhang Q, Zhang X (2018) Improvement of generalization ability of deep CNN via implicit regularization in two-stage training process. IEEE Access 6:15844–15869CrossRef
81.
Zurück zum Zitat Zhou X, Hu B, Chen Q, Wang X (2018) Recurrent convolutional neural network for answer selection in community question answering. Neurocomputing 274:8–18CrossRef Zhou X, Hu B, Chen Q, Wang X (2018) Recurrent convolutional neural network for answer selection in community question answering. Neurocomputing 274:8–18CrossRef
Metadaten
Titel
Augmenting Textbooks with cQA Question-Answers and Annotated YouTube Videos to Increase Its Relevance
verfasst von
Shobhan Kumar
Arun Chauhan
Publikationsdatum
30.06.2022
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 1/2023
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-022-10897-4

Weitere Artikel der Ausgabe 1/2023

Neural Processing Letters 1/2023 Zur Ausgabe

Neuer Inhalt