Skip to main content
Top
Published in: Empirical Software Engineering 1/2023

01-01-2023

Leveraging Stack Overflow to detect relevant tutorial fragments of APIs

Authors: Di Wu, Xiao-Yuan Jing, Hongyu Zhang, Yuming Zhou, Baowen Xu

Published in: Empirical Software Engineering | Issue 1/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Developers often use learning resources such as API tutorials and Stack Overflow (SO) to learn how to use an unfamiliar API. An API tutorial can be divided into a number of consecutive units that describe the same topic, denoted as tutorial fragments. We consider a tutorial fragment explaining the API usage knowledge as a relevant fragment of the API. Discovering relevant tutorial fragments of APIs can facilitate API understanding, learning, and application. However, existing approaches, based on supervised or unsupervised approaches, often suffer from either high manual efforts or lack of consideration of the relevance information. In this paper, we propose a novel approach, called SO2RT, to detect relevant tutorial fragments of APIs based on SO posts. SO2RT first automatically extracts relevant and irrelevant \(\left \langle API, QA \right \rangle \) pairs (QA stands for question and answer) and \(\left \langle API, FRA \right \rangle \) pairs (FRA stands for tutorial fragment). It then trains a semi-supervised transfer learning based detection model, which can transfer the API usage knowledge in SO Q&A pairs to tutorial fragments by utilizing the easy-to-extract \(\left \langle API, QA \right \rangle \) pairs. Finally, relevant fragments of APIs can be discovered by consulting the trained model. In this way, the effort for labeling the relevance between tutorial fragments and APIs can be reduced. We evaluate SO2RT on Java and Android datasets containing 21,008 \(\left \langle API, QA \right \rangle \) pairs. Experimental results show that SO2RT improves the state-of-the-art approaches in terms of F-Measure on both datasets. Our user study further confirms the effectiveness of SO2RT in practice. We also show a successful application of the relevant fragments to API recommendation.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Footnotes
10
https://​www.​joda.​org/​joda-time/​userguide.​html#Intervals
Table 3
Precision, recall and F-measure of SO2RT and the baseline approaches on McGill dataset
RQ
Approaches
JodaTime
Math Library
Col.Official
Col.Jenkov
Smack
  
P
R
F
P
R
F
P
R
F
P
R
F
P
R
F
RQ1
GMR
58.82
50.50
54.05
52.00
49.06
50.49
62.69
31.79
42.18
82.14
58.97
68.66
70.00
93.33
80.00
 
FITSEA
69.00
74.17
70.24
67.89
72.70
61.53
55.74
48.62
48.10
90.44
85.17
85.17
83.38
88.33
83.90
 
FRAPT
85.19
76.67
80.70
84.78
73.58
78.79
62.03
87.50
72.59
61.19
97.62
75.23
77.94
94.64
85.48
 
SO2RT
91.43
86.49
88.89
93.75
80.36
86.54
67.44
87.88
76.32
69.79
82.65
75.68
87.50
85.96
86.73
RQ2
ModelTL
77.78
73.68
75.68
77.27
68.00
72.34
44.12
78.95
56.60
46.15
68.42
55.12
77.78
70.00
73.68
 
ModelSS
45.16
46.67
45.90
65.31
59.26
62.14
32.73
64.29
43.37
33.33
59.52
42.74
66.67
67.86
67.26
 
ModelTU
60.00
75.00
66.67
50.00
66.67
57.14
35.00
77.78
48.28
33.33
57.14
42.11
71.43
83.33
76.92
 
SO2RT
91.43
86.49
88.89
93.75
80.36
86.54
67.44
87.88
76.32
69.79
82.65
75.68
87.50
85.96
86.73
Table 4
Precision, recall and F-measure of SO2RT and the baseline approaches on android dataset
RQ
Approach
Graphics
Resources
Data
Text
  
P
R
F
P
R
F
P
R
F
P
R
F
RQ1
GMR
45.60
44.50
45.04
55.00
21.11
30.51
19.29
14.76
16.72
66.67
22.22
33.33
 
FITSEA
50.42
42.52
43.73
75.83
66.17
66.80
56.52
52.00
54.17
36.19
48.33
39.56
 
FRAPT
49.21
75.61
59.62
65.22
66.67
65.93
71.43
55.56
62.50
57.58
76.00
65.52
 
SO2RT
68.57
75.00
71.64
68.97
64.52
66.67
71.43
68.97
70.18
67.74
72.41
70.00
RQ2
ModelTL
59.09
72.22
65.00
66.67
50.00
57.14
53.33
72.73
61.54
62.50
58.82
60.61
 
ModelSS
38.71
55.81
45.71
30.70
77.78
44.03
35.00
75.00
47.73
31.58
72.00
43.90
 
ModelTU
37.50
50.00
42.86
25.00
60.00
35.29
40.00
66.67
50.00
50.00
33.00
40.00
 
SO2RT
68.57
75.00
71.64
68.97
64.52
66.67
71.43
68.97
70.18
67.74
72.41
70.00
 
Literature
go back to reference Ahasanuzzaman M, Asaduzzaman M, Roy CK, Schneider KA (2020) CAPS: a supervised technique for classifying stack overflow posts concerning API issues. Empir Softw Eng 25(2):1493–1532CrossRef Ahasanuzzaman M, Asaduzzaman M, Roy CK, Schneider KA (2020) CAPS: a supervised technique for classifying stack overflow posts concerning API issues. Empir Softw Eng 25(2):1493–1532CrossRef
go back to reference Asaduzzaman M, Mashiyat AS, Roy CK, Schneider KA (2013) Answering questions about unanswered questions of stack overflow. In: Working conference on mining software repositories, pp 97–100 Asaduzzaman M, Mashiyat AS, Roy CK, Schneider KA (2013) Answering questions about unanswered questions of stack overflow. In: Working conference on mining software repositories, pp 97–100
go back to reference Bao L, Xing Z, Xia X, Lo D, Wu M, Yang X (2020) psc2code: Denoising code extraction from programming screencasts. ACM Trans Softw Eng Methodol 29(3):1–38CrossRef Bao L, Xing Z, Xia X, Lo D, Wu M, Yang X (2020) psc2code: Denoising code extraction from programming screencasts. ACM Trans Softw Eng Methodol 29(3):1–38CrossRef
go back to reference Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7:2399–2434MathSciNetMATH Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7:2399–2434MathSciNetMATH
go back to reference Bulmer MG (1979) Principles of statistics. Courier Corporation, MassachusettsMATH Bulmer MG (1979) Principles of statistics. Courier Corporation, MassachusettsMATH
go back to reference Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357CrossRefMATH Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357CrossRefMATH
go back to reference Chowdhury SA, Hindle A (2015) Mining stackoverflow to filter out off-topic IRC discussion. In: Working conference on mining software repositories, pp 422–425 Chowdhury SA, Hindle A (2015) Mining stackoverflow to filter out off-topic IRC discussion. In: Working conference on mining software repositories, pp 422–425
go back to reference Cliff N (2014) Ordinal methods for behavioral data analysis. Psychology Press, United KingdomCrossRef Cliff N (2014) Ordinal methods for behavioral data analysis. Psychology Press, United KingdomCrossRef
go back to reference Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ (2008) Liblinear: a library for large linear classification. J Mach Learn Res 9(Aug):1871–1874MATH Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ (2008) Liblinear: a library for large linear classification. J Mach Learn Res 9(Aug):1871–1874MATH
go back to reference Gao Z, Xia X, Lo D, Grundy J (2020) Technical q8a site answer recommendation via question boosting. ACM Trans Softw Eng Methodol 30(1):1–34CrossRef Gao Z, Xia X, Lo D, Grundy J (2020) Technical q8a site answer recommendation via question boosting. ACM Trans Softw Eng Methodol 30(1):1–34CrossRef
go back to reference Gretton A, Bousquet O, Smola AJ, Schölkopf B (2005) Measuring statistical dependence with hilbert-schmidt norms. In: Algorithmic learning theory, pp 63–77 Gretton A, Bousquet O, Smola AJ, Schölkopf B (2005) Measuring statistical dependence with hilbert-schmidt norms. In: Algorithmic learning theory, pp 63–77
go back to reference Gu X, Zhang H, Zhang D, Kim S (2016) Deep API learning. In: International symposium on foundations of software engineering, pp 631–642 Gu X, Zhang H, Zhang D, Kim S (2016) Deep API learning. In: International symposium on foundations of software engineering, pp 631–642
go back to reference Gu X, Zhang H, Kim S (2018) Deep code search. In: International conference on software engineering, pp 933–944 Gu X, Zhang H, Kim S (2018) Deep code search. In: International conference on software engineering, pp 933–944
go back to reference Huang Q, Xia X, Lo D (2017) Supervised vs unsupervised models: a holistic look at effort-aware just-in-time defect prediction. In: International conference on software maintenance and evolution, pp 159–170 Huang Q, Xia X, Lo D (2017) Supervised vs unsupervised models: a holistic look at effort-aware just-in-time defect prediction. In: International conference on software maintenance and evolution, pp 159–170
go back to reference Huang Q, Xia X, Xing Z, Lo D, Wang X (2018) API method recommendation without worrying about the task-api knowledge gap. In: International conference on automated software engineering, pp 293–304 Huang Q, Xia X, Xing Z, Lo D, Wang X (2018) API method recommendation without worrying about the task-api knowledge gap. In: International conference on automated software engineering, pp 293–304
go back to reference Jiang H, Zhang J, Li X, Ren Z, Lo D (2016) A more accurate model for finding tutorial segments explaining APIs. In: International conference on software analysis, evolution, and reengineering, pp 157–167 Jiang H, Zhang J, Li X, Ren Z, Lo D (2016) A more accurate model for finding tutorial segments explaining APIs. In: International conference on software analysis, evolution, and reengineering, pp 157–167
go back to reference Jiang H, Zhang J, Ren Z, Zhang T (2017) An unsupervised approach for discovering relevant tutorial fragments for apis. In: International conference on software engineering, pp 38–48 Jiang H, Zhang J, Ren Z, Zhang T (2017) An unsupervised approach for discovering relevant tutorial fragments for apis. In: International conference on software engineering, pp 38–48
go back to reference Jiang M, Huang W, Huang Z, Yen GG (2017) Integration of global and local metrics for domain adaptation learning via dimensionality reduction. IEEE Trans Cybern 47(1):38–51CrossRef Jiang M, Huang W, Huang Z, Yen GG (2017) Integration of global and local metrics for domain adaptation learning via dimensionality reduction. IEEE Trans Cybern 47(1):38–51CrossRef
go back to reference Jing X, Wu F, Dong X, Xu B (2017) An improved SDA based defect prediction framework for both within-project and cross-project class-imbalance problems. IEEE Trans Softw Eng 43(4):321–339CrossRef Jing X, Wu F, Dong X, Xu B (2017) An improved SDA based defect prediction framework for both within-project and cross-project class-imbalance problems. IEEE Trans Softw Eng 43(4):321–339CrossRef
go back to reference Joachims T (1999) Transductive inference for text classification using support vector machines. Icml 99:200–209 Joachims T (1999) Transductive inference for text classification using support vector machines. Icml 99:200–209
go back to reference Kittler J, Hater M, Duin RP (1996) Combining classifiers. In: International conference on pattern recognition, vol 2, pp 897–901 Kittler J, Hater M, Duin RP (1996) Combining classifiers. In: International conference on pattern recognition, vol 2, pp 897–901
go back to reference Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33(1):159–174CrossRefMATH Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33(1):159–174CrossRefMATH
go back to reference Le QV, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp 1188–1196 Le QV, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp 1188–1196
go back to reference Li J, Xing Z, Kabir M A (2018) Leveraging official content and social context to recommend software documentation. IEEE Trans Serv Comput 14(2):472–486CrossRef Li J, Xing Z, Kabir M A (2018) Leveraging official content and social context to recommend software documentation. IEEE Trans Serv Comput 14(2):472–486CrossRef
go back to reference Li X, Fang M, Zhang J J, Wu J (2017) Domain adaptation from rgb-d to rgb images. Signal Process 131:27–35CrossRef Li X, Fang M, Zhang J J, Wu J (2017) Domain adaptation from rgb-d to rgb images. Signal Process 131:27–35CrossRef
go back to reference Li X, Jiang H, Kamei Y, Chen X (2020a) Bridging semantic gaps between natural languages and apis with word embedding. IEEE Trans Softw Eng 46(10):1081–1097CrossRef Li X, Jiang H, Kamei Y, Chen X (2020a) Bridging semantic gaps between natural languages and apis with word embedding. IEEE Trans Softw Eng 46(10):1081–1097CrossRef
go back to reference Li Y, Sheng H, Cheng Y, Stroe D I, Teodorescu R (2020b) State-of-health estimation of lithium-ion batteries based on semi-supervised transfer component analysis. Appl Energy 115504:277 Li Y, Sheng H, Cheng Y, Stroe D I, Teodorescu R (2020b) State-of-health estimation of lithium-ion batteries based on semi-supervised transfer component analysis. Appl Energy 115504:277
go back to reference Lin B, Zampetti F, Bavota G, Penta MD, Lanza M (2019) Pattern-based mining of opinions in q&a websites. In: International conference on software engineering, pp 548–559 Lin B, Zampetti F, Bavota G, Penta MD, Lanza M (2019) Pattern-based mining of opinions in q&a websites. In: International conference on software engineering, pp 548–559
go back to reference Lin Z, Zou Y, Zhao J, Xie B (2017) Improving software text retrieval using conceptual knowledge in source code. In: International conference on automated software engineering, pp 123–134 Lin Z, Zou Y, Zhao J, Xie B (2017) Improving software text retrieval using conceptual knowledge in source code. In: International conference on automated software engineering, pp 123–134
go back to reference Ma S, Xing Z, Chen C, Chen C, Qu L, Li G (2021) Easy-to-deploy api extraction by multi-level feature embedding and transfer learning. IEEE Trans Softw Eng 47(10):2296–2311CrossRef Ma S, Xing Z, Chen C, Chen C, Qu L, Li G (2021) Easy-to-deploy api extraction by multi-level feature embedding and transfer learning. IEEE Trans Softw Eng 47(10):2296–2311CrossRef
go back to reference Maalej W, Robillard MP (2013) Patterns of knowledge in API reference documentation. IEEE Trans Softw Eng 39(9):1264–1282CrossRef Maalej W, Robillard MP (2013) Patterns of knowledge in API reference documentation. IEEE Trans Softw Eng 39(9):1264–1282CrossRef
go back to reference Manning CD (2008) Introduction to information retrieval. Cambridge University Press, United Kingdom Manning CD (2008) Introduction to information retrieval. Cambridge University Press, United Kingdom
go back to reference Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Annual conference on neural information processing systems, pp 3111–3119 Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Annual conference on neural information processing systems, pp 3111–3119
go back to reference Nguyen TD, Nguyen AT, Phan HD, Nguyen TN (2017) Exploring API embedding for API usages and applications. In: International conference on software engineering, pp 438–449 Nguyen TD, Nguyen AT, Phan HD, Nguyen TN (2017) Exploring API embedding for API usages and applications. In: International conference on software engineering, pp 438–449
go back to reference Nguyen TV, Tran NM, Phan H, Nguyen TD, Truong LH, Nguyen AT, Nguyen HA, Nguyen TN (2018) Complementing global and local contexts in representing API descriptions to improve API retrieval tasks. In: Joint meeting on european software engineering conference and symposium on the foundations of software engineering, pp 551–562 Nguyen TV, Tran NM, Phan H, Nguyen TD, Truong LH, Nguyen AT, Nguyen HA, Nguyen TN (2018) Complementing global and local contexts in representing API descriptions to improve API retrieval tasks. In: Joint meeting on european software engineering conference and symposium on the foundations of software engineering, pp 551–562
go back to reference Nigam K, Lafferty J, McCallum A (1999) Using maximum entropy for text classification. In: IJCAI-99 workshop on machine learning for information filtering, pp 61–67 Nigam K, Lafferty J, McCallum A (1999) Using maximum entropy for text classification. In: IJCAI-99 workshop on machine learning for information filtering, pp 61–67
go back to reference Pan SJ, Tsang IW, Kwok JT, Yang Q (2009) Domain adaptation via transfer component analysis. In: International joint conference on artificial intelligence, pp 1187–1192 Pan SJ, Tsang IW, Kwok JT, Yang Q (2009) Domain adaptation via transfer component analysis. In: International joint conference on artificial intelligence, pp 1187–1192
go back to reference Pan SJ, Tsang IW, Kwok JT, Yang Q (2011) Domain adaptation via transfer component analysis. IEEE Trans Neural Netw 22(2):199–210CrossRef Pan SJ, Tsang IW, Kwok JT, Yang Q (2011) Domain adaptation via transfer component analysis. IEEE Trans Neural Netw 22(2):199–210CrossRef
go back to reference Parnin C, Treude C, Grammel L, Storey MA (2012) Crowd documentation: Exploring the coverage and the dynamics of api discussions on stack overflow. Georgia Institute of Technology, Tech Rep Parnin C, Treude C, Grammel L, Storey MA (2012) Crowd documentation: Exploring the coverage and the dynamics of api discussions on stack overflow. Georgia Institute of Technology, Tech Rep
go back to reference Petrosyan G, Robillard MP, De Mori R (2015) Discovering information explaining api types using text classification. In: International conference on software engineering, pp 869–879 Petrosyan G, Robillard MP, De Mori R (2015) Discovering information explaining api types using text classification. In: International conference on software engineering, pp 869–879
go back to reference Ponzanelli L, Bavota G, Mocci A, Oliveto R, Penta M D, Haiduc S, Russo B, Lanza M (2019) Automatic identification and classification of software development video tutorial fragments. IEEE Trans Softw Eng 45(5):464–488CrossRef Ponzanelli L, Bavota G, Mocci A, Oliveto R, Penta M D, Haiduc S, Russo B, Lanza M (2019) Automatic identification and classification of software development video tutorial fragments. IEEE Trans Softw Eng 45(5):464–488CrossRef
go back to reference Raghothaman M, Wei Y, Hamadi Y (2016) SWIM: synthesizing what i mean: code search and idiomatic snippet synthesis. In: International conference on software engineering, pp 357–367 Raghothaman M, Wei Y, Hamadi Y (2016) SWIM: synthesizing what i mean: code search and idiomatic snippet synthesis. In: International conference on software engineering, pp 357–367
go back to reference Rahman MM, Roy CK (2015) An insight into the unresolved questions at stack overflow. In: Working conference on mining software repositories, pp 426–429 Rahman MM, Roy CK (2015) An insight into the unresolved questions at stack overflow. In: Working conference on mining software repositories, pp 426–429
go back to reference Rahman MM, Roy CK, Lo D (2016) RACK: automatic API recommendation using crowdsourced knowledge. In: International conference on software analysis, evolution, and reengineering, pp 349–359 Rahman MM, Roy CK, Lo D (2016) RACK: automatic API recommendation using crowdsourced knowledge. In: International conference on software analysis, evolution, and reengineering, pp 349–359
go back to reference Rahman MM, Roy CK, Lo D (2017) RACK: code search in the IDE using crowdsourced knowledge. In: International conference on software engineering, pp 51–54 Rahman MM, Roy CK, Lo D (2017) RACK: code search in the IDE using crowdsourced knowledge. In: International conference on software engineering, pp 51–54
go back to reference Rehurek R, Sojka P (2010) Software framework for topic modelling with large corpora. In: Proceedings of the LREC. Workshop on New Challenges for NLP Frameworks, Citeseer, p 2010 Rehurek R, Sojka P (2010) Software framework for topic modelling with large corpora. In: Proceedings of the LREC. Workshop on New Challenges for NLP Frameworks, Citeseer, p 2010
go back to reference Robillard M P (2009) What makes apis hard to learn? answers from developers. IEEE Softw 26(6):27–34CrossRef Robillard M P (2009) What makes apis hard to learn? answers from developers. IEEE Softw 26(6):27–34CrossRef
go back to reference Robillard MP, Chhetri YB (2015) Recommending reference API documentation. Empir Softw Eng 20(6):1558–1586CrossRef Robillard MP, Chhetri YB (2015) Recommending reference API documentation. Empir Softw Eng 20(6):1558–1586CrossRef
go back to reference Robillard MP, DeLine R (2011) A field study of API learning obstacles. Empir Softw Eng 16(6):703–732CrossRef Robillard MP, DeLine R (2011) A field study of API learning obstacles. Empir Softw Eng 16(6):703–732CrossRef
go back to reference Rubei R, Di Sipio C, Nguyen PT, Di Rocco J, Di Ruscio D (2020) Postfinder: mining stack overflow posts to support software developers. Inf Softw Technol 106367:127 Rubei R, Di Sipio C, Nguyen PT, Di Rocco J, Di Ruscio D (2020) Postfinder: mining stack overflow posts to support software developers. Inf Softw Technol 106367:127
go back to reference Smola AJ, Gretton A, Song L, Schölkopf B (2007) A hilbert space embedding for distributions. In: Algorithmic learning theory, 18th international conference, pp 13–31 Smola AJ, Gretton A, Song L, Schölkopf B (2007) A hilbert space embedding for distributions. In: Algorithmic learning theory, 18th international conference, pp 13–31
go back to reference Subramanian S, Inozemtseva L, Holmes R (2014) Live API documentation. In: International conference on software engineering, pp 643–652 Subramanian S, Inozemtseva L, Holmes R (2014) Live API documentation. In: International conference on software engineering, pp 643–652
go back to reference Tan C, Sun F, Kong T, Zhang W, Yang C, Liu C (2018) A survey on deep transfer learning. CoRR:1808.01974 Tan C, Sun F, Kong T, Zhang W, Yang C, Liu C (2018) A survey on deep transfer learning. CoRR:1808.​01974
go back to reference Thung F, Wang S, Lo D, Lawall J (2013) Automatic recommendation of api methods from feature requests. In: International conference on automated software engineering, pp 290–300 Thung F, Wang S, Lo D, Lawall J (2013) Automatic recommendation of api methods from feature requests. In: International conference on automated software engineering, pp 290–300
go back to reference Ting KM (2002) An instance-weighting method to induce cost-sensitive trees. IEEE Trans Knowl Data Eng 14(3):659–665MathSciNetCrossRef Ting KM (2002) An instance-weighting method to induce cost-sensitive trees. IEEE Trans Knowl Data Eng 14(3):659–665MathSciNetCrossRef
go back to reference Treude C, Robillard MP (2016) Augmenting API documentation with insights from stack overflow. In: International conference on software engineering, pp 392–403 Treude C, Robillard MP (2016) Augmenting API documentation with insights from stack overflow. In: International conference on software engineering, pp 392–403
go back to reference Treude C, Barzilay O, Storey MD (2011) How do programmers ask and answer questions on the web?. In: International conference on software engineering, pp 804–807 Treude C, Barzilay O, Storey MD (2011) How do programmers ask and answer questions on the web?. In: International conference on software engineering, pp 804–807
go back to reference Treude C, Robillard M P, Dagenais B (2015) Extracting development tasks to navigate software documentation. IEEE Trans Softw Eng 41(6):565–581CrossRef Treude C, Robillard M P, Dagenais B (2015) Extracting development tasks to navigate software documentation. IEEE Trans Softw Eng 41(6):565–581CrossRef
go back to reference Uddin G, Khomh F, Roy CK (2020) Mining api usage scenarios from stack overflow. Inf Softw Technol 122:106277CrossRef Uddin G, Khomh F, Roy CK (2020) Mining api usage scenarios from stack overflow. Inf Softw Technol 122:106277CrossRef
go back to reference Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics bulletin 1(6):80–83CrossRef Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics bulletin 1(6):80–83CrossRef
go back to reference Wu D, Jing XY, Chen H, Zhu X, Zhang H, Zuo M, Zi L, Zhu C (2018) Automatically answering api-related questions. In: International conference on software engineering: companion proceeedings, pp 270–271 Wu D, Jing XY, Chen H, Zhu X, Zhang H, Zuo M, Zi L, Zhu C (2018) Automatically answering api-related questions. In: International conference on software engineering: companion proceeedings, pp 270–271
go back to reference Wu D, Jing XY, Zhang H, Kong X, Xie Y, Huang Z (2020) Data-driven approach to application programming interface documentation mining: a review. Wiley Interdiscip Rev Data Min Knowl Disc 10(5):e1369 Wu D, Jing XY, Zhang H, Kong X, Xie Y, Huang Z (2020) Data-driven approach to application programming interface documentation mining: a review. Wiley Interdiscip Rev Data Min Knowl Disc 10(5):e1369
go back to reference Wu D, Jing X, Zhang H, Li B, Xie Y, Xu B (2021a) Generating API tags for tutorial fragments from stack overflow. Empir Softw Eng 26(4):66CrossRef Wu D, Jing X, Zhang H, Li B, Xie Y, Xu B (2021a) Generating API tags for tutorial fragments from stack overflow. Empir Softw Eng 26(4):66CrossRef
go back to reference Wu D, Jing XY, Zhang H, Zhou Y, Xu B (2021b) Leveraging Stack Overflow to detect relevant tutorial fragments of apis. In: International conference on software analysis, evolution and reengineering, pp 35–46 Wu D, Jing XY, Zhang H, Zhou Y, Xu B (2021b) Leveraging Stack Overflow to detect relevant tutorial fragments of apis. In: International conference on software analysis, evolution and reengineering, pp 35–46
go back to reference Xie W, Peng X, Liu M, Treude C, Xing Z, Zhang X, Zhao W, Zimmermann T (2020) API method recommendation via explicit matching of functionality verb phrases. In: Devanbu P, Cohen MB (eds) Joint european software engineering conference and symposium on the foundations of software engineering, pp 1015–1026 Xie W, Peng X, Liu M, Treude C, Xing Z, Zhang X, Zhao W, Zimmermann T (2020) API method recommendation via explicit matching of functionality verb phrases. In: Devanbu P, Cohen MB (eds) Joint european software engineering conference and symposium on the foundations of software engineering, pp 1015–1026
go back to reference Xu B, Ye D, Xing Z, Xia X, Chen G, Li S (2016) Predicting semantically linkable knowledge in developer online forums via convolutional neural network. In: International conference on automated software engineering, pp 51–62 Xu B, Ye D, Xing Z, Xia X, Chen G, Li S (2016) Predicting semantically linkable knowledge in developer online forums via convolutional neural network. In: International conference on automated software engineering, pp 51–62
go back to reference Xu B, Xing Z, Xia X, Lo D (2017) Answerbot: automated generation of answer summary to developers’ technical questions. In: International conference on automated software engineering, pp 706–716 Xu B, Xing Z, Xia X, Lo D (2017) Answerbot: automated generation of answer summary to developers’ technical questions. In: International conference on automated software engineering, pp 706–716
go back to reference Ye X, Shen H, Ma X, Bunescu RC, Liu C (2016) From word embeddings to document similarities for improved information retrieval in software engineering. In: International conference on software engineering, pp 404–415 Ye X, Shen H, Ma X, Bunescu RC, Liu C (2016) From word embeddings to document similarities for improved information retrieval in software engineering. In: International conference on software engineering, pp 404–415
go back to reference Zhang F, Niu H, Keivanloo I, Zou Y (2018) Expanding queries for code search using semantically related API class-names. IEEE Trans Softw Eng 44(11):1070–1082CrossRef Zhang F, Niu H, Keivanloo I, Zou Y (2018) Expanding queries for code search using semantically related API class-names. IEEE Trans Softw Eng 44(11):1070–1082CrossRef
go back to reference Zhang J, Jiang H, Ren Z, Zhang T, Huang Z (2021) Enriching api documentation with code samples and usage scenarios from crowd knowledge. IEEE Trans Softw Eng 47(6):1299–1314CrossRef Zhang J, Jiang H, Ren Z, Zhang T, Huang Z (2021) Enriching api documentation with code samples and usage scenarios from crowd knowledge. IEEE Trans Softw Eng 47(6):1299–1314CrossRef
go back to reference Zhang N, Huang Q, Xia X, Zou Y, Lo D, Xing Z (2020) Chatbot4qr: interactive query refinement for technical question retrieval. IEEE Trans Softw Eng 48(4):1185–1211CrossRef Zhang N, Huang Q, Xia X, Zou Y, Lo D, Xing Z (2020) Chatbot4qr: interactive query refinement for technical question retrieval. IEEE Trans Softw Eng 48(4):1185–1211CrossRef
go back to reference Zhao D, Xing Z, Chen C, Xia X, Li G (2019) Actionnet: vision-based workflow action recognition from programming screencasts. In: International conference on software engineering, pp 350–361 Zhao D, Xing Z, Chen C, Xia X, Li G (2019) Actionnet: vision-based workflow action recognition from programming screencasts. In: International conference on software engineering, pp 350–361
Metadata
Title
Leveraging Stack Overflow to detect relevant tutorial fragments of APIs
Authors
Di Wu
Xiao-Yuan Jing
Hongyu Zhang
Yuming Zhou
Baowen Xu
Publication date
01-01-2023
Publisher
Springer US
Published in
Empirical Software Engineering / Issue 1/2023
Print ISSN: 1382-3256
Electronic ISSN: 1573-7616
DOI
https://doi.org/10.1007/s10664-022-10235-1

Other articles of this Issue 1/2023

Empirical Software Engineering 1/2023 Go to the issue

Premium Partner