Top

Empirical Software Engineering

Published in:

01-01-2023

Leveraging Stack Overflow to detect relevant tutorial fragments of APIs

Authors: Di Wu, Xiao-Yuan Jing, Hongyu Zhang, Yuming Zhou, Baowen Xu

Published in: Empirical Software Engineering | Issue 1/2023

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Developers often use learning resources such as API tutorials and Stack Overflow (SO) to learn how to use an unfamiliar API. An API tutorial can be divided into a number of consecutive units that describe the same topic, denoted as tutorial fragments. We consider a tutorial fragment explaining the API usage knowledge as a relevant fragment of the API. Discovering relevant tutorial fragments of APIs can facilitate API understanding, learning, and application. However, existing approaches, based on supervised or unsupervised approaches, often suffer from either high manual efforts or lack of consideration of the relevance information. In this paper, we propose a novel approach, called SO2RT, to detect relevant tutorial fragments of APIs based on SO posts. SO2RT first automatically extracts relevant and irrelevant \(\left \langle API, QA \right \rangle \) pairs (QA stands for question and answer) and \(\left \langle API, FRA \right \rangle \) pairs (FRA stands for tutorial fragment). It then trains a semi-supervised transfer learning based detection model, which can transfer the API usage knowledge in SO Q&A pairs to tutorial fragments by utilizing the easy-to-extract \(\left \langle API, QA \right \rangle \) pairs. Finally, relevant fragments of APIs can be discovered by consulting the trained model. In this way, the effort for labeling the relevance between tutorial fragments and APIs can be reduced. We evaluate SO2RT on Java and Android datasets containing 21,008 \(\left \langle API, QA \right \rangle \) pairs. Experimental results show that SO2RT improves the state-of-the-art approaches in terms of F-Measure on both datasets. Our user study further confirms the effectiveness of SO2RT in practice. We also show a successful application of the relevant fragments to API recommendation.

previous article An empirical study on a single company’s cost estimations of 338 software projects

next article Towards effective assessment of steady state performance in Java software: are we there yet?

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

https://www.joda.org/joda-time/userguide.html

https://www.joda.org/joda-time/apidocs/index.html

https://archive.org/download/stackexchange

https://www.joda.org/joda-time/apidocs/index.html

http://commons.apache.org/proper/commons-math/javadocs/

https://www.oracle.com/technetwork/java/javase/documentation/index.html

http://download.igniterealtime.org/smack/docs/

https://developer.android.com/reference/packages

https://www.joda.org/joda-time/userguide.html#Intervals

Table 3

Precision, recall and F-measure of SO2RT and the baseline approaches on McGill dataset

RQ	Approaches	JodaTime			Math Library			Col.Official			Col.Jenkov			Smack
		P	R	F	P	R	F	P	R	F	P	R	F	P	R	F
RQ1	GMR	58.82	50.50	54.05	52.00	49.06	50.49	62.69	31.79	42.18	82.14	58.97	68.66	70.00	93.33	80.00
	FITSEA	69.00	74.17	70.24	67.89	72.70	61.53	55.74	48.62	48.10	90.44	85.17	85.17	83.38	88.33	83.90
	FRAPT	85.19	76.67	80.70	84.78	73.58	78.79	62.03	87.50	72.59	61.19	97.62	75.23	77.94	94.64	85.48
	SO2RT	91.43	86.49	88.89	93.75	80.36	86.54	67.44	87.88	76.32	69.79	82.65	75.68	87.50	85.96	86.73
RQ2	Model_TL	77.78	73.68	75.68	77.27	68.00	72.34	44.12	78.95	56.60	46.15	68.42	55.12	77.78	70.00	73.68
	Model_SS	45.16	46.67	45.90	65.31	59.26	62.14	32.73	64.29	43.37	33.33	59.52	42.74	66.67	67.86	67.26
	Model_TU	60.00	75.00	66.67	50.00	66.67	57.14	35.00	77.78	48.28	33.33	57.14	42.11	71.43	83.33	76.92
	SO2RT	91.43	86.49	88.89	93.75	80.36	86.54	67.44	87.88	76.32	69.79	82.65	75.68	87.50	85.96	86.73

Table 4

Precision, recall and F-measure of SO2RT and the baseline approaches on android dataset

RQ	Approach	Graphics			Resources			Data			Text
		P	R	F	P	R	F	P	R	F	P	R	F
RQ1	GMR	45.60	44.50	45.04	55.00	21.11	30.51	19.29	14.76	16.72	66.67	22.22	33.33
	FITSEA	50.42	42.52	43.73	75.83	66.17	66.80	56.52	52.00	54.17	36.19	48.33	39.56
	FRAPT	49.21	75.61	59.62	65.22	66.67	65.93	71.43	55.56	62.50	57.58	76.00	65.52
	SO2RT	68.57	75.00	71.64	68.97	64.52	66.67	71.43	68.97	70.18	67.74	72.41	70.00
RQ2	Model_TL	59.09	72.22	65.00	66.67	50.00	57.14	53.33	72.73	61.54	62.50	58.82	60.61
	Model_SS	38.71	55.81	45.71	30.70	77.78	44.03	35.00	75.00	47.73	31.58	72.00	43.90
	Model_TU	37.50	50.00	42.86	25.00	60.00	35.29	40.00	66.67	50.00	50.00	33.00	40.00
	SO2RT	68.57	75.00	71.64	68.97	64.52	66.67	71.43	68.97	70.18	67.74	72.41	70.00

https://stackoverflow.com/questions/15358409

https://www.mathworks.com/matlabcentral/fileexchange/69718-semi-supervised-learning-functions

https://sites.google.com/site/stcaso2rt/user-study

https://www.joda.org/joda-time/userguide.html#Custom_Formatters

https://stackoverflow.com/questions/6252678/

https://www.joda.org/joda-time/userguide.html#Time_fields

How to add one day to a date? (2018a). https://stackoverflow.com/questions/1005523/

Joda time - add weekdays to date (2018b). https://stackoverflow.com/questions/12728527/

JodaTime tutorial Construction (2018c). https://www.joda.org/joda-time/userguide.html#Construction

Ahasanuzzaman M, Asaduzzaman M, Roy CK, Schneider KA (2020) CAPS: a supervised technique for classifying stack overflow posts concerning API issues. Empir Softw Eng 25(2):1493–1532CrossRef

Asaduzzaman M, Mashiyat AS, Roy CK, Schneider KA (2013) Answering questions about unanswered questions of stack overflow. In: Working conference on mining software repositories, pp 97–100

Bao L, Xing Z, Xia X, Lo D, Wu M, Yang X (2020) psc2code: Denoising code extraction from programming screencasts. ACM Trans Softw Eng Methodol 29(3):1–38CrossRef

Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7:2399–2434MathSciNetMATH

Bulmer MG (1979) Principles of statistics. Courier Corporation, MassachusettsMATH

Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357CrossRefMATH

Chowdhury SA, Hindle A (2015) Mining stackoverflow to filter out off-topic IRC discussion. In: Working conference on mining software repositories, pp 422–425

Cliff N (2014) Ordinal methods for behavioral data analysis. Psychology Press, United KingdomCrossRef

Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ (2008) Liblinear: a library for large linear classification. J Mach Learn Res 9(Aug):1871–1874MATH

Gao Z, Xia X, Lo D, Grundy J (2020) Technical q8a site answer recommendation via question boosting. ACM Trans Softw Eng Methodol 30(1):1–34CrossRef

Gretton A, Bousquet O, Smola AJ, Schölkopf B (2005) Measuring statistical dependence with hilbert-schmidt norms. In: Algorithmic learning theory, pp 63–77

Gu X, Zhang H, Zhang D, Kim S (2016) Deep API learning. In: International symposium on foundations of software engineering, pp 631–642

Gu X, Zhang H, Kim S (2018) Deep code search. In: International conference on software engineering, pp 933–944

Huang Q, Xia X, Lo D (2017) Supervised vs unsupervised models: a holistic look at effort-aware just-in-time defect prediction. In: International conference on software maintenance and evolution, pp 159–170

Huang Q, Xia X, Xing Z, Lo D, Wang X (2018) API method recommendation without worrying about the task-api knowledge gap. In: International conference on automated software engineering, pp 293–304

Jiang H, Zhang J, Li X, Ren Z, Lo D (2016) A more accurate model for finding tutorial segments explaining APIs. In: International conference on software analysis, evolution, and reengineering, pp 157–167

Jiang H, Zhang J, Ren Z, Zhang T (2017) An unsupervised approach for discovering relevant tutorial fragments for apis. In: International conference on software engineering, pp 38–48

Jiang M, Huang W, Huang Z, Yen GG (2017) Integration of global and local metrics for domain adaptation learning via dimensionality reduction. IEEE Trans Cybern 47(1):38–51CrossRef

Jing X, Wu F, Dong X, Xu B (2017) An improved SDA based defect prediction framework for both within-project and cross-project class-imbalance problems. IEEE Trans Softw Eng 43(4):321–339CrossRef

Joachims T (1999) Transductive inference for text classification using support vector machines. Icml 99:200–209

Kittler J, Hater M, Duin RP (1996) Combining classifiers. In: International conference on pattern recognition, vol 2, pp 897–901

Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33(1):159–174CrossRefMATH

Le QV, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp 1188–1196

Li J, Xing Z, Kabir M A (2018) Leveraging official content and social context to recommend software documentation. IEEE Trans Serv Comput 14(2):472–486CrossRef

Li X, Fang M, Zhang J J, Wu J (2017) Domain adaptation from rgb-d to rgb images. Signal Process 131:27–35CrossRef

Li X, Jiang H, Kamei Y, Chen X (2020a) Bridging semantic gaps between natural languages and apis with word embedding. IEEE Trans Softw Eng 46(10):1081–1097CrossRef

Li Y, Sheng H, Cheng Y, Stroe D I, Teodorescu R (2020b) State-of-health estimation of lithium-ion batteries based on semi-supervised transfer component analysis. Appl Energy 115504:277

Lin B, Zampetti F, Bavota G, Penta MD, Lanza M (2019) Pattern-based mining of opinions in q&a websites. In: International conference on software engineering, pp 548–559

Lin Z, Zou Y, Zhao J, Xie B (2017) Improving software text retrieval using conceptual knowledge in source code. In: International conference on automated software engineering, pp 123–134

Ma S, Xing Z, Chen C, Chen C, Qu L, Li G (2021) Easy-to-deploy api extraction by multi-level feature embedding and transfer learning. IEEE Trans Softw Eng 47(10):2296–2311CrossRef

Maalej W, Robillard MP (2013) Patterns of knowledge in API reference documentation. IEEE Trans Softw Eng 39(9):1264–1282CrossRef

Manning CD (2008) Introduction to information retrieval. Cambridge University Press, United Kingdom

Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Annual conference on neural information processing systems, pp 3111–3119

Nguyen TD, Nguyen AT, Phan HD, Nguyen TN (2017) Exploring API embedding for API usages and applications. In: International conference on software engineering, pp 438–449

Nguyen TV, Tran NM, Phan H, Nguyen TD, Truong LH, Nguyen AT, Nguyen HA, Nguyen TN (2018) Complementing global and local contexts in representing API descriptions to improve API retrieval tasks. In: Joint meeting on european software engineering conference and symposium on the foundations of software engineering, pp 551–562

Nigam K, Lafferty J, McCallum A (1999) Using maximum entropy for text classification. In: IJCAI-99 workshop on machine learning for information filtering, pp 61–67

Pan SJ, Tsang IW, Kwok JT, Yang Q (2009) Domain adaptation via transfer component analysis. In: International joint conference on artificial intelligence, pp 1187–1192

Pan SJ, Tsang IW, Kwok JT, Yang Q (2011) Domain adaptation via transfer component analysis. IEEE Trans Neural Netw 22(2):199–210CrossRef

Parnin C, Treude C, Grammel L, Storey MA (2012) Crowd documentation: Exploring the coverage and the dynamics of api discussions on stack overflow. Georgia Institute of Technology, Tech Rep

Petrosyan G, Robillard MP, De Mori R (2015) Discovering information explaining api types using text classification. In: International conference on software engineering, pp 869–879

Ponzanelli L, Bavota G, Mocci A, Oliveto R, Penta M D, Haiduc S, Russo B, Lanza M (2019) Automatic identification and classification of software development video tutorial fragments. IEEE Trans Softw Eng 45(5):464–488CrossRef

Raghothaman M, Wei Y, Hamadi Y (2016) SWIM: synthesizing what i mean: code search and idiomatic snippet synthesis. In: International conference on software engineering, pp 357–367

Rahman MM, Roy CK (2015) An insight into the unresolved questions at stack overflow. In: Working conference on mining software repositories, pp 426–429

Rahman MM, Roy CK, Lo D (2016) RACK: automatic API recommendation using crowdsourced knowledge. In: International conference on software analysis, evolution, and reengineering, pp 349–359

Rahman MM, Roy CK, Lo D (2017) RACK: code search in the IDE using crowdsourced knowledge. In: International conference on software engineering, pp 51–54

Rehurek R, Sojka P (2010) Software framework for topic modelling with large corpora. In: Proceedings of the LREC. Workshop on New Challenges for NLP Frameworks, Citeseer, p 2010

Robillard M P (2009) What makes apis hard to learn? answers from developers. IEEE Softw 26(6):27–34CrossRef

Robillard MP, Chhetri YB (2015) Recommending reference API documentation. Empir Softw Eng 20(6):1558–1586CrossRef

Robillard MP, DeLine R (2011) A field study of API learning obstacles. Empir Softw Eng 16(6):703–732CrossRef

Rubei R, Di Sipio C, Nguyen PT, Di Rocco J, Di Ruscio D (2020) Postfinder: mining stack overflow posts to support software developers. Inf Softw Technol 106367:127

Smola AJ, Gretton A, Song L, Schölkopf B (2007) A hilbert space embedding for distributions. In: Algorithmic learning theory, 18th international conference, pp 13–31

Subramanian S, Inozemtseva L, Holmes R (2014) Live API documentation. In: International conference on software engineering, pp 643–652

Tan C, Sun F, Kong T, Zhang W, Yang C, Liu C (2018) A survey on deep transfer learning. CoRR:1808.01974

Thung F, Wang S, Lo D, Lawall J (2013) Automatic recommendation of api methods from feature requests. In: International conference on automated software engineering, pp 290–300

Ting KM (2002) An instance-weighting method to induce cost-sensitive trees. IEEE Trans Knowl Data Eng 14(3):659–665MathSciNetCrossRef

Treude C, Robillard MP (2016) Augmenting API documentation with insights from stack overflow. In: International conference on software engineering, pp 392–403

Treude C, Barzilay O, Storey MD (2011) How do programmers ask and answer questions on the web?. In: International conference on software engineering, pp 804–807

Treude C, Robillard M P, Dagenais B (2015) Extracting development tasks to navigate software documentation. IEEE Trans Softw Eng 41(6):565–581CrossRef

Uddin G, Khomh F, Roy CK (2020) Mining api usage scenarios from stack overflow. Inf Softw Technol 122:106277CrossRef

Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics bulletin 1(6):80–83CrossRef

Wu D, Jing XY, Chen H, Zhu X, Zhang H, Zuo M, Zi L, Zhu C (2018) Automatically answering api-related questions. In: International conference on software engineering: companion proceeedings, pp 270–271

Wu D, Jing XY, Zhang H, Kong X, Xie Y, Huang Z (2020) Data-driven approach to application programming interface documentation mining: a review. Wiley Interdiscip Rev Data Min Knowl Disc 10(5):e1369

Wu D, Jing X, Zhang H, Li B, Xie Y, Xu B (2021a) Generating API tags for tutorial fragments from stack overflow. Empir Softw Eng 26(4):66CrossRef

Wu D, Jing XY, Zhang H, Zhou Y, Xu B (2021b) Leveraging Stack Overflow to detect relevant tutorial fragments of apis. In: International conference on software analysis, evolution and reengineering, pp 35–46

Xie W, Peng X, Liu M, Treude C, Xing Z, Zhang X, Zhao W, Zimmermann T (2020) API method recommendation via explicit matching of functionality verb phrases. In: Devanbu P, Cohen MB (eds) Joint european software engineering conference and symposium on the foundations of software engineering, pp 1015–1026

Xu B, Ye D, Xing Z, Xia X, Chen G, Li S (2016) Predicting semantically linkable knowledge in developer online forums via convolutional neural network. In: International conference on automated software engineering, pp 51–62

Xu B, Xing Z, Xia X, Lo D (2017) Answerbot: automated generation of answer summary to developers’ technical questions. In: International conference on automated software engineering, pp 706–716

Ye X, Shen H, Ma X, Bunescu RC, Liu C (2016) From word embeddings to document similarities for improved information retrieval in software engineering. In: International conference on software engineering, pp 404–415

Zhang F, Niu H, Keivanloo I, Zou Y (2018) Expanding queries for code search using semantically related API class-names. IEEE Trans Softw Eng 44(11):1070–1082CrossRef

Zhang J, Jiang H, Ren Z, Zhang T, Huang Z (2021) Enriching api documentation with code samples and usage scenarios from crowd knowledge. IEEE Trans Softw Eng 47(6):1299–1314CrossRef

Zhang N, Huang Q, Xia X, Zou Y, Lo D, Xing Z (2020) Chatbot4qr: interactive query refinement for technical question retrieval. IEEE Trans Softw Eng 48(4):1185–1211CrossRef

Zhao D, Xing Z, Chen C, Xia X, Li G (2019) Actionnet: vision-based workflow action recognition from programming screencasts. In: International conference on software engineering, pp 350–361

Title: Leveraging Stack Overflow to detect relevant tutorial fragments of APIs
Authors: Di Wu
Xiao-Yuan Jing
Hongyu Zhang
Yuming Zhou
Baowen Xu
Publication date: 01-01-2023
Publisher: Springer US
Published in: Empirical Software Engineering / Issue 1/2023
Print ISSN: 1382-3256
Electronic ISSN: 1573-7616
DOI: https://doi.org/10.1007/s10664-022-10235-1

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Other articles of this Issue 1/2023

Automatic prediction of rejected edits in Stack Overflow

An empirical study of the impact of log parsers on the performance of log-based anomaly detection

CyberSAGE: The cyber security argument graph evaluation tool

Developer discussion topics on the adoption and barriers of low code software development platforms

Selecting third-party libraries: the data scientist’s perspective

Can static analysis tools find more defects?

Premium Partner