Abstract
Predictive models are one of the most important techniques that are widely applied in many areas of software engineering. There have been a large number of primary studies that apply predictive models and that present well-performed studies in various research domains, including software requirements, software design and development, testing and debugging, and software maintenance. This article is a first attempt to systematically organize knowledge in this area by surveying a body of 421 papers on predictive models published between 2009 and 2020. We describe the key models and approaches used, classify the different models, summarize the range of key application areas, and analyze research results. Based on our findings, we also propose a set of current challenges that still need to be addressed in future work and provide a proposed research road map for these opportunities.
- [1] . 2019. Supporting analysts by dynamic extraction and classification of requirements-related knowledge. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering (ICSE). IEEE, 442–453.Google ScholarDigital Library
- [2] . 2015. Learning dependency-based change impact predictors using independent change histories. Inf. Softw. Technol. 67 (2015), 220–235.Google ScholarDigital Library
- [3] . 2018. Testing vision-based control systems using learnable evolutionary algorithms. In Proceedings of the IEEE/ACM 40th International Conference on Software Engineering (ICSE). IEEE, 1016–1026.Google ScholarDigital Library
- [4] . 2016. An empirical study of software release notes. Empir. Softw. Eng. 21, 3 (2016), 1107–1142.Google ScholarDigital Library
- [5] . 2020. Automated demarcation of requirements in textual specifications: A machine learning-based approach. Empir. Softw. Eng. 25, 6 (2020), 5454–5497.Google ScholarDigital Library
- [6] . 2020. REPD: Source code defect prediction as anomaly detection. J. Syst. Softw. 168 (2020), 110641.Google ScholarCross Ref
- [7] . 2018. Is “better data” better than “better data miners”? In Proceedings of the IEEE/ACM 40th International Conference on Software Engineering (ICSE). IEEE, 1050–1061.Google ScholarDigital Library
- [8] . 2020. CAPS: A supervised technique for classifying Stack Overflow posts concerning API issues. Empir. Softw. Eng. 25, 2 (2020), 1493–1532.Google ScholarCross Ref
- [9] . 2012. A precise method-method interaction-based cohesion metric for object-oriented classes. ACM Trans. Softw. Eng. Methodol. 21, 2 (2012), 1–34.Google ScholarDigital Library
- [10] . 2020. UI screens identification and extraction from mobile programming screencasts. In Proceedings of the 28th International Conference on Program Comprehension. 319–330.Google ScholarDigital Library
- [11] . 2014. Learning natural coding conventions. In Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering. 281–293.Google ScholarDigital Library
- [12] . 2016. Empirical assessment of machine learning-based malware detectors for Android. Empir. Softw. Eng. 21, 1 (2016), 183–211.Google ScholarDigital Library
- [13] . 2020. SCC++: Predicting the programming language of questions and snippets of stack overflow. J. Syst. Softw. 162 (2020), 110505.Google ScholarDigital Library
- [14] . 2019. Mining historical test logs to predict bugs and localize faults in the test logs. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering (ICSE). IEEE, 140–151.Google ScholarDigital Library
- [15] . 2016. Probing for requirements knowledge to stimulate architectural thinking. In Proceedings of the 38th International Conference on Software Engineering. 843–854.Google ScholarDigital Library
- [16] . 2011. Reducing the effort of bug report triage: Recommenders for development-oriented decisions. ACM Trans. Softw. Eng. Methodol. 20, 3 (2011), 1–35.Google ScholarDigital Library
- [17] . 2010. A systematic and comprehensive investigation of methods to build and evaluate fault prediction models. J. Syst. Softw. 83, 1 (2010), 2–17.Google ScholarDigital Library
- [18] . 2020. A comprehensive review on malware detection approaches. IEEE Access 8 (2020), 6249–6271.Google ScholarCross Ref
- [19] . 2012. Content classification of development emails. In Proceedings of the 34th International Conference on Software Engineering (ICSE). IEEE, 375–385.Google ScholarDigital Library
- [20] . 2020. Unsuccessful story about few shot malware family classification and siamese network to the rescue. In Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering (ICSE). IEEE, 1560–1571.Google ScholarDigital Library
- [21] . 2018. Execution anomaly detection in large-scale systems through console log analysis. J. Syst. Softw. 143 (2018), 172–186.Google ScholarCross Ref
- [22] . 2018. AutoConfig: Automatic configuration tuning for distributed message systems. In Proceedings of the 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 29–40.Google ScholarDigital Library
- [23] Lingfeng Bao, Xin Xia, David Lo, and Gail C. Murphy. 2019. A large scale study of long-time contributor prediction for github projects. IEEE Transactions on Software Engineering 47, 6 (2019), 1277–1298.Google Scholar
- [24] . 2017. Who will leave the company?: A large-scale industry study of developer turnover by mining monthly work report. In Proceedings of the IEEE/ACM 14th International Conference on Mining Software Repositories (MSR). IEEE, 170–181.Google ScholarDigital Library
- [25] . 2020. psc2code: Denoising code extraction from programming screencasts. ACM Trans. Softw. Eng. Methodol. 29, 3 (2020), 1–38.Google ScholarDigital Library
- [26] . 2013. The limited impact of individual developer data on software defect prediction. Empir. Softw. Eng. 18, 3 (2013), 478–505.Google ScholarCross Ref
- [27] . 2019. On the relative value of data resampling approaches for software defect prediction. Empir. Softw. Eng. 24, 2 (2019), 602–636.Google ScholarDigital Library
- [28] . 2021. Binary level toolchain provenance identification with graph neural networks. In Proceedings of the IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 131–141.Google ScholarCross Ref
- [29] . 2020. Translating video recordings of mobile app usages into replayable scenarios. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering. 309–321.Google ScholarDigital Library
- [30] . 2020. Using machine learning to assist with the selection of security controls during security assessment. Empir. Softw. Eng. 25, 4 (2020), 2550–2582.Google ScholarCross Ref
- [31] . 2020. What kind of questions do developers ask on stack overflow? A comparison of automated approaches to classify posts into question categories. Empir. Softw. Eng. 25, 3 (2020), 2258–2301.Google ScholarDigital Library
- [32] . 2012. Automated, highly-accurate, bug assignment using machine learning and tossing graphs. J. Syst. Softw. 85, 10 (2012), 2275–2292.Google ScholarDigital Library
- [33] . 2020. SinkFinder: Harvesting hundreds of unknown interesting function pairs with just one seed. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 1101–1113.Google ScholarDigital Library
- [34] . 2019. High-level software requirements and iteration changes: A predictive model. Empir. Softw. Eng. 24, 3 (2019), 1610–1648.Google ScholarDigital Library
- [35] . 2013. Do all task dependencies require coordination? the role of task properties in identifying critical coordination needs in software projects. In Proceedings of the 9th Joint Meeting on Foundations of Software Engineering. 213–223.Google ScholarDigital Library
- [36] . 2014. Guide to the Software Engineering Body of Knowledge (SWEBOK (R)): Version 3.0. IEEE Computer Society Press.Google ScholarDigital Library
- [37] . 2016. Mutation-aware fault prediction. In Proceedings of the 25th International Symposium on Software Testing and Analysis. 330–341.Google ScholarDigital Library
- [38] . 2020. Planning for untangling: Predicting the difficulty of merge conflicts. In Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering (ICSE). IEEE, 801–811.Google ScholarDigital Library
- [39] . 2019. Bilateral dependency neural networks for cross-language algorithm classification. In Proceedings of the IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 422–433.Google ScholarCross Ref
- [40] . 2009. The road not taken: Estimating path execution frequency statically. In Proceedings of the IEEE 31st International Conference on Software Engineering. IEEE, 144–154.Google ScholarDigital Library
- [41] . 2009. Learning a metric for code readability. IEEE Trans. Softw. Eng. 36, 4 (2009), 546–558.Google ScholarDigital Library
- [42] . 2019. When deep learning met code search. In Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 964–974.Google ScholarDigital Library
- [43] . 2015. Defect prediction as a multiobjective optimization problem. Softw. Test., Verif. Reliab. 25, 4 (2015), 426–459.Google ScholarDigital Library
- [44] . 2015. Assert use in GitHub projects. In Proceedings of the IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 755–766.Google ScholarCross Ref
- [45] . 2018. Enhancing change prediction models using developer-related factors. J. Syst. Softw. 143 (2018), 14–28.Google ScholarCross Ref
- [46] . 2020. Improving change prediction models with code smell-related information. Empir. Softw. Eng. 25, 1 (2020), 49–95.Google ScholarDigital Library
- [47] . 2019. Not all bugs are the same: Understanding, characterizing, and classifying bug types. J. Syst. Softw. 152 (2019), 165–181.Google ScholarDigital Library
- [48] . 2016. Evaluating different families of prediction methods for estimating software project outcomes. J. Syst. Softw. 112 (2016), 48–64.Google ScholarDigital Library
- [49] . 2013. A scalable approach for malware detection through bounded feature space behavior modeling. In Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 312–322.Google ScholarDigital Library
- [50] . 2020. BUILDFAST: History-aware build outcome prediction for fast feedback and reduced cost in continuous integration. In Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 42–53.Google ScholarDigital Library
- [51] . 2016. Learning a dual-language vector space for domain-specific cross-lingual question retrieval. In Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 744–755.Google ScholarDigital Library
- [52] . 2019. Automatically detecting the scopes of source code comments. J. Syst. Softw. 153 (2019), 45–63.Google ScholarDigital Library
- [53] . 2020. Software visualization and deep transfer learning for effective software defect prediction. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering. 578–589.Google ScholarDigital Library
- [54] . 2020. An automatic software vulnerability classification framework using term frequency-inverse gravity moment and feature selection. J. Syst. Softw. 167 (2020), 110616.Google ScholarCross Ref
- [55] . 2021. Why my code summarization model does not work: Code comment improvement with category prediction. ACM Trans. Softw. Eng. Methodol. 30, 2 (2021), 1–29.Google ScholarDigital Library
- [56] . 2017. An empirical study on the effect of testing on code quality using topic models: A case study on software development systems. IEEE Trans. Reliab. 66, 3 (2017), 806–824.Google ScholarCross Ref
- [57] Xin Chen, He Jiang, Xiaochen Li, Liming Nie, Dongjin Yu, Tieke He, and Zhenyu Chen. 2020. A systemic framework for crowdsourced test report quality assessment. Empirical Software Engineering 25, 2 (2020), 1382–1418.Google Scholar
- [58] . 2020. A machine learning approach for vulnerability curation. In Proceedings of the 17th International Conference on Mining Software Repositories. 32–42.Google ScholarDigital Library
- [59] . 2019. SEntiMoji: An emoji-powered learning approach for sentiment analysis in software engineering. In Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 841–852.Google ScholarDigital Library
- [60] . 2021. Emoji-powered sentiment and emotion detection from software developers’ communication data. ACM Trans. Softw. Eng. Methodol. 30, 2 (2021), 1–48.Google ScholarDigital Library
- [61] . 2015. Characterization and prediction of issue-related risks in software projects. In Proceedings of the IEEE/ACM 12th Working Conference on Mining Software Repositories. IEEE, 280–291.Google ScholarDigital Library
- [62] . 2010. RESISTing reliability degradation through proactive reconfiguration. In Proceedings of the IEEE/ACM International Conference on Automated Software Engineering. 83–92.Google ScholarDigital Library
- [63] . 2013. A learning-based method for combining testing techniques. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 142–151.Google ScholarCross Ref
- [64] . 2019. Lessons learned from using a deep tree-based model for software defect prediction in practice. In Proceedings of the IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 46–57.Google ScholarDigital Library
- [65] Hoa Khanh Dam, Truyen Tran, Trang Pham, Shien Wee Ng, John Grundy, and Aditya Ghose. 2018. Automatic feature learning for predicting vulnerable software components. IEEE Transactions on Software Engineering 47, 1 (2018), 67–85.Google Scholar
- [66] . 2010. A symbolic fault-prediction model based on multiobjective particle swarm optimization. J. Syst. Softw. 83, 5 (2010), 868–882.Google ScholarDigital Library
- [67] . 2017. Predicting likelihood of requirement implementation within the planned iteration: An empirical study at IBM. In Proceedings of the IEEE/ACM 14th International Conference on Mining Software Repositories (MSR). IEEE, 124–134.Google ScholarDigital Library
- [68] . 2011. Data mining techniques for software effort estimation: A comparative study. IEEE Trans. Softw. Eng. 38, 2 (2011), 375–397.Google ScholarDigital Library
- [69] . 2012. Toward comprehensible software fault prediction models using Bayesian network classifiers. IEEE Trans. Softw. Eng. 39, 2 (2012), 237–257.Google ScholarDigital Library
- [70] . 2020. Detecting and characterizing bots that commit code. In Proceedings of the 17th International Conference on Mining Software Repositories. 209–219.Google ScholarDigital Library
- [71] . 2015. On the role of developer’s scattered changes in bug prediction. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 241–250.Google ScholarDigital Library
- [72] . 2018. Detecting code smells using machine learning techniques: Are we there yet? In Proceedings of the IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 612–621.Google ScholarCross Ref
- [73] . 2015. Development emails content analyzer: Intention mining in developer discussions (T). In Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 12–23.Google ScholarDigital Library
- [74] . 2018. Identifying implementation bugs in machine learning based image classifiers using metamorphic testing. In Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis. 118–128.Google ScholarCross Ref
- [75] . 2020. Predicting developers’ negative feelings about code review. In Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering (ICSE). IEEE, 174–185.Google ScholarDigital Library
- [76] . 2019. Predicting defects with latent and semantic features from commit logs in an industrial setting. In Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering Workshop (ASEW). IEEE, 98–105.Google ScholarCross Ref
- [77] . 2021. Investigating the performance of personalized models for software defect prediction. J. Syst. Softw. 181 (2021), 111038.Google ScholarDigital Library
- [78] . 2018. Keep it simple: Is deep learning good for linguistic smell detection? In Proceedings of the IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 602–611.Google ScholarCross Ref
- [79] . 2011. Empirical principles and an industrial case study in retrieving equivalent requirements via natural language processing techniques. IEEE Trans. Softw. Eng. 39, 1 (2011), 18–44.Google ScholarDigital Library
- [80] . 2017. Estimating the number of remaining links in traceability recovery. Empir. Softw. Eng. 22, 3 (2017), 996–1027.Google ScholarDigital Library
- [81] Davide Falessi, Justin Roll, Jin L. C. Guo, and Jane Cleland-Huang. 2018. Leveraging historical associations between requirements and source code to identify impacted classes. IEEE Transactions on Software Engineering 46, 4 (2018), 420–441.Google Scholar
- [82] Yuanrui Fan, Xin Xia, Daniel Alencar da Costa, David Lo, Ahmed E. Hassan, and Shanping Li. 2019. The impact of mislabeled changes by szz on just-in-time defect prediction. IEEE Transactions on Software Engineering 47, 8 (2019), 1559–1586.Google Scholar
- [83] Yuanrui Fan, Xin Xia, David Lo, and Ahmed E. Hassan. 2018. Chaff from the wheat: Characterizing and determining valid bug reports. IEEE Transactions on Software Engineering 46, 5 (2018), 495–525.Google Scholar
- [84] . 2018. Early prediction of merged code changes to prioritize reviewing tasks. Empir. Softw. Eng. 23, 6 (2018), 3346–3393.Google ScholarDigital Library
- [85] . 2020. Functional code clone detection with syntax and semantics fusion learning. In Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis. 516–527.Google ScholarDigital Library
- [86] . 2013. Detecting inconsistencies in wrappers: A case study. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 1022–1031.Google ScholarCross Ref
- [87] . 2015. Supporting self-adaptation via quantitative verification and sensitivity analysis at run time. IEEE Trans. Softw. Eng. 42, 1 (2015), 75–99.Google ScholarDigital Library
- [88] . 2016. Comparing and experimenting machine learning techniques for code smell detection. Empir. Softw. Eng. 21, 3 (2016), 1143–1191.Google ScholarCross Ref
- [89] . 2014. Using psycho-physiological measures to assess task difficulty in software development. In Proceedings of the 36th International Conference on Software Engineering. 402–413.Google ScholarDigital Library
- [90] . 2017. Easy over hard: A case study on deep learning. In Proceedings of the 11th Joint Meeting on Foundations of Software Engineering. 49–60.Google ScholarDigital Library
- [91] . 2017. Revisiting unsupervised learning for defect prediction. In Proceedings of the 11th Joint Meeting on Foundations of Software Engineering. 72–83.Google ScholarDigital Library
- [92] . 2015. Revisiting the impact of classification techniques on the performance of defect prediction models. In Proceedings of the IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 789–800.Google ScholarCross Ref
- [93] . 2012. Method-level bug prediction. In Proceedings of the ACM-IEEE International Symposium on Empirical Software Engineering and Measurement. IEEE, 171–180.Google ScholarDigital Library
- [94] . 2011. Comparing fine-grained source code changes and code churn for bug prediction. In Proceedings of the 8th Working Conference on Mining Software Repositories. 83–92.Google ScholarDigital Library
- [95] . 2020. Recognizing developers’ emotions while programming. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering. 666–677.Google ScholarDigital Library
- [96] . 2019. Empirical evaluation of the impact of class overlap on software defect prediction. In Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 698–709.Google ScholarDigital Library
- [97] . 2017. Can latent topics in source code predict missing architectural tactics? In Proceedings of the IEEE/ACM 39th International Conference on Software Engineering (ICSE). IEEE, 15–26.Google ScholarDigital Library
- [98] . 2014. Data-guided repair of selection statements. In Proceedings of the 36th International Conference on Software Engineering. 243–253.Google ScholarDigital Library
- [99] . 2011. Finding relevant answers in software forums. In Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 323–332.Google ScholarDigital Library
- [100] Giovanni Grano, Fabio Palomba, and Harald C. Gall. 2019. Lightweight assessment of test-case effectiveness using source-code-quality indicators. IEEE Transactions on Software Engineering 47, 4 (2019), 758–774.Google Scholar
- [101] . 2013. You are the only possible oracle: Effective test selection for end users of interactive machine learning systems. IEEE Trans. Softw. Eng. 40, 3 (2013), 307–323.Google ScholarDigital Library
- [102] . 2019. Does the fault reside in a stack trace? Assisting crash localization by predicting crashing fault residence. J. Syst. Softw. 148 (2019), 88–104.Google ScholarCross Ref
- [103] . 2013. Variability-aware performance prediction: A statistical learning approach. In Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 301–311.Google ScholarDigital Library
- [104] . 2010. Characterizing and predicting which bugs get fixed: An empirical study of Microsoft Windows. In Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering. 495–504.Google ScholarDigital Library
- [105] . 2019. DeepPerf: Performance prediction for configurable software with deep sparse neural network. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering (ICSE). IEEE, 1095–1106.Google ScholarDigital Library
- [106] . 2018. Is this class thread-safe? Inferring documentation using graph-based learning. In Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. 41–52.Google ScholarDigital Library
- [107] . 2013. Automatic query reformulations for text retrieval in software engineering. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 842–851.Google ScholarCross Ref
- [108] . 2012. Automatic query performance assessment during the retrieval of software artifacts. In Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering. 90–99.Google ScholarDigital Library
- [109] . 2012. Evaluating the specificity of text retrieval queries to support software engineering tasks. In Proceedings of the 34th International Conference on Software Engineering (ICSE). IEEE, 1273–1276.Google ScholarDigital Library
- [110] . 2020. Automating system test case classification and prioritization for use case-driven testing in product lines. Empir. Softw. Eng. 25, 5 (2020), 3711–3769.Google ScholarDigital Library
- [111] . 2017. Learning to predict severity of software vulnerability using only vulnerability description. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 125–136.Google ScholarCross Ref
- [112] . 2014. Finding patterns in static analysis alerts: Improving actionable alert ranking. In Proceedings of the 11th Working Conference on Mining Software Repositories. 152–161.Google ScholarDigital Library
- [113] . 2010. Fault-prone module detection using large-scale text features based on spam filtering. Empir. Softw. Eng. 15, 2 (2010), 147–165.Google ScholarDigital Library
- [114] . 2012. Bug prediction based on fine-grained module histories. In Proceedings of the 34th International Conference on Software Engineering (ICSE). IEEE, 200–210.Google ScholarDigital Library
- [115] . 2020. Duplicate bug report detection using dual-channel convolutional neural networks. In Proceedings of the 28th International Conference on Program Comprehension. 117–127.Google ScholarDigital Library
- [116] . 2015. An empirical study on software defect prediction with a simplified metric set. Inf. Softw. Technol. 59 (2015), 170–190.Google ScholarDigital Library
- [117] . 2018. Deep learning type inference. In Proceedings of the 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 152–162.Google ScholarDigital Library
- [118] . 2016. The impact of tangled code changes on defect prediction models. Empir. Softw. Eng. 21, 2 (2016), 303–336.Google ScholarDigital Library
- [119] . 2014. The impact of test ownership and team structure on the reliability and effectiveness of quality test runs. In Proceedings of the 8th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. 1–10.Google ScholarDigital Library
- [120] . 2019. The review linkage graph for code review analytics: A recovery approach and empirical study. In Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 578–589.Google ScholarDigital Library
- [121] . 2019. DeepJIT: An end-to-end deep learning framework for just-in-time defect prediction. In Proceedings of the IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 34–45.Google ScholarDigital Library
- [122] . 2016. When should internal interfaces be promoted to public? In Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering. 278–289.Google ScholarDigital Library
- [123] . 2017. A systematic literature review and meta-analysis on cross project defect prediction. IEEE Trans. Softw. Eng. 45, 2 (2017), 111–147.Google ScholarCross Ref
- [124] . 2018. A benchmark study on the effectiveness of search-based data selection and feature selection for cross project defect prediction. Inf. Softw. Technol. 95 (2018), 296–312.Google ScholarCross Ref
- [125] . 2013. Using citation influence to predict software defects. In Proceedings of the 10th Working Conference on Mining Software Repositories (MSR). IEEE, 419–428.Google ScholarDigital Library
- [126] . 2015. GPredict: Generic predictive concurrency analysis. In Proceedings of the IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 847–857.Google ScholarCross Ref
- [127] . 2018. Identifying self-admitted technical debt in open source projects using text mining. Empir. Softw. Eng. 23, 1 (2018), 418–451.Google ScholarDigital Library
- [128] . 2017. Supervised vs unsupervised models: A holistic look at effort-aware just-in-time defect prediction. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 159–170.Google ScholarCross Ref
- [129] . 2018. Automating intention mining. IEEE Trans. Softw. Eng. 46, 10 (2018), 1098–1119.Google ScholarCross Ref
- [130] Rubing Huang, Weifeng Sun, Yinyin Xu, Haibo Chen, Dave Towey, and Xin Xia. 2019. A survey on adaptive random testing. IEEE Transactions on Software Engineering 47, 10 (2019), 2052–2083.Google Scholar
- [131] . 2018. Tell them apart: Distilling technology differences from crowd-scale comparison discussions. In Proceedings of the 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 214–224.Google ScholarDigital Library
- [132] . 2020. CommtPst: Deep learning source code for commenting positions prediction. J. Syst. Softw. 170 (2020), 110754.Google ScholarCross Ref
- [133] . 2021. Predicting community smells’ occurrence on individual developers by sentiments. arXiv preprint arXiv:2103.07090 (2021).Google Scholar
- [134] . 2021. EnHMM: On the use of ensemble HMMs and stack traces to predict the reassignment of bug report fields. In Proceedings of the IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 411–421.Google ScholarCross Ref
- [135] . 2009. Improving bug triage with bug tossing graphs. In Proceedings of the 7th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering. 111–120.Google ScholarDigital Library
- [136] . 2018. Using frame semantics for classifying and summarizing application store reviews. Empir. Softw. Eng. 23, 6 (2018), 3734–3767.Google ScholarDigital Library
- [137] . 2019. Automatically detecting missing cleanup for ungraceful exits. In Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 751–762.Google ScholarDigital Library
- [138] . 2017. What causes my test alarm? Automatic cause analysis for test alarms in system and integration testing. In Proceedings of the IEEE/ACM 39th International Conference on Software Engineering (ICSE). IEEE, 712–723.Google ScholarDigital Library
- [139] . 2017. An unsupervised approach for discovering relevant tutorial fragments for APIs. In Proceedings of the IEEE/ACM 39th International Conference on Software Engineering (ICSE). IEEE, 38–48.Google ScholarDigital Library
- [140] . 2020. How to mitigate the incident? An effective troubleshooting guide recommendation technique for online service systems. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 1410–1420.Google ScholarDigital Library
- [141] . 2013. Personalized defect prediction. In Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 279–289.Google ScholarDigital Library
- [142] . 2015. Heterogeneous cross-company defect prediction by unified metric representation and CCA-based transfer learning. In Proceedings of the 10th Joint Meeting on Foundations of Software Engineering. 496–507.Google ScholarDigital Library
- [143] . 2016. An improved SDA based defect prediction framework for both within-project and cross-project class-imbalance problems. IEEE Trans. Softw. Eng. 43, 4 (2016), 321–339.Google ScholarDigital Library
- [144] . 2014. Dictionary learning based software defect prediction. In Proceedings of the 36th International Conference on Software Engineering. 414–423.Google ScholarDigital Library
- [145] . 2016. Automated bug assignment: Ensemble-based machine learning in large scale industrial contexts. Empir. Softw. Eng. 21, 4 (2016), 1533–1578.Google ScholarDigital Library
- [146] . 2018. Examining the stability of logging statements. Empir. Softw. Eng. 23, 1 (2018), 290–333.Google ScholarDigital Library
- [147] . 2019. Distance-based sampling of software configuration spaces. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering (ICSE). IEEE, 1084–1094.Google ScholarDigital Library
- [148] . 2016. Studying just-in-time defect prediction using cross-project models. Empir. Softw. Eng. 21, 5 (2016), 2072–2106.Google ScholarDigital Library
- [149] . 2012. A large-scale empirical study of just-in-time quality assurance. IEEE Trans. Softw. Eng. 39, 6 (2012), 757–773.Google ScholarDigital Library
- [150] . 2012. EClass: An execution classification approach to improving the energy-efficiency of software via machine learning. J. Syst. Softw. 85, 4 (2012), 960–973.Google ScholarDigital Library
- [151] . 2020. A defect estimator for source code: Linking defect reports with programming constructs usage metrics. ACM Trans. Softw. Eng. Methodol. 29, 2 (2020), 1–35.Google ScholarDigital Library
- [152] . 2007. Guidelines for Performing Systematic Literature Reviews in Software Engineering. Technical report, Ver. 2.3 EBSE Technical Report. EBSE.Google Scholar
- [153] . 2011. Predicting post-release defects using pre-release field testing results. In Proceedings of the 27th IEEE International Conference on Software Maintenance (ICSM). IEEE, 253–262.Google ScholarDigital Library
- [154] . 2011. BDTEX: A GQM-based Bayesian approach for the detection of antipatterns. J. Syst. Softw. 84, 4 (2011), 559–572.Google ScholarDigital Library
- [155] . 2016. Using dynamic and contextual features to predict issue lifetime in GitHub projects. In Proceedings of the IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR). IEEE, 291–302.Google ScholarDigital Library
- [156] . 2013. Where should we fix this bug? A two-phase recommendation model. IEEE Trans. Softw. Eng. 39, 11 (2013), 1597–1610.Google ScholarDigital Library
- [157] . 2011. Which crashes should I fix first?: Predicting top crashes at an early stage to prioritize debugging efforts. IEEE Trans. Softw. Eng. 37, 3 (2011), 430–447.Google ScholarDigital Library
- [158] . 2020. Finding client-side business flow tampering vulnerabilities. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering. 222–233.Google ScholarDigital Library
- [159] . 2008. Classifying software changes: Clean or buggy?IEEE Trans. Softw. Eng. 34, 2 (2008), 181–196.Google ScholarDigital Library
- [160] . 2011. Dealing with noise in defect prediction. In Proceedings of the 33rd International Conference on Software Engineering (ICSE). IEEE, 481–490.Google ScholarDigital Library
- [161] . 2004. Evidence-based software engineering. In Proceedings of the 26th International Conference on Software Engineering. IEEE, 273–281.Google ScholarDigital Library
- [162] . 2011. ImpactScale: Quantifying change impact to predict faults in large software systems. In Proceedings of the 27th IEEE International Conference on Software Maintenance (ICSM). IEEE, 43–52.Google ScholarDigital Library
- [163] . 2019. The impact of feature reduction techniques on defect prediction models. Empir. Softw. Eng. 24, 4 (2019), 1925–1963.Google ScholarDigital Library
- [164] . 2020. The impact of context metrics on just-in-time defect prediction. Empir. Softw. Eng. 25, 1 (2020), 890–939.Google ScholarDigital Library
- [165] . 2006. Data preprocessing for supervised leaning. Int. J. Comput. Sci. 1, 2 (2006), 111–117.Google Scholar
- [166] . 2011. Design evolution metrics for defect prediction in object oriented systems. Empir. Softw. Eng. 16, 1 (2011), 141–175.Google ScholarDigital Library
- [167] . 2016. Too much automation? The bellwether effect and its implications for transfer learning. In Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering. 122–131.Google ScholarDigital Library
- [168] . 2019. A cost-effective strategy for software vulnerability prediction based on bellwether analysis. In Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis. 424–427.Google ScholarDigital Library
- [169] . 2018. Effective fault prediction model developed using least square support vector machine (LSSVM). J. Syst. Softw. 137 (2018), 686–712.Google ScholarCross Ref
- [170] . 2010. Predicting the severity of a reported bug. In Proceedings of the 7th IEEE Working Conference on Mining Software Repositories (MSR). IEEE, 1–10.Google ScholarCross Ref
- [171] . 2015. Rclinker: Automated linking of issue reports and commits leveraging rich contextual information. In Proceedings of the IEEE 23rd International Conference on Program Comprehension. IEEE, 36–47.Google ScholarDigital Library
- [172] . 2013. Will fault localization work for these failures? An automated approach to predict effectiveness of fault localization tools. In Proceedings of the IEEE International Conference on Software Maintenance. IEEE, 310–319.Google ScholarDigital Library
- [173] . 2017. Will this localization tool be effective for this bug? Mitigating the impact of unreliability of information retrieval based bug localization tools. Empir. Softw. Eng. 22, 4 (2017), 2237–2279.Google ScholarDigital Library
- [174] . 2011. Measuring code quality to improve specification mining. IEEE Trans. Softw. Eng. 38, 1 (2011), 175–190.Google ScholarDigital Library
- [175] . 2014. Detecting memory leaks through introspective dynamic behavior modelling using machine learning. In Proceedings of the 36th International Conference on Software Engineering. 814–824.Google ScholarDigital Library
- [176] . 2011. Micro interaction metrics for defect prediction. In Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering. 311–321.Google ScholarDigital Library
- [177] . 2016. Developer micro interaction metrics for software defect prediction. IEEE Trans. Softw. Eng. 42, 11 (2016), 1015–1035.Google ScholarDigital Library
- [178] . 2008. Benchmarking classification models for software defect prediction: A proposed framework and novel findings. IEEE Trans. Softw. Eng. 34, 4 (2008), 485–496.Google ScholarDigital Library
- [179] . 2017. Towards just-in-time suggestions for log changes. Empir. Softw. Eng. 22, 4 (2017), 1831–1865.Google ScholarDigital Library
- [180] . 2020. BiLO-CPDP: Bi-level programming for automated model discovery in cross-project defect prediction. In Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 573–584.Google ScholarDigital Library
- [181] . 2010. Detecting user-visible failures in AJAX web applications by analyzing users’ interaction behaviors. In Proceedings of the IEEE/ACM International Conference on Automated Software Engineering. 155–158.Google ScholarDigital Library
- [182] . 2016. An automated software reliability prediction system for safety critical software. Empir. Softw. Eng. 21, 6 (2016), 2413–2455.Google ScholarDigital Library
- [183] . 2020. Detecting and diagnosing energy issues for mobile applications. In Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis. 115–127.Google ScholarDigital Library
- [184] . 2020. Predicting node failures in an ultra-large-scale cloud computing platform: An AIOps solution. ACM Trans. Softw. Eng. Methodol. 29, 2 (2020), 1–24.Google ScholarDigital Library
- [185] . 2017. Heterogeneous defect prediction through multiple kernel learning and ensemble learning. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 91–102.Google ScholarCross Ref
- [186] . 2010. StakeNet: Using social networks to analyse the stakeholders of large-scale software projects. In Proceedings of the ACM/IEEE 32nd International Conference on Software Engineering. IEEE, 295–304.Google ScholarDigital Library
- [187] . 2019. Pattern-based mining of opinions in Q&A websites. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering (ICSE). IEEE, 548–559.Google ScholarDigital Library
- [188] . 2018. Sentiment analysis for software engineering: How far can we go? In Proceedings of the 40th International Conference on Software Engineering. 94–104.Google Scholar
- [189] . 2019. Identifying gameplay videos that exhibit bugs in computer games. Empir. Softw. Eng. 24, 6 (2019), 4006–4033.Google ScholarCross Ref
- [190] . 2018. Predicting node failure in cloud service systems. In Proceedings of the 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 480–490.Google ScholarDigital Library
- [191] . 2014. On using machine learning to automatically classify software applications into domain categories. Empir. Softw. Eng. 19, 3 (2014), 582–618.Google ScholarDigital Library
- [192] Hui Liu, Jiahao Jin, Zhifeng Xu, Yifan Bu, YanzhenZou, and Lu Zhang. 2019. Deep learning based code smell detection. IEEE Transactions on Software Engineering 47, 9 (2019), 1811–1837.Google Scholar
- [193] . 2020. Automated classification of actions in bug reports of mobile apps. In Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis. 128–140.Google ScholarDigital Library
- [194] . 2017. Stochastic optimization of program obfuscation. In Proceedings of the IEEE/ACM 39th International Conference on Software Engineering (ICSE). IEEE, 221–231.Google ScholarDigital Library
- [195] . 2018. Deep learning based feature envy detection. In Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. 385–396.Google ScholarDigital Library
- [196] . 2018. AndroZooOpen: Collecting large-scale open source Android apps for the research community. Star 1, 800 (2018), 1300.Google Scholar
- [197] . 2019. DaPanda: Detecting aggressive push notifications in Android apps. In Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 66–78.Google ScholarDigital Library
- [198] . 2019. Predicting licenses for changed source code. In Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 686–697.Google ScholarDigital Library
- [199] . 2018. Connecting software metrics across versions to predict defects. In Proceedings of the IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 232–243.Google ScholarCross Ref
- [200] . 2012. Active refinement of clone anomaly reports. In Proceedings of the 34th International Conference on Software Engineering (ICSE). IEEE, 397–407.Google ScholarDigital Library
- [201] . 2021. Boosting symbolic execution via constraint solving time prediction (experience paper). In Proceedings of the 30th ACM SIGSOFT International Symposium on Software Testing and Analysis. 336–347.Google ScholarDigital Library
- [202] . 2020. Can this fault be detected: A study on fault detection via automated test generation. J. Syst. Softw. 170 (2020), 110769.Google ScholarCross Ref
- [203] . 2016. Empirical analysis of network measures for effort-aware fault-proneness prediction. Inf. Softw. Technol. 69 (2016), 50–70.Google ScholarDigital Library
- [204] . 2012. Transfer learning for cross-company software defect prediction. Inf. Softw. Technol. 54, 3 (2012), 248–256.Google ScholarDigital Library
- [205] . 2019. Predictive test selection. In Proceedings of the 41st International Conference on Software Engineering: Software Engineering in Practice, ICSE (SEIP). IEEE/ACM, 91–100.
DOI: .Google ScholarDigital Library - [206] . 2019. Predictive test selection. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP). IEEE, 91–100.Google ScholarDigital Library
- [207] . 2016. Predicting build co-changes with source code change and commit categories. In Proceedings of the IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER). IEEE, 541–551.Google ScholarCross Ref
- [208] . 2019. Predicting pull request completion time: A case study on large scale cloud services. In Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 874–882.Google ScholarDigital Library
- [209] . 2017. An empirical study for software change prediction using imbalanced data. Empir. Softw. Eng. 22, 6 (2017), 2806–2851.Google ScholarDigital Library
- [210] . 2019. Dynamic selection of fitness function for software change prediction using particle swarm optimization. Inf. Softw. Technol. 112 (2019), 51–67.Google ScholarDigital Library
- [211] . 2013. Automatic detection of performance deviations in the load testing of large scale systems. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 1012–1021.Google ScholarCross Ref
- [212] . 2019. NL2Type: Inferring JavaScript function types from natural language information. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering (ICSE). IEEE, 304–315.Google ScholarDigital Library
- [213] . 2019. Towards understanding and detecting fake reviews in app stores. Empir. Softw. Eng. 24, 6 (2019), 3316–3355.Google ScholarCross Ref
- [214] . 2020. Hybrid deep neural networks to infer state models of black-box systems. In Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 299–311.Google ScholarDigital Library
- [215] . 2019. What can Android mobile app developers do about the energy consumption of machine learning?Empir. Softw. Eng. 24, 2 (2019), 562–601.Google ScholarDigital Library
- [216] . 2011. Categorizing software applications for maintenance. In Proceedings of the 27th IEEE International Conference on Software Maintenance (ICSM). IEEE, 343–352.Google ScholarDigital Library
- [217] . 2015. Detecting and removing web application vulnerabilities with static analysis and data mining. IEEE Trans. Reliab. 65, 1 (2015), 54–69.Google ScholarCross Ref
- [218] . 2011. Does adding manpower also affect quality? An empirical, longitudinal analysis. In Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering. 81–90.Google ScholarDigital Library
- [219] . 2018. 500+ times faster than deep learning: (A case study exploring faster methods for text mining stackoverflow). In Proceedings of the IEEE/ACM 15th International Conference on Mining Software Repositories (MSR). IEEE, 554–563.Google Scholar
- [220] . 2017. Predicting query quality for applications of text retrieval to software engineering tasks. ACM Trans. Softw. Eng. Methodol. 26, 1 (2017), 1–45.Google ScholarDigital Library
- [221] . 2019. Tracing with less data: Active learning for classification-based traceability link recovery. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 103–113.Google ScholarCross Ref
- [222] . 2015. Detecting, tracing, and monitoring architectural tactics in code. IEEE Trans. Softw. Eng. 42, 3 (2015), 205–220.Google ScholarDigital Library
- [223] . 2012. A tactic-centric approach for automating traceability of quality concerns. In Proceedings of the 34th International Conference on Software Engineering (ICSE). IEEE, 639–649.Google ScholarDigital Library
- [224] . 2014. Bayesian networks for evidence-based decision-making in software engineering. IEEE Trans. Softw. Eng. 40, 6 (2014), 533–554.Google ScholarCross Ref
- [225] . 2016. Studying high impact fix-inducing changes. Empir. Softw. Eng. 21, 2 (2016), 605–641.Google ScholarDigital Library
- [226] . 2012. Ranking and clustering software cost estimation models through a multiple comparisons algorithm. IEEE Trans. Softw. Eng. 39, 4 (2012), 537–551.Google ScholarDigital Library
- [227] . 2009. Amassing and indexing a large sample of version control systems: Towards the census of public source code history. In Proceedings of the 6th IEEE International Working Conference on Mining Software Repositories. IEEE, 11–20.Google ScholarDigital Library
- [228] . 2019. Training binary classifiers as data structure invariants. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering (ICSE). IEEE, 759–770.Google ScholarDigital Library
- [229] . 2013. Assessing the cost effectiveness of fault prediction in acceptance testing. IEEE Trans. Softw. Eng. 39, 10 (2013), 1345–1357.Google ScholarDigital Library
- [230] . 2019. Identifying experts in software libraries and frameworks among GitHub users. In Proceedings of the IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 276–287.Google ScholarDigital Library
- [231] . 2018. Machine learning-based prototyping of graphical user interfaces for mobile apps. IEEE Trans. Softw. Eng. 46, 2 (2018), 196–221.Google ScholarCross Ref
- [232] . 2020. Improving the effectiveness of traceability link recovery using hierarchical Bayesian networks. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering. 873–885.Google ScholarDigital Library
- [233] . 2015. Query-based configuration of text retrieval solutions for software engineering tasks. In Proceedings of the 10th Joint Meeting on Foundations of Software Engineering. 567–578.Google ScholarDigital Library
- [234] . 2019. Balancing the trade-off between accuracy and interpretability in software defect prediction. Empir. Softw. Eng. 24, 2 (2019), 779–825.Google ScholarDigital Library
- [235] . 2019. CloneCognition: Machine learning based code clone validation tool. In Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 1105–1109.Google ScholarDigital Library
- [236] . 2017. A PSO-GA approach targeting fault-prone software modules. J. Syst. Softw. 132 (2017), 41–49.Google ScholarDigital Library
- [237] . 2013. Which work-item updates need your response? In Proceedings of the 10th Working Conference on Mining Software Repositories (MSR). IEEE, 12–21.Google ScholarDigital Library
- [238] . 2015. Stuck and frustrated or in flow and happy: Sensing developers’ emotions and progress. In Proceedings of the IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 688–699.Google ScholarCross Ref
- [239] . 2016. Using (bio) metrics to predict code quality online. In Proceedings of the IEEE/ACM 38th International Conference on Software Engineering (ICSE). IEEE, 452–463.Google ScholarDigital Library
- [240] . 2017. Curating GitHub for engineered software projects. Empir. Softw. Eng. 22, 6 (2017), 3219–3253.Google ScholarDigital Library
- [241] . 2018. An exploratory qualitative and quantitative analysis of emotions in issue report comments of open source systems. Empir. Softw. Eng. 23, 1 (2018), 521–564.Google ScholarDigital Library
- [242] . 2011. Diagnosing new faults using mutants and prior faults (NIER track). In Proceedings of the 33rd International Conference on Software Engineering. 960–963.Google ScholarDigital Library
- [243] . 2015. Platys: An active learning framework for place-aware application development and its evaluation. ACM Trans. Softw. Eng. Methodol. 24, 3 (2015), 1–32.Google ScholarDigital Library
- [244] . 2019. CLCDSA: Cross language code clone detection using syntactical features and API documentation. In Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 1026–1037.Google ScholarDigital Library
- [245] . 2005. Use of relative code churn measures to predict system defect density. In Proceedings of the 27th International Conference on Software Engineering. 284–292.Google ScholarDigital Library
- [246] . 2017. Using bad learners to find good configurations. In Proceedings of the 11th Joint Meeting on Foundations of Software Engineering. 257–267.Google ScholarDigital Library
- [247] . 2019. Marble: Mining for boilerplate code to identify API usability problems. In Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 615–627.Google ScholarDigital Library
- [248] . 2017. Heterogeneous defect prediction. IEEE Trans. Softw. Eng. 44, 9 (2017), 874–896.Google ScholarCross Ref
- [249] . 2015. CLAMI: Defect prediction on unlabeled datasets (T). In Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 452–463.Google ScholarDigital Library
- [250] . 2013. Transfer defect learning. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 382–391.Google ScholarDigital Library
- [251] . 2018. A multi-view context-aware approach to Android malware detection and malicious code localization. Empir. Softw. Eng. 23, 3 (2018), 1222–1274.Google ScholarDigital Library
- [252] . 2021. Automated identification of security discussions in microservices systems: Industrial surveys and experiments. J. Syst. Softw. 181 (2021), 111046.Google ScholarDigital Library
- [253] . 2012. On fault representativeness of software fault injection. IEEE Trans. Softw. Eng. 39, 1 (2012), 80–96.Google ScholarCross Ref
- [254] . 2018. App store mining is not enough for app improvement. Empir. Softw. Eng. 23, 5 (2018), 2764–2794.Google ScholarDigital Library
- [255] . 2017. Which version should be released to app store? In Proceedings of the ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM). IEEE, 324–333.Google ScholarDigital Library
- [256] . 2016. Automated change impact analysis between SysML models of requirements and design. In Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering. 242–253.Google ScholarDigital Library
- [257] . 2011. A topic-based approach for narrowing the search space of buggy files from a bug report. In Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 263–272.Google ScholarDigital Library
- [258] . 2012. Duplicate bug report detection with a combination of information retrieval and topic modeling. In Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering. IEEE, 70–79.Google ScholarDigital Library
- [259] . 2020. Suggesting natural method names to check name consistencies. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering. 1372–1384.Google ScholarDigital Library
- [260] . 2020. Analyzing bug fix for automatic bug cause classification. J. Syst. Softw. 163 (2020), 110538.Google ScholarCross Ref
- [261] . 2017. Learning to rank code examples for code search engines. Empir. Softw. Eng. 22, 1 (2017), 259–291.Google ScholarDigital Library
- [262] . 2013. Termination proofs from tests. In Proceedings of the 9th Joint Meeting on Foundations of Software Engineering. 246–256.Google ScholarDigital Library
- [263] . 2020. Recognizing lines of code violating company-specific coding guidelines using machine learning. Empir. Softw. Eng. 25, 1 (2020), 220–265.Google ScholarDigital Library
- [264] . 2014. Software defect prediction using Bayesian networks. Empir. Softw. Eng. 19, 1 (2014), 154–181.Google ScholarDigital Library
- [265] . 2016. A novel kernel to predict software defectiveness. J. Syst. Softw. 119 (2016), 109–121.Google ScholarDigital Library
- [266] . 2019. Predicting merge conflicts in collaborative software development. In Proceedings of the ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM). IEEE, 1–11.Google ScholarCross Ref
- [267] . 2015. Seer: A lightweight online failure prediction approach. IEEE Trans. Softw. Eng. 42, 1 (2015), 26–46.Google ScholarDigital Library
- [268] . 2014. NeedFeed: Taming change notifications by modeling code relevance. In Proceedings of the 29th ACM/IEEE International Conference on Automated Software Engineering. 665–676.Google ScholarDigital Library
- [269] . 2021. Predicting the emergence of community smells using socio-technical metrics: A machine-learning approach. J. Syst. Softw. 171 (2021), 110847.Google ScholarCross Ref
- [270] Fabio Palomba, Damian Andrew Tamburri, Francesca Arcelli Fontana, Rocco Oliveto, Andy Zaidman, and Alexander Serebrenik. 2018. Beyond technical aspects: How do community smells influence the intensity of code smells? IEEE Transactions on Software Engineering 47, 1 (2018), 108–129.Google Scholar
- [271] . 2016. Smells like teen spirit: Improving bug prediction performance using the intensity of code smells. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 244–255.Google ScholarCross Ref
- [272] . 2021. Continuous test suite failure prediction. In Proceedings of the 30th ACM SIGSOFT International Symposium on Software Testing and Analysis. 553–565.Google ScholarDigital Library
- [273] . 2013. How to effectively use topic models for software engineering tasks? An approach based on genetic algorithms. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 522–531.Google ScholarCross Ref
- [274] . 2018. Are mutation scores correlated with real fault detection? A large scale empirical study on the relationship between mutants and real faults. In Proceedings of the IEEE/ACM 40th International Conference on Software Engineering (ICSE). IEEE, 537–548.Google ScholarDigital Library
- [275] . 2019. Classifying code comments in Java software systems. Empir. Softw. Eng. 24, 3 (2019), 1499–1537.Google ScholarDigital Library
- [276] . 2018. Re-evaluating method-level bug prediction. In Proceedings of the IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 592–601.Google ScholarCross Ref
- [277] . 2019. Fine-grained just-in-time defect prediction. J. Syst. Softw. 150 (2019), 22–36.Google ScholarCross Ref
- [278] . 2020. On the performance of method-level bug prediction: A negative result. J. Syst. Softw. 161 (2020), 110493.Google ScholarDigital Library
- [279] . 2020. A large empirical assessment of the role of data balancing in machine-learning-based code smell detection. J. Syst. Softw. 169 (2020), 110693.Google ScholarCross Ref
- [280] . 2019. Comparing heuristic and machine learning approaches for metric-based code smell detection. In Proceedings of the IEEE/ACM 27th International Conference on Program Comprehension (ICPC). IEEE, 93–104.Google ScholarDigital Library
- [281] . 2021. Attention-based model for predicting question relatedness on Stack Overflow. arXiv preprint arXiv:2103.10763 (2021).Google Scholar
- [282] . 2019. Cross-language clone detection by learning over abstract syntax trees. In Proceedings of the IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 518–528.Google ScholarDigital Library
- [283] . 2012. A machine learning approach to software requirements prioritization. IEEE Trans. Softw. Eng. 39, 4 (2012), 445–461.Google ScholarDigital Library
- [284] . 2012. Privacy and utility for defect prediction: Experiments with MORPH. In Proceedings of the 34th International Conference on Software Engineering (ICSE). IEEE, 189–199.Google ScholarDigital Library
- [285] . 2013. Balancing privacy and utility in cross-company defect prediction. IEEE Trans. Softw. Eng. 39, 8 (2013), 1054–1068.Google ScholarDigital Library
- [286] . 2015. LACE2: Better privacy-preserving data sharing for cross project defect prediction. In Proceedings of the IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 801–811.Google ScholarCross Ref
- [287] . 2013. Better cross company defect prediction. In Proceedings of the 10th Working Conference on Mining Software Repositories (MSR). IEEE, 409–418.Google ScholarDigital Library
- [288] Fayola Peters, Thein Than Tun, Yijun Yu, and Bashar Nuseibeh. 2017. Text filtering and ranking for security bug report prediction. IEEE Transactions on Software Engineering 45, 6 (2017), 615–631.Google Scholar
- [289] . 2015. Guidelines for conducting systematic mapping studies in software engineering: An update. Inf. Softw. Technol. 64 (2015), 1–18.Google ScholarDigital Library
- [290] . 2018. Hora: Architecture-aware online failure prediction. J. Syst. Softw. 137 (2018), 669–685.Google ScholarCross Ref
- [291] . 2021. JITLine: A simpler, better, faster, finer-grained just-in-time defect prediction. arXiv preprint arXiv:2103.07068 (2021).Google Scholar
- [292] . 2011. Ecological inference in empirical software engineering. In Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 362–371.Google ScholarDigital Library
- [293] . 2020. Typewriter: Neural type prediction with search-based validation. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 209–220.Google ScholarDigital Library
- [294] . 2020. Scaffle: Bug localization on millions of files. In Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis. 225–236.Google ScholarDigital Library
- [295] . 2019. Categorizing the content of GitHub README files. Empir. Softw. Eng. 24, 3 (2019), 1296–1327.Google ScholarDigital Library
- [296] . 2011. Network versus code metrics to predict defects: A replication study. In Proceedings of the International Symposium on Empirical Software Engineering and Measurement. IEEE, 215–224.Google ScholarDigital Library
- [297] Yu Qu, Qinghua Zheng, Jianlei Chi, Yangxu Jin, Ancheng He, Di Cui, Hengshan Zhang, and Ting Liu. 2019. Using K-core decomposition on class dependency networks to improve bug prediction Model’s practical performance. IEEE Transactions on Software Engineering 47, 2 (2019), 348–3.Google Scholar
- [298] . 2013. How, and why, process metrics are better. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 432–441.Google ScholarCross Ref
- [299] . 2014. Comparing static bug finders and statistical prediction. In Proceedings of the 36th International Conference on Software Engineering. 424–434.Google ScholarDigital Library
- [300] . 2012. Recalling the “imprecision” of cross-project defect prediction. In Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering. 1–11.Google ScholarDigital Library
- [301] . 2013. Sample size vs. bias in defect prediction. In Proceedings of the 9th Joint Meeting on Foundations of Software Engineering. 147–157.Google ScholarDigital Library
- [302] . 2017. Predicting usefulness of code review comments using textual features and developer experience. In Proceedings of the IEEE/ACM 14th International Conference on Mining Software Repositories (MSR). IEEE, 215–226.Google ScholarDigital Library
- [303] . 2017. The impact of using regression models to build defect classifiers. In Proceedings of the IEEE/ACM 14th International Conference on Mining Software Repositories (MSR). IEEE, 135–145.Google ScholarDigital Library
- [304] . 2020. Descriptions of issues and comments for predicting issue success in software projects. J. Syst. Softw. 168 (2020), 110663.Google ScholarCross Ref
- [305] . 2018. Traceability in the wild: Automatically augmenting incomplete trace links. In Proceedings of the 40th International Conference on Software Engineering. 834–845.Google ScholarDigital Library
- [306] . 2018. Roles and impacts of hands-on software architects in five industrial case studies. In Proceedings of the 40th International Conference on Software Engineering. 117–127.Google ScholarDigital Library
- [307] . 2019. Identifying redundancies in fork-based development. In Proceedings of the IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 230–241.Google ScholarCross Ref
- [308] . 2013. Discovering essential code elements in informal documentation. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 832–841.Google ScholarCross Ref
- [309] . 2017. Detecting user story information in developer-client conversations to generate extractive summaries. In Proceedings of the IEEE/ACM 39th International Conference on Software Engineering (ICSE). IEEE, 49–59.Google ScholarDigital Library
- [310] . 2011. Using source code metrics to predict change-prone Java interfaces. In Proceedings of the 27th IEEE International Conference on Software Maintenance (ICSM). IEEE, 303–312.Google ScholarDigital Library
- [311] . 2018. Toward the development of a conventional time series based web error forecasting framework. Empir. Softw. Eng. 23, 2 (2018), 570–644.Google ScholarDigital Library
- [312] . 2020. A model to detect readability improvements in incremental changes. In Proceedings of the 28th International Conference on Program Comprehension. 25–36.Google ScholarDigital Library
- [313] . 2015. Mining system logs to learn error predictors: A case study of a telemetry system. Empir. Softw. Eng. 20, 4 (2015), 879–927.Google ScholarDigital Library
- [314] . 2016. Value-cognitive boosting with a support vector machine for cross-project defect prediction. Empir. Softw. Eng. 21, 1 (2016), 43–71.Google ScholarDigital Library
- [315] . 2020. FCCI: A fuzzy expert system for identifying coincidental correct test cases. J. Syst. Softw. 168 (2020), 110635.Google ScholarCross Ref
- [316] . 2019. Project Achilles: A prototype tool for static method-level vulnerability detection of java source code using a recurrent neural network. In Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering Workshop (ASEW). IEEE, 114–121.Google ScholarCross Ref
- [317] . 2018. Oreo: Detection of clones in the twilight zone. In Proceedings of the 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 354–365.Google ScholarDigital Library
- [318] . 2021. Can I solve it? Identifying APIs required to complete OSS task. arXiv preprint arXiv:2103.12653 (2021).Google Scholar
- [319] . 2019. Improving bug triaging with high confidence predictions at ericsson. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 81–91.Google ScholarCross Ref
- [320] . 2014. Predicting vulnerable software components via text mining. IEEE Trans. Softw. Eng. 40, 10 (2014), 993–1006.Google ScholarCross Ref
- [321] . 2015. Deep learning in neural networks: An overview. Neural Netw. 61 (2015), 85–117.Google ScholarDigital Library
- [322] John Dooley and John Zukowski. 2011. Software development and professional practice. Springer.Google Scholar
- [323] . 2013. Mining SQL injection and cross site scripting vulnerabilities using hybrid program analysis. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 642–651.Google ScholarCross Ref
- [324] . 2020. Detection of hidden feature requests from massive chat messages via deep siamese network. In Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering (ICSE). IEEE, 641–653.Google ScholarDigital Library
- [325] . 2017. Metric-based software reliability prediction approach and its application. Empir. Softw. Eng. 22, 4 (2017), 1579–1633.Google ScholarDigital Library
- [326] . 2013. Studying re-opened bugs in open source software. Empir. Softw. Eng. 18, 5 (2013), 1005–1042.Google ScholarCross Ref
- [327] . 2018. Automatic topic classification of test cases using text mining at an Android smartphone vendor. In Proceedings of the 12th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. 1–10.Google ScholarDigital Library
- [328] . 2012. On the use of calling structure information to improve fault prediction. Empir. Softw. Eng. 17, 4 (2012), 390–423.Google ScholarDigital Library
- [329] . 2010. Evaluating complexity, code churn, and developer activity metrics as indicators of software vulnerabilities. IEEE Trans. Softw. Eng. 37, 6 (2010), 772–787.Google ScholarDigital Library
- [330] . 2013. Can traditional fault prediction models be used for vulnerability prediction?Empir. Softw. Eng. 18, 1 (2013), 25–59.Google ScholarCross Ref
- [331] . 2019. Automatically identifying code features for software defect prediction: Using AST N-grams. Inf. Softw. Technol. 106 (2019), 142–160.Google ScholarCross Ref
- [332] . 2012. Reducing features to improve code change-based bug prediction. IEEE Trans. Softw. Eng. 39, 4 (2012), 552–569.Google ScholarDigital Library
- [333] . 2015. Performance-influence models for highly configurable systems. In Proceedings of the 10th Joint Meeting on Foundations of Software Engineering. 284–294.Google ScholarDigital Library
- [334] . 2013. Prediction of atomic web services reliability based on k-means clustering. In Proceedings of the 9th Joint Meeting on Foundations of Software Engineering. 70–80.Google ScholarDigital Library
- [335] Joengju Sohn and Shin Yoo. 2019. Empirical evaluation of fault localisation using code and change metrics. IEEE Transactions on Software Engineering 47, 8 (2019), 1605–1625.Google Scholar
- [336] . 2016. Predicting defectiveness of software patches. In Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. 1–10.Google ScholarDigital Library
- [337] . 2013. iTree: Efficiently discovering high-coverage configurations using interaction trees. IEEE Trans. Softw. Eng. 40, 3 (2013), 251–265.Google ScholarDigital Library
- [338] . 2010. A general software defect-proneness prediction framework. IEEE Trans. Softw. Eng. 37, 3 (2010), 356–370.Google ScholarDigital Library
- [339] . 2021. Data balancing improves self-admitted technical debt detection. arXiv preprint arXiv:2103.13165 (2021).Google Scholar
- [340] . 2020. Learning features that predict developer responses for iOS app store reviews. In Proceedings of the 14th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM). 1–11.Google ScholarDigital Library
- [341] . 2020. Misbehaviour prediction for autonomous driving systems. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering. 359–371.Google ScholarDigital Library
- [342] . 2012. Accounting for defect characteristics in evaluations of testing techniques. ACM Trans. Softw. Eng. Methodol. 21, 3 (2012), 1–43.Google ScholarDigital Library
- [343] . 2017. Creating an invalid defect classification model using text mining on server development. J. Syst. Softw. 125 (2017), 197–206.Google ScholarDigital Library
- [344] . 2020. Adapting bug prediction models to predict reverted commits at Wayfair. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 1251–1262.Google ScholarDigital Library
- [345] . 2011. Towards more accurate retrieval of duplicate bug reports. In Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 253–262.Google ScholarDigital Library
- [346] . 2010. A discriminative model approach for accurate duplicate bug report retrieval. In Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering. 45–54.Google ScholarDigital Library
- [347] . 2014. Bug characteristics in open source software. Empir. Softw. Eng. 19, 6 (2014), 1665–1705.Google ScholarDigital Library
- [348] . 2020. Bug severity prediction using question-and-answer pairs from Stack Overflow. J. Syst. Softw. 165 (2020), 110567.Google ScholarCross Ref
- [349] Chakkrit Tantithamthavorn, Ahmed E. Hassan, and Kenichi Matsumoto. 2018. The impact of class rebalancing techniques on the performance and interpretation of defect prediction models. IEEE Transactions on Software Engineering 46, 11 (2018), 1200–1219.Google Scholar
- [350] . 2015. The impact of mislabelling on the performance and interpretation of defect prediction models. In Proceedings of the IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 812–823.Google ScholarCross Ref
- [351] . 2016. Automated parameter optimization of classification techniques for defect prediction models. In Proceedings of the 38th International Conference on Software Engineering. 321–332.Google ScholarDigital Library
- [352] . 2016. An empirical comparison of model validation techniques for defect prediction models. IEEE Trans. Softw. Eng. 43, 1 (2016), 1–18.Google ScholarDigital Library
- [353] . 2015. Approximating attack surfaces with stack traces. In Proceedings of the IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 199–208.Google ScholarCross Ref
- [354] . 2013. The impact of classifier configuration and classifier combination on bug localization. IEEE Trans. Softw. Eng. 39, 10 (2013), 1427–1443.Google ScholarDigital Library
- [355] . 2019. Will this clone be short-lived? Towards a better understanding of the characteristics of short-lived clones. Empir. Softw. Eng. 24, 2 (2019), 937–972.Google ScholarDigital Library
- [356] . 2015. Active semi-supervised defect categorization. In Proceedings of the IEEE 23rd International Conference on Program Comprehension. IEEE, 60–70.Google ScholarDigital Library
- [357] . 2012. Identifying Linux bug fixing patches. In Proceedings of the 34th International Conference on Software Engineering (ICSE). IEEE, 386–396.Google ScholarDigital Library
- [358] . 2018. DeepTest: Automated testing of deep-neural-network-driven autonomous cars. In Proceedings of the 40th International Conference on Software Engineering. 303–314.Google ScholarDigital Library
- [359] . 2018. Software defect prediction using stacked denoising autoencoders and two-stage ensemble learning. Inf. Softw. Technol. 96 (2018), 94–111.Google ScholarDigital Library
- [360] . 2012. Software reliability modeling with software metrics data via Gaussian processes. IEEE Trans. Softw. Eng. 39, 8 (2012), 1179–1186.Google ScholarDigital Library
- [361] . 2010. Practical considerations in deploying statistical methods for defect prediction: A case study within the Turkish telecommunications industry. Inf. Softw. Technol. 52, 11 (2010), 1242–1257.Google ScholarDigital Library
- [362] . 2016. The impact of human discussions on just-in-time quality assurance: An empirical study on OpenStack and Eclipse. In Proceedings of the IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER). IEEE, 189–200.Google ScholarCross Ref
- [363] . 2019. Predicting good configurations for GitHub and stack overflow topic models. In Proceedings of the IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 84–95.Google ScholarDigital Library
- [364] . 2018. Be careful of when: An empirical study on time-related misuse of issue tracking data. In Proceedings of the 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 307–318.Google ScholarDigital Library
- [365] . 2009. On the relative value of cross-company and within-company data for defect prediction. Empir. Softw. Eng. 14, 5 (2009), 540–578.Google ScholarDigital Library
- [366] . 2021. Predicting design impactful changes in modern code review: A large-scale empirical study. In Proceedings of the IEEE/ACM 18th International Conference on Mining Software Repositories (MSR). IEEE, 471–482.Google ScholarCross Ref
- [367] . 2019. CNN-based automatic prioritization of bug reports. IEEE Trans. Reliab. 69, 4 (2019), 1341–1354.Google ScholarCross Ref
- [368] . 2014. Characterizing and predicting blocking bugs in open source projects. In Proceedings of the 11th Working Conference on Mining Software Repositories. 72–81.Google ScholarDigital Library
- [369] . 2017. Machine learning-based detection of open source license exceptions. In Proceedings of the IEEE/ACM 39th International Conference on Software Engineering (ICSE). IEEE, 118–129.Google ScholarDigital Library
- [370] . 2018. A conceptual replication study on bugs that get fixed in open source software. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 299–310.Google ScholarCross Ref
- [371] . 2016. Online reliability prediction via motifs-based dynamic Bayesian networks for service-oriented systems. IEEE Trans. Softw. Eng. 43, 6 (2016), 556–579.Google ScholarDigital Library
- [372] . 2016. Local-based active classification of test report to assist crowdsourced testing. In Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering. 190–201.Google ScholarDigital Library
- [373] . 2021. Large-scale intent analysis for identifying large-review-effort code changes. Inf. Softw. Technol. 130 (2021), 106408.Google ScholarCross Ref
- [374] . 2018. Deep semantic feature learning for software defect prediction. IEEE Trans. Softw. Eng. 46, 12 (2018), 1267–1293.Google ScholarCross Ref
- [375] . 2016. Automatically learning semantic features for defect prediction. In Proceedings of the IEEE/ACM 38th International Conference on Software Engineering (ICSE). IEEE, 297–308.Google ScholarDigital Library
- [376] . 2012. Can I clone this piece of code here? In Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering. 170–179.Google ScholarDigital Library
- [377] . 2014. Predicting consistency-maintenance requirement of code clonesat copy-and-paste time. IEEE Trans. Softw. Eng. 40, 8 (2014), 773–794.Google ScholarCross Ref
- [378] . 2020. Detecting and explaining self-admitted technical debts with attention-based neural networks. In Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering. 871–882.Google ScholarDigital Library
- [379] . 2018. How well do change sequences predict defects? Sequence learning from software changes. IEEE Trans. Softw. Eng. 46, 11 (2018), 1155–1175.Google ScholarCross Ref
- [380] . 2012. Automated inference of goal-oriented performance prediction functions. In Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering. 190–199.Google ScholarDigital Library
- [381] . 2010. Comparing the effectiveness of several modeling methods for fault prediction. Empir. Softw. Eng. 15, 3 (2010), 277–295.Google ScholarDigital Library
- [382] . 2019. Improving random GUI testing with image-based widget detection. In Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis. 307–317.Google ScholarDigital Library
- [383] . 2017. Using contextual information to predict co-changes. J. Syst. Softw. 128 (2017), 220–235.Google ScholarDigital Library
- [384] . 2009. Predicting build failures using social network analysis on developer communication. In Proceedings of the IEEE 31st International Conference on Software Engineering. IEEE, 1–11.Google ScholarDigital Library
- [385] . 2018. Detecting speech act types in developer question/answer conversations during bug repair. In Proceedings of the 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 491–502.Google ScholarDigital Library
- [386] . 2011. Iterative mining of resource-releasing specifications. In Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 233–242.Google ScholarDigital Library
- [387] . 2011. ReLink: Recovering links between bugs and changes. In Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering. 15–25.Google ScholarDigital Library
- [388] . 2021. Improving high-impact bug report prediction with combination of interactive machine learning and active learning. Inf. Softw. Technol. 133 (2021), 106530.Google ScholarCross Ref
- [389] . 2020. SCDetector: Software functional clone detection based on semantic tokens analysis. In Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 821–833.Google ScholarDigital Library
- [390] . 2015. Cross-project build co-change prediction. In Proceedings of the IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER). IEEE, 311–320.Google ScholarCross Ref
- [391] . 2016. Hydra: Massively compositional model for cross-project defect prediction. IEEE Trans. Softw. Eng. 42, 10 (2016), 977–998.Google ScholarDigital Library
- [392] . 2015. Automated bug report field reassignment and refinement prediction. IEEE Trans. Reliab. 65, 3 (2015), 1094–1113.Google ScholarCross Ref
- [393] . 2015. Automatic, high accuracy prediction of reopened bugs. Autom. Softw. Eng. 22, 1 (2015), 75–109.Google ScholarDigital Library
- [394] . 2016. Collective personalized change classification with multiobjective search. IEEE Trans. Reliab. 65, 4 (2016), 1810–1829.Google ScholarCross Ref
- [395] . 2016. Predicting crashing releases of mobile applications. In Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. 1–10.Google ScholarDigital Library
- [396] . 2018. Feedback-based integrated prediction: Defect prediction based on feedback from software testing process. J. Syst. Softw. 143 (2018), 159–171.Google ScholarCross Ref
- [397] . 2016. Predicting semantically linkable knowledge in developer online forums via convolutional neural network. In Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 51–62.Google ScholarDigital Library
- [398] . 2021. A comprehensive comparative study of clustering-based unsupervised defect prediction models. J. Syst. Softw. 172 (2021), 110862.Google ScholarCross Ref
- [399] . 2018. Cross version defect prediction with representative data via sparse subset selection. In Proceedings of the IEEE/ACM 26th International Conference on Program Comprehension (ICPC). IEEE, 132–13211.Google ScholarDigital Library
- [400] . 2019. LDFR: Learning deep feature representation for software defect prediction. J. Syst. Softw. 158 (2019), 110402.Google ScholarDigital Library
- [401] . 2018. Cross-version defect prediction via hybrid active learning with kernel principal component analysis. In Proceedings of the IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 209–220.Google ScholarCross Ref
- [402] . 2016. MICHAC: Defect prediction via feature selection based on maximal information coefficient with hierarchical agglomerative clustering. In Proceedings of the IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER). IEEE, 370–381.Google ScholarCross Ref
- [403] . 2017. File-level defect prediction: Unsupervised vs. supervised models. In Proceedings of the ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM). IEEE, 344–353.Google ScholarDigital Library
- [404] . 2018. Automating change-level self-admitted technical debt determination. IEEE Trans. Softw. Eng. 45, 12 (2018), 1211–1229.Google ScholarCross Ref
- [405] . 2018. New deep learning method to detect code injection attacks on hybrid applications. J. Syst. Softw. 137 (2018), 67–77.Google ScholarCross Ref
- [406] . 2019. Predicting co-changes between functionality specifications and source code in behavior driven development. In Proceedings of the IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 534–544.Google ScholarDigital Library
- [407] . 2010. Automatic detection of nocuous coordination ambiguities in natural language requirements. In Proceedings of the IEEE/ACM International Conference on Automated Software Engineering. 53–62.Google ScholarDigital Library
- [408] . 2015. Classification model for code clones based on machine learning. Empir. Softw. Eng. 20, 4 (2015), 1095–1125.Google ScholarDigital Library
- [409] . 2014. A learning-to-rank approach to software defect prediction. IEEE Trans. Reliab. 64, 1 (2014), 234–246.Google ScholarCross Ref
- [410] . 2018. Ridge and lasso regression models for cross-version defect prediction. IEEE Trans. Reliab. 67, 3 (2018), 885–896.Google ScholarCross Ref
- [411] . 2016. Who should take this task? Dynamic decision support for crowd workers. In Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. 1–10.Google ScholarDigital Library
- [412] . 2016. Effort-aware just-in-time defect prediction: Simple unsupervised models could be better than supervised models. In Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering. 157–168.Google ScholarDigital Library
- [413] . 2014. Are slice-based cohesion metrics actually useful in effort-aware post-release fault-proneness prediction? An empirical study. IEEE Trans. Softw. Eng. 41, 4 (2014), 331–357.Google ScholarDigital Library
- [414] . 2014. An application of oversampling, undersampling, bagging and boosting in handling imbalanced datasets. In Proceedings of the 1st International Conference on Advanced Data and Information Engineering (DaEng’13). Springer, 13–22.Google ScholarCross Ref
- [415] . 2021. Characterization and prediction of questions without accepted answers on stack overflow. arXiv preprint arXiv:2103.11386 (2021).Google Scholar
- [416] . 2010. Combining hardware and software instrumentation to classify program executions. In Proceedings of the 18th ACM SIGSOFT International Symposium on Foundations of Software Engineering. 67–76.Google ScholarDigital Library
- [417] . 2017. A feature matching and transfer approach for cross-company defect prediction. J. Syst. Softw. 132 (2017), 366–378.Google ScholarDigital Library
- [418] . 2018. ConPredictor: Concurrency defect prediction in real-world applications. IEEE Trans. Softw. Eng. 45, 6 (2018), 558–575.Google ScholarCross Ref
- [419] . 2015. Does the failing test execute a single or multiple faults? An approach to classifying failing tests. In Proceedings of the IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 924–935.Google ScholarCross Ref
- [420] Zhe Yu, Christopher Theisen, Laurie Williams, and Tim Menzies. 2019. Improving vulnerability inspection efficiency using active learning. IEEE Transactions on Software Engineering 47, 11 (2019), 2401–2420.Google Scholar
- [421] . 2020. Automatically learning patterns for self-admitted technical debt removal. In Proceedings of the IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 355–366.Google ScholarCross Ref
- [422] . 2013. Categorizing bugs with social networks: A case study on four open source software communities. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 1032–1041.Google ScholarCross Ref
- [423] . 2021. Deep just-in-time defect prediction: How far are we? In Proceedings of the 30th ACM SIGSOFT International Symposium on Software Testing and Analysis. 427–438.Google ScholarDigital Library
- [424] . 2016. The use of summation to aggregate software metrics hinders the performance of defect prediction models. IEEE Trans. Softw. Eng. 43, 5 (2016), 476–491.Google ScholarDigital Library
- [425] . 2017. Data transformation in cross-project defect prediction. Empir. Softw. Eng. 22, 6 (2017), 3186–3218.Google ScholarDigital Library
- [426] . 2017. Predicting change consistency in a clone group. J. Syst. Softw. 134 (2017), 105–119.Google ScholarDigital Library
- [427] . 2016. Towards building a universal defect prediction model with rank transformed predictors. Empir. Softw. Eng. 21, 5 (2016), 2107–2145.Google ScholarDigital Library
- [428] . 2016. Cross-project defect prediction using a connectivity-based unsupervised classifier. In Proceedings of the IEEE/ACM 38th International Conference on Software Engineering (ICSE). IEEE, 309–320.Google ScholarDigital Library
- [429] . 2013. Predicting bug-fixing time: An empirical study of commercial software projects. In Proceedings of the 35th International Conference on Software Engineering (ICSE). IEEE, 1042–1051.Google ScholarCross Ref
- [430] . 2021. Are comments on Stack Overflow well organized for easy retrieval by developers?ACM Trans. Softw. Eng. Methodol. 30, 2 (2021), 1–31.Google ScholarDigital Library
- [431] . 2018. Predictive mutation testing. IEEE Trans. Softw. Eng. 45, 9 (2018), 898–918.Google ScholarDigital Library
- [432] . 2019. A machine learning based approach to identify SQL injection vulnerabilities. In Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 1286–1288.Google ScholarDigital Library
- [433] . 2019. Robust log-based anomaly detection on unstable log data. In Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 807–817.Google ScholarDigital Library
- [434] . 2020. Automated defect identification via path analysis-based features with transfer learning. J. Syst. Softw. 166 (2020), 110585.Google ScholarCross Ref
- [435] . 2021. Predicting crash fault residence via simplified deep forest based on a reduced feature set. arXiv preprint arXiv:2104.01768 (2021).Google Scholar
- [436] . 2020. Real-time incident prediction for online service systems. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 315–326.Google ScholarDigital Library
- [437] . 2019. iFeedback: Exploiting user feedback for real-time issue detection in large-scale online service systems. In Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 352–363.Google ScholarDigital Library
- [438] . 2009. Inferring resource specifications from natural language API documentation. In Proceedings of the IEEE/ACM International Conference on Automated Software Engineering. IEEE, 307–318.Google Scholar
- [439] . 2014. Who will stay in the FLOSS community? Modeling participant’s initial behavior. IEEE Trans. Softw. Eng. 41, 1 (2014), 82–99.Google ScholarDigital Library
- [440] . 2019. Improving defect prediction with deep forest. Inf. Softw. Technol. 114 (2019), 204–216.Google ScholarDigital Library
- [441] . 2019. Latent error prediction and fault localization for microservice applications by learning from system trace logs. In Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 683–694.Google ScholarDigital Library
- [442] . 2016. Combining text mining and data mining for bug report classification. J. Softw.: Evol. Process 28, 3 (2016), 150–176.Google ScholarDigital Library
- [443] . 2010. On the ability of complexity metrics to predict fault-prone classes in object-oriented systems. J. Syst. Softw. 83, 4 (2010), 660–674.Google ScholarDigital Library
- [444] . 2014. An in-depth study of the potentially confounding effect of class size in fault prediction. ACM Trans. Softw. Eng. Methodol. 23, 1 (2014), 1–51.Google ScholarDigital Library
- [445] . 2018. How far we have progressed in the journey? An examination of cross-project defect prediction. ACM Trans. Softw. Eng. Methodol. 27, 1 (2018), 1–51.Google ScholarDigital Library
- [446] . 2009. Ensemble Learning.Encyc. Biomet. 1 (2009), 270–273.Google ScholarCross Ref
- [447] . 2009. Cross-project defect prediction: A large scale experiment on data vs. domain vs. process. In Proceedings of the 7th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering. 91–100.Google ScholarDigital Library
- [448] . 2012. Characterizing and predicting which bugs get reopened. In Proceedings of the 34th International Conference on Software Engineering (ICSE). IEEE, 1074–1083.Google ScholarDigital Library
Index Terms
- Predictive Models in Software Engineering: Challenges and Opportunities
Recommendations
A Survey of Software Engineering Practice: Tools, Methods, and Results
The results of a survey of software development practice are reported and analyzed. The problems encountered in various phases of the software life cycle are measured and correlated with characteristics of the responding installations. The use and ...
Where's the Theory for Software Engineering?
Darwin's theory of natural selection, Maxwell's equations, the theory of demand and supply; almost all established academic disciplines place great emphasis on what their core theory is. This is not, however, the case in software engineering. What is ...
Search-based software engineering: Trends, techniques and applications
In the past five years there has been a dramatic increase in work on Search-Based Software Engineering (SBSE), an approach to Software Engineering (SE) in which Search-Based Optimization (SBO) algorithms are used to address problems in SE. SBSE has been ...
Comments