ABSTRACT
On Stack Overflow, more than 38,000 diverse tags are used to classify posts. The Stack Overflow community provides tag synonyms to reduce the number of tags that have the same or similar meaning. In our previous research, we used those synonym pairs to derive a number of strategies to create tag synonyms automatically.
In this work, we continue this line of research and present an approach to group tag synonyms to meaningful topics. We represent our synonyms as directed, weighted graphs, and investigate several graph community detection algorithms to build meaningful groups of tags, also called tag communities.
We apply our approach to the tags obtained from Android-related Stack Overflow posts and evaluate the resulting tag communities quantitatively with various community metrics. In addition, we evaluate our approach qualitatively through a manual inspection and comparison of a random sample of tag communities. Our results show that we can cluster the Android tags to 2,481 meaningful tag communities. We also show how these tag communities can be used to derive trends of topics of Android-related questions on Stack Overflow.
- A. Arenas, L. Danon, A. Diaz-Guilera, P. M. Gleiser, and R. Guimera. Community analysis in social networks. The European Physical Journal B-Condensed Matter and Complex Systems, 38(2):373--380, 2004.Google ScholarCross Ref
- E. Bagheri and F. Ensan. Semantic tagging and linking of software engineering social content. Automated Software Engineering, 23(2):147--190, 2014. Google ScholarDigital Library
- A. Barua, S. Thomas, and A. Hassan. What are developers talking about? an analysis of topics and trends in stack overflow. Empirical Software Engineering, pages 1--36, 2012. Google ScholarDigital Library
- S. Beyer and M. Pinzger. A manual categorization of android app development issues on stack overflow. In Proceedings of the International Conference on Software Maintenance and Evolution, pages 531--535. IEEE, 2014. Google ScholarDigital Library
- S. Beyer and M. Pinzger. Synonym suggestion for tags on stack overflow. In Proceedings of the International Conference on Program Comprehension, pages 94--103. IEEE, 2015. Google ScholarDigital Library
- D. Correa and A. Sureka. Fit or unfit: analysis and prediction of 'closed questions' on stack overflow. In Proceedings of the Conference on Online Social Networks, pages 201--212. ACM, 2013. Google ScholarDigital Library
- G. Csardi and T. Nepusz. The igraph software package for complex network research. Inter Journal, Complex Systems:1695, 2006.Google Scholar
- E. Cuvelier and M.-A. Aufaure. Business Intelligence: First European Summer School, chapter Graph Mining and Communities Detection, pages 117--138. Springer, 2012.Google Scholar
- S. Fortunato. Community detection in graphs. Physics Reports, 486(3):75--174, 2010.Google ScholarCross Ref
- E. L. Galina and A. M. Kuznetsov. Predict closed questions on stackoverflow. Proceedings of the Researchers Colloquium on Databases and Information Systems, pages 10--14, 2013.Google Scholar
- A. Joorabchi, M. English, and A. E. Mahdi. Automatic mapping of user tags to wikipedia concepts: The case of a Q&A website--stackoverflow. Journal of Information Science, pages 570--583, 2015. Google ScholarDigital Library
- D. Kavaler, D. Posnett, C. Gibler, H. Chen, P. Devanbu, and V. Filkov. Using and asking: Apis used in the android market and asked about in stackoverflow. In Social Informatics, pages 405--418. Springer, 2013. Google ScholarDigital Library
- D. Lai, H. Lu, and C. Nardini. Extracting weights from edge directions to find communities in directed networks. Journal of Statistical Mechanics: Theory and Experiment, 2010(06):P06003, 2010.Google ScholarCross Ref
- J. R. Landis and G. G. Koch. The measurement of observer agreement for categorical data. biometrics, pages 159--174, 1977.Google Scholar
- M. Linares-Vásquez, G. Bavota, M. Di Penta, R. Oliveto, and D. Poshyvanyk. How do api changes trigger stack overflow discussions? a study on the android sdk. In Proceedings of the International Conference on Program Comprehension, pages 83--94. ACM, 2014. Google ScholarDigital Library
- M. Linares-Vásquez, B. Dit, and D. Poshyvanyk. An exploratory analysis of mobile development issues using stack overflow. In Proceedings of the International Workshop on Mining Software Repositories, pages 93--96. IEEE, 2013. Google ScholarDigital Library
- P. Morrison and E. Murphy-Hill. Is programming knowledge related to age? an exploration of stack overflow. In Proceedings of the Working Conference on Mining Software Repositories, pages 69--72. IEEE, 2013. Google ScholarDigital Library
- C. Parnin, C. Treude, and L. Grammel. Crowd documentation: Exploring the coverage and the dynamics of api discussions on stack overflow., Technical report, Georgia Institute of Technology, 2012.Google Scholar
- P. Pons and M. Latapy. Computing communities in large networks using random walks. In Computer and Information Sciences, volume 3373 of Lecture Notes in Computer Science, pages 284--293. Springer, 2005. Google ScholarDigital Library
- C. Rosen and E. Shihab. What are mobile developers asking about? a large scale study using stack overflow. Empirical Software Engineering, pages 1--32, 2015. Google ScholarDigital Library
- J. Scott. Social network analysis. SAGE Publications Ltd., 2012.Google Scholar
- C. Treude, O. Barzilay, and M.-A. Storey. How do programmers ask and answer questions on the web?: Nier track. In Proceedings of the International Conference on Software Engineering, pages 804--807. IEEE, 2011. Google ScholarDigital Library
- C. Treude and M.-A. Storey. How tagging helps bridge the gap between social and technical aspects in software development. In Proceedings of the International Conference on Software Engineering, pages 12--22. IEEE, 2009. Google ScholarDigital Library
- E. Tsui, W. M. Wang, C. F. Cheung, and A. S. Lau. A concept-relationship acquisition and inference approach for hierarchical taxonomy construction from tags. Information processing & management, 46(1):44--57, Elsevier, 2010. Google ScholarDigital Library
- J. Wang and B. D. Davison. Explorations in tag suggestion and query expansion. In Proceedings of the Workshop on Search in Social Media, pages 43--50. ACM, 2008. Google ScholarDigital Library
- S. Wang, D. Lo, and L. Jiang. Inferring semantically related software terms and their taxonomy by leveraging collaborative tagging. In Proceedings of the International Conference on Software Maintenance, pages 604--607. IEEE, 2012. Google ScholarDigital Library
- J. Yao, B. Cui, G. Cong, and Y. Huang. Evolutionary taxonomy construction from dynamic tag space. World Wide Web, 15(5-6):581--602, 2012.Google ScholarCross Ref
Index Terms
- Grouping android tag synonyms on stack overflow
Recommendations
Synonym Suggestion for Tags on Stack Overflow
ICPC '15: Proceedings of the 2015 IEEE 23rd International Conference on Program ComprehensionThe amount of diverse tags used to classify posts on Stack Overflow increased in the last years to more than 38,000 tags. Many of these tags have the same or similar meaning. Stack Overflow provides an approach to reduce the amount of tags by allowing ...
Predicting Stack Overflow Question Tags: A Multi-Class, Multi-Label Classification
ICSEW'20: Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering WorkshopsThis work proposes to predict the tags assigned for the posts on Stack Overflow platform. The raw data was obtained from the stackexchange.com including more than 50K posts and their associated tags given by the users. The posts' questions and titles ...
Modeling stack overflow tags and topics as a hierarchy of concepts
Highlights- Applied a topic model to organize Stack Overflow data set into a concept hierarchy.
AbstractDevelopers rely on online Q&A forums to look up technical solutions, to pose questions on implementation problems, and to enhance their community profile by contributing answers. Many popular developer communication platforms, such as ...
Comments