skip to main content
10.1145/2901739.2901750acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
research-article

Grouping android tag synonyms on stack overflow

Published:14 May 2016Publication History

ABSTRACT

On Stack Overflow, more than 38,000 diverse tags are used to classify posts. The Stack Overflow community provides tag synonyms to reduce the number of tags that have the same or similar meaning. In our previous research, we used those synonym pairs to derive a number of strategies to create tag synonyms automatically.

In this work, we continue this line of research and present an approach to group tag synonyms to meaningful topics. We represent our synonyms as directed, weighted graphs, and investigate several graph community detection algorithms to build meaningful groups of tags, also called tag communities.

We apply our approach to the tags obtained from Android-related Stack Overflow posts and evaluate the resulting tag communities quantitatively with various community metrics. In addition, we evaluate our approach qualitatively through a manual inspection and comparison of a random sample of tag communities. Our results show that we can cluster the Android tags to 2,481 meaningful tag communities. We also show how these tag communities can be used to derive trends of topics of Android-related questions on Stack Overflow.

References

  1. A. Arenas, L. Danon, A. Diaz-Guilera, P. M. Gleiser, and R. Guimera. Community analysis in social networks. The European Physical Journal B-Condensed Matter and Complex Systems, 38(2):373--380, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  2. E. Bagheri and F. Ensan. Semantic tagging and linking of software engineering social content. Automated Software Engineering, 23(2):147--190, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Barua, S. Thomas, and A. Hassan. What are developers talking about? an analysis of topics and trends in stack overflow. Empirical Software Engineering, pages 1--36, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Beyer and M. Pinzger. A manual categorization of android app development issues on stack overflow. In Proceedings of the International Conference on Software Maintenance and Evolution, pages 531--535. IEEE, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Beyer and M. Pinzger. Synonym suggestion for tags on stack overflow. In Proceedings of the International Conference on Program Comprehension, pages 94--103. IEEE, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Correa and A. Sureka. Fit or unfit: analysis and prediction of 'closed questions' on stack overflow. In Proceedings of the Conference on Online Social Networks, pages 201--212. ACM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. G. Csardi and T. Nepusz. The igraph software package for complex network research. Inter Journal, Complex Systems:1695, 2006.Google ScholarGoogle Scholar
  8. E. Cuvelier and M.-A. Aufaure. Business Intelligence: First European Summer School, chapter Graph Mining and Communities Detection, pages 117--138. Springer, 2012.Google ScholarGoogle Scholar
  9. S. Fortunato. Community detection in graphs. Physics Reports, 486(3):75--174, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  10. E. L. Galina and A. M. Kuznetsov. Predict closed questions on stackoverflow. Proceedings of the Researchers Colloquium on Databases and Information Systems, pages 10--14, 2013.Google ScholarGoogle Scholar
  11. A. Joorabchi, M. English, and A. E. Mahdi. Automatic mapping of user tags to wikipedia concepts: The case of a Q&A website--stackoverflow. Journal of Information Science, pages 570--583, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Kavaler, D. Posnett, C. Gibler, H. Chen, P. Devanbu, and V. Filkov. Using and asking: Apis used in the android market and asked about in stackoverflow. In Social Informatics, pages 405--418. Springer, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. Lai, H. Lu, and C. Nardini. Extracting weights from edge directions to find communities in directed networks. Journal of Statistical Mechanics: Theory and Experiment, 2010(06):P06003, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  14. J. R. Landis and G. G. Koch. The measurement of observer agreement for categorical data. biometrics, pages 159--174, 1977.Google ScholarGoogle Scholar
  15. M. Linares-Vásquez, G. Bavota, M. Di Penta, R. Oliveto, and D. Poshyvanyk. How do api changes trigger stack overflow discussions? a study on the android sdk. In Proceedings of the International Conference on Program Comprehension, pages 83--94. ACM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Linares-Vásquez, B. Dit, and D. Poshyvanyk. An exploratory analysis of mobile development issues using stack overflow. In Proceedings of the International Workshop on Mining Software Repositories, pages 93--96. IEEE, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. P. Morrison and E. Murphy-Hill. Is programming knowledge related to age? an exploration of stack overflow. In Proceedings of the Working Conference on Mining Software Repositories, pages 69--72. IEEE, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C. Parnin, C. Treude, and L. Grammel. Crowd documentation: Exploring the coverage and the dynamics of api discussions on stack overflow., Technical report, Georgia Institute of Technology, 2012.Google ScholarGoogle Scholar
  19. P. Pons and M. Latapy. Computing communities in large networks using random walks. In Computer and Information Sciences, volume 3373 of Lecture Notes in Computer Science, pages 284--293. Springer, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C. Rosen and E. Shihab. What are mobile developers asking about? a large scale study using stack overflow. Empirical Software Engineering, pages 1--32, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Scott. Social network analysis. SAGE Publications Ltd., 2012.Google ScholarGoogle Scholar
  22. C. Treude, O. Barzilay, and M.-A. Storey. How do programmers ask and answer questions on the web?: Nier track. In Proceedings of the International Conference on Software Engineering, pages 804--807. IEEE, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. C. Treude and M.-A. Storey. How tagging helps bridge the gap between social and technical aspects in software development. In Proceedings of the International Conference on Software Engineering, pages 12--22. IEEE, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. E. Tsui, W. M. Wang, C. F. Cheung, and A. S. Lau. A concept-relationship acquisition and inference approach for hierarchical taxonomy construction from tags. Information processing & management, 46(1):44--57, Elsevier, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. J. Wang and B. D. Davison. Explorations in tag suggestion and query expansion. In Proceedings of the Workshop on Search in Social Media, pages 43--50. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. S. Wang, D. Lo, and L. Jiang. Inferring semantically related software terms and their taxonomy by leveraging collaborative tagging. In Proceedings of the International Conference on Software Maintenance, pages 604--607. IEEE, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J. Yao, B. Cui, G. Cong, and Y. Huang. Evolutionary taxonomy construction from dynamic tag space. World Wide Web, 15(5-6):581--602, 2012.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Grouping android tag synonyms on stack overflow

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MSR '16: Proceedings of the 13th International Conference on Mining Software Repositories
      May 2016
      544 pages
      ISBN:9781450341868
      DOI:10.1145/2901739

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 14 May 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Upcoming Conference

      ICSE 2025

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader