skip to main content
10.1145/3468264.3468562acmconferencesArticle/Chapter ViewAbstractPublication PagesfseConference Proceedingsconference-collections
research-article
Open Access
Artifacts Available / v1.1

A first look at developers’ live chat on Gitter

Authors Info & Claims
Published:18 August 2021Publication History

ABSTRACT

Modern communication platforms such as Gitter and Slack play an increasingly critical role in supporting software teamwork, especially in open source development.Conversations on such platforms often contain intensive, valuable information that may be used for better understanding OSS developer communication and collaboration. However, little work has been done in this regard. To bridge the gap, this paper reports a first comprehensive empirical study on developers' live chat, investigating when they interact, what community structures look like, which topics are discussed, and how they interact. We manually analyze 749 dialogs in the first phase, followed by an automated analysis of over 173K dialogs in the second phase. We find that developers tend to converse more often on weekdays, especially on Wednesdays and Thursdays (UTC), that there are three common community structures observed, that developers tend to discuss topics such as API usages and errors, and that six dialog interaction patterns are identified in the live chat communities. Based on the findings, we provide recommendations for individual developers and OSS communities, highlight desired features for platform vendors, and shed light on future research directions. We believe that the findings and insights will enable a better understanding of developers' live chat, pave the way for other researchers, as well as a better utilization and mining of knowledge embedded in the massive chat history.

References

  1. A Tool to Prototype and Experiment Angular Codes. 2020. Plnkr. http://plnkr.co/Google ScholarGoogle Scholar
  2. Explosion AI. 2020. Spacy. https://spacy.io/Google ScholarGoogle Scholar
  3. Miltiadis Allamanis and Charles Sutton. 2013. Why, When, and What: Analyzing Stack Overflow Questions by Topic, Type, and Code. In Proceedings of the 10th Working Conference on Mining Software Repositories, MSR ’13. IEEE Computer Society, 53–56. https://doi.org/10.1109/MSR.2013.6624004 Google ScholarGoogle ScholarCross RefCross Ref
  4. Deeksha Arya, Wenting Wang, Jin L. C. Guo, and Jinghui Cheng. 2019. Analysis and Detection of Information Types of Open Source Software Issue Discussions. In Proceedings of the 41st International Conference on Software Engineering, ICSE 2019. 454–464. https://doi.org/10.1109/ICSE.2019.00058 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Mehdi Bagherzadeh and Raffi Khatchadourian. 2019. Going Big: A Large-scale Study on What Big Data Developers Ask. In Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2019. ACM, 432–442. https://doi.org/10.1145/3338906.3338939 Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Mathieu Bastian, Sebastien Heymann, and Mathieu Jacomy. 2009. Gephi: An Open Source Software for Exploring and Manipulating Networks. In Proceedings of the International AAAI Conference on Web and Social Media. 3.Google ScholarGoogle Scholar
  7. Alex Bavelas. 1950. Communication Patterns in Task-oriented Groups. The journal of the acoustical society of America, 22, 6 (1950), 725–730.Google ScholarGoogle Scholar
  8. S. Beyer, C. Macho, M. Di Penta, and M. Pinzger. 2017. Analyzing the Relationships between Android API Classes and Their References on Stack Overflow.. University of Klagenfurt, University of Sannio.Google ScholarGoogle Scholar
  9. Stefanie Beyer, Christian Macho, Martin Pinzger, and Massimiliano Di Penta. 2018. Automatically Classifying Posts into Question Categories on Stack Overflow. In Proceedings of the 26th Conference on Program Comprehension, ICPC 2018. ACM, 211–221. https://doi.org/10.1145/3196321.3196333 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Stefanie Beyer and Martin Pinzger. 2014. A Manual Categorization of Android App Development Issues on Stack Overflow. In 30th IEEE International Conference on Software Maintenance and Evolution. IEEE Computer Society, 531–535. https://doi.org/10.1109/ICSME.2014.88 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Christian Bird, Alex Gourley, Prem Devanbu, Michael Gertz, and Anand Swaminathan. 2006. Mining Email Social Networks. In Proceedings of the 2006 international workshop on Mining software repositories. 137–143. https://doi.org/10.1145/1137983.1138016 Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Bosu, C. S. Corley, D. Heaton, D. Chatterji, J. C. Carver, and N. A. Kraft. 2013. Building Reputation in StackOverflow: An Empirical Investigation. In 2013 10th Working Conference on Mining Software Repositories (MSR). 89–92. https://doi.org/10.1109/MSR.2013.6624013 Google ScholarGoogle ScholarCross RefCross Ref
  13. Jérémie Bouttier, Philippe Di Francesco, and Emmanuel Guitter. 2003. Geodesic Distance in Planar Graphs. Nuclear physics B, 663, 3 (2003), 535–567.Google ScholarGoogle Scholar
  14. Fabio Calefato, Filippo Lanubile, and Nicole Novielli. 2018. How to Ask for Technical Help? Evidence-based Guidelines for Writing Questions on Stack Overflow. Inf. Softw. Technol., 94 (2018), 186–207. https://doi.org/10.1016/j.infsof.2017.10.009 Google ScholarGoogle ScholarCross RefCross Ref
  15. Preetha Chatterjee, Kostadin Damevski, Nicholas A. Kraft, and Lori L. Pollock. 2020. Software-related Slack Chats with Disentangled Conversations. In MSR ’20: 17th International Conference on Mining Software Repositories. ACM, 588–592. https://doi.org/10.1145/3379597.3387493 Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Microsoft Corporation. 2020. Nodejs. https://nodejs.org/en/Google ScholarGoogle Scholar
  17. Microsoft Corporation. 2020. Typescript. https://www.typescriptlang.org/Google ScholarGoogle Scholar
  18. Fabio Crestani and Mounia Lalmas. 2001. Logic and Uncertainty in Information Retrieval. In Lectures in Information Retrieval, Lecture Notes in Computer Science. Springer Verlag. https://doi.org/10.1007/3-540-45368-7_9 Google ScholarGoogle ScholarCross RefCross Ref
  19. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Volume 1 (Long and Short Papers). Association for Computational Linguistics, 4171–4186. https://doi.org/10.18653/v1/n19-1423 Google ScholarGoogle ScholarCross RefCross Ref
  20. Reinhard Diestel. 2005. Graph theory. 2005. Grad. Texts in Math, 101 (2005).Google ScholarGoogle Scholar
  21. Inc Docker. 2020. Docker. https://www.docker.com/Google ScholarGoogle Scholar
  22. Sally Fincher and Josh D. Tenenberg. 2005. Making Sense of Card Sorting Data. Expert Syst. J. Knowl. Eng., 22, 3 (2005), 89–93. https://doi.org/10.1111/j.1468-0394.2005.00299.x Google ScholarGoogle ScholarCross RefCross Ref
  23. Brian Fitzgerald. 2006. The Transformation of Open Source Software. MIS Q., 30, 3 (2006), 587–598.Google ScholarGoogle ScholarCross RefCross Ref
  24. Ethereum Foundation. 2020. Ethereum. https://ethereum.org/en/Google ScholarGoogle Scholar
  25. JS Foundation. 2020. Appium. http://appium.io/Google ScholarGoogle Scholar
  26. Linton C Freeman. 1977. A Set of Measures of Centrality Based on Betweenness. Sociometry, 35–41.Google ScholarGoogle Scholar
  27. Adam Gibson. 2020. Deeplearning4j. https://deeplearning4j.org/Google ScholarGoogle Scholar
  28. Gitter. 2020. REST API. https://developer.gitter.im/docs/rest-apiGoogle ScholarGoogle Scholar
  29. Jesús M. González-Barahona and Gregorio Robles. 2013. Trends in Free, Libre, Open Source Software Communities: From Volunteers to Companies / Aktuelle Trends in Free-, Libre-, und Open-Source-Software-Gemeinschaften: Von Freiwilligen zu Unternehmen. it Inf. Technol., 55, 5 (2013), 173–180. https://doi.org/10.1524/itit.2013.1012 Google ScholarGoogle ScholarCross RefCross Ref
  30. Google. 2020. Angular. https://angular.io/Google ScholarGoogle Scholar
  31. Google. 2020. Gitter. https://gitter.im/Google ScholarGoogle Scholar
  32. Google. 2020. Slack. https://slack.com/Google ScholarGoogle Scholar
  33. Gaoyang Guo, Chaokun Wang, Jun Chen, and Pengcheng Ge. 2018. Who Is Answering to Whom? Finding “Reply-To” Relations in Group Chats with Long Short-Term Memory Networks. In Proceedings of the 7th International Conference on Emerging Databases. https://doi.org/10.1007/s10586-018-2031-4 Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Anja Guzzi, Alberto Bacchelli, Michele Lanza, Martin Pinzger, and Arie van Deursen. 2013. Communication in Open Source Software Development Mailing Lists. In Proceedings of the 10th Working Conference on Mining Software Repositories, MSR ’13. IEEE Computer Society, 277–286. https://doi.org/10.1109/MSR.2013.6624039 Google ScholarGoogle ScholarCross RefCross Ref
  35. Junxiao Han, Emad Shihab, Zhiyuan Wan, Shuiguang Deng, and Xin Xia. 2020. What Do Programmers Discuss About Deep Learning Frameworks. Empir. Softw. Eng., 25, 4 (2020), 2694–2747. https://doi.org/10.1007/s10664-020-09819-6 Google ScholarGoogle ScholarCross RefCross Ref
  36. Qiao Huang, Xin Xia, David Lo, and Gail C. Murphy. 2018. Automating Intention Mining. IEEE Transactions on Software Engineering, PP, 99 (2018), 1–1. https://doi.org/10.1109/ICSM.2015.7332474 Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Verena Käfer, Ivan Bogicevic, Stefan Wagner, and Jasmin Ramadani. 2018. Communication in Open-source Projects-end of the E-mail Era? In Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, ICSE 2018. ACM, 242–243. https://doi.org/10.1145/3183440.3194951 Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Valdis Krebs and June Holley. 2004. Building Sustainable Communities through Social Network Development. The Nonprofit Quarterly, 11 (2004), 01, 46–53.Google ScholarGoogle Scholar
  39. Jonathan K. Kummerfeld, Sai R. Gouravajhala, Joseph Peper, Vignesh Athreya, Chulaka Gunasekara, Jatin Ganhotra, Siva Sankalp Patel, Lazaros Polymenakos, and Walter S. Lasecki. 2019. A Large-scale Corpus for Conversation Disentanglement. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). https://doi.org/10.18653/v1/p19-1374 Google ScholarGoogle ScholarCross RefCross Ref
  40. Bin Lin, Alexey Zagalsky, Margaret-Anne D. Storey, and Alexander Serebrenik. 2016. Why Developers Are Slacking Off: Understanding How Software Teams Use Slack. In Proceedings of the 19th ACM Conference on Computer Supported Cooperative Work and Social Computing. 333–336. https://doi.org/10.1145/2818052.2869117 Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Hui Liu, Zhan Shi, Jia-Chen Gu, Quan Liu, Si Wei, and Xiaodan Zhu. 2020. End-to-End Transition-based Online Dialogue Disentanglement. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020. ijcai.org, 3868–3874. https://doi.org/10.24963/ijcai.2020/535 Google ScholarGoogle ScholarCross RefCross Ref
  42. Troupe Technology Ltd.. 2020. Gitter. https://gitter.im/Google ScholarGoogle Scholar
  43. Yao Lu, Xinjun Mao, Minghui Zhou, Yang Zhang, Tao Wang, and Zude Li. 2020. Haste Makes Waste: An Empirical Study of Fast Answers in Stack Overflow. In IEEE International Conference on Software Maintenance and Evolution, ICSME 2020. IEEE, 23–34. https://doi.org/10.1109/ICSME46990.2020.00013 Google ScholarGoogle ScholarCross RefCross Ref
  44. Manju, K., Ahuja, Kathleen, M., and Carley. 1998. Network Structure in Virtual Organizations. Journal of Computer Mediated Communication, https://doi.org/10.1111/j.1083-6101.1998.tb00079.x Google ScholarGoogle ScholarCross RefCross Ref
  45. Andrew Meneely, Laurie A. Williams, Will Snipes, and Jason A. Osborne. 2008. Predicting Failures with Developer Networks and Social Network Analysis. In Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2008. ACM, 13–23. https://doi.org/10.1145/1453101.1453106 Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Courtney Miller, Paige Rodeghero, Margaret-Anne D. Storey, Denae Ford, and Thomas Zimmermann. 2021. "How Was Your Weekend?" Software Development Teams Working From Home During COVID-19. CoRR, abs/2101.05877 (2021).Google ScholarGoogle Scholar
  47. Ana Moreira, Awais Rashid, and João Araújo. 2005. Multi-Dimensional Separation of Concerns in Requirements Engineering. In 13th IEEE International Conference on Requirements Engineering (RE 2005). IEEE Computer Society, 285–296. https://doi.org/10.1109/RE.2005.46 Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Sebastiano Panichella, Andrea Di Sorbo, Emitza Guzman, Corrado Aaron Visaggio, Gerardo Canfora, and Harald C Gall. 2015. How Can I Improve My App? Classifying User Reviews for Software Maintenance and Evolution. 281–290.Google ScholarGoogle Scholar
  49. Esteban Parra, Ashley Ellis, and Sonia Haiduc. 2020. GitterCom: A Dataset of Open Source Developer Communications in Gitter. In MSR ’20: 17th International Conference on Mining Software Repositories. ACM, 563–567. https://doi.org/10.1145/3379597.3387494 Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Chen Qu, Liu Yang, W. Bruce Croft, Yongfeng Zhang, Johanne R. Trippas, and Minghui Qiu. 2019. User Intent Prediction in Information-seeking Conversations. In Proceedings of the 2019 Conference on Human Information Interaction and Retrieval, CHIIR 2019. ACM, 25–33. https://doi.org/10.1145/3295750.3298924 Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Christoffer Rosen and Emad Shihab. 2016. What Are Mobile Developers Asking About? A Large Scale Study Using Stack Overflow. Empir. Softw. Eng., 21, 3 (2016), 1192–1223. https://doi.org/10.1007/s10664-015-9379-3 Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Gordon Rugg and Peter McGeorge. 2005. The Sorting Techniques: A Tutorial Paper on Card Sorts, Picture Sorts and Item Sorts. Expert Syst. J. Knowl. Eng., 22, 3 (2005), 94–107. https://doi.org/10.1111/j.1468-0394.2005.00300.x Google ScholarGoogle ScholarCross RefCross Ref
  53. Khadijah Al Safwan and Francisco Servant. 2019. Decomposing the Rationale of Code Commits: The Software Developer’s Perspective. In Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2019. ACM, 397–408. https://doi.org/10.1145/3338906.3338979 Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Jorge M. Santos and Mark J. Embrechts. 2009. On the Use of the Adjusted Rand Index as a Metric for Evaluating Supervised Classification. In Artificial Neural Networks - ICANN 2009, 19th International Conference, Limassol (Lecture Notes in Computer Science, Vol. 5769). Springer, 175–184. https://doi.org/10.1007/978-3-642-04277-5_18 Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Roland Robert Schreiber and Matthäus Paul Zylka. 2020. Social Network Analysis in Software Development Projects: A Systematic Literature Review. Int. J. Softw. Eng. Knowl. Eng., 30, 3 (2020), 321–362. https://doi.org/10.1142/s021819402050014x Google ScholarGoogle ScholarCross RefCross Ref
  56. Dou Shen, Qiang Yang, Jian-Tao Sun, and Zheng Chen. 2006. Thread Detection in Dynamic Text Message Streams. In SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 35–42. https://doi.org/10.1145/1148170.1148180 Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Lin Shi, Mingzhe Xing, Mingyang Li, Yawen Wang, Shoubin Li, and Qing Wang. 2020. Detection of Hidden Feature Requests from Massive Chat Messages via Deep Siamese Network. In ICSE ’20: 42nd International Conference on Software Engineering. ACM, 641–653. https://doi.org/10.1145/3377811.3380356 Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Emad Shihab, Zhen Ming Jiang, and Ahmed E. Hassan. 2009. On the Use of Internet Relay Chat (IRC) Meetings by Developers of the GNOME GTK+ Project. In Proceedings of the 6th International Working Conference on Mining Software Repositories, MSR 2009 (Co-located with ICSE). 107–110. https://doi.org/10.1109/MSR.2009.5069488 Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Emad Shihab, Zhen Ming Jiang, and Ahmed E. Hassan. 2009. Studying the Use of Developer IRC Meetings in Open Source Projects. In 25th IEEE International Conference on Software Maintenance (ICSM 2009). 147–156. https://doi.org/10.1109/ICSM.2009.5306333 Google ScholarGoogle ScholarCross RefCross Ref
  60. Andrea Di Sorbo, Sebastiano Panichella, Corrado Aaron Visaggio, Massimiliano Di Penta, Gerardo Canfora, and Harald C. Gall. 2015. Development Emails Content Analyzer: Intention Mining in Developer Discussions (T). In 30th IEEE/ACM International Conference on Automated Software Engineering, ASE 2015. 12–23. https://doi.org/10.1109/ASE.2015.12 Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Alexander Strehl and Joydeep Ghosh. 2002. Cluster Ensembles — A Knowledge Reuse Framework for Combining Multiple Partitions. J. Mach. Learn. Res., 3 (2002), 583–617.Google ScholarGoogle Scholar
  62. Neny Sulistianingsih and Edi Winarko. [n.d.]. Performance Analysis of Molecular Complex Detection in Social Network Datasets. International Journal of Computer Applications, 975 ([n. d.]), 8887.Google ScholarGoogle Scholar
  63. Xin Tan, Minghui Zhou, and Zeyu Sun. 2020. A first look at good first issues on GitHub. In ESEC/FSE ’20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8-13, 2020, Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann (Eds.). ACM, 398–409. https://doi.org/10.1145/3368089.3409746 Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. Christoph Treude, Ohad Barzilay, and Margaret-Anne D. Storey. 2011. How Do Programmers Ask and Answer Questions on the Web? In Proceedings of the 33rd International Conference on Software Engineering, ICSE 2011. ACM, 804–807. https://doi.org/10.1145/1985793.1985907 Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. Stanley Wasserman and Katherine Faust. 1994. Social Network Analysis: Methods and Applications. Cambridge University Press. https://doi.org/10.1017/CBO9780511815478 Google ScholarGoogle ScholarCross RefCross Ref
  66. Duncan J Watts and Steven H Strogatz. 1998. Collective Dynamics of ‘Small-world’ Networks. nature, 393, 6684 (1998), 440–442.Google ScholarGoogle Scholar
  67. Liguo Yu, Srini Ramaswamy, Alok Mishra, and Deepti Mishra. 2011. Communications in Global Software Development: An Empirical Study Using GTK+ OSS Repository. In Proceedings of the 2011th Confederated International Conference on the Move to Meaningful Interest Systems, OTM’11. 218–227. https://doi.org/10.1007/978-3-642-25126-9_32 Google ScholarGoogle ScholarCross RefCross Ref
  68. Yuxia Zhang, Minghui Zhou, Klaas-Jan Stol, Jianyu Wu, and Zhi Jin. 2020. How Do Companies Collaborate in Open Source Ecosystems?: An Empirical Study of OpenStack. In ICSE ’20: 42nd International Conference on Software Engineering. ACM, 1196–1208. https://doi.org/10.1145/3377811.3380376 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A first look at developers’ live chat on Gitter

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ESEC/FSE 2021: Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering
        August 2021
        1690 pages
        ISBN:9781450385626
        DOI:10.1145/3468264

        Copyright © 2021 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 18 August 2021

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate112of543submissions,21%

        Upcoming Conference

        FSE '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader