skip to main content
10.1145/1137983.1138016acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
Article

Mining email social networks

Published:22 May 2006Publication History

ABSTRACT

Communication & Co-ordination activities are central to large software projects, but are difficult to observe and study in traditional (closed-source, commercial) settings because of the prevalence of informal, direct communication modes. OSS projects, on the other hand, use the internet as the communication medium,and typically conduct discussions in an open, public manner. As a result, the email archives of OSS projects provide a useful trace of the communication and co-ordination activities of the participants. However, there are various challenges that must be addressed before this data can be effectively mined. Once this is done, we can construct social networks of email correspondents, and begin to address some interesting questions. These include questions relating to participation in the email; the social status of different types of OSS participants; the relationship of email activity and commit activity (in the CVS repositories) and the relationship of social status with commit activity. In this paper, we begin with a discussion of our infrastructure (including a novel use of Scientific Workflow software) and then discuss our approach to mining the email archives; and finally we present some preliminary results from our data analysis.

References

  1. R. Agrawal, S. Rajagopalan, R. Srikant, and Y. Xu. Mining newsgroups using networks arising from social behavior. In WWW '03: Proceedings of the 12th international conference on World Wide Web, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A.-L. Barabási and R. Albert. Emergence of scaling in random networks. Science, 286:509--512, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  3. C. Bird, A. Gourley, P. Devanbu, A. Swaminathan, and M. Gertz. Mining email social networks in postgres. In MSR '06: Proceedings of the International Workshop on Mining Software Repositories, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. F. Brooks. The Mythical Man-Month: Essays on Software Engineering, 20th Anniversary Edition. Addison-Wesley, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Chapman. Sam's string metrics page. www.dcs.shef.ac.uk/ sam/stringmetrics.html.Google ScholarGoogle Scholar
  6. J. F. P. D. Cleidson de Souza. Seeking the source: Software source code as a social and technical artifact, 2005. http://opensource.mit.edu/papers/desouza.pdf.Google ScholarGoogle Scholar
  7. K. Crowston and J. Howison. The social structure of free and open source software development. opensource.mit.edu/papers/crowstonhowison.pdf, November 2004.Google ScholarGoogle Scholar
  8. B. J. Dempsey, D. Weiss, P. Jones, and J. Greenberg. Who is an open source software developer? Communications of the ACM, 45(2):67--72, February 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. C. Freeman. Centrality in social networks I. Conceptual clarification. Social Networks, 1:215--239, 1979.Google ScholarGoogle ScholarCross RefCross Ref
  10. M. Granovetter. The strength of weak ties. American Journal of Sociology, 78:1360--1380, 1973.Google ScholarGoogle ScholarCross RefCross Ref
  11. K. Kuwabara. Linux: A bazaar at the edge of chaos. First Monday, 5(3), March 2000.Google ScholarGoogle Scholar
  12. L. Lopez, J. M. Gonzalez-Barahona, and G. Robles. Applying social network analysis to the information in cvs repositories. In Proceedings of the International Workshop on Mining Software Repositories, 2004.Google ScholarGoogle Scholar
  13. G. Navarro. A guided tour to approximate string matching. ACM Comput. Surveys, 33(1):31--88, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. E. J. Newman. The structure and function of complex networks. SIAM Review, 45:167--256, 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Nieminen. On centrality in a graph. Scandinavian Journal of Psychology, 15:322--336, 1974.Google ScholarGoogle ScholarCross RefCross Ref
  16. E. S. Raymond. The Cathedral and the Bazaar: Musings on Linux and Open Source by an Accidental Revolutionary. O'Reilly and Associates, Sebastopol, California, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. E. Ukkonen. Algorithms for approximate string matching. Information & Control, 64(1-3), 1985. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. P. A. Wagstrom, J. D. Herbsleb, and K. Carley. A social network approach to free/open source software simulation. In Proceedings First International Conference on Open Source Systems, pages 16--23, 2005.Google ScholarGoogle Scholar
  19. J. Xu, Y. Gao, S. Christley, and G. Madey. A topological analysis of the open source software development community. In HICSS '05: Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 7, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Mining email social networks

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MSR '06: Proceedings of the 2006 international workshop on Mining software repositories
        May 2006
        191 pages
        ISBN:1595933972
        DOI:10.1145/1137983

        Copyright © 2006 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 22 May 2006

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Upcoming Conference

        ICSE 2025

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader