A Quantitative Study of Social Organisation in Open Source Software Communities

The success of open source projects crucially depends on the voluntary contributions of a sufficiently large community of users. Apart from the mere size of the community, interesting questions arises when looking at the evolution of structural features of collaborations between community members. In this article, we discuss several network analytic proxies that can be used to quantify different aspects of the social organisation in social collaboration networks. We particularly focus on measures that can be related to the cohesiveness of the communities, the distribution of responsibilities and the resilience against turnover of community members. We present a comparative analysis on a large-scale data set that covers the full history of collaborations between users of 14 major open source software communities. 

Results at a glance

Our analysis is based on a comprehensive data set collected from the bug tracking communities of 14 major OSS projects. We particularly look at the social organisation from the perspective of time-evolving networks and highlight how projects - although very similar in terms of size, problem domain and age - a) largely differ in terms of clustering coefficient, assortativity and closeness centralisation and b) that some of the projects show an interesting dynamics with respect to these measures that cannot be attributed to mere size change effects.

Evolution of structural measures for Bugzilla collaboration networks

Figure 1 shows the evolution of the number of nodes in the largest connected component, its assortativity, clustering coefficient and closeness centralisation for the four OSS communities Gentoo, KDE, Eclipse and NetBeans.
For all projects, the fraction of nodes in the LCC is rather stable with values between 0.7 and 1 consistent with the aggregate values given in Table 1. The same is true for the evolution of the average degree. We thus omit these plots. The four projects show significant differences in the evolution of the clustering coefficient that cannot be explained by mere size effects. In the particular time frame between 2006 and 2008, the clustering coefficient of the Eclipse community (≈ 0.15) was roughly ten times higher than that of the Gentoo community (≈ 0.01), although the LCCs of both communities were of comparable size (≈ 500 nodes). In addition, the clustering coefficient of the Gentoo community shows an interesting dynamics, dropping to a very small value between 2006 and 2008 and increasing thereafter. A different perspective of the structural change the Gentoo community was undergoing is given in Figure 1(d) which displays a visible plateau in the closeness centralisation of the network within the same period. In fact, as can be seen in the network depicted in Figure 1(i), in the period between 2006 and 2008 most of the collaborations were mediated by a single central community member, while the social organisation of the Eclipse community depicted in 1(j) was structured in a much more homogeneous way.

Conclusion

The long term goal of the project into which this study is embedded is the provision of multi-dimensional quantitative indicators for the dynamics of the social and technical organisation of OSS projects that are correlated with project performance and that can thus be reasonably included both in the management and evaluation of OSS projects [11, 14, 9]. Such indicators can be particularly useful when wanting to make an informed decision about which OSS project to invest in or rely on. Furthermore, due to the distributed nature of the collaborations, individuals often lack a macroscopic perspective on evolving communication and coordination structures, even though these can critically influence long-term success. An inclusion of suitable indicators in community management platforms like e.g. Bugzilla can assist the community in determining risks and allow project managers to timely react by shifting responsibilities, fostering information flow or changing organisational procedures.

 

Selected Publications

A quantitative study of social organisation in open source software communities, 2012

Zanetti, Marcelo Serrano; Sarigol, Emre; Scholtes, Ingo; Tessone, Claudio Juan; Schweitzer, Frank