ABSTRACT
Global software development has long being recognised as a paradigm shift in modern software development. As an immediate effect, co-location of workers in the same building or office is not seen as necessary any longer. Coordination in distributed socio-technical systems is mostly achieved by means of the artifacts that are produced by the developers part of a project's team.
Geographic distance profoundly affects the ability to collaborate. With communication becoming less frequent, the challenge is for it to become more effective. This is especially complex when different nationalities, languages and cultures are part of the same development effort. Open source software is an example of a distributed, multi-lingual development effort. As such, the main resulting artefacts are discussions, and source code. Diverse backgrounds can produce a different semantic corpus if the authors come from the same ethnic and language groups or from different ones.
The purpose of this paper is to evaluate the artifacts in the context of their semantics, and how semantic corpora are affected by development and languages. By using a selection of Open Source projects developed within national boundaries, we compare their semantic richness, and how their class content is reflected in their identifiers. We also compare these national projects to a successful, international project. The aim is to discover how national boundaries influence the semantics of the developed code.
- Nemitari Ajienka and Andrea Capiluppi. 2016. Semantic Coupling Between Classes: Corpora or Identifiers?. In Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. ACM, 40. Google ScholarDigital Library
- Gabriele Bavota, Andrea De Lucia, Andrian Marcus, and Rocco Oliveto. 2010. A two-step technique for extract class refactoring. In Proceedings of the IEEE/ACM international conference on Automated software engineering. ACM, 151--154. Google ScholarDigital Library
- Gabriele Bavota, Bogdan Dit, Rocco Oliveto, Massimiliano Di Penta, Denys Poshyvanyk, and Andrea De Lucia. 2013. An empirical study on the developers' perception of software coupling. In Proceedings of the 2013 International Conference on Software Engineering. IEEE Press, 692--701. Google ScholarDigital Library
- Gabriele Bavota, Malcom Gethers, Rocco Oliveto, Denys Poshyvanyk, and Andrea de Lucia. 2014. Improving software modularization via automated analysis of latent topics and dependencies. ACM Transactions on Software Engineering and Methodology (TOSEM) 23, 1 (2014), 4. Google ScholarDigital Library
- Gabriele Bavota, Rocco Oliveto, Malcom Gethers, Denys Poshyvanyk, and Andrea De Lucia. 2014. Methodbook: Recommending move method refactorings via relational topic models. Software Engineering, IEEE Transactions on 40, 7 (2014), 671--694. Google ScholarDigital Library
- Eoin ó Conchúir, Pär Ågerfalk, Helena H Olsson, and Brian Fitzgerald. 2009. Global software development: where are the benefits? Commun. ACM 52, 8 (2009), 127--131. Google ScholarDigital Library
- Dimoklis Despotakis, Dhavalkumar Thakker, Lydia Lau, and Vania Dimitrova. 2011. Capturing the semantics of individual viewpoints on social signals in interpersonal communication. Semantic Web Journal, Special Issue on Personal and Social Semantic Web ((Under review)) (2011).Google Scholar
- Robert Feldt and Ana Magazinius. 2010. Validity Threats in Empirical Software Engineering Research-An Initial Survey.. In SEKE. 374--379.Google Scholar
- Malcom Gethers, Amir Aryani, and Denys Poshyvanyk. 2012. Combining conceptual and domain-based couplings to detect database and code dependencies. In Source Code Analysis and Manipulation (SCAM), 2012 IEEE 12th International Working Conference on. IEEE, 144--153. Google ScholarDigital Library
- Jesus M Gonzalez-Barahona, Gregorio Robles, and Daniel Izquierdo-Cortazar. 2016. Determining the Geographical distribution of a Community by means of a Time-zone Analysis. In Proceedings of the 12th International Symposium on Open Collaboration. ACM, 3. Google ScholarDigital Library
- Huzefa Kagdi, Malcom Gethers, and Denys Poshyvanyk. 2013. Integrating conceptual and logical couplings for change impact analysis in software. Empirical Software Engineering 18, 5 (2013), 933--969.Google ScholarCross Ref
- Huzefa Kagdi, Malcom Gethers, Denys Poshyvanyk, and Michael L Collard. 2010. Blending conceptual and evolutionary couplings to support change impact analysis in source code. In Reverse Engineering (WCRE), 2010 17th Working Conference on. IEEE, 119--128. Google ScholarDigital Library
- Vlado Kešelj, Fuchun Peng, Nick Cercone, and Calvin Thomas. 2003. N-gram-based author profiles for authorship attribution. In Proceedings of the conference pacific association for computational linguistics, PACLING, Vol. 3. 255--264.Google Scholar
- Do Ba Khang and Tun Lin Moe. 2008. Success criteria and factors for international development projects: A life-cycle-based framework. Project Management Journal 39, 1 (2008), 72--84.Google ScholarCross Ref
- Kari Laitinen. 1996. Estimating understandability of software documents. ACM SIGSOFT Software Engineering Notes 21, 4 (1996), 81--92. Google ScholarDigital Library
- Paul Mcnamee and James Mayfield. 2004. Character n-gram tokenization for European language text retrieval. Information retrieval 7, 1--2 (2004), 73--97. Google ScholarDigital Library
- John Noll, Sarah Beecham, and Ita Richardson. 2010. Global software development and collaboration: barriers and solutions. ACM inroads 1, 3 (2010), 66--78. Google ScholarDigital Library
- David Lorge Parnas. 2011. Precise documentation: The key to better software. In The Future of Software Engineering. Springer, 125--148.Google Scholar
- Denys Poshyvanyk and Andrian Marcus. 2006. The conceptual coupling metrics for object-oriented systems. In Software Maintenance, 2006. ICSM'06. 22nd IEEE International Conference on. IEEE, 469--478. Google ScholarDigital Library
- Denys Poshyvanyk, Andrian Marcus, Rudolf Ferenc, and Tibor Gyimóthy. 2009. Using information retrieval based coupling measures for impact analysis. Empirical software engineering 14, 1 (2009), 5--32. Google ScholarDigital Library
- Abdallah Qusef, Gabriele Bavota, Rocco Oliveto, Andrea De Lucia, and David Binkley. 2011. Scotch: Test-to-code traceability using slicing and conceptual coupling. In Software Maintenance (ICSM), 2011 27th IEEE International Conference on. IEEE, 63--72. Google ScholarDigital Library
- Jim Whitehead. 2007. Collaboration in software engineering: A roadmap. In 2007 Future of Software Engineering. IEEE Computer Society, 214--225. Google ScholarDigital Library
Recommendations
Modeling and simulation of open source development using an agile practice
Special issue: AGILE methodologies for software productionThe goal of this work is to study the effects of the adoption of agile practices on open source development. In particular, we started to evaluate the effects of TDD (Test Driven Development) since it is easer to apply in a distributed environment than ...
Open Source Ecosystems
The open source community can deliver high-quality, very popular software. This software is developed in a culturally and geographically diverse environment. The developers do it for little or no money, with little or no extrinsic management. And many ...
Impact of license choice on Open Source Software development activity
The Open Source Software (OSS) development model has emerged as an important competing paradigm to proprietary alternatives; however, insufficient research exists to understand the influence of some OSS project characteristics on the level of activity ...
Comments