MINING EFFORT DATA FROM THE OSS REPOSITORY OF DEVELOPER’S BUG FIX ACTIVITY

  • Syed Nadeem Ahsan Institute for Software Technology (IST) Graz University of Technology
  • Muhammad Tanvir Afzal Centre for Distributed and Semantic Computing (CDSC) Mohammad Ali Jinnah University
  • Safdar Zaman Institute for Software Technology (IST) Graz University of Technology
  • Christian Gütel School of Information Systems (SIS) Curtin University of Technology
  • Franz Wotawa Institute for Software Technology (IST) Graz University of Technology
Keywords: Software repository, mining effort data, estimation models, developer expertise and open source software.

Abstract

During the evolution of any software, efforts are made to fix bugs or to add new features in software. In software engineering, previous history of effort data is required to build an effort estimation model, which estimates the cost and complexity of any software. Therefore, the role of effort data is indispensable to build state-of-the-art effort estimation models. Most of the Open Source Software does not maintain any effort related information. Consequently there is no state-of-the-art effort estimation model for Open Source Software, whereas most of the existing effort models are for commercial software. In this paper we present an approach to build an effort estimation model for Open Source Software. For this purpose we suggest to mine effort data from the history of the developer’s bug fix activities. Our approach determines the actual time spend to fix a bug, and considers it as an estimated effort. Initially, we use the developer’s bug-fix-activity data to construct the developer’s activity log-book. The log-book is used to store the actual time elapsed to fix a bug. Subsequently, the log-book information is used to mine the bug fix effort data. Furthermore, the developer’s bug fix activity data is used to define three different measures for the developer’s contribution or expertise level. Finally, we used the bug-fix-activity data to visualize the developer’s collaborations and the involved source files. In order to perform an experiment we selected the Mozilla open source project and downloaded 93,607 bug reports from the Mozilla project bug tracking system i.e., Bugzilla. We also downloaded the available CVS-log data from the Mozilla project repository. In this study we reveal that in case of Mozilla only 4.9% developers have been involved in fixing 71.5% of the reported bugs.

References

Ahsan,S.N., Ferzund, J., Wotawa, F. (2009). Program File Bug Fix Effort Estimation Using Machine Learning Methods for OSS, In proceddings of 21st Software Engineering and Knowledge Engineering (SEKE), pp.129-134, Boston, USA.

Albrecht, A.J., Gaffney, J.E., Jr. (1983). Software Function, Source Lines of Code, and Development Effort Prediction: A Software Science Validation, IEEE Transactions on Software Engineer, vol. SE-9, no.6, pp. 639-648, Nov. 1983

https://doi.org/10.1109/TSE.1983.235271

Alonso, O., Devanbu, P.T., Gertz M. (2008). Expertise identification and visualization from CVS. In Proceedings of the 2008 international Working Conference on Mining Software Repositories (Leipzig, Germany, May 10 - 11, 2008). MSR '08. ACM, New York, NY, 125-128. DOI= http://doi.acm.org/10.1145/1370750.1370780

https://doi.org/10.1145/1370750.1370780

Asundi. J. (2005). The need for effort estimation models for open source software projects. SIGSOFT Software Engineering Notes 30, 4 (Jul. 2005), 1-3. DOI= http://doi.acm.org/10.1145/1082983.1083260.

https://doi.org/10.1145/1083258.1083260

Ball, T., Kim, J.-M.., Porter, A. A. Siy, H. P(1997). If your version control system could talk ... In ICSE "97 Workshop on Process Modeling and Empirical Studies of Software Engineering, May 1997.

Boehm, B. Clark, B Horowitz, E., Westland, C., Madachy, R., Selby, R. (1995). Cost Models for Future Software Life Cycle Process: COCOMO 2, Annals of Software Engineering, 1995.

https://doi.org/10.1007/BF02249046

Boetticher, G. (2001). An Assessment of Metric Contribution in the Construction of a Neural Network-Based Effort Estimator, Second Int. Workshop on Soft Computing Applied to Software Engineering, 2001.

Fischer, M.. Pinzger, M., Gall. H. (2003). Analysing and relating bug report data for feature tracking. In Proceeding of 10th Working Conference on Reverse Engineering (WCRE 2003), Victoria British Columbia, Canada, 2003 IEEE

Gousios, G Kalliamvakou, E, and ,Spinellis, D.(2008). Measuring developer contribution from software repository data. In Proceedings of the 2008 international Working Conference on Mining Software Repositories (Leipzig, Germany, May 10 - 11, 2008). MSR '08. ACM, New York, NY, 129-132. DOI= http://doi.acm.org/10.1145/1370750.1370781.

https://doi.org/10.1145/1370750.1370781

Hahn, J., Moon, J.Y. and Zhang, C. (2006) Impact of Social Ties on Open Source Project Team Formation. In The Second International Conference on Open Source Systems - OSS 2006, (Como, Italy, 2006).

Herraiz, I., Daniel M. G., Jesus M. G., Gregorio R (2008) Towards a simplification of the bug report form in eclipse. Proceedings of the 2008 international working conference on Mining software repositories, 2008, Leipzig, Germany. http://doi.acm.org/10.1145/1370750.1370786.

https://doi.org/10.1145/1370750.1370786

Joseph, F., Brian, F. (2001). Understanding Open Source Software Development. Addison Wesley, Pearson Education Book, Dec 2001

Kawin N., Dongsong Z., A Gunes K., Lina Z., Rober N. (2008). An Exploratory Study on the Evolution of OSS Developer Communities. Proceedings of the 41st Hawaii International Conference on System Sciences 2008

Koch, S. (2008). Effort Modeling and Programmer Participation in Open Source Software. Projects, Information. Economics and Policy, 20(4): 345-355.

https://doi.org/10.1016/j.infoecopol.2008.06.004

Lanza, M., Ducasse. S. (2002). Understanding Software Evolution using a Combination of Software Visualization and Software Metrics. In Proceedings of LMO 2002.

https://doi.org/10.3166/objet.8.1-2.135-149

Panjer, L.D. (2007) Predicting Eclipse Bug Lifetimes, Mining Software Repositories, ICSE Workshops MSR apos;07. 20-26 May 2007 Page(s):29 - 29.

https://doi.org/10.1109/MSR.2007.25

Pressman. R.S. (2000) Software Engineering: A Practitioner's Approach. McGraw Hill Higher Education

th edition. December 1, 2000. ISBN-13 978-0071181822.

Raymond, E.S. (1999) The Cathedral & the Bazaar. O'Reilly Retrieved, 1999.

https://doi.org/10.1007/s12130-999-1026-0

Scacchi, W. (2005) Socio-Technical Interaction Networks in Free/Open Source Software Development Processes S.T. Acuña and N. Juristo (eds.), Software Process Modeling, pp. 1-27, Springer Science and Business Media Inc., New York, 2005.

https://doi.org/10.1007/0-387-24262-7_1

Schröter, A., Zimmermann, T., Premraj, R., Zeller, A (2006).. If your bug database could talk. In Proceedings of the 5th International Symposium on Empirical Software Engineering, Volume II: Short Papers and Posters. 2006.

Schuler, D. Zimmermann, T. (2008). Mining usage expertise from version archives. In Proceedings of the 2008 international Working Conference on Mining Software Repositories (Leipzig, Germany, May 10 - 11, 2008). MSR '08. ACM, New York, NY, 121-124. DOI= http://doi.acm.org/10.1145/1370750.1370779.

https://doi.org/10.1145/1370750.1370779

Weiss, C., Premraj, R., Zimmermann, T., and Zeller, A. (2007). How Long Will It Take to Fix This Bug?, In Proceedings of the Fourth international Workshop on MSR (20-26 May, 2007). International Conference on Software Engineering. IEEE Computer Society, Washington, DC, 1.

https://doi.org/10.1109/MSR.2007.13

Yu, L. (2006). Indirectly predicting the maintenance effort of open-source software. Research. Articles, Journal of Software Maintenance and Evolution, (5 Sep, 2006), pages: 311-332.

https://doi.org/10.1002/smr.335

Published
2016-04-20
How to Cite
Ahsan, S. N., Afzal, M. T., Zaman, S., Gütel, C., & Wotawa, F. (2016). MINING EFFORT DATA FROM THE OSS REPOSITORY OF DEVELOPER’S BUG FIX ACTIVITY. Journal of IT in Asia, 3(1), 107-128. https://doi.org/10.33736/jita.38.2010
Section
Articles