Accuracy Improvement in Software Cost Estimation based on Selection of Relevant Features of Homogeneous Clusters | ||
Journal of AI and Data Mining | ||
دوره 11، شماره 3، مهر 2023، صفحه 453-476 اصل مقاله (2.01 M) | ||
نوع مقاله: Technical Paper | ||
شناسه دیجیتال (DOI): 10.22044/jadm.2023.12750.2429 | ||
نویسندگان | ||
Saba Beiranvand* 1؛ Mohammad Ali Zare Chahooki2 | ||
1Department of Computer Engineering, Technical and Vocational University (TVU), Tehran, Iran. | ||
2Department of Computer Engineering, Faculty of Engineering, Yazd University, Yazd, Iran. | ||
چکیده | ||
Software Cost Estimation (SCE) is one of the most widely used and effective activities in project management. In machine learning methods, some features have adverse effects on accuracy. Thus, preprocessing methods based on reducing non-effective features can improve accuracy in these methods. In clustering techniques, samples are categorized into different clusters according to their semantic similarity. Accordingly, in the proposed study, to improve SCE accuracy, first samples are clustered based on original features. Then, a feature selection (FS) technique is separately done for each cluster. The proposed FS method is based on a combination of filter and wrapper FS methods. The proposed method uses both filter and wrapper advantages in selecting effective features of each cluster, with less computational complexity and more accuracy. Furthermore, as the assessment criteria have significant impacts on wrapper methods, a fused criterion has also been used. The proposed method was applied to Desharnais, COCOMO81, COCONASA93, Kemerer, and Albrecht datasets, and the obtained Mean Magnitude of Relative Error (MMRE) for these datasets were 0.2173, 0.6489, 0.3129, 0.4898 and 0.4245, respectively. These results were compared with previous studies and showed improvement in the error rate of SCE. | ||
کلیدواژهها | ||
Software Cost Estimation (SCE)؛ Software Effort Estimation (SEE)؛ Machine Learning methods؛ Clustering؛ Feature Selection | ||
مراجع | ||
[1] S. K. Sehra, Y. S. Brar, N. Kaur and S. S. Sehra, "Research patterns and trends in software effort estimation," Information and Software Technology, vol. 91, pp. 1-21, 2017.
[2] S. Sarwar and M. Gupta, "Proposing effort estimation of cocomo ii through perceptron learning rule," International Journal of Computer Applications, vol. 70, no. 1, 2013.
[3] P. Pandey, "Analysis of the techniques for software cost estimation," in 2013 Third International Conference on Advanced Computing and Communication Technologies (ACCT), 2013.
[4] M. O. Elish, T. Helmy and M. I. Hussain, "Empirical study of homogeneous and heterogeneous ensemble models for software development effort estimation," Mathematical Problems in Engineering, vol. 2013, 2013.
[5] E. Papatheocharous, H. Papadopoulos and A. S. Andreou, "Feature subset selection for software cost modelling and estimation," arXiv preprint arXiv:1210.1161, 2012.
[6] C. Kirsopp, M. J. Shepperd and J. Hart, "Search heuristics, case-based reasoning and software project effort prediction," in the 4th Annual Conference on Genetic and Evolutionary Computation, Morgan Kaufmann Publishers Inc, 2002.
[7] T. Menzies, K. Ammar, A. Nikora and J. DiStefano, "How simple is software defect detection," Submitted to the Emprical Software Engineering Journal, 2003.
[8] M. F. Bosu, "Data quality in empirical software engineering: An investigation of time-aware models in software effort estimation (Doctoral dissertation, University of Otago)," Doctoral dissertation, University of Otago, 2016.
[9] S. Beiranvand and Z. Chahooki, "Bridging the semantic gap for software effort estimation by hierarchical feature selection techniques," Journal of AI and Data Mining, vol. 4, no. 2, pp. 157-168, 2016.
[10] Uc-Cetina and V´ıctor, "Recent Advances in Software Effort Estimation using Machine Learning," arXiv preprint arXiv:2303.03482 (2023).
[11] C. A. P. Rodríguez, L. M. S. Martínez, D. H. P. Ordóñez and J. A. T. Peña, "Effort Estimation in Agile Software Development: A Systematic Map Study," INGE CUC, vol. 19, no. 1, 2023.
[12] J. Antil and R. Rishi, "SOFTWARE COST ESTIMATION USING TEMPORAL DATA MINING TECHNIQUES: AN OVERVIEW," Journal of Data Acquisition and Processing, vol. 38, no. 2, pp. 2718-2728, 2023.
[13] Usman, Muhammad, J. Börstler and K. Petersen, "An effort estimation taxonomy for agile software development," International Journal of Software Engineering and Knowledge Engineering, vol. 27, no. 4, pp. 641-674, 2017.
[14] A. Zaid, M. H. Selamat, A. Ghani, R. Atan and K. Wei, "Issues in Software Cost Estimation," International Journal of Computer Science and Network Security, vol. 8, no. 11, pp. 350-356, 2008.
[15] T. R. Benala and R. Mall, "DABE: Differential evolution in analogy-based software development effort estimation.," Swarm and Evolutionary Computation, vol. 38, pp. 158-172, 2018.
[16] J. Wen, S. Li, Z. Lin, Y. Hu and C. Huang, "Systematic literature review of machine learning based software development effort estimation models," Information and Software Technology, vol. 54, no. 1, pp. 41-59, 2012.
[17] S. M. R. Chirra and H. Reza, "A survey on software cost estimation techniques," Journal of Software Engineering and Applications, vol. 12, no. 6, pp. 226-248, 2019.
[18] A. Moradbeiky, "FEEM: A Flexible Model based on Artificial Intelligence for Software," Journal of Artificial Intelligence and Data Mining (JAIDM), vol. 11, no. 1, pp. 39-51, 2023.
[19] B. Baskeles, B. Turhan and A. Bener, "Software effort estimation using machine learning methods," in 2007 22nd international symposium on computer and information sciences, IEEE, 2007.
[20] L. Radlinski, "A survey of bayesian net models for software development effort prediction," International Journal of Software Engineering and Computing, vol. 2, no. 2, pp. 95-109, 2010.
[21] A. Chavoya, C. Lopez-Martin, I. R. Andalon-Garcia and M. Meda-Campaña, "Genetic programming as alternative for predicting development effort of individual software projects," PloS one, vol. 7, no. 11, p. e50531, 2012.
| ||
آمار تعداد مشاهده مقاله: 874 تعداد دریافت فایل اصل مقاله: 563 |