Deep Learning-Based Software Defect Detection: A Comparative Study of Neural Network Architectures

Authors

  • Linda Marlinda Department of Informatic Engineering, Universitas Nusa Mandiri Author
  • Gilang Mahendra Department of Computer System, Universitas Sultan Fatah Author
  • Ade Kurniawan Department of Informatic Engineering, Universitas Dian Nuswantoro Author
  • Doni Ramdhani Department of Informatic Engineering, Universitas Dian Nuswantoro Author
  • Irma Eryanti Putri Department of Computer Science, Universitas Muhammadiyah Bima Author
  • Miftahul Jannah Department of Computer Science, Universitas Muhammadiyah Bima Author

DOI:

https://doi.org/10.64479/iarci.v1i2.65

Keywords:

Software defect prediction, deep learning, neural networks, comparative analysis, software quality assurance

Abstract

Software defect prediction plays a crucial role in software quality assurance by enabling early identification of defect-prone modules, thereby reducing testing effort and improving software reliability. This study presents a comprehensive comparative analysis of three widely used deep learning architectures Multi-Layer Perceptron (MLP), Convolutional Neural Network (CNN), and Long Short-Term Memory (LSTM) for software defect prediction under identical experimental conditions. A systematic seven-phase framework was employed, covering data collection, preprocessing, feature engineering, model implementation, training, validation, and comparative evaluation using twelve datasets from the NASA Metrics Data Program. Experimental results indicate that the LSTM architecture consistently outperforms CNN and MLP, achieving an average accuracy of 93.5%, precision of 94.2%, recall of 93.1%, F1-score of 93.6%, and ROC-AUC of 0.947 across all datasets. Statistical significance analysis using Friedman and Wilcoxon signed-rank tests confirms that the performance improvements of LSTM are statistically significant (p < 0.001) with large effect sizes. Furthermore, cross-dataset evaluation demonstrates that LSTM exhibits superior generalization capability, with a smaller average accuracy degradation compared to CNN and MLP. The study also highlights important trade-offs between predictive performance and computational efficiency, providing practical guidance for architecture selection in real-world software defect prediction systems. These findings contribute empirical insights and deployment-oriented recommendations for advancing automated software quality assurance.

Downloads

Download data is not yet available.

References

[1] E. Iannone, R. Guadagni, F. Ferrucci, A. De Lucia, and F. Palomba, “The Secret Life of Software Vulnerabilities: A {Large-Scale} Empirical Study,” IEEE Trans. Softw. Eng., 2023.

[2] M. J. Hernández-Molinos, Á. Sánchez-Garc’ia, R. Barrientos-Mart’inez, J. C. Pérez-Arriaga, and J. O. Ocharán-Hernández, “Software Defect Prediction with Bayesian Approaches,” Mathematics, vol. 11, 2023.

[3] N. Alnor and A. Khleel, “Software defect prediction using a bidirectional LSTM network combined with oversampling techniques,” vol. 0123456789, pp. 3615–3638, 2024, doi: 10.1007/s10586-023-04170-z.

[4] A. Taskeen, S.-U.-R. Khan, and E. A. Felix, “A research landscape on software defect prediction,” J. Softw. Evol. Process, 2023.

[5] L. Gong, H. Zhang, J. Zhang, M. Wei, and Z. Huang, “A Comprehensive Investigation of the Impact of Class Overlap on Software Defect Prediction,” IEEE Trans. Softw. Eng., 2023.

[6] N. Grattan, D. A. da Costa, and N. Stanger, “The need for more informative defect prediction: A systematic literature review,” Inf. Softw. Technol., 2024.

[7] Z. Li, H. Zhang, X.-Y. Jing, J. Xie, M. Guo, and J. Ren, “DSSDPP: Data Selection and Sampling Based Domain Programming Predictor for Cross-Project Defect Prediction,” IEEE Trans. Softw. Eng., vol. 49, no. 4, pp. 1941–1963, 2023, doi: 10.1109/TSE.2022.3204589.

[8] N. A. A. Khleel and K. Nehéz, “Software defect prediction using a bidirectional {LSTM} network combined with oversampling techniques,” Cluster Comput., 2023.

[9] A. Abdu, Z. Zhai, H. A. Abdo, and R. Algabri, “Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph,” IEEE Trans. Reliab., 2024.

[10] Y. Al-Smadi, M. Eshtay, A. Al-qerem, S. Nashwan, O. Ouda, and A. A. A. El-Aziz, “Reliable prediction of software defects using Shapley interpretable machine learning models,” Egypt. Informatics J., 2023, [Online]. Available: https://api.semanticscholar.org/CorpusID:260391021

[11] A. Abdu et al., “Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model,” Sci. Rep., vol. 14, 2024.

[12] A. Alazba and H. Aljamaan, “Software Defect Prediction Using Stacking Generalization of Optimized {Tree-Based} Ensembles,” Appl. Sci., vol. 12, 2022.

[13] X. Dong, Y. Liang, S. Miyamoto, and S. Yamaguchi, “Ensemble learning based software defect prediction,” J. Eng. Res., 2023, [Online]. Available: https://api.semanticscholar.org/CorpusID:265011535

[14] B. Arasteh, K. Arasteh, A. Ghaffari, and R. Ghanbarzadeh, “A new binary chaos-based metaheuristic algorithm for software defect prediction,” Cluster Comput., 2024.

[15] H. Das, S. Prajapati, M. Gourisaria, R. M. Pattanayak, A. Alameen, and M. S. Kolhar, “Feature Selection Using Golden Jackal Optimization for Software Fault Prediction,” Mathematics, vol. 11, 2023.

[16] Y. Zhao, K. Damevski, and H. Chen, “A Systematic Survey of Just-in-Time Software Defect Prediction,” ACM Comput. Surv., vol. 55, pp. 1–35, 2022, [Online]. Available: https://api.semanticscholar.org/CorpusID:252910605

[17] and H. A. Alazba, Amal, “applied sciences Software Defect Prediction Using Stacking Generalization of,” 2022.

[18] A. Abdu, Z. Zhai, R. Algabri, H. A. Abdo, K. Hamad, and M. A. Al-antari, “Deep {Learning-Based} Software Defect Prediction via Semantic Key Features of Source {Code---Systematic} Survey,” Mathematics, vol. 10, 2022.

[19] M. Ali, T. Mazhar, A. Al-Rasheed, T. Shahzad, Y. Ghadi, and M. A. Khan, “Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning,” PeerJ Comput. Sci., 2024.

[20] N. S. Thomas and S. Kaliraj, “An Improved and Optimized Random Forest Based Approach to Predict the Software Faults,” SN Comput. Sci., 2024.

[21] N. A. A. Khleel and K. Nehéz, “A novel approach for software defect prediction using {CNN} and {GRU} based on {SMOTE} Tomek method,” J. Intell. Inf. Syst., 2023.

Downloads

Published

2026-02-24

Issue

Section

Articles

How to Cite

Deep Learning-Based Software Defect Detection: A Comparative Study of Neural Network Architectures. (2026). Indonesian Applied Research Computing and Informatics, 1(2), 47-59. https://doi.org/10.64479/iarci.v1i2.65

Similar Articles

You may also start an advanced similarity search for this article.