Loading...

Elaris Computing Nexus

Elaris Computing Nexus


Automated Code Review and Technical Debt Detection in Agile Development Using Natural Language Processing on Commit Histories


Elaris Computing Nexus

Received On : 18 October 2025

Revised On : 25 November 2025

Accepted On : 08 December 2025

Published On : 06 January 2026

Volume 02, 2026

Pages : 001-012


Abstract

A significant issue in the recent development is the growing concern with the quality of code and low levels of technical debt in the agile software development environments that can lead to an absence of adequate manual reviews of the code. Recent automated review systems primarily rely on the application of static code analysis and they fail to give credit to contextual and semantic information found in commit histories. This paper demonstrates a severe lack in the exploitation of the information stored in natural language and which developers add to their commits to detect the early indications of technical debt trends. To overcome this drawback, a new framework that presents the utilization of Natural Language Processing (NLP) tools to handle commit messages and other metadata to generate an automatic code review and debt analysis is proposed. The methodology is a combination of transformer-based language model, semantic similarity analysis and classification alongside the application of keywords to detect debt-inducing changes. The experimental background is rooted in the examination of publicly available Git repositories comprising annotated examples of technical debt and evaluates performance with the measures of accuracy, recall, and F1-score. The F1-score of the specified method improves the conventional tools of the statical analysis by 18.6% and the precision and the recall values are more than 0.89 and 0.87, respectively. The main component of this work is the availability of environment-sensitive, NLP-based framework, which enhances automated review of the code and the proactive management of technical debt of the agile development cycles.

Keywords

Automated Code Review, Technical Debt Detection, Natural Language Processing, Commit History Analysis, Agile Software Development, Transformer Models.

  1. M. Ahmed, S. U. R. Khan, and K. A. Alam, “An NLP-based quality attributes extraction and prioritization framework in Agile-driven software development,” Automated Software Engineering, vol. 30, no. 1, Jan. 2023, doi: 10.1007/s10515-022-00371-9.
  2. M. A. Quintana, R. R. Palacio, G. B. Soto, and S. González-López, “Agile Development Methodologies and Natural Language Processing: A Mapping Review,” Computers, vol. 11, no. 12, p. 179, Dec. 2022, doi: 10.3390/computers11120179.
  3. R. Younisse and M. Azzeh, “Application of Natural Language Processing Techniques in Agile Software Project Management: A Survey,” 2023 14th International Conference on Information and Communication Systems (ICICS), pp. 01–06, Nov. 2023, doi: 10.1109/icics60529.2023.10330468.
  4. A. M. Radwan, M. A. Abdel-Fattah, and W. Mohamed, “Smart Agile Prioritization and Clustering: An AI-Driven Approach for Requirements Prioritization,” IEEE Access, vol. 13, pp. 127335–127350, 2025, doi: 10.1109/access.2025.3589959.
  5. G. B. Herwanto, G. Quirchmayr, and A. M. Tjoa, “Leveraging NLP Techniques for Privacy Requirements Engineering in User Stories,” IEEE Access, vol. 12, pp. 22167–22189, 2024, doi: 10.1109/access.2024.3364533.
  6. B. Yalçıner, K. Dinçer, A. G. Karaçor, and M. Ö. Efe, “Enhancing Agile Story Point Estimation: Integrating Deep Learning, Machine Learning, and Natural Language Processing with SBERT and Gradient Boosted Trees,” Applied Sciences, vol. 14, no. 16, p. 7305, Aug. 2024, doi: 10.3390/app14167305.
  7. T. Catak, P. O. Durdu, and S. Ilhan Omurca, “Enhancing Agile Effort Estimation: An NLP Approach for Software Requirements Analysis,” 2024 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA), pp. 1–8, May 2024, doi: 10.1109/hora61326.2024.10550870.
  8. A. Alhaizaey and M. Al-Mashari, “Automated Classification and Identification of Non-Functional Requirements in Agile-Based Requirements Using Pre-Trained Language Models,” IEEE Access, vol. 13, pp. 87401–87417, 2025, doi: 10.1109/access.2025.3570359.
  9. A. M. Almanaseer, W. Alzyadat, M. Muhairat, S. Al-Showarah, and A. Alhroob, “A proposed model for eliminating nonfunctional requirements in Agile Methods using natural language processes,” 2022 International Conference on Emerging Trends in Computing and Engineering Applications (ETCEA), pp. 1–7, Nov. 2022, doi: 10.1109/etcea57049.2022.10009796.
  10. D. Planötscher, “NLP and GenAI in Agile Project Management: A Systematic Mapping Study,” Agile Processes in Software Engineering and Extreme Programming – Workshops, pp. 41–49, Oct. 2025, doi: 10.1007/978-3-032-05799-0_5.
  11. B. Yang, X. Ma, C. Wang, H. Guo, H. Liu, and Z. Jin, “User story clustering in agile development: a framework and an empirical study,” Frontiers of Computer Science, vol. 17, no. 6, Jan. 2023, doi: 10.1007/s11704-022-8262-9.
  12. Meiliana, G. Daniella, N. Wijaya, N. G. E. Putra, and R. Efata, “Agile Software Development Effort Estimation based on Product Backlog Items,” Procedia Computer Science, vol. 227, pp. 186–193, 2023, doi: 10.1016/j.procs.2023.10.516.
  13. A. M. Radwan, M. A. Abdel-Fattah, and W. Mohamed, “AI-Driven Prioritization Techniques of Requirements in Agile Methodologies: A Systematic Literature Review,” International Journal of Advanced Computer Science and Applications, vol. 15, no. 9, 2024, doi: 10.14569/ijacsa.2024.0150983.
  14. F. Casillo, V. Deufemia, and C. Gravino, “Detecting privacy requirements from User Stories with NLP transfer learning models,” Information and Software Technology, vol. 146, p. 106853, Jun. 2022, doi: 10.1016/j.infsof.2022.106853.
  15. S.-C. Necula, F. Dumitriu, and V. Greavu-Șerban, “A Systematic Literature Review on Using Natural Language Processing in Software Requirements Engineering,” Electronics, vol. 13, no. 11, p. 2055, May 2024, doi: 10.3390/electronics13112055.
  16. O. Araque, J. F. Sánchez-Rada, and C. A. Iglesias, “GSITK: A sentiment analysis framework for agile replication and development,” SoftwareX, vol. 17, p. 100921, Jan. 2022, doi: 10.1016/j.softx.2021.100921.
CRediT Author Statement

The author reviewed the results and approved the final version of the manuscript.

Conceptualization: Zhanar Sartabanova and Minu Balakrishnan; Methodology: Zhanar Sartabanova; Software: Zhanar Sartabanova; Data Curation: Minu Balakrishnan; Writing- Original Draft Preparation: Zhanar Sartabanova and Minu Balakrishnan; Visualization: Zhanar Sartabanova and Minu Balakrishnan; Investigation: Zhanar Sartabanova and Minu Balakrishnan; Supervision: Minu Balakrishnan; Validation: Zhanar Sartabanova and Minu Balakrishnan; Writing- Reviewing and Editing: Zhanar Sartabanova and Minu Balakrishnan; All authors reviewed the results and approved the final version of the manuscript.

Acknowledgements

The authors would like to thank to the reviewers for nice comments on the manuscript.

Funding

No funding was received to assist with the preparation of this manuscript.

Ethics Declarations

Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Availability of Data and Materials

Data sharing is not applicable to this article as no new data were created or analysed in this study.

Author Information

Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.

Corresponding Author



Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit: https://creativecommons.org/licenses/by-nc-nd/4.0/

Cite this Article

Zhanar Sartabanova and Minu Balakrishnan, “Automated Code Review and Technical Debt Detection in Agile Development Using Natural Language Processing on Commit Histories”, Elaris Computing Nexus, pp. 001-012, 2026, doi: 10.65148/ECN/2026001.

Copyright

© 2026 Zhanar Sartabanova and Minu Balakrishnan. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.