REGRESSION MODELS FOR ESTIMATING THE DURATION OF BANKING SOFTWARE DEVELOPMENT
DOI:
https://doi.org/10.32782/mathematical-modelling/2026-9-1-19Keywords:
nonlinear regression model, banking software, software development duration, project effort, estimation, outlier, normalizing transformation, Box-Cox transformation, Johnson transformationAbstract
The paper addresses the problem of improving the estimation accuracy of software development duration in a tightly integrated environment of banking information systems, which is critically important for effective project planning, optimal resource allocation and ensuring business success. It is shown that existing approaches to estimating software development time, in particular the COCOMO and ISBSG models, do not provide sufficient accuracy for the specific conditions of the banking domain, which is characterized by high architectural complexity, integration of numerous external services, regulatory requirements and increased security demands. Based on data from previous studies and analysis results, the use of nonlinear regression models with prior data normalization for estimating the duration of banking software development based on project effort is justified. An approach is proposed that involves constructing separate models for single-team and multi-team projects, which allows the model to account for additional time costs due to inter-team interaction, work synchronization and organizational complexity. The empirical basis of the study consists of data from 482 real banking software development projects for the period 2014–2024, developed in Java and JavaScript. A methodology for constructing nonlinear regression models is implemented, which includes data normalization using decimal logarithm transformations, the bivariate Box-Cox transformation and the univariate Johnson transformation, detection and removal of outliers based on the prediction ellipse, construction of linear regression for normalized data, derivation of nonlinear models through inverse transformations, construction of confidence intervals and prediction intervals and identification of outliers using prediction intervals. The quality of the models is evaluated using the R², MMRE and PRED(0.25) metrics. The obtained results indicate that models constructed using logarithmic transformation are unsuitable due to a significant number of outliers. In contrast, models based on the Box-Cox and Johnson transformations demonstrate acceptable accuracy: R² values exceed 0.85, MMRE does not exceed 0.25 and PRED(0.25) is above 0.75 in most cases. The best results are achieved by models using the bivariate Box-Cox transformation, which are characterized by a smaller number of outliers and better agreement with empirical data. The proposed models can be used to improve the accuracy of predicting the duration of banking software development in Java and JavaScript based on project effort. At the same time, the limitations of the approach are identified, in particular the dependence of the models on the specifics of the application domain and the limitation of the range of project effort values.
References
Mersino A. Project Managers Fail to Help Software Projects (Standish Group Chaos 2020). URL: https://medium.com/leadership-and-agility/project-managers-fail-to-help-softwareprojects-standish-group-chaos-2020-e65d803e99f2 (дата звернення: 12.03.2025).
Pasuksmit J., Thongtanunam P., Karunasekera S. A Systematic Literature Review on Reasons and Approaches for Accurate Effort Estimations in Agile. ACM Computing Surveys. 2024. Vol. 56 (11). P. 1–37. URL: https://doi.org/10.1145/3663365
Dasanayake S., Aaramaa S., Markkula J., Oivo M. Impact of requirements volatility on software architecture: How do software teams keep up with ever‐changing requirements? Journal of Software: Evolution and Process. 2019. Vol. 31(6). P. e2160 URL: https://doi.org/10.1002/smr.2160
Ashleigh M., Ojiako U., Chipulu M., Wang J. K. Critical learning themes in project management education: Implications for blended learning. International Journal of Project Management. 2012. Vol. 30 (2). P. 153–161. URL: https://doi.org/10.1016/j.ijproman.2011.05.002
Morgenshtern O., Raz T., Dvir D. Factors affecting duration and effort estimation errors in software development projects. Information and Software Technology. 2007. Vol. 49(8). P. 827–837, URL: https://doi.org/10.1016/j.infsof.2006.09.006
Teslyuk V., Batyuk A., Voityshyn V. Method of Software Development Project Duration Estimation for Scrum Teams with Differentiated Specializations. Systems. 2022. Vol 10(4). Article 123. URL: https://doi.org/10.3390/systems10040123
Korytkowski P., Malachowski B. Competence-based estimation of activity duration in IT projects. European Journal of Operational Research. 2019. Vol. 275 (2). P. 708–720. URL: https://doi.org/10.1016/j.ejor.2018.11.061
Pospieszny P., Czarnacka-Chrobot B., Kobylinski A. An effective approach for software project effort and duration estimation with machine learning algorithms. Journal of Systems and Software. 2018. Vol. 137. P. 184–196. URL: https://doi.org/10.1016/j.jss.2017.11.066
Menzies T., Yang Y., Mathew G., Boehm B., Hihn J. Negative results for software effort estimation. Empir Software Eng. 2017. Vol. 22, P. 2658–2683. URL: https://doi.org/10.1007/s10664-016-9472-2
Nandal D., Sangwan O. P. Software cost estimation by optimizing COCOMO model using hybrid BATGSA algorithm. International Journal of Intelligent Engineering and Systems. 2018. 11(4). P. 250–263. URL: https://doi.org/10.22266/ijies2018.0831.25
Latanska L. O., Polovynka A. V. Analysis of methods and models for estimating the duration of banking software development projects. Herald of Khmelnytskyi national university. Technical sciences. 2025. Issue 3, part 2. Р.140–145. https://doi.org/10.31891/2307-5732-2025-353-18
Латанська Л. О., Половинка А. В. Математичне моделювання в задачах оцінювання тривалості розробки банківського програмного забезпечення. Комп’ютерне моделювання та оптимізація складних систем (КМОСС-2025) : матеріали ІX Міжнародної науково-технічної конференції, м. Дніпро, 5–7 листопада 2025 р. Дніпро, 2025. С. 59.
Prykhodko N. V., Prykhodko S. B. Constructing the Nonlinear Regression Models on the Basis of Multivariate Normalizing Transformations. Electronic Modeling. 2018. Vol. 40. No 6. P. 99–108. URL: https://doi.org/10.15407/emodel.40.06.101
Prykhodko S. B., Prykhodko N. V., Koltsov A. V. A Nonlinear Regression Model for Early LOC Estimation of Open-Source Kotlin-Based Applications. Radio Electronics Computer Science Control. 2024. No 1. P. 85–95. URL: https://doi.org/10.15588/1607-3274-2024-1-8





