NONLINEAR REGRESSION MODELS FOR ESTIMATING THE DURATION OF SOFTWARE DEVELOPMENT IN JAVA FOR PC BASED ON THE 2021 ISBSG DATA

Authors

  • S. B. Prykhodko Admiral Makarov National University of Shipbuilding, Mykolaiv, Ukraine, Ukraine
  • A. V. Pukhalevych Admiral Makarov National University of Shipbuilding, Mykolaiv, Ukraine, Ukraine
  • K. S. Prykhodko Admiral Makarov National University of Shipbuilding, Mykolaiv, Ukraine, Ukraine
  • L. M. Makarova Admiral Makarov National University of Shipbuilding, Mykolaiv, Ukraine, Ukraine

DOI:

https://doi.org/10.15588/1607-3274-2022-3-14

Keywords:

duration, software development, Java, personal computer, nonlinear regression model, normalizing transformation, non-Gaussian data, ISBSG.

Abstract

Context. The problem of estimating the duration of software development in Java for personal computers (PC) is important because, first, failed duration estimating is often the main contributor to failed software projects, second, Java is a popular language, and, third, a personal computer is a widespread multi-purpose computer. The object of the study is the process of estimating the duration of software development in Java for PC. The subject of the study is the nonlinear regression models to estimate the duration of software development in Java for PC.

Objective. The goal of the work is to build nonlinear regression models for estimating the duration of software development in Java for PC based on the normalizing transformations and deleting outliers in data to increase the confidence of the estimation in comparison to the ISBSG model for the PC platform.

Method. The models, confidence, and prediction intervals of nonlinear regressions to estimate the duration of software development in Java for PC are constructed based on the normalizing transformations for non-Gaussian data with the help of appropriate techniques. The techniques to build the models, confidence, and prediction intervals of nonlinear regressions are based on normalizing transformations. Also, we apply outlier removal for model construction. In general, the above leads to a reduction of the mean magnitude of relative error, the widths of the confidence, and prediction intervals in comparison to nonlinear models constructed without outlier removal application in the model construction process.

Results. A comparison of the model based on the decimal logarithm transformation with the nonlinear regression models based on the Johnson (for the SB family) and Box-Cox transformations as both univariate and bivariate ones has been performed.

Conclusions. The nonlinear regression model to estimate the duration of software development in Java for PC is constructed based on the decimal logarithm transformation. This model, in comparison with other nonlinear regression models, has smaller widths of the confidence and prediction intervals for effort values that are bigger than 900 person-hours. The prospects for further research may include the application of bivariate normalizing transformations and data sets to construct the nonlinear regression models for estimating the duration of software development in other languages for PC and other platforms, for example, mainframe.

Author Biographies

S. B. Prykhodko, Admiral Makarov National University of Shipbuilding, Mykolaiv, Ukraine

Dr. Sc., Professor, Head of the Department of Software of Automated Systems

A. V. Pukhalevych, Admiral Makarov National University of Shipbuilding, Mykolaiv, Ukraine

PhD, Lecturer of the Department of Software of Automated Systems

K. S. Prykhodko, Admiral Makarov National University of Shipbuilding, Mykolaiv, Ukraine

PhD, Associate Professor of the Department of Information Systems and Technologies

L. M. Makarova, Admiral Makarov National University of Shipbuilding, Mykolaiv, Ukraine

PhD, Associate Professor, Associate Professor of the Department of Software of Automated Systems

References

Boehm B. W. Software engineering economics. Englewood Cliffs, NJ, Prentice Hall, 1981, 768 p.

Boehm B. W., Abts C., Brown A. W. et al. Software cost estimation with COCOMO II. Upper Saddle River, NJ: Prentice Hall PTR, 2000, 506 p.

Owais M., Ramakishore R. Effort, duration and cost estimation in agile software development, 2016 Ninth International Conference on Contemporary Computing (IC3), 2016, pp. 1–5, DOI: 10.1109/IC3.2016.7880216.

Abran A. Data collection and industry standards: the ISBSG repository, Software Project Estimation: The Fundamentals for Providing High Quality Information to Decision Makers, IEEE, 2015, pp. 161–184, DOI: 10.1002/9781118959312.ch8.

Putnam L. H. A general empirical solution to the macrosoftware sizing and estimating problem, IEEE Transactions on Software Engineering, 1978, Vol. 4, No. 2, July, pp. 345– 361.

Oligny S., Bourque P. , Abran A. , Fournier B. Exploring the relation between effort and duration in software engineering projects, Proceedings of the World Computer Congress, Aug. 2000, P. 175–178.

Prykhodko S. B., Pukhalevich A. V. Developing PC Software Project Duration Model based on Johnson transformation, Proceedings of the 12th International Conference Modern Problems of Radio Engineering, Telecommunications and Computer Science TCSET’2014, Lviv-Slavske, Ukraine. Lviv, Polytechnic National University, 2014, pp. 114–116.

Prykhodko S.B., Pukhalevich A. V. Confidence interval estimation of PC software project duration regression based on Johnson transformation, Radioelectronic and Computer Systems. Kharkiv, 2014, No. 2 (66), pp. 104–107. ISSN: 1814-4225

López-Martín C., Abran A. Neural networks for predicting the duration of new software projects, Journal of Systems and Software, 2015, Vol. 101, pp. 127–135. DOI: 10.1016/J.JSS.2014.12.002

Pospieszny P., Czarnacka-Chrobot B., Kobylinski A. An effective approach for software project effort and duration estimation with machine learning algorithms, Journal of Systems and Software, 2018, pp. 184–196. DOI: 10.1016/J.JSS.2017.11.066

Prykhodko S., Prykhodko N. Mathematical modeling of non-Gaussian dependent random variables by nonlinear regression models based on the multivariate normalizing transformations, Mathematical Modeling and Simulation of Systems : 15th International Scientific-practical Conference MODS’2020. Chernihiv, Ukraine, June 29 – July 01, 2020, selected papers. Springer, Cham, 2021, P. 166–174. (Advances in Intelligent Systems and Computing, Vol. 1265). DOI: 10.1007/978-3-030-58124-4_16

Bates D. M., Watts D. G. Nonlinear regression analysis and its applications. New York, John Wiley & Sons, 1988, 384 p. DOI:10.1002/9780470316757

Seber G.A.F., C. J. Wild Nonlinear regression. New York, John Wiley & Sons, 1989, 768 p. DOI: 10.1002/0471725315

Ryan T. P. Modern regression methods. New York, John Wiley & Sons, 1997, 529 p. DOI: 10.1002/9780470382806

Drapper N. R., Smith H. Applied regression analysis. New York, John Wiley & Sons, 1998, 736 p.

Johnson R. A., Wichern D. W. Applied multivariate statistical analysis, Pearson Prentice Hall, 2007, 800 p.

Chatterjee S., Simonoff J. S. Handbook of regression analysis. New York, John Wiley & Sons, 2013, 236 p. DOI: 10.1002/9781118532843

Prykhodko S., Prykhodko N., Makarova L., et al. Detecting Outliers in Multivariate Non-Gaussian Data on the basis of Normalizing Transformations, Electrical and Computer Engineering : the 2017 IEEE First Ukraine Conference (UKRCON) «Celebrating 25 Years of IEEE Ukraine Section», Kyiv, Ukraine, May 29 – June 2, 2017 : proceedings. Kyiv, IEEE, 2017, pp. 846–849. DOI: 10.1109/UKRCON.2017.8100366

Prykhodko S., Prykhodko N., Makarova L. et al. Application of the Squared Mahalanobis Distance for Detecting Outliers in Multivariate Non-Gaussian Data, Radioelectronics, Telecommunications and Computer Engineering : 14th International Conference on Advanced Trends (TCSET). LvivSlavske, Ukraine, February 20–24, 2018 : proceedings, pp. 962–965. DOI: 10.1109/TCSET.2018.8336353

Prykhodko S., Prykhodko N., Knyrik K. Estimating the efforts of mobile application development in the planning phase using nonlinear regression analysis, Applied Computer Systems, 2020, Vol. 25, No. 2, pp. 172–179. DOI: 10.2478/acss-2020-0019

Mardia K.V. Measures of multivariate skewness and kurtosis with applications, Biometrika, 1970, Vol. 57, pp. 519– 530. DOI: 10.1093/biomet/57.3.519

Mardia K.V. Applications of some measures of multivariate skewness and kurtosis in testing normality and robustness studies, Sankhya: The Indian Journal of Statistics, Series B (1960–2002), 1974, Vol. 36, Issue 2, pp. 115–128.

Campbell S. L., Chancelier J.-P., Nikoukhah R.Modeling and simulation in Scilab/Scicos. Springer, 2005, 313 p.

Foss T., Stensrud E., Kitchenham B., Myrtveit I. A simulation study of the model evaluation criterion MMRE, IEEE Transactions on software engineering, 2003, Vol. 29, Issue 11, pp. 985–995. DOI: 10.1109/TSE.2003.1245300

Port D., Korte M. Comparative studies of the model evaluation criterions MMRE and PRED in software cost estimation research, Empirical Software Engineering and Measurement, the 2nd ACM-IEEE International Symposium ESEM, Kaiserslautern, Germany, October, 2008 : proceedings. New York, ACM, 2008, pp. 51–60

Downloads

Published

2022-10-17

How to Cite

Prykhodko, S. B., Pukhalevych, A. V., Prykhodko, K. S., & Makarova, L. M. (2022). NONLINEAR REGRESSION MODELS FOR ESTIMATING THE DURATION OF SOFTWARE DEVELOPMENT IN JAVA FOR PC BASED ON THE 2021 ISBSG DATA . Radio Electronics, Computer Science, Control, (3), 144. https://doi.org/10.15588/1607-3274-2022-3-14

Issue

Section

Progressive information technologies