A Comprehensive Analysis of Proportional Intensity-Based Software Reliability Models with Covariates
Abstract
:1. Introduction
2. Related Works
3. NHPP-Based Software Reliability Modeling
- (i)
- ;
- (ii)
- has independent increment;
- (iii)
- ;
- (iv)
- ,
- (i)
- Software faults are detected at independent and identically distributed (i.i.d.) random times with the non-degenerate cumulative distribution function (CDF), , where is a free parameter vector.
- (ii)
- The total number of software faults remaining in software before testing, say, at time , is a Poisson random variable with parameter .
4. Proportional Intensity Model
4.1. Model Description
4.2. Maximum Likelihood Estimation
5. Numerical Examples
5.1. Goodness-of-Fit Performance
5.2. Predictive Performance
- Case I:
- All the test/development metric data are completely known through the testing phase in advance, so the software testing expenditures are exactly given in the testing.
- Case II:
- The test/development metrics data do not change from the observation point in the future.
- Case III:
- The test/development metrics data experienced in the future are regarded as independent random variables and predictable by any statistical method.
5.3. Software Reliability Assessment
6. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Lyu, M. (Ed.) Handbook of Software Reliability Engineering; McGraw Hill: New York, NY, USA, 1996. [Google Scholar]
- Musa, J.D.; Iannino, A.; Okumoto, K. Software Reliability Measurement, Prediction, Application; McGraw-Hill: New York, NY, USA, 1987. [Google Scholar]
- Pham, H. Software Reliability; Springer: London, UK, 2000. [Google Scholar]
- Goel, A.L.; Okumoto, K. Time-dependent error-detection rate model for software reliability and other performance measures. IEEE Trans. Reliab. 1979, R-28, 206–211. [Google Scholar] [CrossRef]
- Yamada, S.; Ohba, M.; Osaki, S. S-shaped reliability growth modeling for software error detection. IEEE Trans. Reliab. 1983, R-32, 475–478. [Google Scholar] [CrossRef]
- Zhao, M.; Xie, M. On maximum likelihood estimation for a general non-homogeneous Poisson process. Scand. J. Stat. 1996, 23, 597–607. [Google Scholar]
- Abdel-Ghaly, A.A.; Chan, P.Y.; Littlewood, B. Evaluation of competing software reliability predictions. IEEE Trans. Softw. Eng. 1986, SE-12, 950–967. [Google Scholar] [CrossRef]
- Ohba, M. Inflection S-shaped software reliability growth model. In Stochastic Models in Reliability Theory; Springer: New York, NY, USA, 1984; pp. 144–162. [Google Scholar]
- Gokhale, S.S.; Trivedi, K.S. Log-logistic software reliability growth model. In Proceedings of the Third IEEE International High-Assurance Systems Engineering Symposium (HASE 1998), Washington, DC, USA, 13–14 November 1998; pp. 34–41. [Google Scholar]
- Okamura, H.; Dohi, T.; Osaki, S. Software reliability growth models with normal failure time distributions. Reliab. Eng. Syst. Saf. 2013, 116, 135–141. [Google Scholar] [CrossRef]
- Achcar, J.A.; Dey, D.K.; Niverthi, M. A Bayesian approach using nonhomogeneous Poisson processes for software reliability models. In Frontiers in Reliability; World Scientific: Singapore, 1998; pp. 1–18. [Google Scholar]
- Ohishi, K.; Okamura, H.; Dohi, T. Gompertz software reliability model: Estimation algorithm and empirical validation. J. Syst. Softw. 2009, 82, 535–543. [Google Scholar] [CrossRef] [Green Version]
- Rinsaka, K.; Shibata, K.; Dohi, T. Proportional Intensity-Based Software Reliability Modeling with Time-Dependent Metrics. In Proceedings of the 30th Annual International Computer Software and Applications Conference (COMPSAC’06), Chicaco, IL, USA, 17–21 September 2006; Volume 1, pp. 369–376. [Google Scholar]
- Shibata, K.; Rinsaka, K.; Dohi, T. PISRAT: Proportional Intensity-Based Software Reliability Assessment Tool. In Proceedings of the 13th Pacific Rim International Symposium on Dependable Computing (PRDC 2007), Melbourne, VIC, Australia, 17–19 December 2007; pp. 43–52. [Google Scholar]
- Okamura, H.; Dohi, T. SRATS: Software reliability assessment tool on spreadsheet (Experience report). In Proceedings of the 2013 IEEE 24th International Symposium on Software Reliability Engineering (ISSRE 2013), Pasadena, CA, USA, 4–7 November 2013; pp. 100–107. [Google Scholar]
- McCabe, T.J. A complexity measure. IEEE Trans. Softw. Eng. 1976, SE-2, 308–320. [Google Scholar] [CrossRef]
- Halstead, M.H. Elements of Software Science; Elsevier: New York, NY, USA, 1977. [Google Scholar]
- Putnam, L.H. A general empirical solution to the macro software sizing and estimating problem. IEEE Trans. Softw. Eng. 1978, SE-4, 345–367. [Google Scholar] [CrossRef]
- Takahashi, M.; Kamayachi, Y. An empirical study of a model for program error prediction. In Proceedings of the 8th International Conference on Software Engineering, London, UK, 28–30 August 1985; pp. 330–336. [Google Scholar]
- Pillai, K.; Nair, V.S.S. A model for software development effort and cost estimation. IEEE Trans. Softw. Eng. 1997, 23, 485–497. [Google Scholar] [CrossRef]
- Khoshgoftaar, T.M.; Munson, J.C. Predicting software development errors using software complexity metrics. IEEE J. Sel. Areas Commun. 1990, 8, 253–261. [Google Scholar] [CrossRef]
- Khoshgoftaar, T.M.; Bhattacharyya, B.B.; Richardson, G.D. Predicting software errors, during development, using nonlinear regression models: A comparative study. IEEE Trans. Reliab. 1992, 41, 390–395. [Google Scholar] [CrossRef]
- Khoshgoftaar, T.M.; Munson, J.C.; Bhattacharyya, B.B.; Richardson, G.D. Predictive modeling techniques of software quality from software measures. IEEE Trans. Softw. Eng. 1992, 18, 979–987. [Google Scholar] [CrossRef]
- Khoshgoftaar, T.M.; Pandya, A.; Lanning, D. Application of neural networks for predicting program fault. Ann. Softw. Eng. 1995, 1, 141–154. [Google Scholar] [CrossRef]
- Schnieidewind, N.F. Software metrics model for integrating quality control and prediction. In Proceedings of the Eighth International Symposium on Software Reliability Engineering, Washington, DC, USA, 2–5 November 1997; pp. 402–415. [Google Scholar]
- Schneidewind, N.F. Measuring and evaluating maintenance process using reliability, risk, and test metrics. IEEE Trans. Softw. Eng. 1999, 25, 768–781. [Google Scholar] [CrossRef]
- Li, P.L.; Shaw, M.; Herbsleb, J.; Ray, B.; Santhanam, P. Empirical evaluation of defect projection models for widely-deployed production software systems. In Proceedings of the 12th ACM SIGSOFT Symposium on Foundations of Software Engineering, Newport Beach, CA, USA, 31 October–6 November 2004; pp. 263–272. [Google Scholar]
- Khoshgoftaar, T.M.; Gao, K.; Szabo, R. Comparing software fault predictions of pure and zero-inflated Poisson regression models. Int. J. Syst. Sci. 2005, 36, 705–715. [Google Scholar] [CrossRef]
- Amasaki, S.; Yoshitomi, T.; Mizuno, O.; Takagi, Y.; Kikuno, T. A new challenge for applying time series metrics data to software quality estimation. Softw. Qual. J. 2005, 13, 177–193. [Google Scholar] [CrossRef]
- Ascher, H. Proportional hazards modelling of software failure data. In Software Reliability; State of the Art Report; Bendell, A., Mellor, P., Eds.; Pergamon Infotech: Berkshire, UK, 1986; pp. 229–263. [Google Scholar]
- Ascher, H. The use of regression techniques for matching reliability models to the real world. In Software System Design Methods, NATO ASI Series; Skwirzynski, J.K., Ed.; Springer: Berlin/Heidelberg, Germany, 1986; Volume F22, pp. 366–378. [Google Scholar]
- Bendell, A. The use of exploratory data analysis techniques for software reliability assessment and prediction. In Software System Design Methods, NATO ASI Series; Skwirzynski, J.K., Ed.; Springer: Berlin/Heidelberg, Germany, 1986; Volume F22, pp. 337–351. [Google Scholar]
- Evanco, W.M.; Lacovara, R. A model-based framework for the integration of software metrics. J. Syst. Softw. 1995, 26, 75–84. [Google Scholar] [CrossRef]
- Evanco, W.M. Using a proportional hazards model to analyze software reliability. In Proceedings of the 9th International Conference Software Technology & Engineering Practice, Pittsburgh, PA, USA, 2 September 1999; pp. 134–141. [Google Scholar]
- Nishio, Y.; Dohi, T. Determination of the optimal software release time based on proportional hazards software reliability growth models. J. Qual. Maint. Eng. 2003, 9, 48–65. [Google Scholar] [CrossRef]
- Cox, D.R. Regression models and life-tables. J. R. Stat. Soc. 1972, B-34, 187–220. [Google Scholar] [CrossRef]
- Murphy, S.; Sen, P. Time-dependent coefficients in a Cox type regression model. Stoch. Process. Their Appl. 1991, 39, 153–180. [Google Scholar] [CrossRef] [Green Version]
- Tian, T.; Zucker, D.; Wei, L.J. On the Cox model with time-varying regression coefficient. J. Am. Stat. Assoc. 2005, 100, 172–183. [Google Scholar] [CrossRef] [Green Version]
- Musa, J.D. Software Reliability Data, Technical Report, Data and Analysis Center for Software; Rome Air Development Center: New York, NY, USA, 1979. [Google Scholar]
- Shibata, K.; Rinsaka, K.; Dohi, T. Metrics-Based Software Reliability Models Using Non-homogeneous Poisson Processes. In Proceedings of the 2006 17ths International Symposium on Software Reliability Engineering, Raleigh, NC, USA, 7–10 November 2006; pp. 52–61. [Google Scholar]
- Okamura, H.; Etani, Y.; Dohi, T. A Multi-factor Software Reliability Model Based on Logistic Regression. In Proceedings of the 2010 IEEE 21st International Symposium on Software Reliability Engineering, San Jose, CA, USA, 1–4 November 2010; pp. 31–40. [Google Scholar]
- Kuwa, D.; Dohi, T. Generalized Logit Regression-Based Software Reliability Modeling with Metrics Data. In Proceedings of the 2013 IEEE 37th Annual Computer Software and Applications Conference, Kyoto, Japan, 22–26 July 2013; pp. 246–255. [Google Scholar]
- Kuwa, D.; Dohi, T.; Okamura, H. Generalized Cox Proportional Hazards Regression-Based Software Reliability Modeling with Metrics Data. In Proceedings of the 2013 IEEE 19th Pacific Rim International Symposium on Dependable Computing, Vancouver, BC, Canada, 2–4 December 2013; pp. 328–337. [Google Scholar]
- Nagaraju, V.; Jayasinghe, C.; Fiondella, L. Optimal test activity allocation for covariate software reliability and security models. J. Syst. Softw. 2020, 168, 110643. [Google Scholar] [CrossRef]
- Lawless, J.F.L. Regression methods for Poisson process data. J. Am. Stat. Assoc. 1987, 82, 808–815. [Google Scholar] [CrossRef]
- Goel, A.L. Software reliability models: Assumptions, limitations, and applicability. IEEE Trans. Softw. Eng. 1985, SE-11, 1411–1423. [Google Scholar] [CrossRef]
Models | |
---|---|
Exponential distribution (exp) [4] | |
Gamma distribution (gamma) [5,6] | |
Pareto distribution (pareto) [7] | |
Truncated normal distribution (tnorm) [10] | |
Log-normal distribution (lnorm) [10,11] | |
Truncated logistic distribution (tlogist) [8] | |
Log-logistic distribution (llogist) [9] | |
Truncated extreme-value maximum distribution (txvmax) [12] | |
Log-extreme-value max maximum distribution (lxvmax) [12] | |
Truncated extreme-value minimum distribution (txvmin) [12] | |
Log-extreme-value minimum distribution (lxvmin) [46] |
Data | No. Faults | Testing Days |
---|---|---|
GDS1 | 136 | 21 |
GDS2 | 54 | 17 |
GDS3 | 38 | 14 |
GDS4 | 53 | 16 |
Metrics Data: | Failure identification work, Execution time, Computer time-failure identification. |
Combination I | |
Combination II | |
Combination III | |
Combination IV | |
Combination V | |
Combination VI | |
Combination VII | |
Execution time, Failure identification work. Computer time-failure identification. |
(i) Best proportional intensity model (cumulative metrics data) | ||||
---|---|---|---|---|
Model | AIC | MSE | ||
GDS1 | tlogist-VI | 110.114 | 0.470 | |
GDS2 | tlogist-III | 69.785 | 0.282 | |
GDS3 | txvmin-II | 57.281 | 0.289 | |
GDS4 | exp-I | 81.059 | 0.612 | |
(ii) Best proportional intensity model (non-cumulative metrics data) | ||||
GDS1 | txvmin-II | 109.015 | 0.721 | |
GDS2 | llogist-II | 67.352 | 0.261 | |
GDS3 | gamma-II | 50.696 | 0.221 | |
GDS4 | exp-VI | 81.131 | 0.450 | |
(iii) Best SRATS (no metrics data) | ||||
GDS1 | tlogist | 116.891 | 0.820 | - |
GDS2 | llogist | 73.053 | 0.501 | - |
GDS3 | lxvmax | 61.694 | 0.481 | - |
GDS4 | txvmin | 79.761 | 0.530 | - |
GDS1 | ||
---|---|---|
Best model | PMSE | |
Case I (cumulative) | tlogist-III | 6.409 |
Case I (non-cumulative) | tlogist-II | 4.014 |
Case II (cumulative) | lxvmax-II | 2.160 |
Case II (non-cumulative) | txvmax-IV | 4.931 |
Case III (cumulative): Linear regression | exp-IV | 4.146 |
Case III (cumulative): Exponential regression | txvmin-V | 19.213 |
Case III (non-cumulative): Linear regression | txvmax-II | 3.916 |
SRATS | tnorm | 3.408 |
Best model | PMSE | |
Case I (cumulative) | tlogist-II | 0.816 |
Case I (non-cumulative) | tnorm-III | 0.799 |
Case II (cumulative) | gamma-II | 0.742 |
Case II (non-cumulative) | txvmax-II | 0.407 |
Case III (cumulative): Linear regression | tlogist-IV | 0.616 |
Case III (cumulative): Exponential regression | tnorm-III | 1.644 |
Case III (non-cumulative): Linear regression | tlogist-IV | 0.780 |
SRATS | tlogist | 1.769 |
GDS3 | ||
Best model | PMSE | |
Case I (cumulative) | tlogist-II | 2.676 |
Case I (non-cumulative) | txvmax-III | 0.481 |
Case II (cumulative) | exp-VII | 0.467 |
Case II (non-cumulative) | pareto-VI | 1.506 |
Case III (cumulative): Linear regression | llogist-II | 0.748 |
Case III (cumulative): Exponential regression | lxvmax-VI | 1.842 |
Case III (non-cumulative): Linear regression | lxvmax-VII | 1.769 |
SRATS | exp | 1.836 |
GDS4 | ||
Best model | PMSE | |
Case I (cumulative) | tlogist-III | 2.088 |
Case I (non-cumulative) | pareto-II | 1.506 |
Case II (cumulative) | exp-I | 0.495 |
Case II (non-cumulative) | tnorm-VI | 0.425 |
Case III (cumulative): Linear regression | txvmax-VI | 1.139 |
Case III (cumulative): Exponential regression | exp-II | 0.688 |
Case III (non-cumulative): Linear regression | lxvmin-I | 0.703 |
SRATS | tlogist | 1.754 |
GDS1 | ||
---|---|---|
Best model | PMSE | |
Case I (cumulative) | tnorm-II | 2.482 |
Case I (non-cumulative) | txvmax-III | 1.768 |
Case II (cumulative) | txvmax-VII | 2.142 |
Case II (non-cumulative) | txvmax-V | 2.903 |
Case III (cumulative): Linear regression | tnorm-II | 1.033 |
Case III (cumulative): Exponential regression | tlogist-VII | 3.159 |
SRATS | txvmin | 1.218 |
GDS2 | ||
Best model | PMSE | |
Case I (cumulative) | pareto-IV | 0.488 |
Case I (non-cumulative) | gamma-V | 0.277 |
Case II (cumulative) | lnorm-VII | 0.399 |
Case II (non-cumulative) | pareto-I | 0.466 |
Case III (cumulative): Linear regression | exp-IV | 0.455 |
Case III (cumulative): Exponential regression | llogist-VI | 0.499 |
Case III (non-cumulative): Linear regression | llogist-IV | 0.508 |
SRATS | lnorm | 0.531 |
GDS3 | ||
Best model | PMSE | |
Case I (cumulative) | tnorm-II | 0.326 |
Case I (non-cumulative) | txvmax-II | 0.150 |
Case II (cumulative) | txvmax-IV | 0.330 |
Case II (non-cumulative) | lxvmax-II | 0.982 |
Case III (cumulative): Linear regression | lxvmin-I | 0.340 |
Case III (cumulative): Exponential regression | txvmin-VI | 1.484 |
Case III (non-cumulative): Linear regression | pareto-III | 0.293 |
SRATS | exp | 0.295 |
GDS4 | ||
Best model | PMSE | |
Case I (cumulative) | exp-I | 0.213 |
Case I (non-cumulative) | lxvmin-V | 0.227 |
Case II (cumulative) | tnorm-IV | 0.220 |
Case II (non-cumulative) | tnorm-II | 0.206 |
Case III (cumulative): Linear regression | tlogist-II | 0.207 |
Case III (cumulative): Exponential regression | lxvmax-III | 0.273 |
Case III (non-cumulative): Linear regression | tlogist-VII | 0.220 |
SRATS | gamma | 0.230 |
(i) Best proportional intensity model (cumulative metrics data) | ||
---|---|---|
Model | Reliability | |
GDS1 | tlogist-VI | 2.969 × 10 |
GDS2 | tlogist-III | 9.260 × 10 |
GDS3 | txvmin-II | 9.998 × 10 |
GDS4 | exp-I | 5.455 × 10 |
(ii) Best proportional intensity model (non-cumulative metrics data) | ||
GDS1 | txvmin-II | 4.393 × 10 |
GDS2 | llogist-II | 1.984 × 10 |
GDS3 | gamma-II | 2.945 × 10 |
GDS4 | exp-VI | 4.324 × 10 |
(iii) Best SRATS (no metrics data) | ||
GDS1 | tlogist | 6.977 × 10 |
GDS2 | llogist | 4.152 × 10 |
GDS3 | lxvmax | 7.236 × 10 |
GDS4 | txvmin | 9.559 × 10 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, S.; Dohi, T.; Okamura, H. A Comprehensive Analysis of Proportional Intensity-Based Software Reliability Models with Covariates. Electronics 2022, 11, 2353. https://doi.org/10.3390/electronics11152353
Li S, Dohi T, Okamura H. A Comprehensive Analysis of Proportional Intensity-Based Software Reliability Models with Covariates. Electronics. 2022; 11(15):2353. https://doi.org/10.3390/electronics11152353
Chicago/Turabian StyleLi, Siqiao, Tadashi Dohi, and Hiroyuki Okamura. 2022. "A Comprehensive Analysis of Proportional Intensity-Based Software Reliability Models with Covariates" Electronics 11, no. 15: 2353. https://doi.org/10.3390/electronics11152353
APA StyleLi, S., Dohi, T., & Okamura, H. (2022). A Comprehensive Analysis of Proportional Intensity-Based Software Reliability Models with Covariates. Electronics, 11(15), 2353. https://doi.org/10.3390/electronics11152353