Sensitivity of Bayesian Networks to Errors in Their Structure
Abstract
:1. Introduction
2. Materials and Methods
2.1. Medical Data Sets
2.2. Bayesian Network Models
2.3. Experimental Design
2.4. Node Removal
2.5. Edge Removal
2.6. Edge Reversal
3. Discussion
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
ACC | Diagnostic accuracy (the proportion of correct diagnoses and a measure of a model’s quality) |
ANB | Augmented Naive Bayes (a structure learning algorithm) |
AI | Artificial Intelligence |
API | Application Programming Interface |
AUC | Area Under the (ROC) Curve (a measure of model’s ability to detect a single class) |
BN | Bayesian Network |
BSA | Bayesian Search (a structure learning algorithm) |
CPT | Conditional Probability Table |
VOI | Value Of Information (a measure of the worth/importance of a potential observation variable) |
References
- Henrion, M.; Breese, J.S.; Horvitz, E.J. Decision Analysis and Expert Systems. AI Mag. 1991, 12, 64–91. [Google Scholar]
- Morgan, M.G.; Henrion, M. Uncertainty: A Guide to Dealing with Uncertainty in Quantitative Risk and Policy Analysis; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar]
- Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference; Morgan Kaufmann Publishers, Inc.: San Mateo, CA, USA, 1988. [Google Scholar]
- Henrion, M.; Pradhan, M.; del Favero, B.; Huang, K.; Provan, G.; O’Rorke, P. Why Is Diagnosis Using Belief Networks Insensitive to Imprecision In Probabilities? In Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI-1996), Portland, OR, USA, 1–4 August 1996; pp. 307–314. [Google Scholar]
- Oniśko, A.; Druzdzel, M.J. Sensitivity of Bayesian Networks to Noise in Their Parameters. Entropy 2024, 26, 963. [Google Scholar] [CrossRef]
- Pradhan, M.; Henrion, M.; Provan, G.; del Favero, B.; Huang, K. The Sensitivity of Belief Networks to Imprecise Probabilities: An Experimental Investigation. Artif. Intell. 1996, 85, 363–397. [Google Scholar] [CrossRef]
- Oniśko, A.; Druzdzel, M.J. Impact of Bayesian network model structure on the accuracy of medical diagnostic systems. In Artificial Intelligence and Soft Computing, Proceedings of the 13th International Conference (ICAISC 2014), Zakopane, Poland, 1–5 June 2014; Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M., Eds.; Reprinted in Springer Lecture Notes in Computer Science; Lecture Notes in Artificial Intelligence, LNAI 8468; Springer: Berlin/Heidelberg, Germany, 2014; pp. 167–178. [Google Scholar]
- Druzdzel, M.J.; van der Gaag, L.C. Building probabilistic networks: ‘Where do the numbers come from?’ Guest editors’ introduction. IEEE Trans. Knowl. Data Eng. 2000, 12, 481–486. [Google Scholar] [CrossRef]
- Oniśko, A.; Druzdzel, M.J. Impact of precision of Bayesian networks parameters on accuracy of medical diagnostic systems. Artif. Intell. Med. 2013, 57, 197–206. [Google Scholar] [CrossRef] [PubMed]
- Kelly, M.; Longjohn, R.; Nottingham, K. The UCI Machine Learning Repository. Available online: https://archive.ics.uci.edu (accessed on 1 October 2024).
- Oniśko, A.; Druzdzel, M.J.; Wasyluk, H. An experimental comparison of methods for handling incomplete data in learning parameters of Bayesian networks. In Intelligent Information Systems 2002, Proceedings of the IIS’2002 Symposium, Sopot, Poland, 3–6 June 2002; Klopotek, M., Wierzchon, S.T., Michalewicz, M., Eds.; Advances in Soft Computing Series; Physica-Verlag (A Springer-Verlag Company): Berlin/Heidelberg, Germany, 2002; pp. 351–360. [Google Scholar]
- Zwitter, M.; Soklic, M. Breast Cancer [Dataset]; UCI Machine Learning Repository. 1988. Available online: https://archive.ics.uci.edu/dataset/14/breast+cancer (accessed on 1 October 2024).
- Campos, D.; Bernardes, J. Cardiotocography [Dataset]; UCI Machine Learning Repository. 2000. Available online: https://archive.ics.uci.edu/dataset/193/cardiotocography (accessed on 1 October 2024).
- Ilter, N.; Guvenir, H. Dermatology [Dataset]; UCI Machine Learning Repository. 1998. Available online: https://archive.ics.uci.edu/dataset/33/dermatology (accessed on 1 October 2024).
- Lichtinghagen, R.; Klawonn, F.; Hoffmann, G. HCV Data [Dataset]; UCI Machine Learning Repository. 2020. Available online: https://archive.ics.uci.edu/dataset/571/hcv+data (accessed on 1 October 2024).
- Hepatitis [Dataset]; UCI Machine Learning Repository. 1983. Available online: https://archive.ics.uci.edu/dataset/46/hepatitis (accessed on 1 October 2024).
- Kononenko, I. Inductive and Bayesian learning in medical diagnosis. Appl. Artif. Intell. 1993, 7, 317–337. [Google Scholar] [CrossRef]
- Kurgan, L.; Cios, K.J.; Tadeusiewicz, R.; Ogiela, M.R.; Goodenday, L.S. Knowledge discovery approach to automated cardiac SPECT diagnosis. Artif. Intell. Med. 2001, 23, 149–169. [Google Scholar] [CrossRef] [PubMed]
- Cooper, G.F.; Herskovits, E. A Bayesian Method for the Induction of Probabilistic Networks from Data. Mach. Learn. 1992, 9, 309–347. [Google Scholar] [CrossRef]
- Koiter, J.R. Visualizing Inference in Bayesian Networks. Master’s Thesis, Delft University of Technology, Delft, The Netherlands, 2006. [Google Scholar]
- Henrion, M. Some practical issues in constructing belief networks. In Uncertainty in Artificial Intelligence 3; Kanal, L.N., Levitt, T.S., Lemmer, J.F., Eds.; Elsevier Science Publishers: Amsterdam, The Netherlands, 1989; pp. 161–173. [Google Scholar]
Data Set | Citation | Instances | Variables | Variable Types | Classes | Missing Values |
---|---|---|---|---|---|---|
Breast Cancer | [12] | 286 | 10 | Categorical | 2 | 0.31% |
Cardiotocography | [13] | 2126 | 22 | Categorical, real | 3 | — |
Dermatology | [14] | 366 | 35 | Categorical, real | 6 | 0.06% |
HCV | [15] | 615 | 13 | Categorical, real | 5 | 0.2% |
Hepatitis | [16] | 155 | 20 | Categorical, real | 2 | 5.4% |
Lymphography | [17] | 148 | 19 | Categorical, integer | 4 | — |
Primary Tumor | [17] | 339 | 18 | Categorical, integer | 20 | 3.7% |
SPECT Heart | [18] | 267 | 23 | Categorical | 2 | — |
Model | Number of Nodes | Average Number of States | Mean in-Degree | Number of Edges | Number of Parameters |
---|---|---|---|---|---|
Breast Cancer | 10 | 4.50 | 1.40 | 14 | 200 |
Cardiotocography | 22 | 2.91 | 2.86 | 63 | 13,347 |
Dermatology | 35 | 3.94 | 0.83 | 29 | 2032 |
HCV | 13 | 3.15 | 1.38 | 18 | 312 |
Hepatitis | 20 | 2.50 | 1.90 | 38 | 465 |
Lymphography | 19 | 3.00 | 1.05 | 20 | 300 |
Primary Tumor | 18 | 3.17 | 1.83 | 33 | 877 |
SPECT Heart | 23 | 2.00 | 2.26 | 52 | 290 |
Model | Number of Classes | Prevalence of Various Classes in the Data Set |
---|---|---|
Breast Cancer | 2 | (70.3%, 29.7%) |
Cardiotocography | 3 | (77.8%, 13.9%, 8.3%) |
Dermatology | 6 | (30.6%, 19.7%, 16.7%, 14.2%, 13.4%, 5.5%) |
HCV | 5 | (86.7%, 4.9%, 3.9%, 3.4%, 1.1%) |
Hepatitis | 2 | (79.2%, 20.8%) |
Lymphography | 4 | (54.5%, 41.1%, 2.9%, 1.5%) |
Primary Tumor | 20 | (24.8%, 11.5%, 8.6%, 8.3%, 7.1%, 7.1%, 5.9%, 4.7%, 4.1%, 4.1%, 2.9%, 2.7%, 2.4%, 2.1%, 1.8%, 0.6%, 0.6%, 0.3%, 0.3%, 0.3%) |
SPECT Heart | 2 | (79.4%, 20.6%) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Onisko, A.; Druzdzel, M.J. Sensitivity of Bayesian Networks to Errors in Their Structure. Entropy 2024, 26, 975. https://doi.org/10.3390/e26110975
Onisko A, Druzdzel MJ. Sensitivity of Bayesian Networks to Errors in Their Structure. Entropy. 2024; 26(11):975. https://doi.org/10.3390/e26110975
Chicago/Turabian StyleOnisko, Agnieszka, and Marek J. Druzdzel. 2024. "Sensitivity of Bayesian Networks to Errors in Their Structure" Entropy 26, no. 11: 975. https://doi.org/10.3390/e26110975