Studies in the extensively automatic construction of large odds-based inference networks from structured data. Examples from medical, bioinformatics, and health insurance claims data
- PMID: 29500985
- DOI: 10.1016/j.compbiomed.2018.02.013
Studies in the extensively automatic construction of large odds-based inference networks from structured data. Examples from medical, bioinformatics, and health insurance claims data
Abstract
Theoretical and methodological principles are presented for the construction of very large inference nets for odds calculations, composed of hundreds or many thousands or more of elements, in this paper generated by structured data mining. It is argued that the usual small inference nets can sometimes represent rather simple, arbitrary estimates. Examples of applications in clinical and public health data analysis, medical claims data and detection of irregular entries, and bioinformatics data, are presented. Construction of large nets benefits from application of a theory of expected information for sparse data and the Dirac notation and algebra. The extent to which these are important here is briefly discussed. Purposes of the study include (a) exploration of the properties of large inference nets and a perturbation and tacit conditionality models, (b) using these to propose simpler models including one that a physician could use routinely, analogous to a "risk score", (c) examination of the merit of describing optimal performance in a single measure that combines accuracy, specificity, and sensitivity in place of a ROC curve, and (d) relationship to methods for detecting anomalous and potentially fraudulent data.
Keywords: Anomaly detection; Bayes Net; Big Data; Bioinformatics; Clinical decision support; Data mining; Fraud detection; Hyperbolic Dirac Net; Inference net; Machine learning.
Copyright © 2018 Elsevier Ltd. All rights reserved.
Similar articles
-
Implementation of a web based universal exchange and inference language for medicine: Sparse data, probabilities and inference in data mining of clinical data repositories.Comput Biol Med. 2015 Nov 1;66:82-102. doi: 10.1016/j.compbiomed.2015.07.015. Epub 2015 Jul 28. Comput Biol Med. 2015. PMID: 26386548
-
Studies in the use of data mining, prediction algorithms, and a universal exchange and inference language in the analysis of socioeconomic health data.Comput Biol Med. 2019 Sep;112:103369. doi: 10.1016/j.compbiomed.2019.103369. Epub 2019 Jul 25. Comput Biol Med. 2019. PMID: 31377681
-
Suggestions for a Web based universal exchange and inference language for medicine.Comput Biol Med. 2013 Dec;43(12):2297-310. doi: 10.1016/j.compbiomed.2013.09.010. Epub 2013 Sep 20. Comput Biol Med. 2013. PMID: 24211018
-
Data mining in clinical big data: the frequently used databases, steps, and methodological models.Mil Med Res. 2021 Aug 11;8(1):44. doi: 10.1186/s40779-021-00338-z. Mil Med Res. 2021. PMID: 34380547 Free PMC article. Review.
-
Hyperbolic Dirac Nets for medical decision support. Theory, methods, and comparison with Bayes Nets.Comput Biol Med. 2014 Aug;51:183-97. doi: 10.1016/j.compbiomed.2014.03.014. Epub 2014 Apr 8. Comput Biol Med. 2014. PMID: 24954566 Review.
Cited by
-
Towards faster response against emerging epidemics and prediction of variants of concern.Inform Med Unlocked. 2022;31:100966. doi: 10.1016/j.imu.2022.100966. Epub 2022 May 20. Inform Med Unlocked. 2022. PMID: 35611320 Free PMC article. Review.
-
The use of knowledge management tools in viroinformatics. Example study of a highly conserved sequence motif in Nsp3 of SARS-CoV-2 as a therapeutic target.Comput Biol Med. 2020 Oct;125:103963. doi: 10.1016/j.compbiomed.2020.103963. Epub 2020 Aug 13. Comput Biol Med. 2020. PMID: 32828990 Free PMC article.
-
Bioinformatics studies on a function of the SARS-CoV-2 spike glycoprotein as the binding of host sialic acid glycans.Comput Biol Med. 2020 Jul;122:103849. doi: 10.1016/j.compbiomed.2020.103849. Epub 2020 Jun 8. Comput Biol Med. 2020. PMID: 32658736 Free PMC article.
-
COVID-19 Coronavirus spike protein analysis for synthetic vaccines, a peptidomimetic antagonist, and therapeutic drugs, and analysis of a proposed achilles' heel conserved region to minimize probability of escape mutations and drug resistance.Comput Biol Med. 2020 Jun;121:103749. doi: 10.1016/j.compbiomed.2020.103749. Epub 2020 Apr 11. Comput Biol Med. 2020. PMID: 32568687 Free PMC article. Review.
-
Computers and viral diseases. Preliminary bioinformatics studies on the design of a synthetic vaccine and a preventative peptidomimetic antagonist against the SARS-CoV-2 (2019-nCoV, COVID-19) coronavirus.Comput Biol Med. 2020 Apr;119:103670. doi: 10.1016/j.compbiomed.2020.103670. Epub 2020 Feb 26. Comput Biol Med. 2020. PMID: 32209231 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources