Data Governance in the Health Industry: Investigating Data Quality Dimensions within a Big Data Context
Abstract
:1. Introduction
- Big Data is being used to improve decision making in the healthcare industry by increasing the potential of evidence-based medicine’s (EBM’s) “small data” [4]. EBM is defined as “the conscientious, explicit, and judicious use of current best evidence in making decisions about the care of individual patients” [5]. Personalized decision support systems (PDSS) are enhancing personalized medicine or evidence-based medicine through big data analytics [6].
- Healthcare frauds are very serious issues in many countries. Big Data, with the help of data mining combined with machine learning, can play a major role in fraud detection. Data mining can identify some fraud as soon as it happens and therefore positively increase prevention.
- Big Data analytics is being applied with the aim of reducing patients’ readmission numbers. Patient readmission is not only very expensive for hospitals, but the ratio of patients’ dying after readmission is alarmingly high [7].
- Big data is proving to be a very useful tool for medical research. As there are many very large medical datasets, such as the human genomic dataset, pharmaceutical companies are harnessing the power of Big Data analytics to discover new medicines and understand diseases.
- Through the use of Internet of Things (IoT) in healthcare, data is retrieved in a pervasive manner. The data collected through IoT governs the daily life of the patient. Through integrating Big Data and IoT with healthcare, both patients and health facilities cut down costs by reducing the repetition of tests, so they benefit from more accurate diagnoses.
2. Literature Review
2.1. Data Governance and Data Quality
2.2. Data Quality Dimensions
2.3. Dimensions of Data Quality for Big Data
- Contextual consistency refers to how far big datasets are used within the same domain of interest independently of data format, size, and velocity of the production of data. For the current research, the domain of interest refers to health data. Relevancy, credibility, ease of understanding, accuracy, and confidentiality are key DQDs for this type of consistency.
- Temporal consistency conveys the idea that data needs to be understood in a consistent time slot such that the same data might not be comparable if it is from another time slot. Time concurrency, availability, and currency are deemed to be essential for temporal consistency.
- Operational consistency brings in the operational influence of technology on the production and use of data. The sources of data could be more than one in Big Data scenarios; hence, operational consistency is crucial for ensuring the veracity of data. Availability, portability, precision, completeness, and traceability are considered the main connected dimensions here. Table 2 shows a mapping of the 3Vs of Big Data (volume, variety, velocity) to the 3Cs (contextual consistency, temporal consistency, operational consistency) of data quality.
2.4. Data Quality Dimensions Specific for the Healthcare Industry
3. Research Methodology
3.1. Description of IHC
3.2. Conceptual Reasoning
- RQ1:
- Should the Intrinsic category of DQDs still be applicable in the health industry context?
- RQ2:
- How does the breadth of health data use cases impact the Contextual category of DQDs?
- RQ3:
- Is the Representational category of DQDs negatively affected by the variety characteristic of Big Data and the fact that the quality of data would depend upon the aims of data analysis?
- RQ4:
- Does the fact that health datasets are very often publicly available and voluminous data slows down access and retrieval of data negatively impact the Accessibility category of DQDs?
4. Work Undertaken
5. Discussions
6. Conclusions and Implications
Author Contributions
Funding
Conflicts of Interest
References
- O’Driscoll, A.; Daugelaite, J.; Sleator, R.D. ‘Big data’, Hadoop and cloud computing in genomics. J. Biomed. Inform. 2013, 46, 774–781. [Google Scholar] [CrossRef] [PubMed]
- Panahy, P.; Sidi, F.; Affendey, L.; Jabar, M.; Ibrahim, H.; Mustapha, A. Discovering dependencies among data quality dimensions: A validation of instrument. J. Appl. Sci. 2013, 13, 95–105. [Google Scholar] [CrossRef]
- Raghupathi, V.; Raghupathi, W. Big data analytics in healthcare: Promise and challenges. Health Inf. Sci. Syst. 2014, 2, 3. [Google Scholar] [CrossRef] [PubMed]
- Handler, D.J. Small Data-Thinking Kills Big Data-Aspirations. Available online: http://www.wired.com/insights/2013/01/small-data-thinking-kills-big-data-aspirations/ (accessed on 27 April 2017).
- Sackett, D.L.; Rosenberg, W.M.; Gray, J.A.; Haynes, R.B.; Richardson, W.S. Evidence based medicine: What it is and what it isn’t. BMJ 1996, 312, 71–72. [Google Scholar] [CrossRef] [PubMed]
- Yesha, Y.; Janeja, V.; Rishe, N.; Yesha, Y. Personalized Decision Support System to Enhance Evidence Based Medicine through Big Data Analytics. In Proceedings of the 2014 IEEE International Conference on Healthcare Informatics (ICHI), Verona, Italy, 15–17 September 2014. [Google Scholar]
- Zolfaghar, K.; Meadem, N.; Teredesai, A.; Roy, S.B.; Chin, S.C.; Muckian, B. Big data solutions for predicting risk-of-readmission for congestive heart failure patients. In Proceedings of the 2013 IEEE International Conference on Big Data, Santa Clara, CA, USA, 6–9 October 2013; pp. 64–79. [Google Scholar]
- Batini, C.; Rula, A.; Scannapieco, M.; Viscusi, G. From data quality to big data quality. J. Database Manag. 2015, 1, 60–82. [Google Scholar] [CrossRef]
- Cai, L.; Zhu, Y. The Challenges of Data Quality and Data Quality Assessment in the Big Data Era. Data Sci. J. 2015, 14, 2. [Google Scholar] [CrossRef] [Green Version]
- Huang, H.; Stvilia, B.; Bass, H. Prioritization of Data Quality Dimensions and Skills Requirements in Genome Annotation Work. J. Am. Soc. Inf. Sci. Technol. 2012, 63, 195–207. [Google Scholar] [CrossRef]
- Weiskopf, N.G.; Weng, C. Methods and dimensions of electronic health record data quality assessment: Enabling reuse for clinical research. J. Am. Med. Inf. Assoc. 2013, 20, 144–151. [Google Scholar] [CrossRef] [PubMed]
- Juddoo, S. Overview of Big Data quality challenges. In Proceedings of the IEEE ICCCS Conference, Pointe au Piments, Mauritius, 4–5 December 2015. [Google Scholar] [CrossRef]
- Wang, R.; Strong, D. Beyond Accuracy: What Data Quality Means to Data Consumers. J. Manag. Inf. Syst. 1996, 12, 5–33. [Google Scholar] [CrossRef]
- Russom, P. Data Governance Strategies. Bus. Intell. J. 2008, 13, 13–14. [Google Scholar]
- Thomas, G. Data Governance Institute. Available online: www.DataGovernance.com (accessed on 8 July 2017).
- Barakhan, B. Drive towards Data Governance. Available online: http://www.ewweb.com (accessed on 11 July 2016).
- IBM. IBM Data Governance Council Maturity Model. Available online: www.ibm.com/software/data/information/trust-governance.html (accessed on 7 May 2016).
- Khatri, V.; Brown, C. Designing data governance. Commun. ACM 2012, 53, 148–152. [Google Scholar] [CrossRef]
- Malik, P. Governing Big Data: Principles and Practices. IBM J. Res. Dev. 2013, 57, 1. [Google Scholar] [CrossRef]
- Pipino, L.; Yang, L.; Wang, R. Data Quality Assessment. Commun. ACM 2002, 45, 211–218. [Google Scholar] [CrossRef]
- Islam, M.S. An Assessment for focusing the change of data quality (DQ) with timeliness in Information Manufacturing systems. In Proceedings of the SDIWC Second International Conference on Digital Enterprise and Information Systems (DEIS2013), Kuala Lumpur, Malaysia, 4–6 March 2013. [Google Scholar]
- Cappiello, C.; Francalanci, C.; Pernici, B. Data quality assessment from the user’s perspective. In Proceedings of the International Workshop on Information Quality in Information Systems, Paris, France, 18 June 2004. [Google Scholar]
- Caballero, I.; Serrano, M.; Piattinni, M. A Data Quality in Use Model for Big Data. In ER 2014: Advances in Conceptual Modeling; Springer: Cham, Switzerland, 2014; pp. 65–74. [Google Scholar]
- Dong, X.L.; Srivastava, D. Big Data Integration; Morgan and Claypool Publishers: San Rafael, CA, USA, 2015. [Google Scholar] [CrossRef]
- Saha, B.; Srivastava, D. Data Quality: The other face of Big Data. In Proceedings of the 2014 IEEE 30th International Conference on Data Engineering (ICDE), Chicago, IL, USA, 31 March–4 April 2014. [Google Scholar]
- Soares, S. Big Data Quality, Big Data Governance: An Emerging Imperative; MC Press: Boise, ID, USA; pp. 101–112.
- Geisler, S.; Weber, S.; Quix, C. Ontology-Based Data Quality Framework for Data Streams. In Proceedings of the 16th International Conference on Information Quality, Adelaide, Australia, 18–20 November 2011. [Google Scholar]
- CIHI. Canadian Institute for Health Information. Available online: https://www.cihi.ca/en/data-and-standards/data-quality (accessed on 20 August 2018).
- Langley, J.; Stephenson, S.; Thorpe, C.; Davie, G. Accuracy of injury coding under ICD-9 for New Zealand public hospital discharges. Injury Prev. 2006, 12, 58–61. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- O’Reilly, G.; Gabbe, B.; Moore, L.; Cameron, P. Classifying, measuring and improving the quality of data in trauma registries: A review of literature, Injury. Int. J. Care Injured 2016, 47, 559–567. [Google Scholar] [CrossRef] [PubMed]
- Varshney, K.R.; Wei, D.; Ramamurthy, K.N.; Mojsilovic, A. Data Challenges in Disease Response: The 2014 Ebola Outbreak and beyond. ACM J. Data Inf. Qual. 2015, 6, 2–3. [Google Scholar] [CrossRef]
- Almutiry, A.; Wills, G.; Alwabel, A.; Crowder, R.; Walters, R. Toward A Framework For Data Quality in Cloud-Based Health Information System. In Proceedings of the IEEE International Conference on Information Society, London, UK, 10–12 November 2014. [Google Scholar]
- Richard, T. Writing Integrative Literature Reviews: Guidelines and Examples. Hum. Resour. Dev. Rev. 2005, 4, 356–367. [Google Scholar] [CrossRef]
- Whittemore, R.; Knafl, K. The integrative review: Updated methodology. J. Adv. Nurs. 2005, 5, 546–553. [Google Scholar] [CrossRef] [PubMed]
- Kahn, M.; Raebel, M.A.; Glanz, J.M.; Riedlinger, K.; Steiner, J. A Pragmatic Framework for Single-site and Multisite Data Quality Assessment in Electronic Health Record-based Clinical Research. Med. Care 2012. [Google Scholar] [CrossRef] [PubMed]
- Jones, K.; Zenk, S.; Tarlov, E.; Powell, L.; Matthews, S.; Horoi, I. A step-by-step approach to improve data quality when using commercial business lists to characterize retail food environments. BMC Res. Notes 2017, 10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Amoakoh-Coleman, M.; Kayode, G.A.; Brown-Davies, C.; Agyepong, I.A.; Grobbee, D.E.; Klipstein-Grobusch, K.; Ansah, E.K. Completeness and accuracy of data transfer of routine maternal health services data in thegreater Accra region. BMC Res. Notes 2015. [Google Scholar] [CrossRef] [PubMed]
- Jacke, C.O.; Kalder, M.; Wagner, U.; Albert, U. Valid comparisons and decisions based on clinical registers and population based cohort studies: Assessing the accuracy, completeness and epidemiological relevance of a breast cancer query database. BMC Res. Notes 2012, 5, 700. [Google Scholar] [CrossRef] [PubMed]
- Serhani, M.A.; Kassabi, H.T.; Taleb, I.; Nujum, A. An Hybrid Approach to Quality Evaluation across Big Data Value Chain. In Proceedings of the 2016 IEEE International Congress on Big Data, San Francisco, CA, USA, 27 June–2 July 2016. [Google Scholar]
- Batini, C.; Scanapieca, M. Data Quality. In Data Centric Systems and Applications; Springer: Berlin, Germany, 2006. [Google Scholar]
- Xiao, Y.; Bochner, A.F.; Makunike, B.; Holec, M.; Xaba, S.; Tshimanga, M.; Chitimbire, V.; Barnhart, S.; Feldacker, C. Challenges in data quality: The influence of data quality assessments on data availability and completeness in a voluntary medical male circumcision programme in Zimbabwe. BMJ Open 2017, 7, e013562. [Google Scholar] [CrossRef] [PubMed]
- Giarrizzo-Wilson, S.; Maxwell-Downing, D.; Bianco, J. Data quality and the electronic health record (EHR). AORN J. 2011, 94. [Google Scholar] [CrossRef]
- Giarrizzo-Wilson, S.; Maxwell-Downing, D.; Bianco, J. Pre-charting patient care information. AORN J. 2011, 94. [Google Scholar] [CrossRef]
- Li, X.; Shi, Y.; Li, J.; Zhang, P. Data Mining Consulting Improve Data Quality. Data Sci. J. 2007, 6, S658–S666. [Google Scholar] [CrossRef]
- Sidi, F.; Jabar, M.; Ibrahim, H.; Mustapha, A. Data Quality: A survey of data quality dimensions. In Proceedings of the 2012 International Conference on Information Retrieval & Knowledge Management, Kuala Lumpur, Malaysia, 13–15 March 2012. [Google Scholar] [CrossRef]
- Weber, J.; Price, M.; Davies, I. Data Quality by Contract—Towards an Architectural View for Data Quality in Health Information Systems. In Proceedings of the Conference on Artificial Intelligence in Medicine in Europe, Pavia, Italy, 17–20 June 2015; pp. 143–157. [Google Scholar] [CrossRef]
- Leon, A.; Reyes, J.; Burriel, V.; Valverde, F. Data Quality Problems when Integrating Genomic Information. In Proceedings of the ER 2016 Workshops, Gifu, Japan, 14–17 November 2016; pp. 173–182. [Google Scholar] [CrossRef]
- Pinto, M. Data representation factors and dimensions from the quality function deployment (QFD) perspective. J. Inf. Sci. 2006. [Google Scholar] [CrossRef]
- Arts, D.G.T.; De Keizer, N.F.; Scheffer, G. Defining and Improving Data Quality in Medical Registries. Am Med. Inform. Assoc. 2002, 9, 600–611. [Google Scholar] [CrossRef]
- Todoran, I.-G.; Lecornu, L.; Kenchaf, A.; Le Caillac, J.M. A Methodology to Evaluate Important Dimensions of Information quality in systems. J. Data Inf. Qual. 2015, 6, 11. [Google Scholar] [CrossRef]
- Nystrom, M.; Andersson, R.; Holmqvist, K.; Weijer, J. The influence of calibration method and eye physiology on eyetracking data quality. Behav. Res. 2013, 45, 272–288. [Google Scholar] [CrossRef] [PubMed]
- Aday, L.A.; Cynamon, M. Health Survey Research Methods. In Proceedings of the 9th Conference on Health Survey Research Methods, Centers for Disease Control and Prevention, National Center for Health Statistics, Hyattsville, MD, USA, May 2010. [Google Scholar]
- Cure, O. Improving the data quality of drug databases using conditional dependencies and ontologies. ACM J. Data Inform. Qual. 2012, 4, 3. [Google Scholar] [CrossRef] [Green Version]
- Vattulainen, M. Improving the Predictive Power of Business Performance measurement systems by Constructed Data Quality Features? Five cases. In Advances in Data Mining: Applications and Theoretical Aspects; Springer: Cham, Switzerland, 2015; pp. 3–16. ISBN 978-3-319-20910-4. [Google Scholar]
- Byrd, W.L.; Byrd, T.A. Contrasting the Dimensions of Information Quality in their Effects on Healthcare Quality in Hospitals. In Proceedings of the 46th Hawaii International Conference on System Sciences, Wailea, Maui, HI, USA, 7–10 January 2013. [Google Scholar]
- Lin, Y.; Wang, H.; Zhang, S.; Li, J.; Gao, H. Efficient quality-driven source selection from massive data sources. J. Syst. Softw. 2016, 118, 221–233. [Google Scholar] [CrossRef]
- Gibson, N. Measuring the Quality of Patient Data with Particular Reference to Data Accuracy; University of Keele: Keele, UK, 1997. [Google Scholar]
- Vetro, A.; Canova, L.; Torchiano, M.; Iemma, R. Open data quality measurement framework: Definition and application to Open Government Data. Gov. Inf. Q. 2016, 33, 325–337. [Google Scholar] [CrossRef]
- Craswell, A.; Moxham, L.; Broadbent, M. Does use of computer technology for perinatal data collection influence data quality? Health Inform. J. 2016, 22, 293–303. [Google Scholar] [CrossRef] [PubMed]
- Lee, H.S.; Haider, A. Identifying Relationships of Information Quality Dimensions. In Proceedings of the 2013 Technology Management for Emerging Technologies (PICMET’13), San Jose, CA, USA, 28 July–1 August 2013. [Google Scholar]
- Lima, C.R.; Schramm, J.M.; Coeli, C.M.; da Silva, M.E. Review of data quality dimensions and applied methods in the evaluation of health information systems. Cad. Saúde Pública 2009, 25, 2095–2109. [Google Scholar] [CrossRef] [PubMed]
- CDC. Joint Meeting of International Collaborative Effort on Injury Statistics and the Global Burden of Disease-Injury Expert Group. Available online: http://www.cdc.gov/nchs/injury/ice/boston2009/boston2009_proceedings.htm#proceeding_14 (accessed on 13 January 2017).
- Blake, R.; Mangiameli, P. The Effects and Interactions of Data Quality and Problem Complexity on Classification. ACM J. Data Inf. Qual. 2011, 2. [Google Scholar] [CrossRef]
- Salati, M.; Falcoz, P.E.; Decaluwe, H.; Rocco, G.; Van Raemdonck, D.; Varela, G.; Brunelli, A.; ESTS Database Committee. The European thoracic data quality project: An Aggregate Data Quality score to measure the quality of international multi-institutional databases. Eur. J. Cardiothorac. Surg. 2016, 49, 1470–1475. [Google Scholar] [CrossRef] [PubMed]
- Rhodegero, J. The use of big data in manual physiotherapy. Man. Ther. 2014, 19, 509–510. [Google Scholar] [CrossRef] [PubMed]
- White, P. A Pilot Ontology for a Large, Diverse Set of National Health Service Healthcare Quality Indicators. Unpublished Ph.D. Thesis, City University London, London, UK, 2014. [Google Scholar]
Dimensions | Definitions |
---|---|
Accessibility | Extent to which data is available, or easily and quickly retrievable |
Appropriate amount of data | Extent to which volume of data is appropriate for the task at hand |
Believability | Extent to which data is regarded as true and credible |
Completeness | Extent to which data is not missing and is of sufficient breadth and depth for the task at hand |
Consistent representation | Extent to which data is presented in the same format |
Ease of manipulation | Extent to which data is easy to manipulate and apply to different tasks |
Free-of-error | Extent to which data is correct and reliable |
Interpretability | Extent to which data is in the appropriate languages, symbols, and units, and the definitions are clear |
Objectivity | Extent to which data is unbiased, unprejudiced, and impartial |
Relevancy | Extent to which data is applicable and helpful for the task at hand |
Reputation | Extent to which data is highly regarded in terms of its source and content |
Security | Extent to which access to data is restricted appropriately to maintain its security |
Timeliness | Extent to which data is sufficiently up-to-date |
Understandability | Extent to which data is easily comprehended |
Value-added | Extent to which data is beneficial and provides advantages from its uses |
Velocity | Volume | Variety | |
---|---|---|---|
Contextual | Consistency, Credibility, Confidentiality | Completeness, Credibility | Accuracy, Consistency, Understandability |
Temporal | Consistency, credibility, Currentness, Availability | Availability | Consistency, Currentness, Compliance |
Operational | Completeness, Accessibility, Efficiency, Traceability, Availability, Recoverability | Completeness, Accessibility, Efficiency, Availability, Recoverability | Accuracy, Compliance, Accessibility, Efficiency, Traceability, Availability, Recoverability, Precision |
Category | Description and Main DQ Dimensions |
---|---|
Intrinsic | Is explained by data having quality in their own right. (accuracy, objectivity, believability, and reputation) |
Contextual | Highlights the idea that data quality is a factor of the task at hand (value-added, relevancy, timeliness, completeness, and appropriate amount of data) |
Representational | Includes aspects linked with the format and meaning of data (interpretability, ease of understanding, representational consistency, and concise representation) |
Accessibility | Emphasizes the role of getting access to data (accessibility and access security) |
Research Article USED | DQ Dimensions | Weight |
---|---|---|
(Panahy et al., 2013) [2] | Accuracy, Completeness, Consistency, Timeliness. | L |
(Khan et al., 2012) [35] | Consistency, Completeness, Accuracy | M |
(Jones et al., 2017) [36] | Accuracy | L |
(Amoakoh-Coleman et al., 2015) [37] | Accuracy, Completeness | L |
(Jacke et al., 2012) [38] | Accuracy, Completeness | M |
(Langley et al., 2006) [29] | Accuracy | L |
(Serhani et al., 2016) [39] | Accuracy, Completeness, Consistency, Timeliness | H |
(Batini et al., 2006) [40] | Accuracy, Completeness, Accessibility, Trust, Readability, Consistency | M |
(Xiao et al., 2017) [41] | Availability, Completeness | L |
(O’Reilly et al., 2016) [30] | Accuracy, Capture, Completeness | M |
(Giarrizzo-Wilson et al., 2011) [42] | Accuracy, Completeness, Trust, Legibility | L |
(Giarrizzo-Wilson et al., 2011) [43] | Accuracy, Completeness | L |
(Varshney et al., 2015) [31] | Accuracy, Privacy and Security, Heterogeneity, Provenance and Trust, Availability, Completeness | M |
(Li et al., 2007) [44] | Completeness, Reliability, Correctness, Consistency, ‘minimality’ | L |
(Sidi et al., 2012) [45] | Accuracy, Completeness, Consistency | L |
(Weber et al., 2015) [46] | Correctness, Provenance, Currency, Plausibility | M |
(Leon et al., 2016) [47] | Accuracy, Completeness, Consistency, Currency, Reliability, Uniqueness | H |
(Pinto, 2006) [48] | Relevance, Consistency, Accuracy, Currency, Comprehensiveness, Format | L |
(Arts et al., 2002) [49] | Accuracy, Completeness, Clarity, Format | M |
(Weiskopf and Weng, 2012) [11] | Completeness, Correctness, Plausibility Concordance, Currency | M |
(Geisler et al., 2011) [27] | Accuracy, Completeness, Consistency, Timeliness, Confidence, Data Volume | M |
(Todoran et al., 2015) [50] | Accuracy, Completeness, Currency, Reliability | H |
(Nystrom et al., 2013) [51] | Accuracy, Precision | L |
(Huang et al., 2012) [10] | Accuracy, Usefulness, Accessibility, Relevance, Security | H |
(Aday and Cynamon, 2010) [52] | Accuracy, Reliability, Consistency | H |
(Cure, 2012) [53] | Accuracy, Completeness | M |
(Vattulainen, 2015) [54] | Completeness, Redundancy, Accuracy, Representativeness, Consistency | L |
(Byrd and Byrd, 2013) [55] | Accuracy, Completeness, Timeliness | M |
(Lin et al., 2016) [56] | Completeness, Consistency, Coincidence | M |
(Gibson, 1997) [57] | Accuracy, Completeness, Precision, Verifiability, Validity, Plausibility | M |
(Vetro et al., 2016) [58] | Accuracy, Completeness, Understandability, Traceability, Compliance | M |
(Craswell et al., 2016) [59] | Accuracy, Consistency, Clarity | M |
(Lee and Haider, 2013) [60] | Believability, Security, Accuracy, Timeliness | L |
(Lima et al., 2009) [61] | Reliability, Validity, Coverage, Accuracy, Completeness | M |
(CDC, 2009) [62] | Consistency, Accuracy, Plausibility | M |
(Cai and Zhu, 2015) [9] | Availability, Usability, Reliability, Relevance, Presentation Quality | M |
(Blake and Mangiameli, 2011) [63] | Accuracy, Completeness, Consistency, Timeliness | L |
(Salati et al., 2016) [64] | Completeness, Reliability | L |
(Rhodegero, 2014) [65] | Completeness, Accuracy | M |
(Kahn et al., 2012) [35] | Format, Availability, Timeliness, Consistency | L |
(White, 2014) [66] | Consistency, Conciseness, Completeness, Expandability, Sensitivity. | M |
DQD | Total Weighted Count | DQD | Total Weighted Count |
---|---|---|---|
Accuracy | 58 | Readability | 2 |
Completeness | 52 | Capture | 2 |
Consistency | 30 | Privacy | 2 |
Reliability | 15 | Heterogeneity | 2 |
Timeliness | 11 | Provenance | 2 |
Currency | 8 | Comprehensiveness | 2 |
Availability | 6 | Concordance | 2 |
Accessibility | 5 | Confidence | 2 |
Trust | 5 | Data volume | 2 |
Security | 5 | Coincidence | 2 |
Correctness | 5 | Verifiability | 2 |
Plausibility | 4 | Understandable | 2 |
Relevance | 4 | Traceability | 2 |
Clarity | 4 | Compliance | 2 |
Validity | 4 | Coverage | 2 |
Uniqueness | 3 | Usability | 2 |
Format | 3 | Presentation quality | 2 |
Precision | 3 | Expandability | 2 |
Usefulness | 3 | Sensitivity | 2 |
Category | Individual Dimensions | Count |
---|---|---|
Intrinsic | Accuracy, Trust, Plausibility, Precision, Compliance, Traceability, Verifiability, Provenance, Confidence, Concordance, Correctness | 87 |
Contextual | Completeness, Timeliness, Currency, Reliability, Availability, Uniqueness, Relevance, Validity, Expandability, Sensitivity, Coverage, Data volume, Comprehensiveness, Heterogeneity | 108 |
Representational | Consistency, Format, Usefulness, Readability, Capture, Coincidence, Understandable, Usability, Presentation quality | 48 |
Accessibility | Accessibility, Security, Privacy, Compliance | 14 |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Juddoo, S.; George, C.; Duquenoy, P.; Windridge, D. Data Governance in the Health Industry: Investigating Data Quality Dimensions within a Big Data Context. Appl. Syst. Innov. 2018, 1, 43. https://doi.org/10.3390/asi1040043
Juddoo S, George C, Duquenoy P, Windridge D. Data Governance in the Health Industry: Investigating Data Quality Dimensions within a Big Data Context. Applied System Innovation. 2018; 1(4):43. https://doi.org/10.3390/asi1040043
Chicago/Turabian StyleJuddoo, Suraj, Carlisle George, Penny Duquenoy, and David Windridge. 2018. "Data Governance in the Health Industry: Investigating Data Quality Dimensions within a Big Data Context" Applied System Innovation 1, no. 4: 43. https://doi.org/10.3390/asi1040043
APA StyleJuddoo, S., George, C., Duquenoy, P., & Windridge, D. (2018). Data Governance in the Health Industry: Investigating Data Quality Dimensions within a Big Data Context. Applied System Innovation, 1(4), 43. https://doi.org/10.3390/asi1040043