Next Article in Journal
Targeted Tick-Borne Disease Recognition: Assessing Risk for Improved Public Health
Previous Article in Journal
Caregiver Knowledge, Attitude, and Behavior toward Care of Children with Cerebral Palsy: A Saudi Arabian Perspective
Previous Article in Special Issue
The Role of Technology in Online Health Communities: A Study of Information-Seeking Behavior
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Data-Driven Approach to Defining Risk-Adjusted Coding Specificity Metrics for a Large U.S. Dementia Patient Cohort

by
Kaylla Richardson
1,2,
Sankari Penumaka
2,
Jaleesa Smoot
1,2,
Mansi Reddy Panaganti
2,
Indu Radha Chinta
2,
Devi Priya Guduri
2,
Sucharitha Reddy Tiyyagura
2,
John Martin
3,
Michael Korvink
3 and
Laura H. Gunn
1,2,4,*
1
Department of Public Health Sciences, University of North Carolina at Charlotte (UNC Charlotte), Charlotte, NC 28223, USA
2
School of Data Science, University of North Carolina at Charlotte (UNC Charlotte), Charlotte, NC 28223, USA
3
ITS Data Science, Premier, Inc., Charlotte, NC 28277, USA
4
School of Public Health, Faculty of Medicine, Imperial College London, London W6 8RP, UK
*
Author to whom correspondence should be addressed.
Healthcare 2024, 12(10), 983; https://doi.org/10.3390/healthcare12100983
Submission received: 26 March 2024 / Revised: 1 May 2024 / Accepted: 4 May 2024 / Published: 10 May 2024
(This article belongs to the Special Issue Data Driven Insights in Healthcare)

Abstract

:
Medical coding impacts patient care quality, payor reimbursement, and system reliability through the precision of patient information documentation. Inadequate coding specificity can have significant consequences at administrative and patient levels. Models to identify and/or enhance coding specificity practices are needed. Clinical records are not always available, complete, or homogeneous, and clinically driven metrics to assess medical practices are not logistically feasible at the population level, particularly in non-centralized healthcare delivery systems and/or for those who only have access to claims data. Data-driven approaches that incorporate all available information are needed to explore coding specificity practices. Using N = 487,775 hospitalization records of individuals diagnosed with dementia and discharged in 2022 from a large all-payor administrative claims dataset, we fitted logistic regression models using patient and facility characteristics to explain the coding specificity of principal and secondary diagnoses of dementia. A two-step approach was produced to allow for the flexible clustering of patient-level outcomes. Model outcomes were then used within a Poisson binomial model to identify facilities that over- or under-specify dementia diagnoses against healthcare industry standards across hospitalizations. The results indicate that multiple factors are significantly associated with dementia coding specificity, especially for principal diagnoses of dementia (AUC = 0.727). The practical use of this novel risk-adjusted metric is demonstrated for a sample of facilities and geospatially via a U.S. map. This study’s findings provide healthcare facilities with a benchmark for assessing coding specificity practices and developing quality enhancements to align with healthcare industry standards, ultimately contributing to better patient care and healthcare system reliability.

1. Introduction

The precise recording, evaluation, and documentation of patient information through medical coding play a large role in the quality of care delivered, reimbursement from payors, and the reliability of healthcare systems [1]. The process of clinical coding has multiple facets, such as accuracy, completeness, and appropriate levels for the specificity of diagnostic coding. Coding specificity, an important aspect of the coding process, refers to the level of granularity at which a clinical diagnosis is recorded [2].
A widely used medical coding system is the International Classification of Diseases Clinical Modification (or ICD-CM) [3]. ICD-10-CM (herein ICD-10) refers to the 10th revision of the taxonomy and serves to categorize diseases and health conditions at varying degrees of specificity, from coarse or unspecified more general diagnoses to more granular ones representing a deeper level of knowledge of the clinical condition. Medical coding helps standardize documentation across healthcare systems, serves as a tool for medical billing, standardizes risk factors in risk-adjusted quality measures, and helps guide healthcare policy by accurately identifying disease prevalence and supporting national and international decision making [3]. The coding system also helps organize and find medical information easily, impacting how we understand and use medical data, including assessing in real time the spread or prevalence of diseases and the optimization of resource allocations [3,4,5,6].
When considering coding practice enhancements, it is important to assess how healthcare professionals in hospitals document the presence of a specific condition or disease. Transcription—potentially including voice transcription for electronic health records (EHRs)—errors, missing information in patient charts, and illegible handwriting all contribute to inadequate specificity in coding [7]. It constitutes a shared responsibility among all parties involved to appropriately code to the highest level of specificity [8]. The move from ICD-9 to ICD-10 on 1 October 2015 led to a fivefold increase in the number of codes, exacerbating the complexity of coders’ work and the potential for coding errors following this transition [3]. Clinical specialties have been affected differently, with this ICD-10 transition representing an uneven burden across facilities and patients depending on the diagnosis family and the facility’s level of specialization [9,10].
A lower coding specificity is a potential burden for Medicare and other payors, which are billed for potentially under-specified diagnoses when a higher specificity (i.e., enhanced diagnosis quality) could be available [3,8]. Medicare is the health insurance coverage provided by the United States (U.S.) government for individuals 65 years and older. Medicare within the acute care inpatient setting refers to payments reimbursed through the Inpatient Prospective Payment System (IPPS). Through the IPPS, hospitalizations are grouped into medical severity diagnosis-related groups (MS-DRGs) based largely on the presence of principal and secondary ICD-10-CM diagnosis codes, and in some cases, ICD-10-PCS procedure codes. The MS-DRG grouping is associated with a weight that further adjusts the base hospital payment (determined by a set of hospital-level characteristics) to determine the final discharge-level reimbursement. There is a tradeoff between productivity and coding quality, as enhancing the coding specificity can be time intensive in some instances, requiring the coder to explore whether more specific coding is appropriate given the information provided in the patient medical record. At the facility level, the associated costs to the healthcare payor as well as the impact on patients’ clinical history, treatment, and resulting health outcomes must also be considered [8]. Coding specificity is especially relevant when a diagnosis is unrelated to the principal cause of hospitalization, or when diagnoses are not made by specialists, as added specification for these codes may not affect the hospital’s rate of reimbursement for the hospitalization. Thus, coding specificity is not only important at the administrative level but also has the potential to impact both healthcare facilities and patients.
At the facility level, coding that accurately captures clinical diagnoses ensures that healthcare facilities maintain effective billing operations. Coding specificity has the potential to impact a facility’s financial capital and allocation of resources, since facilities, after a short grace period that ended in October 2016, can be denied Medicare-based claims based on insufficient diagnostic specificity [8,11]. ICD-10 codes are the foundation of hospital billing processes, so misdiagnoses or misclassifications of codes can impact hospital reimbursement and insurance eligibility, including Medicare reimbursements [12]. Inaccurate coding further has the potential to affect facilities’ reputations and pay-for-performance incentive payments, as such hospital-ranking programs evaluate quality performance using risk-adjusted outcomes that rely on ICD-10 coding [13,14,15,16,17,18].
From the patient perspective, accurate documentation ensures that the patient is receiving an adequate treatment plan tailored to the specifics of their diagnosis and needs, both during their inpatient stay and after discharge [8]. In addition to individuals who may go undiagnosed, those with an under-specified diagnosis may also suffer worsened clinical outcomes. Among other factors, practitioners’ unconscious biases as well as inappropriate facility practices could result in deviations from universal documentation and coding standards, thus potentially exacerbating health disparities [19], which may manifest through specificity gaps across subpopulations. A lack of specificity can lead to patients’ clinical histories being affected, thus resulting in potential variations in care and resulting outcomes across social strata [8]. From the standpoint of coding specificity, the literature lacks a wider understanding of how decreased (or increased) levels of specificity may be associated with sociodemographic factors, thus potentially exacerbating the aforementioned healthcare disparities [19].
Not all unspecified diagnoses are inappropriate. In fact, unspecified diagnosis codes are recommended by the United States (U.S.) Centers for Medicare and Medicaid Services (CMS): “When sufficient clinical information is not known or available about a particular health condition to assign a more specific code, it is acceptable to report the appropriate unspecified code” [20]. Hence, there is a balance between the necessary level of unspecificity and the unnecessary level of unspecificity that needs to be considered, since a pure minimization of unspecified codes could also lead to incorrectly specified diagnoses. Conversely, achieving a higher level of specificity may require additional clinical tests or interventions, which may be subject to additional considerations regarding cost-effectiveness [21,22,23], especially when the primary cause of the inpatient stay is not related to nor affected by the unspecified diagnosis. Also, higher levels of specificity may not be warranted by clinical diagnoses. Incorrect levels of both specificity and unspecificity can lead to inappropriate treatment. Thus, when a diagnosis is not confirmed, it is appropriate to provide an initial, temporary unspecified diagnosis [20] until further tests can be performed, if clinically recommended.
While our approach is generalizable and can be applied across clinical strata, our motivating example consists of a large patient cohort across the U.S. of nearly 500,000 unique inpatient individuals who were diagnosed with dementia and discharged in 2022. In 2020, dementia affected the lives of over 55 million people across the world, which is close to 1% of the global population [24]. Projections suggest that this number will experience nearly twofold growth every 20 years, surging to 78 million by 2030 and about 139 million by 2050 [24]. This number is further increased by those providing caregiving and other family members indirectly suffering from this debilitating disease. In the absence of enhanced treatments or preventive measures, adverse outcomes associated with dementia will persistently rise [25]. Many patients are likely to receive unspecified dementia diagnoses when seeing a primary care provider compared to when seeing a specialized provider (like a neurologist or geriatrician) [26,27]. Thus, dementia represents an important disease within an aging population that is likely to be of increased relevance as treatment interventions are developed, and enhanced coding specificity is needed in this area to identify resources properly [25]. In a 2017 study, researchers reviewed the medical records of dementia ICD-10 code cases, and they discovered that many of the cases lacked specific descriptions that would aid in confirming the diagnosis of specific types of dementia [28]. This study revealed that 63% of cases did not provide a specific diagnosis of dementia in the medical records, but instead considered other conditions as the likely explanation of the patient’s hospitalization [28]. More generally, mental health conditions have been identified among conditions suffering from higher rates of unspecified diagnoses [8].
Models have been developed for assessing risk-adjusted coding intensity for both diagnoses and procedures, as well as identifying facilities that over- or under-code [29,30]. This area tangentially relates to coding specificity. However, the literature still lacks risk-adjusted approaches that account for factors potentially associated with coding specificity, adjusting for patient and facility characteristics, with only some initial work developed in the area of depression [31], but none, to our knowledge, in the area of dementia or other neurocognitive diseases. The aim of this study is to provide a novel risk-adjusted metric, demonstrated through a population-based dementia patient cohort in the U.S., to estimate dementia ICD-10 coding specificity by facility upon adjusting for a set of commonly available facility- and patient-level characteristics.
While enhancements in coding specificity practices are possible through other means, such as through the clinical identification of potential coding specificity inaccuracies or increased training, such approaches are not cost-effective if they need to be performed at the population level. There is a need for cost-effective approaches that serve to pre-screen and identify facilities which may need such enhancements the most. Clinical assessments may be possible if electronic health records are available, but this is not always the case. In this case, data-driven approaches may provide insights into how coding specificity can vary across patients and facilities and whether these variations occur in ways that may depart from anticipated randomness. Our proposed data-driven metric can serve facilities to self-assess variation in coding specificity compared with their healthcare peers and can provide a benchmark to identify facilities that could benefit from a further analysis of diagnostic coding specificity practices.

2. Materials and Methods

2.1. Data and Variables

De-identified data sourced and provided by Premier, Inc.’s private database serve as the foundation of this analysis [32]. The dataset is composed of N = 487,775 observations containing information on the first inpatient hospitalization for each patient with a principal or secondary diagnosis related to dementia who was discharged in the year 2022 using the F ICD-10 diagnosis codes provided in Supplementary Tables S1 and S2. The ICD-10 codes corresponding to these diagnoses were identified by an expert team of medical coders at Premier, Inc. Patients who were admitted prior to 2022 were also included if they were discharged in 2022.
The data were further categorized into three types of variables: (1) outcome variables; (2) patient characteristics; and (3) facility characteristics. Outcome variables include coding specificity of principal diagnosis of dementia codes and coding specificity of secondary diagnosis of dementia codes. Principal diagnosis specificity denotes whether the ICD-10 dementia-related principal diagnosis code was specified (versus unspecified), and for secondary diagnoses, a specified diagnosis is assumed when at least one secondary diagnosis related to dementia was specified. In addition to masked patient IDs, patient characteristics for this study include the following: age group; sex; race; length of stay; primary payor; point of origin; discharge status; number of procedure codes; ICD-10 coding period (2022 for coding prior to 1 October 2022 and 2023 for codes from 1 October 2022); five Centers for Disease Control and Prevention’s Agency for Toxic Substances and Disease Registry’s (ATSDR) social vulnerability indices [33]; a COVID-19 indicator; and Medicare Severity Diagnosis Related Group (MS-DRG) type for the inpatient stay. In addition to masked facility IDs, facility characteristics include the following: three facility status variables (teaching, academic, and urban); ownership; size (bed count, grouped); case mix index (CMI); and U.S. state.

2.2. Statistical Analysis

Descriptive statistics were calculated and tabulated. Variables for which certain subgroups had limited representation (e.g., charity and indigent payors) were grouped together. Patients under 45 years old were grouped together due to their low counts. Discharge status codes indicating that the patient expired were collapsed into a single category. A diverse set of categories representing patients’ points of origin with low counts were grouped into a single ‘other’ category.
Univariate and multivariate logistic regression analyses were utilized to identify associations between patient- and facility-level characteristics and each of the two outcomes (specificity of dementia principal and secondary diagnoses per patient hospitalization). Univariate and adjusted odds ratios (ORs), as well as corresponding 95% confidence intervals (Cls) and p-values, were computed and tabulated. Receiver operating characteristic (ROC) curves were calculated and depicted, and area under the curve (AUC) values were extracted to demonstrate the multivariate models’ fitted performances to explain principal and secondary dementia diagnoses.
Clustering of this metric is demonstrated at the facility level, though other clustering factors are possible. Importantly, as opposed to variables used for constructing the patient-specific metric, clustering variables do not need to be observable for the full sample. A facility-specific metric of diagnostic coding specificity was also calculated from the risk-adjusted probabilities of specificity. Let Y i , j be the binary variable denoting coding specificity of the principal or secondary diagnosis for hospitalization i at facility j. This variable follows a Bernoulli (Ber) distribution with estimated probability p ^ i , j as shown below:
Y i , j   ~   B e r p ^ i , j .
The set p ^ i , j was estimated from the multivariate logistic regression model which was adjusted for patient and facility characteristics. Assuming that each hospitalization’s coding specificity was independently, though not identically, distributed per facility, the total count of facility-specific coding specificity follows a Poisson binomial (PoiBin) distribution with probability vector p ^ j = p ^ 1 , j , p ^ 2 , j , , p ^ n j , j for n j hospitalizations in facility j as shown as follows:
i = 1 n j Y i , j   ~   P o i B i n   p ^ j
Facility-specific 95% CIs were extracted through the Poisson binomial facility-specific cumulative distribution functions (CDFs). These were used to identify facilities which under- (p < 0.025) and over- (p > 0.975) specified in their coding versus facilities’ peers using the estimated CDF for the specificity count.
Error bars were constructed to demonstrate the facility-specific metric for a sample of 20 facilities for both dementia principal and secondary diagnoses. Among these facilities, the coding specificity of dementia diagnosis indicator variable was defined, and an observed count (dots) was plotted for each facility (X-axis). A 95% CI for each facility, built on the basis of the Poisson binomial model, was added to identify these facilities’ adjusted levels of coding specificity against peers. Over- and under-coding risk-adjusted specificity practices were then identified by the facility.
Finally, geospatial U.S. maps were created to display adjusted ORs of principal and secondary diagnosis coding specificity by state against the reference of New York, which is the state with the highest per capita healthcare expenditure in the U.S. [34].

3. Results

Table 1 provides a summary of the descriptive statistics for N = 487,775 hospitalization records and patients, since each patient is only observed once due to the cohort definition (the first hospitalization for each patient within the year). The dataset comprised observations from 866 facilities, with an average of 563.25 patients per facility. The distribution of age among this dementia patient cohort is naturally skewed, with 61% of individuals being 80 years and older. Females constituted 58% of the patients, and the majority of the patients identified as White (76%). The median length of stay, which was log-transformed due to its large right skewness, was 5 days, and the most common primary payor was Medicare traditional (53%). The point of origin was predominantly non-healthcare facilities (79%), and the discharge status varied, with 19% of the patients being discharged to home or self-care, while the majority were transferred to other healthcare facilities, often skilled nursing facilities (36%). The average number of procedures during inpatient stays was 2.7, with surgical MS-DRGs representing 15% of hospitalizations. Additionally, 13% were COVID-19-positive patients. Most of the facilities were non-teaching (78%) and non-academic (85%). Urban facilities were more prevalent (86%) than rural ones. Voluntary non-profit private was the most common ownership status (64%). The bed capacity varied, with 1–50 beds (3%) and >400 beds (39%) being the least and most common facility sizes, respectively. The mean case mix index was 1.7. The dataset represented multiple states, with New York (9%) and Florida (12%) being the top states in the number of hospitalizations.
Table 2 contains the adjusted ORs, 95% CIs, and p-values for the univariate and multivariate logistic regression analyses for modeling the coding specificity of dementia-related principal diagnoses. Younger patients were generally associated with higher odds of coding specificity than patients in the oldest age group (85+). Males experienced 45% higher odds of dementia-related principal diagnosis coding specificity than females (OR = 1.454; 95% CI: 1.301–1.625). Race was generally non-significant, except for Black patients, who experienced significantly higher odds of principal diagnosis coding specificity than White patients (OR = 1.237; 95% CI: 1.058–1.446). The log-length of stay was significant, with longer stays associated with higher odds of coding specificity (OR = 1.124; 95% CI: 1.060–1.191), but primary payor and point of origin were generally not significant. Patients with certain discharge statuses experienced significantly higher odds of coding specificity than those discharged to home or self-care, namely patients discharged to hospice homes, hospice medical facilities, or psychiatric hospitals (OR ≥ 1.354). The number of procedures was also significant, with each additional procedure performed associated with 23% increased odds of coding specificity (OR = 1.230; 95% CI: 1.179–1.283). The CMS fiscal year was highly significant, indicating 73.6% lower odds of specificity for 2023 (discharges occurring between 1 October and 31 December 2022) compared to 2022 (discharges between 1 January and 30 September 2022) (OR = 0.264; 95% CI: 0.224–0.311). Social vulnerability indices were not significant at the multivariate level, though some were significant univariately, indicating that some of the information content may be present in other patient characteristics. COVID-19 status and MS-DRG type were not statistically significant, except for at the univariate level, at which the latter showed surgical MS-DRGs associated with increased odds of specificity. At the facility level, patients in facilities whose teaching status was not available experienced 65.1% lower odds of specificity than those in non-teaching facilities (OR = 0.349; 95% CI: 0.142–0.856). Neither academic nor rural/urban status showed significant variability at the multivariate level. Most ownership categories were not significantly different from the voluntary non-profit private reference, except for other non-profit voluntary (OR = 0.605; 95% CI: 0.442–0.827) and local government (OR = 2.104; 95% CI: 1.401–3.159). Patients from facilities with bed counts lower than the reference category (>400) experienced lower odds of coding specificity, though only three categories were statistically significant. The case mix index was significant, with each unit increase accompanied by 57.9% increased odds of dementia-related principal diagnosis coding specificity (OR = 1.579; 95% CI: 1.188–2.100). Finally, most states demonstrated no statistically significant differences in principal diagnosis coding specificity compared to New York, with the exception of Hawaii, Louisiana, Minnesota, Oregon, Pennsylvania, and Virginia (which had a higher odds) as well as Illinois and Tennessee (which had lower odds of coding specificity).
Table 3 reports the univariate and multivariate logistic regression results (ORs, 95% CIs, and p-values) for the specificity of secondary dementia diagnoses’ outcome. For the multivariate results, all age groups experienced higher odds of specificity of dementia secondary diagnoses than the reference group of ages 85+ (OR ≥ 1.316; p < 0.001). Male patients had significantly higher odds of dementia secondary diagnosis specificity compared to females (OR = 1.224, 95% CI: 1.209–1.239; p < 0.001). Individuals identifying as Black were associated with lower odds of dementia secondary diagnosis specificity (OR = 0.955; 95% CI: 0.937–0.973) compared to White patients, while the opposite was found for those identifying as other races (OR = 1.069; 95% CI: 1.040–1.099). For some categories, primary payor, patient origin, and discharge status also showed significant associations with dementia secondary diagnosis coding specificity (see Table 3). Length of stay (in log terms) was also associated with higher odds of dementia secondary diagnosis specificity (OR = 1.017; 95% CI: 1.008–1.025). Those undergoing a larger number of procedures experienced higher odds of dementia secondary diagnosis specificity (OR = 1.039; 95% CI: 1.036–1.042). The CMS fiscal year was not substantially different, with those who were hospitalized in the new 2023 fiscal year experiencing 1.4% higher odds of specificity (OR = 1.014; 95% CI: 1.000–1.028). Patient socioeconomic (OR = 0.829; 95% CI: 0.717–0.958) and racial/ethnic minority (OR = 1.09; 95% CI: 1.03–1.154) statuses within the social vulnerability indices were significantly associated with decreased and increased, respectively, odds of dementia secondary diagnosis specificity. COVID-19-positive patients were associated with lower odds of dementia secondary diagnosis specificity (OR = 0.948; 95% CI: 0.930–0.965). Patients undergoing a surgical MS-DRG experienced 14% lower odds of dementia secondary diagnosis specificity compared to those undergoing a medical MS-DRG (OR = 0.859; 95% CI: 0.844–0.875). Academic facilities demonstrated higher odds of dementia secondary diagnosis specificity (OR = 1.052; 95% CI: 1.020–1.085), whereas those in rural settings experienced lower odds of dementia secondary diagnosis specificity (OR = 0.976; 95% CI: 0.955–0.997). Patients at facilities of different ownership types also experienced differing odds of dementia secondary diagnosis specificity (see Table 3). Lower bed counts were generally associated with lower odds of dementia secondary diagnosis specificity (OR ≤ 0.954) than those in the largest cluster of hospitals (>400 beds), with the exception of facilities with 51–100 beds and those with 351–400 beds. Substantial differences in the odds of dementia secondary diagnosis specificity were found by state when compared to the reference state of New York.
Figure 1 panel (a) shows the ROC curve for the multivariate model of the coding specificity of a principal diagnosis related to dementia. The estimated AUC was 0.7269, representing the good reliability of the multivariate model in assessing the coding specificity of dementia-related principal diagnoses. Panel (b) shows the ROC curve corresponding to the multivariate logistic regression analysis for assessing the coding specificity of secondary dementia diagnoses. The corresponding AUC was 0.5919, demonstrating a worse model performance when compared to that of the model assessing the coding specificity of primary dementia diagnoses.
Figure 2 represents a subset of the facilities’ observed dementia-related principal diagnosis coding specificity (a) and secondary diagnosis coding specificity (b) relative to industry standards. The p-values (and the 95% CIs, which are represented as error bars) from the estimated Poisson binomial distribution are used so that under-specificity versus peers (p < 0.025) is represented in blue; specificity in line with peers (0.025 ≤ p ≤ 0.975) is represented in black; and over-specificity versus peers (p > 0.975) is represented in orange.
Figure 3 represents the adjusted ORs for the coding specificity of a dementia-related principal diagnosis (a) and secondary diagnosis (b). All of the adjusted ORs are represented against New York as the reference state. Only a few states demonstrate statistically different adjusted odds of coding specificity of a dementia-related principal diagnosis versus New York, while a larger amount of variability is observed for states’ secondary diagnosis coding specificity. The gray states had non-significant adjusted ORs of coding specificity.

4. Discussion

The literature on diagnostic coding specificity remains scarce, with healthcare facilities and practitioners limited in their ability to self-evaluate against healthcare industry standards of practice. It is also unclear whether non-clinical characteristics can explain variability in specificity practices. To address this gap, a novel approach was demonstrated to evaluate facility-specific practices for the dementia-related coding specificity of principal and secondary diagnoses upon making risk adjustments for commonly available patient and facility characteristics. A logistic regression was applied to make risk adjustments to the probability of receiving a specified dementia diagnosis. The statistical output is used in a two-step approach, building on a Poisson binomial model, to evaluate the performance of healthcare facilities in providing specified dementia-related principal, or at least one secondary, diagnoses. This metric can be used to identify facilities that perform differently (under- or over-specifying) compared to their healthcare industry peers and can provide an objective standard against which the coding specificity practices of facilities can be evaluated. These findings offer valuable insights for healthcare stakeholders and quality-control personnel, facilitating the identification of facilities that may benefit from targeted interventions to enhance the levels of specificity of dementia-related diagnosis coding.
Our results indicate that the coding specificity of dementia diagnoses is associated with a range of patient and facility characteristics, particularly for primary diagnoses, as demonstrated through a higher AUC value. Younger patients were generally associated with a higher odds of coding specificity for dementia-related principal and secondary diagnoses. While dementia has been found to be more easily identifiable among older patients [35], our findings indicate that, conditional on a dementia diagnosis, the odds of coding specificity are higher among younger patients. However, it is unclear whether there is a clinical association between the prevalence of specified cases of dementia and age, particularly when comparing age groups with those at least 85 years old.
Prior studies have found that the prevalence of types of dementia is different by sex [36], which could also be due to environmental and behavioral differences according to sex. Males had approximately 22% (secondary) and 45% (principal) higher odds of dementia diagnosis specificity compared to females, though this could be confounded with age. Black patients demonstrated a significantly higher odds of principal diagnosis coding specificity than White patients. However, the reverse is observed for secondary diagnosis specificity. In both cases, there could be confounders due to collinearity with other factors, including social vulnerability indices. Patients have been shown to experience differences in the prevalence of dementia and its associated symptoms and severity by race [37], which could potentially have an association with the ability of doctors to provide a specified dementia diagnosis.
The significant association between longer hospital stays and higher odds of both principal and secondary coding specificity could be due to the additional inpatient time which allows for more comprehensive evaluations, diagnoses, and documentation. Patients discharged to specific destinations, such as hospice homes, hospice medical facilities, or psychiatric hospitals, exhibited significantly higher odds of principal and secondary diagnosis specificity. This could be related to the severity of their case or their prior history, which could, in turn, be associated with a potentially more accurate clinical diagnosis. Patients undergoing more procedures had higher odds of receiving a specified principal or secondary diagnosis. Though the cause of this association is unclear, this could be related to there being more resources allocated for identifying a patient’s disease when procedures are necessary during their inpatient stay. While a COVID-19 diagnosis was not associated with differing odds of principal diagnosis specificity, it was associated with lower odds of secondary diagnosis specificity. However, it is unclear whether the association between the severity of patients’ COVID-19 symptoms and age could be a confounder [38].
While the differences by CMS fiscal year in secondary diagnosis specificity were minor and are probably clinically irrelevant, the differences were more substantial among those with a dementia primary diagnosis. However, this could be due to seasonal confounders. The new fiscal year, denoted as 2023, was only measured in the October–December 2022 period, which may also be a period with seasonally over-burdened hospitals and less time for healthcare personnel to perform more in-depth diagnoses of patients.
From a payor perspective, none of the payor types were associated with differing odds of principal diagnosis specificity when compared to that of Medicare traditional. This is encouraging, as it indicates that principal diagnosis specificity may not be attributable to healthcare payor type. However, the substantial differences in the univariate results indicate that some complex associations may be embedded, though this is unclear, since the patient mix would not be homogeneous across payor types. For example, age could be acting as a proxy for Medicare status. Also, some differences were found when assessing odds of secondary diagnosis specificity. Some of these differences could be due to other patient characteristics. For example, those receiving Medicare traditional may be in widely different age groups than those for whom the payor comes from a direct employer contract or who receives workers’ compensation. Thus, health insurance coverage may be substantially different across patients, leading to the different propensities of patients to seek hospitalization [39].
Additionally, the ownership status of the facilities displayed some significant differences, with local government-owned facilities showing notably higher odds of principal and secondary diagnosis specificity. Again, the non-clinical patient characteristics by facility and facility ownership could differ widely. The case mix index of the hospital was significantly, positively associated with the specificity of principal diagnosis, indicating that the overall complexity of patients’ needs in a facility is related to higher degrees of specificity provided during a hospitalization. However, no significant association of specificity and the facility case mix index was found when the dementia diagnosis was secondary during the inpatient stay. Substantial differences were also found by state, particularly for secondary diagnoses. These differences could stem from the population mix or could be related to a substantially larger sample size for this analysis. Differences in health care provision by state across multiple metrics, such as care setting and type of disease/clinical area, have been documented [40]. However, we cannot link the coding specificity with the quality of care directly, since a low quality of care can occur when there are low levels of specificity state-wide but also when there are high levels of specificity and such excessive level of specificity is not clinically warranted.
These variations in coding practices demonstrate the potential influence of organizational characteristics or state-wide standards of practice on coding specificity. State-level variations may be attributed to regional variations in healthcare infrastructure, regulatory frameworks, insurance-related expectations/requirements, or coding practices. Also, there is state clustering of hospitals with a common health system, which may share a coding department and/or coding standards. However, they could also be influenced by the patient mix and other correlated factors in these states, given the socioeconomic, racial, and age differences across states, which may reflect the underlying reasons for non-idiosyncratic specificity disparities [41].
Providing high levels of coding specificity, when possible and appropriate, supports the accuracy and completeness of health records for patients, potentially enhancing their subsequent health outcomes. However, high coding standards require both time and educational/training resources for coders to conduct efficient and consistent coding practices that are current and accurate. Unspecified diagnoses may sometimes be a consequence of insufficient knowledge about all possible ICD-10 codes available related to a condition. Over-specified diagnoses may be a consequence of miscoding. Therefore, there is a tradeoff between the cost of specificity-related accuracy (oftentimes paid by the provider) and the cost of specificity-related inaccuracy (oftentimes a burden for the payor and the patient). Our approach demonstrates that facilities with dementia-related hospitalizations can be compared against a common/industry standard in a risk-adjusted form, so that facilities over- or under-specifying can be identified and their coding standards of practice can be adjusted, when needed.
While the proposed approach is demonstrated with an example of clustering at the facility level, for which full information is available for all patients, clustering by other factors is also possible. For example, clustering by zip code can allow for geospatial analyses of coding specificity. Also, clustering factors do not need to be available for all observations, allowing for more flexible analyses. For example, some hospitals may collect information about patients or systems that other hospitals do not collect. Clustering analyses are possible in such instances, and it is one of the core advantages of the two-step approach of performing patient-level analyses and subsequently clustering by any desired factor.
Our findings emphasize the association of multiple patient and facility characteristics with coding specificity. The relative significance of the evaluated variables in explaining the variability in coding specificity further underscores the importance of risk-adjusted performance metrics when comparing healthcare outcomes and facility performances.

Strengths and Limitations

A large comprehensive dataset with nearly 488,000 patient observations related to dementia was used for this study, which represents, to our knowledge, the largest dementia-related study approaching the topic of diagnostic coding specificity. Developing and utilizing the proposed risk-adjusted metric allows for a fair assessment of coding specificity among healthcare facilities while producing an extrapolatable approach that allows for the incorporation of any available information about patient hospitalizations.
Though the dataset contains the most recently completed year (2022), it only encompasses a single year of discharges, yielding temporal limitations since coding policies and practices can be updated yearly. However, due to these potential dynamics, it is important to have a recent dataset that reflects current practices. The demonstrated method, however, can be applied on a rolling basis, so that facilities can assess their practices over time and evaluate any adjustments made along the way.
While the dataset comprises a large portion of U.S. hospitalizations, there could be data imbalances by state or other factors not considered in the study. This may affect our ability to measure associations with some variables with low counts, such as some of the states. However, this would not affect our results as long as the data imbalances are not directly related to the coding specificity. Also, the cohort definition includes only a subset of dementia-related codes (F ICD-10 diagnosis codes). A more expansive cohort definition is possible, but it would not affect the approach taken, since the cohort definition is common across facilities.
We utilize administrative claims data for explaining a substantial portion of the coding specificity variability in healthcare facilities. While this is insufficient to explain the full variability of diagnostic coding specificity, it is noteworthy that this explanatory power was achieved with minimal access to patients’ clinical characteristics, such as those provided in EHRs, many of which are not commonly available in claims data. This indicates that the model provides a baseline from which substantial improvements are possible if additional information is available, such as the granularity and clinical details found in EHRs. However, by making EHRs an optional input, our model gains generalizability, since there is no need for a clinical metric against which to measure the ‘correctness’ of the degree of coding specificity. Thus, while such clinical metric would be ideal, it is also unfeasible. Therefore, our approach should only be used as a metric to compare against industry standards and averages or against aspirational peer facilities.
Our approach assumes that patients are provided homogeneous treatments within facilities conditional on the set of variables used in the multivariate logistic regression. However, this assumption could be relaxed by introducing additional clustering factors/variables, such as the physicians within facilities, which may explain additional sources of coding specificity variability. The assumption of independence across hospitalizations could also be questionable, since there will be a substantial number of unmeasured factors that could contribute to a lack of independence (e.g., how busy the facilities were during the hospitalizations, who provided treatment, what the commonalities of the unmeasured clinical components across patients were, etc.). However, the model provides an initial metric to flag facilities with the potential for non-standard specificity practices, which can then be investigated more thoroughly by quality-control personnel.
The inclusion of random effects in the model was first considered across a range of facility-level characteristics, particularly the facility identifier. However, for the purpose of this study, we did not include random effects for multiple reasons as follows: (1) Computational complexity—for example, facility-specific random effects added hundreds of random effects in this particular dataset and potentially thousands or tens of thousands for other cohorts, leading to memory limitations. The proposed approach still required nearly 8 Gb RAM. Additionally, if even larger computational resources are needed, then the ability of quality-control personnel to use this approach could be substantially limited; (2) The reduced level of extrapolatability for even more complex or larger datasets, as administrative data may contain few observations or just one observation per facility, particularly if the tool is used for ‘live’ monitoring purposes; (3) Assumptions behind a random effects approach would be highly questionable, since the random effects would likely be correlated with some of the patient-level characteristics; and (4) While random effects and other modeling enhancements (e.g., different machine learning approaches or semiparametric models with spline components for some of the continuous variables, such as the log-length of stay) could have been considered for variables with lower numbers of categories, the purpose of our approach is not to find the optimal model for a particular cohort/year or set of variables. Instead, this manuscript aims to demonstrate the methodology and utility of administrative information in explaining diagnostic coding specificity variability among patients diagnosed with dementia. The purpose of the two-step approach (first at the patient visit level and then aggregated at any level) is to also provide tools that can be used in different forms, both in a disaggregated form for patient visit monitoring and in an aggregated form for facility monitoring.
The presence of multicollinearity among risk adjustment factors can complicate coefficient interpretation. Alternative approaches that map the information content to smaller sets of uncorrelated factors may be viable to reduce variance inflation, though they would be highly complex to construct due to the mostly categorical structure of the explanatory variable set. Such alternatives could also reduce interpretability. Multicollinearity, however, does not impact the main outcome of this manuscript, which is the estimation of a probability metric for coding specificity at the hospitalization level and a subsequent facility-level aggregation to measure facilities against healthcare industry standards. The goodness of fit or model use for prediction are not affected by collinearity, which allows for wide arrays of explanatory variables to be combined, regardless of potential information overlap in these variables. Thus, the focus of this manuscript is the metrics at the hospitalization and facility levels and their utility in identifying hospitalizations and facilities whose outcomes may substantially differ from industry practices, rather than the specific associations between explanatory variables and outcomes.
Finally, some quantitative data were provided in grouped categories for confidentiality purposes (e.g., age and bed size), and additional variables were not included to maintain the confidentiality of the records. This additional granularity and information could prove to enhance model outcomes within healthcare facility settings.

5. Conclusions

Medical coding is a very important component of healthcare systems, with an extensive impact on patient care quality, reimbursement, and system reliability. An understudied aspect of coding accuracy relates to coding specificity to the highest precision clinically possible. Our study focused on dementia coding specificity in the U.S. and demonstrates that a large number of readily available patient- and facility-level characteristics can be used to make risk adjustments to the odds of coding specificity and thus provide a standardized metric against which facilities can compare their coding specificity practices and standards. This study provides healthcare facilities with a valuable tool to enhance and assess variations in coding specificity, thus contributing to improved healthcare system reliability and financial efficiency as well as improved patient care in an era when accuracy and precision are of the utmost importance. The method demonstrated in this manuscript fills a significant gap in the literature, and its adaptability across patient cohorts, health conditions, and clusters of healthcare provision makes it a valuable tool for quality control and performance assessment. Our results indicate that the variability in the coding specificity of principal diagnoses of dementia can be better explained than the variability in the specificity of secondary diagnoses of dementia. This study addresses a critical need by making risk adjustments for factors that influence coding practices, ultimately contributing to our understanding of coding specificity disparities.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/healthcare12100983/s1, Table S1: List of specified dementia ICD-10 codes and corresponding descriptions; Table S2: List of unspecified dementia ICD-10 codes and corresponding descriptions.

Author Contributions

Conceptualization, J.M., M.K. and L.H.G.; methodology, L.H.G.; formal analysis, S.P., K.R., J.S., I.R.C., D.P.G., M.R.P., S.R.T. and L.H.G.; data curation, M.K.; writing—original draft preparation, K.R., J.S., S.P., I.R.C., D.P.G., M.R.P., S.R.T. and L.H.G.; writing—review and editing, K.R., J.S., S.P., I.R.C., D.P.G., M.R.P., S.R.T., J.M., M.K. and L.H.G.; visualization, S.P., K.R., J.S., I.R.C., D.P.G., M.R.P., S.R.T. and L.H.G.; supervision, L.H.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Ethical review and approval were exempt by the UNC Charlotte Institutional Review Board due to use of existing, de-identified secondary data.

Informed Consent Statement

Data were de-identified and provided by Premier, Inc. No informed consent was needed.

Data Availability Statement

Data were provided by Premier, Inc. and can be requested via https://www.pinc-ai.com/.

Conflicts of Interest

Michael Korvink and John Martin work for Premier, Inc. and own stock in the company. All other authors declare no conflicts of interest.

References

  1. Adane, K.; Gizachew, M.; Kendie, S. The role of medical data in efficient patient care delivery: A review. Risk Manag. Healthc. Policy 2019, 12, 67–73. [Google Scholar] [CrossRef] [PubMed]
  2. Tang, K.L.; Lucyk, K.; Quan, H. Coder perspectives on physician-related barriers to producing high-quality administrative data: A qualitative study. CMAJ Open 2017, 5, E617–E622. [Google Scholar] [CrossRef] [PubMed]
  3. Centers for Disease Control and Prevention (CDC). International Classification of Diseases, (ICD-10-CM/PCS) Transition—Background. CDC. November 2015. Available online: https://www.cdc.gov/nchs/icd/icd10cm_pcs_background.htm (accessed on 23 October 2023).
  4. Asadi, F.; Hosseini, M.A.; Almasi, S. Reliability of trauma coding with ICD-10. Chin. J. Traumatol. 2022, 25, 102–106. [Google Scholar] [CrossRef] [PubMed]
  5. Wu, J.T.; Leung, K.; Lam, T.T.Y. Nowcasting epidemics of novel pathogens: Lessons from COVID-19. Nat. Med. 2021, 27, 388–395. [Google Scholar] [CrossRef] [PubMed]
  6. World Health Organization (WHO). International Statistical Classification of Diseases and Related Health Problems, 10th Revision (ICD-10). WHO. September 2012. Available online: https://www.cdc.gov/nchs/data/icd/icdinformationsheet.pdf (accessed on 23 October 2023).
  7. Hsia, D.C.; Krushat, W.M.; Fagan, A.B.; Tebbutt, J.A.; Kusserow, R.P. Accuracy of diagnostic coding for Medicare patients under the prospective payment system. N. Engl. J. Med. 1988, 318, 352–355. [Google Scholar] [CrossRef] [PubMed]
  8. Zegan, J.; Improving Specificity in ICD-10 Diagnosis Coding. American Health Information Management Association. Available online: https://library.ahima.org/doc?oid=302473 (accessed on 23 October 2023).
  9. Boyd, A.D.; Li, J.J.; Burton, M.D.; Jonen, M.; Gardeux, V.; Achour, I.; Luo, R.Q.; Zenku, I.; Bahroos, N.; Brown, S.B.; et al. The discriminatory cost of ICD-10-CM transition between clinical specialties: Metrics, case study, and mitigating tools. J. Am. Med. Inform. Assoc. 2013, 20, 708–717. [Google Scholar] [CrossRef] [PubMed]
  10. Grasso, M.A.; Dezman, Z.D.W.; Jerrard, D.A. Coding disparity and specificity during emergency department visits after transitioning to the tenth version of the International Classification of Diseases. AMIA Annu. Symp. Proc. 2022, 2022, 495–501. [Google Scholar] [CrossRef]
  11. Centers for Medicare and Medicaid Services, Department of Health and Human Services. ICD-10-CM Official Guidelines for Coding and Reporting FY 2017. Available online: https://www.cms.gov/medicare/coding/icd10/downloads/2017-icd-10-cm-guidelines.pdf (accessed on 23 October 2023).
  12. Horsky, J.; Drucker, E.A.; Ramelson, H.Z. Accuracy and completeness of clinical coding using ICD-10 for ambulatory visits. AMIA Annu. Symp. Proc. 2017, 2018, 912–920. [Google Scholar]
  13. Centers for Medicare and Medicaid Services (CMS). Overall Star Rating for Hospitals. CMS. Available online: https://www.medicare.gov/care-compare/resources/hospital/overall-star-rating (accessed on 23 October 2023).
  14. Centers for Medicare and Medicaid Services (CMS). Hospital Value-Based Purchasing Program. Available online: https://www.cms.gov/Medicare/Quality-Initiatives-Patient-Assessment-Instruments/HospitalQualityInits/Hospital-Value-Based-Purchasing (accessed on 30 November 2023).
  15. Centers for Medicare and Medicaid Services (CMS). Hospital Readmissions Reduction Program (HRRP). Available online: https://www.cms.gov/Medicare/Medicare-Fee-for-Service-Payment/AcuteInpatientPPS/Readmissions-Reduction-Program (accessed on 30 November 2023).
  16. Centers for Medicare and Medicaid Services (CMS). Hospital-Acquired Conditions. Available online: https://www.cms.gov/Medicare/Medicare-Fee-for-Service-Payment/HospitalAcqCond/Hospital-Acquired_Conditions (accessed on 30 November 2023).
  17. Austin, J.M.; D’Andrea, G.; Birkmeyer, J.D.; Leape, L.L.; Milstein, A.; Pronovost, P.J.; Romano, P.S.; Singer, S.J.; Vogus, T.J.; Wachter, R.M. Safety in Numbers: The Development of Leapfrog’s Composite Patient Safety Score for U.S. Hospitals. J. Patient Saf. 2014, 10, 64–71. [Google Scholar] [CrossRef]
  18. Fortune and PINC AI. The 2023 Fortune/PINC AI 100 Top Hospitals. Available online: https://fortune.com/article/100-top-hospitals-2023-pinc-ai (accessed on 30 November 2023).
  19. Torres, J.M.; Hessler-Jones, D.; Yarbrough, C.; Tapley, A.; Jimenez, R.; Gottlieb, L.M. An online experiment to assess bias in professional medical coding. BMC Med. Inform. Decis. Mak. 2019, 19, 115. [Google Scholar] [CrossRef]
  20. Department of Health and Human Services. Information and Resources for Submitting Correct ICD-10 Codes to Medicare. MLN Matters 2014. Available online: https://www.hhs.gov/guidance/sites/default/files/hhs-guidance-documents/SE1518.pdf (accessed on 23 October 2023).
  21. American Hospital Association (AHA). Using the X-ray Report for Specificity; AHA Coding Clinic for ICD-10-CM and ICD-10-PCS (First Quarter 2013), 28; AHA Central Office: Chicago, IL, USA, 2013. [Google Scholar]
  22. American Hospital Association (AHA). Use of Imaging Reports for Greater Specificity; AHA Coding Clinic for ICD-10-CM and ICD-10-PCS (Third Quarter 2014), 5; AHA Central Office: Chicago, IL, USA, 2014. [Google Scholar]
  23. American Hospital Association (AHA). Use of X-ray to Determine Site of Pain; AHA Coding Clinic for ICD-10-CM and ICD-10-PCS (Fourth Quarter 2016), 143; AHA Central Office: Chicago, IL, USA, 2016. [Google Scholar]
  24. Alzheimer’s Disease International. Dementia Statistics. Available online: https://www.alzint.org/about/dementia-facts-figures/dementia-statistics (accessed on 23 October 2023).
  25. Arvanitakis, Z.; Shah, R.C.; Bennett, D.A. Diagnosis and management of dementia: Review. JAMA 2019, 322, 1589–1599. [Google Scholar] [CrossRef] [PubMed]
  26. Butler, D.; Kowall, N.W.; Lawler, E.; Gaziano, J.M.; Driver, J.A. Underuse of diagnostic codes for specific dementias in the Veterans Affairs New England healthcare system. J. Am. Geriatr. Soc. 2012, 60, 910–915. [Google Scholar] [CrossRef] [PubMed]
  27. Drabo, E.F.; Barthold, D.; Joyce, G.; Ferido, P.; Chang, C.H.; Zissimopoulos, J. Longitudinal analysis of dementia diagnosis and specialty care among racially diverse Medicare beneficiaries. Alzheimers Dement. 2019, 15, 1402–1411. [Google Scholar] [CrossRef] [PubMed]
  28. Fujiyoshi, A.; Jacobs, D.R., Jr.; Alonso, A.; Luchsinger, J.A.; Rapp, S.R.; Duprez, D.A. Validity of death certificate and hospital discharge ICD codes for dementia diagnosis: The Multi Ethnic Study of Atherosclerosis. Alzheimer Dis. Assoc. Disord. 2017, 31, 168–172. [Google Scholar] [CrossRef] [PubMed]
  29. Rios, N.G.; Oldiges, P.E.; Lizano, M.S.; Daucet-Wadford, D.S.; Quick, D.L.; Martin, J.; Korvink, M.; Gunn, L.H. Modeling coding intensity of procedures in a U.S. population-based hip/knee arthroplasty inpatient cohort adjusting for patient- and facility-level characteristics. Healthcare 2022, 10, 1368. [Google Scholar] [CrossRef] [PubMed]
  30. Mishra, R.; Verman, H.; Aynala, V.B.; Arredondo, P.R.; Martin, J.; Korvink, M.; Gunn, L.H. Diagnostic coding intensity among a pneumonia inpatient cohort using a risk-adjustment model and claims data: A U.S. population-based study. Diagnostics 2022, 12, 1495. [Google Scholar] [CrossRef] [PubMed]
  31. Glass, A.; Melton, N.C.; Moore, C.; Myrick, K.; Thao, K.; Mogaji, S.; Howell, A.; Patton, K.; Martin, J.; Korvink, M.; et al. A novel method for assessing risk-adjusted diagnostic coding specificity for depression using a U.S. cohort of over one million patients. Diagnostics 2024, 14, 426. [Google Scholar] [CrossRef] [PubMed]
  32. PINC AI Applied Sciences. PINC AI Healthcare Database White Paper: Data That Informs and Performs; Premier Inc.: Charlotte, NC, USA, 2023; Available online: https://offers.premierinc.com/rs/381-NBB-525/images/PINC_AI_Healthcare_Data_White_Paper.pdf (accessed on 23 October 2023).
  33. Centers for Disease Control and Prevention (CDC), Agency for Toxic Substances and Disease Registry. CDC SVI Documentation 2020. Available online: https://www.atsdr.cdc.gov/placeandhealth/svi/documentation/SVI_documentation_2020.html (accessed on 10 November 2023).
  34. Centers for Medicare & Medicaid Services, Office of the Actuary, National Health Statistics Group. National Health Expenditure Data: Health Expenditures by State of Residence, August 2022. Available online: https://www.cms.gov/data-research/statistics-trends-and-reports/national-health-expenditure-data/state-residence (accessed on 10 November 2023).
  35. Bradford, A.; Kunik, M.E.; Schulz, P.; Williams, S.P.; Singh, H. Missed and delayed diagnosis of dementia in primary care: Prevalence and contributing factors. Alzheimer Dis. Assoc. Disord. 2009, 23, 306–314. [Google Scholar] [CrossRef] [PubMed]
  36. Podcasy, J.L.; Epperson, C.N. Considering sex and gender in Alzheimer disease and other dementias. Dialogues Clin. Neurosci. 2016, 18, 437–446. [Google Scholar] [CrossRef]
  37. Lennon, J.C.; Aita, S.L.; Bene, V.A.D.; Rhoads, T.; Resch, Z.J.; Eloi, J.M.; Walker, K.A. Black and White individuals differ in dementia prevalence, risk factors, and symptomatic presentation. Alzheimers Dement. 2022, 18, 1461–1471. [Google Scholar] [CrossRef]
  38. Herrera-Esposito, D.; de Los Campos, G. Age-specific rate of severe and critical SARS-CoV-2 infections estimated with multi-country seroprevalence studies. BMC Infect. Dis. 2022, 22, 311. [Google Scholar] [CrossRef] [PubMed]
  39. Zhao, Y.; Paschalidis, I.C.; Hu, J. The impact of payer status on hospital admissions: Evidence from an academic medical center. BMC Health Serv. Res. 2021, 21, 930. [Google Scholar] [CrossRef] [PubMed]
  40. Agency for Healthcare Research and Quality. Health Care Quality: How Does Your State Compare? Available online: https://www.ahrq.gov/data/infographics/state-compare-text.html (accessed on 14 November 2023).
  41. Aranda, M.P.; Kremer, I.N.; Hinton, L.; Zissimopoulos, J.; Whitmer, R.A.; Hummel, C.H.; Trejo, L.; Fabius, C. Impact of dementia: Health disparities, population trends, care interventions, and economic costs. J. Am. Geriatr. Soc. 2021, 69, 1774–1783. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Receiver operating characteristic (ROC) curve of the multivariate logistic regression model for the specificity of a dementia-related principal diagnosis (a) and secondary diagnosis (b).
Figure 1. Receiver operating characteristic (ROC) curve of the multivariate logistic regression model for the specificity of a dementia-related principal diagnosis (a) and secondary diagnosis (b).
Healthcare 12 00983 g001
Figure 2. Observed counts of indicators of principal diagnosis coding specificity (a) and secondary diagnosis coding specificity (b) for dementia diagnoses by facility (dots) and 95% confidence intervals based on the Poisson binomial metric (error bars), with colors denoting over-specificity (orange), under-specificity (blue), and specificity in line with peers (black).
Figure 2. Observed counts of indicators of principal diagnosis coding specificity (a) and secondary diagnosis coding specificity (b) for dementia diagnoses by facility (dots) and 95% confidence intervals based on the Poisson binomial metric (error bars), with colors denoting over-specificity (orange), under-specificity (blue), and specificity in line with peers (black).
Healthcare 12 00983 g002
Figure 3. Geographical U.S. map of adjusted odds ratios (ORs) of coding specificity of dementia-related principal (a) and secondary (b) diagnoses by state, with a reference state of New York. Odds ratios that were not statistically significant are shown in gray.
Figure 3. Geographical U.S. map of adjusted odds ratios (ORs) of coding specificity of dementia-related principal (a) and secondary (b) diagnoses by state, with a reference state of New York. Odds ratios that were not statistically significant are shown in gray.
Healthcare 12 00983 g003
Table 1. Descriptive statistics of the dementia-related principal and secondary diagnosis coding specificity outcomes as well as patient and facility characteristics (counts and means/proportions and corresponding percentages/standard deviations).
Table 1. Descriptive statistics of the dementia-related principal and secondary diagnosis coding specificity outcomes as well as patient and facility characteristics (counts and means/proportions and corresponding percentages/standard deviations).
Study VariablesCount or Mean/Proportion (% or Standard Deviation (SD))
Outcomes
Specificity of dementia principal diagnosis (count, proportion)1788 (17%)
Specificity of dementia secondary diagnoses (count, proportion)186,300 (39%)
Patient Characteristics
Age (Years)
    0–44809 (<1%)
    45–542735 (1%)
    55–595492 (1%)
    60–6413,512 (3%)
    65–6927,970 (6%)
    70–7453,037 (11%)
    75–7983,694 (17%)
    80–84103,805 (21%)
    85+196,721 (40%)
Sex
    Female282,090 (58%)
    Male205,685 (42%)
Race
    Asian12,539 (3%)
    Black68,784 (14%)
    Other26,675 (5%)
    Unable to determine10,463 (2%)
    White369,314 (76%)
Log(Length of Stay) (Days) (mean, SD)1.6 (0.85)
Primary Payor
    Charity/Indigent200 (<1%)
    Commercial indemnity4613 (1%)
    Direct employer contract217 (<1%)
    Managed care capitated417 (<1%)
    Managed care non-capitated9370 (2%)
    Medicaid managed care capitated 1028 (<1%)
    Medicaid managed care non-capitated7807 (2%)
    Medicaid traditional5512 (1%)
    Medicare managed care capitated20,202 (4%)
    Medicare managed care non-capitated165,874 (34%)
    Medicare traditional258,584 (53%)
    Other2713 (1%)
    Other government payors9097 (2%)
    Self-pay 1950 (<1%)
    Workers’ compensation191 (<1%)
Point of Origin
    Clinic16,400 (3%)
    Court/Law enforcement215 (<1%)
    Information not available3174 (1%)
    Non-healthcare facility 385,271 (79%)
    Other425 (<1%)
    Transfer from ambulatory surgery center 2767 (1%)
    Transfer from dept unit in same hospital, separate claim302 (<1%)
    Transfer from health facility 7337 (2%)
    Transfer from hospice and under hospice care131 (<1%)
    Transfer from hospital (different facility)27,141 (6%)
    Transfer from SNF 1 or ICF 244,612 (9%)
Discharge Status
    Acute inpatient readmission847 (<1%)
    Discharged to home health organization89,392 (18%)
    Discharged to home or self-care91,843 (19%)
    Discharged to hospice home25,625 (5%)
    Discharged to hospice medical facility24,993 (5%)
    Discharged/Transferred to another rehab facility15,425 (3%)
    Discharged/Transferred to cancer center/children’s hospital209 (<1%)
    Discharged/Transferred to court/law enforcement247 (<1%)
    Discharged/Transferred to critical access hospital71 (<1%)
    Discharged/Transferred to federal hospital279 (<1%)
    Discharged/Transferred to ICF 212,891 (3%)
    Discharged/Transferred to long-term care hospital5448 (1%)
    Discharged/Transferred to nursing facility1732 (<1%)
    Discharged/Transferred to other facility5270 (1%)
    Discharged/Transferred to other health institute not in list1391 (<1%)
    Discharged/Transferred to psychiatric hospital2341 (<1%)
    Discharged/Transferred to SNF 1177,780 (36%)
    Discharged/Transferred to swing bed1605 (<1%)
    Expired28,217 (6%)
    Information not available232 (<1%)
    Left against medical advice1901 (<1%)
    Still a patient—expected to return36 (<1%)
Count of Procedures (mean, SD)(2.7) 2.2
CMS 3 Fiscal Year
    2022359,803 (74%)
    2023127,972 (26%)
Social Vulnerability Indices (mean, SD)
    Household characteristics0.52% (0.25)
    Housing type and transportation0.62% (0.24)
    Overall 0.61% (0.25)
    Racial and ethnic minority status0.71% (0.23)
    Socioeconomic status0.56% (0.26)
COVID-19 Status
    Not identified425,762 (87%)
    Positive62,013 (13%)
MS-DRG 4 Type
    Medical412,512 (85%)
    Surgical75,263 (15%)
Facility Characteristics
Teaching Status
    No378,768 (78%)
    Not available6221 (1%)
    Yes102,786 (21%)
Academic Status
    No415,274 (85%)
    Yes72,501 (15%)
Rural/Urban Status
    Rural66,204 (14%)
    Urban421,571 (86%)
Ownership
    Government—federal1118 (<1%)
    Government—hospital district/authority32,417 (7%)
    Government—local11,798 (2%)
    Government—state3611 (1%)
    Not available2248 (<1%)
    Physician1032 (<1%)
    Proprietary25,459 (5%)
    Voluntary non-profit—church71,085 (15%)
    Voluntary non-profit—other 26,015 (5%)
    Voluntary non-profit—private312,992 (64%)
Bed Count
    1–5012,300 (3%)
    51–10025,881 (5%)
    101–15038,616 (8%)
    151–20032,908 (7%)
    201–25047,519 (10%)
    251–30050,418 (10%)
    301–35053,077 (11%)
    351–40039,018 (8%)
    >400188,038 (39%)
Case Mix Index (mean, SD)1.76 (0.26)
State Abbreviation
    AK253 (<1%)
    AL3966 (1%)
    AR3789 (1%)
    AZ12,029 (2%)
    CA27,281 (6%)
    CO3578 (1%)
    CT5982 (1%)
    DE1125 (<1%)
    FL60,713 (12%)
    GA7318 (2%)
    HI6038 (1%)
    IA4759 (1%)
    ID28 (<1%)
    IL18,082 (4%)
    IN6646 (1%)
    KS2856 (1%)
    KY10,899 (2%)
    LA3505 (1%)
    MA4549 (1%)
    MD7131 (1%)
    ME21 (<1%)
    MI22,526 (5%)
    MN3431 (1%)
    MO3382 (1%)
    MS6744 (1%)
    MT1374 (<1%)
    NC27,936 (6%)
    ND676 (<1%)
    NE1965 (<1%)
    NH57 (<1%)
    NJ9184 (2%)
    NM2360 (<1%)
    NV5537 (1%)
    NY45,302 (9%)
    OH22,769 (5%)
    OK7968 (2%)
    OR7750 (2%)
    PA24,500 (5%)
    RI15 (<1%)
    SC10,009 (2%)
    SD1194 (<1%)
    TN16,120 (3%)
    TX32,134 (7%)
    UT38 (<1%)
    VA16,695 (3%)
    VT163 (<1%)
    WA7833 (2%)
    WI9889 (2%)
    WV9472 (2%)
    WY204 (<1%)
1 SNF: Skilled nursing facility; 2 ICF: Intermediate care facility; 3 CMS: U.S. Centers for Medicare and Medicaid Services; 4 MS-DRG: Medicare Severity Diagnosis Related Group.
Table 2. Univariate and multivariate logistic regression results including odds ratios (ORs), corresponding 95% confidence intervals (CIs), and p-values for specificity of a dementia-related principal diagnosis.
Table 2. Univariate and multivariate logistic regression results including odds ratios (ORs), corresponding 95% confidence intervals (CIs), and p-values for specificity of a dementia-related principal diagnosis.
Univariate Analysis Multivariate Analysis
VariableOR95% CIp-ValueOR95% CIp-Value
Intercept---0.0300.016–0.057<0.001
Age (Ref: 85+)
    0–445.6982.111–15.3780.0012.5990.781–8.6460.119
    45–545.9763.679–9.706<0.0014.8452.731–8.596<0.001
    55–593.3372.276–4.894<0.0012.4271.577–3.735<0.001
    60–643.2812.546–4.227<0.0012.6011.930–3.507<0.001
    65–692.7222.254–3.288<0.0012.2661.843–2.786<0.001
    70–741.5831.330–1.884<0.0011.4831.232–1.785<0.001
    75–791.7261.482–2.009<0.0011.6631.414–1.955<0.001
    80–841.5561.338–1.809<0.0011.4841.266–1.740<0.001
Sex (Ref: Female)
    Male1.5911.436–1.763<0.0011.4541.301–1.625<0.001
Race (Ref: White)
    Asian1.4851.084–2.0350.0141.3080.907–1.8880.151
    Black1.3851.213–1.581<0.0011.2371.058–1.4460.008
    Other1.2150.986–1.4980.0680.9070.713–1.1540.428
    Unable to determine0.8110.570–1.1540.2440.8010.551–1.1660.247
Log(Length of Stay)1.2721.212–1.336<0.0011.1241.060–1.191<0.001
Primary Payor (Ref: Medicare traditional)
    Charity/Indigent1.3050.146–11.6940.8120.4620.038–5.5760.543
    Commercial indemnity1.3340.914–1.9480.1361.0540.694–1.6010.805
    Direct employer contract0.0000.000-Inf0.9430.0000.000-Inf0.989
    Managed care capitated0.8700.105–7.2380.8980.6040.062–5.9190.665
    Managed care non-capitated1.4491.107–1.8980.0071.1950.887–1.6100.240
    Medicaid managed care capitated2.9011.537–5.4770.0010.9240.436–1.9560.836
    Medicaid managed care non-capitated1.8751.350–2.604<0.0011.0830.738–1.5900.684
    Medicaid traditional2.4371.677–3.541<0.0011.2360.802–1.9070.337
    Medicare managed care capitated1.0960.829–1.4490.5210.9020.659–1.2350.520
    Medicare managed care non-capitated1.0910.972–1.2240.1401.0480.924–1.1900.463
    Other1.5550.852–2.8380.1501.3110.690–2.4920.409
    Other government payors1.7071.234–2.3620.0011.1290.779–1.6370.521
    Self-pay1.0930.531–2.2500.8090.7570.353–1.6240.475
    Workers’ compensation2.6110.236–28.8270.4341.4160.105–19.1170.793
Point of Origin (Ref: Non-healthcare facility)
    Clinic1.3390.996–1.7990.0531.3750.994–1.9020.055
    Court/Law enforcement1.7880.569–5.6230.3201.3940.411–4.7240.594
    Information not available3.6592.308–5.802<0.0013.9492.321–6.719<0.001
    Other1.6390.17–15.7690.6691.4730.141–15.3830.746
    Transfer from ambulatory surgery center0.8190.099–6.8120.8540.6880.080–5.9300.734
    Transfer from dept unit in same hospital, separate claim1.4140.868–2.3060.1641.4750.858–2.5360.160
    Transfer from health facility0.9220.556–1.5300.7531.0010.586–1.7090.997
    Transfer from hospice and under hospice program0.0000.000-Inf0.9460.0000.000-Inf0.984
    Transfer from hospital (different facility)1.3021.012–1.6770.0401.0260.776–1.3570.856
    Transfer from SNF 1 or ICF 21.0970.882–1.3640.4041.1330.893–1.4370.304
Discharge Status (Ref: Discharged to home or self-care)
    Acute inpatient readmission0.7160.213–2.4040.5880.8400.238–2.9610.786
    Discharged to home health organization1.1060.929–1.3160.2571.1440.948–1.3820.161
    Discharged to hospice home1.2020.921–1.5680.1761.3671.023–1.8280.035
    Discharged to hospice medical facility1.2320.937–1.6200.1351.3541.006–1.8230.046
    Discharged/Transferred to another rehab facility1.1310.785–1.6280.5101.2500.845–1.8480.264
    Discharged/Transferred to court/law enforcement15.7411.633–151.7800.0173.3750.293–38.8120.329
    Discharged/Transferred to federal hospital2.0990.460–10.8620.3772.0890.368–11.8730.406
    Discharged/Transferred to ICF 21.2180.913–1.6240.1801.1420.831–1.5700.414
    Discharged/Transferred to long-term care hospital0.5620.280–1.1290.1050.5190.244–1.1040.089
    Discharged/Transferred to nursing facility1.3310.772–2.2950.3031.1850.636–2.2090.594
    Discharged/Transferred to other facility1.0880.637–1.8570.7581.1320.641–2.0000.669
    Discharged/Transferred to other health institute not in list1.1990.552–2.6080.6461.2170.531–2.7900.642
    Discharged/Transferred to psychiatric hospital1.3441.013–1.7850.0411.4571.069–1.9860.017
    Discharged/Transferred to SNF 11.1270.979–1.2970.0961.1440.979–1.3380.091
    Discharged/Transferred to swing bed1.1660.251–5.4200.8451.2270.234–6.4400.809
    Expired1.2820.901–1.8240.1680.9910.672–1.4620.965
    Information not available0.0000.000-Inf0.9540.0000.000-Inf0.986
    Left against medical advice0.7770.367–1.6480.5110.8880.400–1.9690.770
    Still a patient—expected to return0.0000.000-Inf0.9620.0000.000-Inf0.989
Count of Procedures1.1451.109–1.183<0.0011.2301.179–1.283<0.001
CMS 3 Fiscal Year (Ref: 2022)
    20230.3360.290–0.388<0.0010.2640.224–0.311<0.001
Social Vulnerability Index
    Household characteristics1.5061.238–1.831<0.0011.6250.873–3.0250.126
    Housing type and transportation1.3291.080–1.6360.0071.3620.584–3.1770.474
    Overall1.3261.085–1.6210.0060.3610.037–3.5610.383
    Racial and ethnic minority status1.0470.830–1.3210.6980.8290.491–1.3980.482
    Socioeconomic status1.2651.045–1.5320.0162.0030.600–6.6850.259
COVID-19 Status (Ref: Not identified)
    Positive1.1300.927–1.3770.2270.9780.788–1.2150.842
MS-DRG 4 Type (Ref: Medical)
    Surgical2.0891.482–2.943<0.0011.1950.794–1.8000.393
Teaching Status (Ref: No)
    Not Available0.5450.249–1.1900.1270.3490.142–0.8560.021
    Yes1.3491.208–1.506<0.0011.0470.839–1.3070.685
Academic Status (Ref: No)
    Yes1.2701.124–1.435<0.0010.9000.701–1.1560.411
Rural/Urban Status (Ref: Urban)
    Rural0.9100.775–1.0680.2490.9950.803–1.2320.963
Ownership (Ref: Voluntary non-profit—private)
    Government—federal0.0000.000-Inf0.9340.0000.000-Inf0.981
    Government—hospital district/authority0.9450.763–1.1700.6020.8470.652–1.1010.214
    Government—local1.5911.166–2.1700.0032.1041.401–3.159<0.001
    Government—state0.7180.389–1.3230.2280.6410.304–1.3520.243
    Not available1.8710.774–4.5220.1641.7580.679–4.5530.245
    Physician2.2720.206–25.0820.5033.2290.267–39.0960.357
    Proprietary0.9430.762–1.1660.5860.9840.752–1.2890.908
    Voluntary non-profit—church0.7820.661–0.9250.0040.8690.711–1.0630.173
    Voluntary non-profit—other0.7230.545–0.9580.0240.6050.442–0.8270.002
Bed Count (Ref: >400)
    1–500.6050.403–0.9080.0150.6870.426–1.1070.123
    51–1000.5920.460–0.760<0.0010.6710.492–0.9160.012
    101–1500.7950.649–0.9750.0280.8420.652–1.0860.185
    151–2000.6560.514–0.8380.0010.7060.530–0.9390.017
    201–2500.6190.510–0.751<0.0010.8180.645–1.0370.097
    251–3000.6030.501–0.725<0.0010.7040.559–0.8860.003
    301–3500.7270.604–0.8760.0010.8650.688–1.0860.212
    351–4000.7540.608–0.9350.0100.9330.718–1.2110.601
Case Mix Index1.9121.574–2.322<0.0011.5791.188–2.1000.002
State Abbreviation (Ref: NY)
    AK2.7080.245–29.9750.4174.5690.382–54.8240.230
    AL1.5040.836–2.7070.7131.1870.625–2.2540.600
    AR0.5240.224–1.2240.1360.4130.163–1.0440.062
    AZ0.6330.313–1.2790.2030.5540.265–1.1570.116
    CA1.2160.931–1.5880.1521.2420.901–1.7140.186
    CO0.4780.146–1.5680.2230.4700.316–1.6210.232
    CT1.6580.916–3.0020.0951.7700.934–3.3520.080
    DE0.0000.000-Inf0.9520.0000.000-Inf0.966
    FL0.9280.748–1.1520.5000.8600.654–1.1300.279
    GA1.5110.943–2.4220.8600.8740.492–1.5560.648
    HI1.8841.050–3.3790.0342.2391.019–4.9200.045
    IA1.6250.944–2.7980.0801.2560.676–2.3360.471
    IL0.6680.465–0.9600.0290.6440.428–0.6960.035
    IN1.2360.755–2.0230.3991.0670.621–1.8350.814
    KS1.4890.754–2.9410.2510.8790.412–1.8730.738
    KY1.1460.751–1.7490.5261.0170.621–1.6670.945
    LA4.4312.343–8.379<0.0012.7941.358–5.7480.005
    MA1.3890.957–2.0150.0841.2560.819–1.9280.297
    MD0.9310.581–1.4920.7661.1030.653–1.8620.714
    ME0.0000.000-Inf0.9840.0000.000-Inf0.989
    MI0.9470.726–1.2360.6900.8400.599–1.1780.313
    MN1.7431.115–2.7240.0151.8421.127–3.010.015
    MO0.6020.237–1.5310.2860.3620.128–1.0210.055
    MS0.7330.405–1.3290.3070.5820.306–1.1090.100
    MT0.2260.030–1.6760.1460.1480.018–1.2050.074
    NC1.6451.282–2.111<0.0011.1010.796–1.5230.561
    ND2.4070.736–7.8750.1463.6640.972–13.8160.055
    NE0.4920.150–1.6170.2430.6610.912–2.2770.512
    NJ0.9610.700–1.3180.8050.9880.692–1.4130.949
    NM1.4440.753–2.7680.2681.5130.736–3.1130.260
    NV0.7820.515–1.1870.2480.9060.542–1.5130.705
    OH1.2871.001–1.6540.0491.0770.791–1.4670.637
    OK1.2810.910–1.8020.1561.0440.676–1.6130.845
    OR2.4621.567–3.868<0.0012.5521.530–4.257<0.001
    PA2.0031.623–2.472<0.0012.1461.656–2.782<0.001
    SC1.0440.699–1.5590.8330.9270.592–1.4500.739
    SD0.0000.000-Inf0.9440.0000.000-Inf0.962
    TN0.7520.534–1.0590.1020.5980.399–0.8970.013
    TX0.9740.696–1.3610.8760.9290.627–1.3740.711
    UT0.0000.000-Inf0.9880.0000.000-Inf0.992
    VA1.4771.097–1.9890.0101.5061.052–2.1550.025
    VT0.0000.000-Inf0.9770.0000.000-Inf0.986
    WA0.8960.554–1.4480.6530.9650.562–1.6560.897
    WI1.5291.026–2.2760.0371.5000.952–2.3630.080
    WV0.9210.590–1.4370.7170.7260.434–1.2160.224
    WY0.0000.000-Inf0.9740.0000.000-Inf0.982
1 SNF: Skilled nursing facility; 2 ICF: Intermediate care facility; 3 CMS: U.S. Centers for Medicare and Medicaid Services; 4 MS-DRG: Medicare Severity Diagnosis Related Group.
Table 3. Univariate and multivariate logistic regression results including odds ratios (ORs), corresponding 95% confidence intervals (CIs), and p-values for specificity of a dementia-related secondary diagnosis.
Table 3. Univariate and multivariate logistic regression results including odds ratios (ORs), corresponding 95% confidence intervals (CIs), and p-values for specificity of a dementia-related secondary diagnosis.
Univariate Analysis Multivariate Analysis
VariableOR95% CIp-ValueOR95% CIp-Value
Intercept---0.3510.327–0.378<0.001
Age (Ref: 85+)
    0–441.9561.701–2.248<0.0011.9341.676–2.233<0.001
    45–541.7361.607–1.875<0.0011.7341.601–1.877<0.001
    55–591.7451.652–1.843<0.0011.7361.640–1.838<0.001
    60–641.5321.477–1.588<0.0011.5261.468–1.586<0.001
    65–691.4591.421–1.497<0.0011.4221.384–1.461<0.001
    70–741.4381.409–1.467<0.0011.4211.392–1.451<0.001
    75–791.4101.386–1.435<0.0011.4021.377–1.426<0.001
    80–841.3161.295–1.337<0.0011.3161.295–1.338<0.001
Sex (Ref: Female)
    Male1.2691.254–1.284<0.0011.2241.209–1.239<0.001
Race (Ref: White)
    Asian0.9770.941–1.0150.2301.0090.967–1.0530.680
    Black0.9890.972–1.0060.2020.9550.937–0.973<0.001
    Other1.0361.009–1.0630.0081.0691.040–1.099<0.001
    Unable to determine0.9690.930–1.0100.1390.9730.933–1.0150.210
Log(Length of Stay)1.0691.062–1.077<0.0011.0171.008–1.025<0.001
Primary Payor (Ref: Medicare traditional)
    Charity/Indigent1.2040.903–1.6040.2060.9760.729–1.3080.873
    Commercial indemnity1.1211.055–1.192<0.0010.9790.919–1.0420.499
    Direct employer contract2.9342.229–3.861<0.0012.4491.854–3.234<0.001
    Managed care capitated1.0870.889–1.3280.4160.8630.703–1.0600.161
    Managed care non-capitated1.0901.044–1.139<0.0011.0110.967–1.0570.639
    Medicaid managed care capitated1.1300.993–1.2850.0630.8960.785–1.0230.105
    Medicaid managed care non-capitated1.1291.077–1.183<0.0010.8960.852–0.942<0.001
    Medicaid traditional1.0500.993–1.1110.0870.8440.796–0.896<0.001
    Medicare managed care capitated1.0180.987–1.0490.2580.9790.948–1.0120.214
    Medicare managed care non-capitated0.9730.961–0.986<0.0010.9450.933–0.958<0.001
    Other0.9630.888–1.0430.3540.9090.837–0.9860.022
    Other government payors1.0811.034–1.1290.0010.9150.875–0.958<0.001
    Self-pay0.8790.798–0.9680.0090.8190.742–0.904<0.001
    Workers’ compensation0.4670.327–0.667<0.0010.4950.344–0.711<0.001
Point of Origin (Ref: Non-healthcare facility)
    Clinic0.9230.893–0.955<0.0010.9540.921–0.9870.007
    Court/Law enforcement1.0290.771–1.3720.8480.9490.698–1.2910.739
    Information not available1.0080.936–1.0850.8381.0450.968–1.1290.256
    Other1.0710.878–1.3060.4971.1590.946–1.4190.154
    Transfer from ambulatory surgery center0.7690.599–0.9880.0400.7150.555–0.9210.009
    Transfer from dept unit in same hospital, separate claim0.9760.901–1.0570.5430.9990.921–1.0840.982
    Transfer from health facility1.0030.956–1.0530.8960.9500.904–0.9990.044
    Transfer from hospice and under hospice program1.2880.904–1.8340.1611.2480.871–1.7870.227
    Transfer from hospital (different facility)0.9500.926–0.975<0.0010.8840.860–0.908<0.001
    Transfer from SNF 1 or ICF 21.1181.096–1.141<0.0011.0821.059–1.106<0.001
Discharge Status (Ref: Discharged to home or self-care)
    Acute inpatient readmission0.9660.835–1.1170.6371.0330.891–1.1970.666
    Discharged to home health organization1.0261.006–1.0460.0111.0641.043–1.085<0.001
    Discharged to hospice home1.2391.204–1.275<0.0011.3121.274–1.352<0.001
    Discharged to hospice medical facility1.1801.146–1.215<0.0011.2281.191–1.265<0.001
    Discharged/Transferred to another rehab facility1.0100.974–1.0470.5891.0190.982–1.0580.321
    Discharged/Transferred to cancer ctr/children’s hospital0.9380.702–1.2530.6661.1090.827–1.4880.490
    Discharged/Transferred to court/law enforcement1.2490.965–1.6160.0921.1280.856–1.4860.392
    Discharged/Transferred to critical access hospital0.6120.355–1.0560.0780.5810.335–1.0080.054
    Discharged/Transferred to federal hospital1.0270.800–1.3190.8331.0270.797–1.3220.839
    Discharged/Transferred to ICF 21.2891.240–1.339<0.0011.2491.201–1.300<0.001
    Discharged/Transferred to long-term care hospital1.1081.046–1.173<0.0011.0020.944–1.0630.950
    Discharged/Transferred to nursing facility1.5321.389–1.690<0.0011.5441.396–1.706<0.001
    Discharged/Transferred to other facility0.9780.922–1.0380.4630.9470.891–1.0050.074
    Discharged/Transferred to other health institute not in list1.1891.064–1.3280.0021.1961.068–1.3380.002
    Discharged/Transferred to psychiatric hospital1.8491.694–2.019<0.0011.6851.542–1.842<0.001
    Discharged/Transferred to SNF 11.0501.032–1.068<0.0011.0751.056–1.095<0.001
    Discharged/Transferred to swing bed1.0840.977–1.2020.1271.1511.034–1.2800.010
    Expired0.9360.910–0.963<0.0010.8960.870–0.923<0.001
    Information not available0.6910.514–0.9280.0140.8300.613–1.1240.228
    Left against medical advice0.8500.769–0.9400.0020.8290.749–0.917<0.001
    Still a patient—expected to return1.2040.603–2.4050.5981.0060.498–2.0340.987
Count of Procedures1.0381.036–1.041<0.0011.0391.036–1.0420.001
CMS 3 Fiscal Year (Ref: 2022)
    20231.0301.016–1.044<0.0011.0141.000–1.0280.049
Social Vulnerability Index
    Household characteristics0.8080.790–0.827<0.0010.8290.717–0.9580.011
    Housing type and transportation0.9360.914–0.958<0.0010.9630.892–1.0400.336
    Overall0.9120.889–0.936<0.0011.0901.030–1.1540.003
    Racial and ethnic minority status0.9110.889–0.934<0.0010.9310.844–1.0270.156
    Socioeconomic status0.8360.817–0.857<0.0011.1770.895–1.5480.245
COVID-19 Status (Ref: Not identified)
    Positive0.9600.943–0.977<0.0010.9480.930–0.965<0.001
MS-DRG 4 Type (Ref: Medical)
    Surgical0.9320.917–0.947<0.0010.8590.844–0.875<0.001
Teaching Status (Ref: No)
    Not Available1.1531.095–1.215<0.0011.1301.066–1.197<0.001
    Yes1.0671.051–1.082<0.0010.9750.949–1.0010.061
Academic Status (Ref: No)
    Yes1.0781.061–1.096<0.0011.0521.020–1.085<0.001
Rural/Urban Status (Ref: Urban)
    Rural0.9690.952–0.986<0.0010.9760.955–0.9970.025
Ownership (Ref: Voluntary non-profit—private)
    Government—federal0.9520.841–1.0780.4370.9150.802–1.0420.181
    Government—hospital district/authority1.0481.024–1.074<0.0011.0331.006–1.0620.017
    Government—local1.1021.060–1.145<0.0011.1461.097–1.196<0.001
    Government—state0.8980.837–0.9630.0030.8810.815–0.9510.001
    Not available0.9050.829–0.9890.0280.8470.771–0.9300.001
    Physician1.3641.206–1.544<0.0011.1911.045–1.3580.009
    Proprietary0.8370.814–0.860<0.0010.8720.845–0.899<0.001
    Voluntary non-profit—church0.8860.871–0.902<0.0010.9040.887–0.921<0.001
    Voluntary non-profit—other0.9610.935–0.9870.0030.8900.864–0.916<0.001
Bed Count (Ref: >400)
    1–500.9400.904–0.9760.0020.9540.912–0.9970.038
    51–1001.0351.007–1.0640.0011.0711.037–1.105<0.001
    101–1500.8490.829–0.869<0.0010.8780.987–1.040<0.001
    151–2000.9520.928–0.975<0.0010.9510.854–0.902<0.001
    201–2500.8680.849–0.886<0.0010.9310.924–0.978<0.001
    251–3000.9140.895–0.933<0.0010.9450.909–0.955<0.001
    301–3500.9080.889–0.926<0.0010.9390.922–0.969<0.001
    351–4000.9240.903–0.945<0.0011.0130.917–0.9610.329
Case Mix Index1.0281.005–1.0520.0180.9810.951–1.0120.227
State Abbreviation (Ref: NY)
    AK1.3921.081–1.7930.0101.3351.034–1.7250.027
    AL0.7310.679–0.787<0.0010.7230.669–0.780<0.001
    AR0.6260.579–0.677<0.0010.6650.612–0.722<0.001
    AZ1.0110.968–1.0550.6331.0390.992–1.0890.104
    CA0.9460.916–0.9770.0010.9490.915–0.9840.005
    CO0.8570.795–0.923<0.0010.8410.778–0.909<0.001
    CT1.2401.173–1.312<0.0011.2351.166–1.309<0.001
    DE0.8170.716–0.9320.0030.8250.720–0.9450.005
    FL0.9120.888–0.936<0.0010.9220.894–0.952<0.001
    GA0.9330.885–0.9840.0110.8810.829–0.935<0.001
    HI1.1501.087–1.217<0.0011.2191.136–1.309<0.001
    IA1.3421.261–1.427<0.0011.2981.216–1.386<0.001
    ID1.3070.612–2.7920.4891.4880.693–3.1970.308
    IL1.0611.023–1.1010.0021.0871.044–1.131<0.001
    IN1.0931.035–1.1540.0011.0951.033–1.1610.002
    KS0.6970.639–0.760<0.0010.6850.626–0.750<0.001
    KY1.1961.144–1.249<0.0011.2251.167–1.287<0.001
    LA1.3301.239–1.428<0.0011.2871.194–1.388<0.001
    MA1.0250.960–1.0950.4540.9620.899–1.0310.275
    MD0.9690.918–1.0230.2540.9500.897–1.0070.083
    ME1.2430.515–3.0000.6281.3720.563–3.3470.487
    MI1.0060.972–1.0410.7411.0270.987–1.0690.182
    MN2.1622.014–2.321<0.0012.0771.930–2.234<0.001
    MO0.9910.919–1.0680.8090.9020.833–0.9770.012
    MS1.0050.951–1.0620.8610.9920.936–1.0520.796
    MT0.7980.708–0.900<0.0010.8480.749–0.9600.009
    NC1.3051.265–1.347<0.0011.2441.199–1.290<0.001
    ND1.4951.280–1.746<0.0011.7231.469–2.022<0.001
    NE1.6671.521–1.827<0.0011.8081.643–1.990<0.001
    NH0.4830.250–0.9330.0300.5140.265–0.9960.049
    NJ0.8630.821–0.907<0.0010.8740.830–0.921<0.001
    NM0.8800.803–0.9630.0060.8390.763–0.921<0.001
    NV0.7930.745–0.845<0.0010.8730.815–0.936<0.001
    OH1.2871.244–1.331<0.0011.3561.306–1.408<0.001
    OK1.4071.339–1.478<0.0011.3501.277–1.426<0.001
    OR1.7561.672–1.845<0.0011.8811.784–1.984<0.001
    PA1.4031.358–1.450<0.0011.4391.389–1.492<0.001
    RI3.0301.078–8.5160.0352.7630.977–7.8130.055
    SC1.0821.033–1.1330.0011.0801.028–1.1350.002
    SD1.4091.252–1.586<0.0011.3931.233–1.573<0.001
    TN0.8070.776–0.840<0.0010.8260.790–0.865<0.001
    TX1.2391.203–1.277<0.0011.2471.203–1.292<0.001
    UT0.8550.422–1.7300.6630.8810.433–1.7890.726
    VA1.3741.324–1.426<0.0011.3651.309–1.422<0.001
    VT0.7970.564–1.1270.1990.8530.601–1.2110.373
    WA1.2251.165–1.288<0.0011.2811.215–1.352<0.001
    WI1.5091.443–1.578<0.0011.4961.426–1.569<0.001
    WV0.9710.926–1.0190.2290.9810.930–1.0360.492
    WY1.4161.067–1.8790.0161.4711.104–1.9600.008
1 SNF: Skilled nursing facility; 2 ICF: Intermediate care facility; 3 CMS: Centers for Medicare and Medicaid Services; 4 MS-DRG: Medicare Severity Diagnosis Related Group.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Richardson, K.; Penumaka, S.; Smoot, J.; Panaganti, M.R.; Chinta, I.R.; Guduri, D.P.; Tiyyagura, S.R.; Martin, J.; Korvink, M.; Gunn, L.H. A Data-Driven Approach to Defining Risk-Adjusted Coding Specificity Metrics for a Large U.S. Dementia Patient Cohort. Healthcare 2024, 12, 983. https://doi.org/10.3390/healthcare12100983

AMA Style

Richardson K, Penumaka S, Smoot J, Panaganti MR, Chinta IR, Guduri DP, Tiyyagura SR, Martin J, Korvink M, Gunn LH. A Data-Driven Approach to Defining Risk-Adjusted Coding Specificity Metrics for a Large U.S. Dementia Patient Cohort. Healthcare. 2024; 12(10):983. https://doi.org/10.3390/healthcare12100983

Chicago/Turabian Style

Richardson, Kaylla, Sankari Penumaka, Jaleesa Smoot, Mansi Reddy Panaganti, Indu Radha Chinta, Devi Priya Guduri, Sucharitha Reddy Tiyyagura, John Martin, Michael Korvink, and Laura H. Gunn. 2024. "A Data-Driven Approach to Defining Risk-Adjusted Coding Specificity Metrics for a Large U.S. Dementia Patient Cohort" Healthcare 12, no. 10: 983. https://doi.org/10.3390/healthcare12100983

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop