1. Introduction
The global average health care overpayment loss is estimated to be more than 6 % of overall spending, or more than
$450 billion. In the U.S., national health care spending reached
$3.6 trillion in 2018, or
$11,172 per person [
1]. The Federal Bureau of Investigation estimates that up to ten percent of annual health care spending is lost to fraud, waste and abuse. Despite the increasing attention and resources for fraud detection, the extent of health care fraud remains an issue. For instance, in Florida, in 2012 a whistle blower exposed billing fraud in the millions of dollars at seven major hospitals (
https://www.radiologybusiness.com/topics/business-intelligence/whistleblower-saw-imaging-billing-fraud-7-florida-hospitals, accessed on 13 August 2019). In 2016, a
$1 billion alleged Medicare fraud, money laundering scheme, led to many arrests after its detection, using analytical methods (
https://www.cnbc.com/2016/07/22/1-billion-alleged-medicare-fraud-money-laundering-scheme-leads-to-florida-arrests.html, accessed on 13 August 2019). In September 2020, nineteen more defendants were charged in Florida as part of a national crackdown on fraudulent submissions to private insurers and federal health programs to the tune of
$6 billion (
https://www.justice.gov/usao-mdfl/pr/national-health-care-fraud-and-opioid-takedown-results-largest-enforcement-action, accessed on 13 August 2019).
Health care overpayments are generally classified into three categories—provider, patient and insurer. Among these, overpayments as a result of provider activities, involving fraudulent protocols by hospitals and physicians, are the most common. Examples include, but are not limited to, the submission of false claims, kickback payments, self-referrals, etc.; see [
2] for a comprehensive discussion. This paper focuses on upcoding, which is also referred to as overcharging or aggressive billing. Examples of upcoding include the following: (a) exaggerating the time to perform a procedure; (b) deliberately mis-specifying the equipment needed in the procedure; (c) misrepresenting the credentials of the staff performing the procedure in order to increase payment rates; and (d) lying that the procedure happened when in fact it did not.
Upcoding is a concern for both private and government insurance programs. The most straightforward way to detect potential upcoding is through manual audits by using the medical charts. However, the size and complexity make comprehensive audits challenging, if not impossible [
3]. Therefore, statistical methods are widely used to detect fraudulent transactions; see [
4,
5,
6] for overviews. Standard practice of upcoding detection typically uses outlier detection to isolate providers with more aggressive billing patterns as potential audit targets. For any procedure and/or provider, one could compare suspicious billings with historical claims data. For instance, Silverman and Skinner [
7] show that profit driven hospitals biased their claims toward higher-paying diagnoses to maximize reimbursement while [
8] discuss cases of upcoding at non-profit hospitals. However, as [
9] noted, identifying upcoding in claims data is difficult due to confounding variables such as patient risk. Kc and Terwiesch [
10] argue that patient selection bias is a potential issue in such studies. Indeed, Ata et al. [
11] show that payment models could offer incentives to hospitals to admit short-lived patients. Particular payment plan characteristics could also motivate providers toward aggressive billing. Brunt [
12] show that drastic changes in the Medicare fee differentials for similar procedures increase the likelihood of upcoding. Zafari and Ekin [
13] and Ekin et al. [
14] propose a combination of Bayesian hierarchical and outlier detection approaches to group and detect unusual provider billings. Zafari et al. [
15] present an integrated multivariate outlier detection and decision modeling framework.
While identification is crucial and helps with recovery of overpayments, it does not change the overall culture of upcoding; hence, long term system integrity is compromised. This motivates policy makers to be interested in the provider billing characteristics as well as understanding the common factors that drive conservative and/or aggressive behavior. For instance, Cooper et al. [
16] point out the low correlation between the growth in hospital referral region-level private health spending and growth in fee-for-service Medicare spending; they argue that different factors may be driving the growth in spending for different systems. This study focuses on U.S. Medicare fee for services systems and uses claims data to study the key factors that can explain differences in billing patterns. Many factors contribute to determination of payment amounts for Medicare Part B fee for services. These include, but are not limited to, the 20% co-payment; non-participating status and limiting charge; facility & non-facility rates; geographic adjustments; and multiple procedure payment reductions. Among these, geographical adjustments are one of the main reasons for payment differences since, typically, urban areas have
to
higher payment rates than national averages. Medicare allowed payment amounts for each procedure in each geographical region are publicly available at [
17]; see, for instance, [
18] for a county level analysis. Although these payment amounts are determined with respect to well-specified fixed formulas that account for geographic and billing type variations, the billing patterns among providers may still differ even for the same procedure. For instance, underlying common factors for providers that request higher reimbursement for same set of services could be different than the providers with conservative billing behaviors. Even if the factors are similar, their marginal impact could be different. Understanding such differences in billing patterns could enable policy makers offer specific policies. This, in turn, can also lead to better upcoding detection by decreasing the amount of false positives.
A variety of statistical approaches are proposed to study the common factors underlying billing patterns, see [
19] for a discussion. Linear regression models are often used. Jürges and Köberlein [
20] use regression to determine relationships between upcoded cases and expected payment differences; Bowblis and Brunt [
21] analyze the impact of geographic adjustments on profit margins; Fang and Gong [
22] attempt to detect potential overbilling in Medicare reimbursement. Such oft-use of linear models is somewhat surprising, given the complexity of the data. Indeed, since billing data distributions are asymmetric and/or leptokurtic, the use of ordinary least square regression could lead to serious biases in fraud detection. A limited number of hybrid studies use regression methods along with unsupervised methods [
23]. Lorence and Spink [
24] analyze survey-based variations in billing patterns via descriptive analytics. A logistic Bayesian model is developed to detect questionable claims and estimate probability of upcoding [
25].
We introduce quantile regression models to the domain of health care fraud detection. The use of quantile regression in health care utilization and fraud assessment has been limited. The only relevant quantile regression based application examines health care expenditures during the Great Recession [
26]. To the best of our knowledge, our paper is the first study that investigates the underlying characteristics of both aggressive and conservative billing by providers in health care systems via quantile regression and variable selection methods. Under this approach, providers at lower quantiles of the aggressiveness distribution could be argued to be more conservative or efficient, or even potentially underpaid. Their efficient behavior can serve as a benchmark for other providers. On the other hand, providers at the higher quantiles of the aggressiveness distribution could be upcoding the system. Characteristics unique to this latter group may help improve our understanding of their billing behavior. A related issue is the set of covariates that could influence aggressive billing could be quite large and may have differing impacts on the data distribution’s quantiles. We propose using the Bayesian Information Criterion (BIC) to select key variables at several quantiles of the billing distribution. This quantile variable selection framework is a first in the statistical modeling of fraud detection.
In order to conduct the empirical analysis; in addition to using the fee for service payment data from Medicare Part B, we construct a large database by combining a number of public data sources. With these data, for the mammography procedure, we find that the variables which impact the various quantiles of the billing aggressiveness distribution are very different than those obtained from a linear regression. Moreover, the set of variables differ across the various quantiles of the billing distribution. We discuss how our findings can be used in health care fraud prevention and detection. The paper proceeds as follows.
Section 2 discusses the data, quantile regression and variable selection methodologies. This is followed by an empirical analysis in
Section 3. A discussion and an overview in
Section 4 and
Section 5 conclude the paper.
2. Materials and Methods
2.1. Data
Data Sources. The major data set of use is the
The Physician and Other Supplier Public Use File [
27], which contains 100% final-action physician Medicare Part B services and procedures provided to fee for service (FFS) beneficiaries. It includes measurements for utilization, payment and submitted charges organized by the
National provider identifier (NPI),
Healthcare common procedure coding system (HCPCS) code, and place of service. We chose the year 2014 to take into account the major payment reduction adjustment after April 1, 2013, and the addition of the variable “standardized payment”. For fair comparison, we only use observations for individual entities that have facility as place of services.
The raw health care claims data has 9,316,307 observations in which each row corresponds to the billing summary of a given procedure by a particular provider. It is straightforward to use our approach for data of any procedure, as well as any given provider or provider-procedure pairs. We focus on the fee for service claim billings for Procedure 77055, which corresponds to computer-aided detection applied to a diagnostic mammogram, in order to illustrate the proposed method. Following the recent Affordable Care Act, starting 1 January 2020, mammography was reclassified as a preventive, not diagnostic, screening procedure. Developing a dataset for the analysis of fraudulent billing behavior is a formidable challenge due to the complexity and size of the source databases. In the following, we describe how we accomplished this task in general, keeping in mind we used the process to isolate the upcoding practice for Procedure 77055.
The characteristics of a given procedure-provider pair (with a unique HCPCS service code and a unique NPI) are the focal variables. These include the average Medicare allowed payment amount (the Medicare payment, deductible and copay averaged over procedure); average submitted charge amount; average Medicare payment amount after deduction of deductible and coinsurance; and average Medicare standardized payment amount that removes geographical differences in payment rates. The claims data also include number of unique beneficiaries served; number of services provided; and number of distinct Medicare beneficiary/per day services. Provider-specific information, such as the state, zip-code, address and provider type, are also used. In order to measure the impact of provider type for claims on a given procedure, we computed the mean and standard deviation of the aggressiveness ratios that capture the relative frequency and deviation.
Next utilized data source is the
Fee Schedules to map the zip codes to locality information [
28]. The current
Physician Fee Schedule locality structure was developed and implemented in 1997. At present, there are 89 localities of which 34 are statewide areas,. which account for the geographical variation in payment patterns; see [
29] for details. Then, we extract the geographical practice cost indices for each locality [
17]. In particular, the relative costs of physician work, practice expense, and malpractice expense in a specific area, compared to the national average costs, for each component are retrieved; see [
30] for details. Lastly, we employ a public use file that provides demographic, spending, utilization, and quality indicators at the state level. This can help evaluate geographic variation in the utilization and quality of health care services for the Medicare fee for service population; see [
31] for details.
Response variable. “Billing aggressiveness of the provider” for the mammography procedure is defined as a non-negative measurement for which larger values potentially reflect upcoding for the same procedure. It is the ratio of the average submitted charged amount and average payment amount for all claims submitted for a given procedure [
9]. The first row of
Table 1 displays the skewness of the standardized payments for Procedure 77055. There are a total of 685 providers that billed for this procedure. It can be seen that the standard deviation of standardized payments is approximately ten percent of the mean amount for all providers. The skewness and kurtosis values are high. Per our earlier discussions, such discrepancies in the standardized average payment amounts could be explained by geographical variations and patient related factors. However, we should emphasize that our focus is not on payments per se, rather our analysis aims to investigate the potential reasons for differences in the aggressiveness of providers for the same procedure. The second row of
Table 1 exhibits the right skew in the distribution of the aggressiveness of the providers. Both skewness and kurtosis values are very high. The skewness and the existence of outliers are also displayed in the box-plot and histogram of billing aggressiveness; see
Figure 1. The table and graphs point to the non-normality of the billing distribution for this procedure.
Covariates. We consider the following covariates.
Line Service Count: This corresponds to the number of services provided by the particular provider of interest for that procedure.
Unique Beneficiary Count: This is the number of distinct Medicare beneficiaries receiving the service. This is expected to be related to the Line Service Count, however it can provide additional insights especially for fraudulent and/or aggressive provider billing behavior.
Average Medicare Allowed Payment: This is the sum of payment amount by Medicare, the deductible and coinsurance amounts paid by the beneficiary, and any amounts that a third party is responsible for paying.
Average Medicare Standardized Payment: This variable captures the average standardized payment amount by Medicare after the deduction of beneficiary deductible and coinsurance amounts and geographical standardization. Standardization removes geographic differences in payment rates for individual services due to local prices and makes Medicare payments across geographic areas comparable. This allows to reflect variations and capture underlying factors of providers’ billing patterns [
32].
GPCI Work: The Geographical Practice Cost Index (GPCI) has been established for every Medicare payment locality for each of the three components of a procedure’s relative value unit (RVU) for work, practice expense, and malpractice. Since these are highly correlated, we picked the RVU for work.
MA Participation Rate: These are state rates for the Medicare Advantage (MA) participation. This is chosen since it can provide a glimpse of the billing patterns within that state, in particular of the aggressiveness of provider billing.
Average Age: This is the numerical variable of age, which is mostly above 65 given the nature of Medicare program. In general, the older patients are taken more advantage of, so this can help understand billing patterns.
Percent Female: This is coded as a numerical variable that provides the ratio of female out of all beneficiaries. We aim to investigate if the gender of the beneficiaries has an association with billing patterns.
Percent non-Hispanic White: We aim to investigate if the racial demographic of the beneficiaries has an association with billing patterns.
Percent Eligible for Medicaid: This is the state rate for the percent of Medicare beneficiaries that are eligible for Medicaid. Dual Medicare-Medicaid (Medi-Medi) access are available to those who meet certain criteria. This has been an area of upcoding, and hence motivated the initation of Medi-Medi Data Match Program [
33].
Average HCC Score: The Hierarchical Condition Category (HCC) score is a risk-adjusted value that ranks diagnoses into categories which represent conditions with similar cost patterns. Higher categories represent higher predicted health care costs, resulting in higher risk scores. Average HCC Score represents the averaqe risk score for Medicare beneficiaries of all ages in each region. The national average is 1.0.
Standardized Risk-adjusted Per Capita Costs: This corresponds to the cost of Medicare procedures adjusted to reflect all beneficiaries for each state.
Mean Aggressiveness of Provider Type: This is defined as the average ratio of the average submitted charged amount and average payment amount for that particular provider type. Billing for certain procedures are expected to be more aggressive for some provider types, so we aim to capture that.
Standard Deviation of Aggressiveness of Provider Type: This is defined as the standard deviation of the average submitted charged amount and average payment amount for that particular provider type. Some provider types could include various distinct types of provider billing patterns, so this variable aims to help capture that volatility. It can be expected that a provider with a type that has low standard deviation of aggressiveness, would behave within billing norms associated with that provider type.
The asymmetric and/or leptokurtic distributions of billing data need to be considered while modeling the the underlying characteristics of provider billing patterns. Our first hypothesis is that different quantiles of the distribution of billing aggressiveness may be affected by different covariates. Our second hypothesis is that the magnitude of the impact of the covariates differ at quantiles of the distribution of billing aggressiveness. In order to check for those hypotheses, we use the statistical methods of quantile regression and quantile variable selection, which are described in the following.
2.2. Quantile Regression
A practical option for upcoding detection is to flag any provider with aggressiveness higher than a certain threshold as an outlier for audits. However, in order to ensure program integrity, we also need to understand the underlying characteristics of aggressive providers and upcoding. Health care billing is a complex phenomena, and causality is difficult to attain. We emphasize that we do not claim causal relationships between the response and independent variables in our model. A major limitation of existing classical linear regression models is their inability to quantify the differing impacts of explanatory variables on different parts of the data distribution. They might provide acceptable explanatory power “on average” but may be poor at modeling abnormal swings. Consider
Table 1. The factors that impact the lower quantiles of the aggressiveness distribution may be different than those at the upper quantiles. And even if the factors coincide, their marginal impacts at the lower and upper quantiles could be different. A change in the “payment amount” may have little impact if billing aggressiveness is low, but it could result in a great impact in billing aggressiveness in case it is already high. These varying relationships, and differing impacts of variables, on different levels of aggressiveness cannot be captured by mean linear regression models. For settings with many variables, a variable selection method can be valuable to identify the best possible subset of covariates in the quantile regression model. In addition to handling non-normal data distributions, the contextual relevance of understanding what covariates influence the entire data distribution motivates us to employ quantile regression variable selection techniques.
Following [
34,
35,
36,
37], the linear quantile regression model for the
ith observation of the response variable
y is given by:
where the error terms,
are i.i.d;
for almost every independent variable
;
; and
.
Given that our aggressive billing data is highly asymmetric and heavy-tailed, following [
38], we assume that each
follows an asymmetric Laplace distribution with density function
where
,
is independent of
, and
I is the indicator function. Thus, the conditional
-quantile of
given
is
.
2.3. Quantile Variable Selection
The Bayesian Information Criterion (BIC) method [
39] is known to provide best subset selection as it identifies the true model consistently [
40]. Modified forms of the BIC to account for large model spaces and large sample sizes are presented to have sound statistical properties [
41,
42]. These improvements to BIC are based on embedding robust shrinkage estimation methods such as the least absolute shrinkage and selection operator (LASSO) and smoothly clipped absolute deviation (SCAD); see [
38,
43].
The aim is to consider only
independent variables among the
s, implying
covariates to be set as zero in Equation (
1). Adapting the notation from [
38], let
denote a candidate model corresponding to the independent variables
: denote
;
to be the cardinality
d of
S;
to be the maximum likelihood estimator of
. Then, the BIC for linear quantile regression is given by:
where
is a positive constant that diverges to infinity as
n increases. A discussion related to the theoretical properties of Equation (
3) can be found in [
38]; these authors also demonstrate how the BIC successfully identifies the true model in high-dimensional quantile regression models.
Let
. The process of selecting the best subset model proceeds by choosing
as follows:
The selected subset
includes the variables that are selected at the
th quantile of the distribution of the dependent variable
y. In order to find the estimators for
, we use the LASSO procedure in R [
44]. We let
vary between 0.01 and 1. If a particular independent variable’s coefficient estimate is less than 0.001, that is set to zero, which means it is not selected in the model. For the optimal
, the module presents the corresponding estimates for
and the BIC values at each of the selected quantiles. The following quantiles,
, are selected: 0.01, 0.05, 0.25, 0.50, 0.75, 0.95 and 0.99; collectively, these quantiles better capture the impact of the covariates at various positions along the entire distribution of aggressiveness; importantly, the impact at the tails of the aggressive billing distribution are also accounted for in the process.
3. Results
We first estimate the parameters corresponding to the covariates via the OLS mean regression model which could be used in fraud detection. We then employ the proposed methodology on the same data and variables.
3.1. OLS Estimation
Consider
Table 2. Average Medicare standardized payment amount and mean aggressiveness of the provider type are significant factors. Most of the providers that are billing for this procedure are diagnostic radiologists; this is crucial in understanding upcoding. For instance, the average aggressiveness of general surgeons is 6.06, which is higher than the overall mean of 4.00. The billing pattern of a general surgeon is likely to be more aggressive than average. Another finding is that providers exhibit higher aggressiveness when the amount of average standardized payment is lower (
p = 0.000). Lastly, it should be noted that an increase in average HCC score, which indicates aggregate sickness level of patients, also increases billing aggressiveness (
p = 0.082).
However, not surprisingly, the residuals from the OLS analysis violate the normality and iid assumptions. The Lilliefors test for non-normality is significant (
p = 2.133 × 10
), which is also evidenced in the q-q plot shown in
Figure 2. In short, the analysis shown in
Table 2 could lead to a poor understanding of providers that overcharge.
3.2. Quantile Estimation
Setting aside variable selection for the moment, consider
Table 3 which presents the quantile regression coefficients and their significance levels at the quantiles {0.01, 0.05, 0.25, 0.5, 0.75, 0.95, 0.99}. The varying impacts of variables at the different quantiles of the aggressive billing distribution are evident. While average Medicare standardized payment amount is significant for every quantile of interest, please note that the mean aggressiveness of the provider type is only significant for three quantiles. Let us examine just a few of these relationships below in the interests of brevity.
First, the average Medicare standardized payment amount is significant for all quantiles, albeit with varying beta coefficients at different quantiles. For higher percentiles of aggressiveness, the relationship becomes stronger and more negative. The providers with increasing aggressive billing patterns become less aggressive for higher standardized payment amounts. Second, average HCC score, has a positive impact on aggressive billing, but is only significant at percentiles of 0.05 and 0.95. Similarly, the variable GPCI Work, which measures the geographic variation adjustment of practice expense, is only significant at 25th percentile. Third, consider the impact of standard deviation of aggressiveness for each provider type. This variable’s marginal effect trends from negative to positive with increasing values. Fourth, upcoding tends to be higher among less aggressive providers that bill in states where patient populations have higher proportions of dual eligibility—those that are eligible for Medicaid and Medicare. This relationship becomes inverse for providers with increasing aggressiveness. Upcoding is also higher in states with more non-Hispanic, white demographic patients; although this is not significant.
To further demonstrate the value of quantile regressions in this context, we graphically compare the confidence intervals and the point estimates given in
Table 2 and
Table 3. Consider
Figure 3 that presents the parameter estimates for the variables “
average Medicare standardized payment amount”, “
GPCI Work”, “
average HCC score”, and “
mean aggressiveness of provider type”.
The first takeaway is the
non-linear, non-constant nature of the relationship between the covariates and aggressive billing data; the
linear OLS estimation misses these true relationships entirely. Second, at first glance, it would seem that since some quantile estimates for each of the covariates are contained in the corresponding OLS-based 95% confidence intervals, there is no major difference in the OLS and quantile estimates. This inference would be misguided. The 95% OLS-based interval is
always equidistant from the OLS
mean estimate of a parameter. The quantile estimates are
localized regression values. To illustrate this, consider the top right panel for the GPCI Work estimates. Most of the quantile estimates are within the 95 % confidence interval of the OLS estimate but consider, for instance, the quantile regression point estimate of GPCI Work at the first quartile. The
localized marginal effect of GPCI Work on the billing data that fall at this quantile is twice as large as the OLS mean effect; the latter’s effect on billing data is
always, on average, 15.49 from
Table 2, no matter where the data are on the billing distribution. Practically speaking, the impact of the relative cost of physicians’ work is, in fact, twice as large at the lower spectrum of the aggressive billing distribution; the OLS estimate grossly underestimates the impact of this covariate on a sizable number of billing data. A similar argument can be made for average HCC score. Thus, even if the quantile estimates are within the equal-tailed 95% confidence interval of the OLS estimate, they vary significantly and non-linearly across the billing data distribution. Third, consider the top left panel for the average Medicare standardized payment amount; see
Table 3 for the corresponding numerical values that quantify the varying nature of the marginal effect of this covariate. In contrast, the linear mean effect for this covariate is fixed throughout the distribution of aggressiveness. The numerical estimates only intersect slightly at the upper 95 % confidence interval of the 75th percentile. This is also evident with the impact of “mean aggressiveness of the provider type”. Although this mean effect is significant for OLS regression, for all quantiles of the quantile regression method, its estimates vary across quantiles, and some are not significant. Finally, the non-linear quantile interval estimates provide evidence of changing standard errors at the different quantiles, but in the OLS estimation the standard error is always fixed; this is implausible given the skew in the billing data distribution shown in
Table 1 as well as the plots in
Figure 1.
Similar results hold for the other covariates; for brevity, we only present the plots for the variables discussed above. These results validate our key empirical claim that mean regressions are ill-suited to explore the impact of covariates on the billing aggressiveness data as a whole.
3.3. Variable Selection
Our hypothesis is that different quantiles of the distribution of billing aggressiveness may be affected by different covariates. To validate this, we investigate which of the 14 covariates are selected by the BIC procedure at each of the quantiles ( = 0.01, 0.05, 0.25, 0.50, 0.75, 0.95, 0.99).
Table 4 presents the Maximum Likelihood Estimates (MLEs) for the selected variables in the LASSO quantile regression, which quantifies the marginal effects of each variable. As expected, average standardized payment amount (avg. std. payment amount) is kept in each model. The coefficients for avg. std. payment amount display a negative relationship with aggressiveness, which gets stronger for increasing quantiles. The directions of the relationships for other MLE estimates are also in line with previous results. The MA participation rate is kept in models at five quantiles, while its value is larger for higher quantiles. The mean aggressiveness of provider type (mean agg PT) is significant and positive with increasing values for percentiles starting at 0.50. The average allowed payment amount is always positively related to aggressiveness. The marginal effects of the “GPCI Work”, “percent efmal” “services provided” (line service count) vary among quantiles where they are significant. These findings emphasize the advantages of using Quantile Regression versus OLS results in
Table 2.
4. Discussion
The primary impact of our study is to enable auditors understand billing behavior more accurately by an improved siphoning of providers of each medical procedure. Consider the OLS estimate for “standardized payment amount”. While this variable is negatively (and significantly) associated with billing aggressiveness as displayed by both OLS and quantile regression outputs; the amount of its effect varies over the quantiles of billing distribution. A unit change in the average standardized payment amount has more than 30 times impact for highly aggressive billing data compared to very low aggressiveness. Using OLS, an auditor will not be able to focus on providers accordingly considering their billing aggressiveness; the focus would be on the impact on mean aggressiveness. The second major impact is that mean aggressiveness of the provider type is not significant for quantiles corresponding to conservatively billing providers. When the auditor uses OLS; the assumption is that the whole billing distribution is impacted. Similar to the other variables, the magnitude of the impact also varies. These could result in a large number of false positives. Based on the BIC selection, we can also glean insights about very conservative providers. When the standardized payment amount for the procedure is high, its impact on aggressiveness is lower at at low quantiles. Increasing standardized payment amount have a higher negative impact at more aggressive providers. In other words, the most conservative providers, in terms of billing impact, affect provider payment systems differently. This conclusion is glaringly absent in the OLS mean regression approach.
The analysis also points to potential differences on the impact of covariates on high quantile payment amounts versus median and low quantile payment amounts. While the significant covariates for highly aggressive providers may include mean aggressiveness of provider type, average HCC score and percent of female beneficiaries; MA participation rate is significant for low quantiles.
When we analyze the most aggressive individual providers, we would typically merely red flag the top providers for audit, number depending on our resources. But now, our analysis can include the crucial insight of the (statistically significant) patients with low average HCC score offered by this provider when compared to the rest of the providers at that quantile. Moreover, this provider may be providing services in a state with high percent of female beneficiaries. In conjunction with the mean aggressiveness values among that provider type (diagnostic radiologists), such information could be used as potential clues to detect cases of upcoding.
5. Conclusions
The proposed quantile regression and variable selection framework is utilized on a combined CMS Medicare data set to determine the key factors that reveal billing aggressiveness patterns with respect to the geography, provider and procedure type. It can also help with potential audits of suspicious providers, decrease the amount of false positives, and identify the underlying determinants of upcoding.
We used data from Medicare Part B, which is a fee for service (FFS) payment system. In FFS systems, the insurer reimburses the provider for every procedure or test done on the patient. CMS has been actively experimenting with novel payment models. For instance, the alternative of bundled payments are utilized in programs such as Medicare Advantage: these plans receive a fixed monthly payment-per-beneficiary to cover potential health care costs. They are incentivized to improve the patients’ health at minimum cost. With these policy changes and novel experiments, it will be even more important to understand the billing patterns of providers in payment systems. It could be worthwhile to extend our methodology to other programs’ payment data. For instance, Medicare Advantage programs employ risk adjustment payment systems. These also have cases of upcoding; see [
45].
Our approach can also be extended for a type-procedure pair to see the differences among billing relationships. For instance, we can take a subset of all cardiologists who billed for a particular procedure (say 99213—office outpatient visit). The skewness for such billings and non-linear nature of the varying impacts of the covariates can be captured with the proposed methods.
One of the main limitations of our paper is the lack of patient-specific characteristics and data for quality of care. Although we tried to capture that impact by using average HCC scores, it is still only at an aggregate level and may not reflect a patient’s well-being. In order to capture the impact of quality of care, CMS has established a budget-neutral “value modifier”. This measure provides differential payments to providers based on the quality of care compared to the cost of care furnished to Medicare FFS beneficiaries during a performance period. Incorporating that to our analysis can improve the results.