Next Article in Journal
Relational “Dance” Between Mother and Moderately Preterm Infant at 6 and 9 Months of Correct Age: Possible Risk and Protective Factors
Next Article in Special Issue
Demographic and Geographic Characteristics Associated with the Type of Prescription and Drug Expenditure: Real World Evidence for Greece During 2015–2021
Previous Article in Journal
Psychometric Validation of the CLN2 Quality of Life Questionnaire in Participants with CLN2 Disease Treated with Cerliponase Alfa
Previous Article in Special Issue
Achieving Robust Medical Coding in DRGs Systems: Innovative Actions Adopted in Greece
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Efficiency of Primary Health Services in the Greek Public Sector: Evidence from Bootstrapped DEA/FDH Estimators

by
Angeliki Flokou
1,*,
Vassilis H. Aletras
1,2,
Chrysovalantis Miltiadis
1,
Dimitris Charalambos Karaferis
3 and
Dimitris A. Niakas
1,3
1
School of Social Sciences, Hellenic Open University, 26335 Patra, Greece
2
Department of Business Administration, University of Macedonia, 54636 Thessaloniki, Greece
3
Medical School, National and Kapodistrian University of Athens, 11527 Athens, Greece
*
Author to whom correspondence should be addressed.
Healthcare 2024, 12(22), 2230; https://doi.org/10.3390/healthcare12222230
Submission received: 12 October 2024 / Revised: 4 November 2024 / Accepted: 4 November 2024 / Published: 8 November 2024
(This article belongs to the Special Issue Efficiency, Innovation, and Sustainability in Healthcare Systems)

Abstract

:
Strengthening primary healthcare (PHC) is vital for enhancing efficiency and improving access, clinical outcomes, and population well-being. The World Health Organization emphasizes the role of effective PHC in reducing healthcare costs and boosting productivity. With growing healthcare demands and limited resources, efficient management is critical. Background/Objectives: Building on this point, this study aimed to evaluate the efficiency of PHC units across Greece, focusing on Health Centers (HCs) and Local Health Units (ToMYs). The objective was to assess their efficiency levels and identify factors contributing to observed inefficiencies. This study explores a novel research area by being the first to assess the efficiency of restructured primary healthcare facilities in Greece, utilizing 2019 data—the first year operational data became available for the newly established ToMY facilities following recent healthcare reforms. Methods: We applied a comprehensive suite of non-parametric methods, including Data Envelopment Analysis (DEA) under variable, constant, increasing, and decreasing returns to scale (VRS, CRS, IRS/NDRS, DRS/NIRS) assumptions, along with the Free Disposal Hull (FDH) model, all oriented toward output maximization. Efficiency scores were refined using bootstrapping to calculate 95% confidence intervals, and efficient units were ranked via the super-efficiency model. Outliers were identified and removed through the data cloud algorithm. For the first time at this scale, the final sample included the vast majority of PHC facilities in Greece—234 Health Centers and 94 Local Health Units—with inputs categorized into three human resource types: medical, nursing/paramedical, and administrative/other staff. Outputs encompassed scheduled visits, emergency visits, and pharmaceutical prescription visits. This diverse and comprehensive application of DEA methods represents a novel approach to evaluating PHC efficiency in Greece, with potential relevance to broader healthcare contexts. Results: The analysis revealed significant inefficiencies and differences in technical efficiency between HCs and ToMYs. HCs could nearly double their outputs (VRS score: 1.92), while ToMYs could increase theirs by 58% (VRS score: 1.58). Scale efficiency scores were closer, with HCs slightly more aligned with their optimal scale (1.17 vs. 1.20 for ToMYs). Conclusions: There is significant potential to improve efficiency in PHC, with variations depending on unit characteristics and regional differences. This evaluation provides a foundation for policymakers to identify areas for improvement and enhance the overall performance of healthcare services in Greece.

1. Introduction

As stated in the Astana Declaration, primary healthcare (PHC) is the cornerstone of health systems, essential for achieving universal health coverage and meeting the health-related Sustainable Development Goals of the United Nations [1]. According to the World Health Organization [2], PHC aims to ensure the highest possible level of health and well-being for society based on people’s needs and encompasses a wide range of services, from information and prevention to treatment and rehabilitation. Strengthening PHC services is a vital investment to improve health equity, access, clinical outcomes, and population well-being while reducing healthcare and social costs and enhancing population productivity [3].
Sufficient funding and the reinforcement of PHC are crucial objectives for health systems as they play key roles in enhancing healthcare efficiency, service quality, resource utilization, and equitable access for all citizens. Historically, PHC in Greece has been marked by a lack of centralized planning and comprehensive policies, leading to fragmented healthcare services; understaffing of specialized personnel; significant shortages in diagnostic equipment, especially in rural areas; limited community-based prevention and health promotion services; poor coordination with secondary care; and high levels of public dissatisfaction [4,5,6,7].
The absence of mandatory referral (gatekeeping) mechanisms from primary to secondary care and the failure to establish a functional connection between PHC services and hospital care result in systemic inefficiencies, further contributing to public dissatisfaction [7]. In 2010, during the economic crisis, PHC became a target for reforms as part of the economic adjustment program. The reforms aimed to reduce and rationalize costs while improving efficiency and effectiveness [8,9,10,11].
Law 3918/2011 [12] established the National Organization for the Provision of Health Services (EOPYY) by merging primary social security funds (IKA, OGA, OAEE, OPAD). Law 4238/2014 [13] introduced the National Primary Care Network (PEDY), which integrated IKA’s PHC services into the National Health System (ESY), transforming EOPYY into a healthcare purchaser, and, additionally, all public PHC facilities were placed under the jurisdiction of the Regional Health Authorities (RHA/YPE) [8,14,15]. Later, Law 4486/2017 [16] renamed PEDY Health Units as Health Centers and established Local Health Units (ToMYs) as decentralized branches of these Health Centers, aiming to provide holistic, efficient, and patient-centered care tailored to the population’s needs.
Currently, in Greece, primary healthcare (PHC) services in the public sector are provided through structures of the Greek National Health System (ESY), including Health Centers (HCs) and their affiliated Regional Medical Offices, Multipurpose Regional Medical Offices, Specialized Regional Medical Offices, Local Medical Offices, Local Health Units (ToMYs), and outpatient clinics in hospitals [17].
Between 2010 and 2019, there was a 24.5% decrease in medical personnel and a 29.5% reduction in non-medical personnel in Health Centers, contrasted by a 26.6% increase in nursing personnel [17,18]. Ambulatory care expenditures decreased by 41.68%, with public spending down by 21.03% and private spending down by 52.25%, reflecting declining disposable incomes [19,20,21,22]. Despite these reductions, private expenditures still constituted a larger share than public spending.
As healthcare needs grow and resources remain constrained, efficient management becomes crucial. Modern methods for assessing the efficiency of PHC units, such as Data Envelopment Analysis (DEA), offer valuable tools for policymakers. DEA is a non-parametric method widely used to evaluate the efficiency of healthcare units, including hospitals and PHC facilities [23,24,25,26,27,28,29,30,31,32,33].

1.1. Related Studies

In Greece, studies utilizing Data Envelopment Analysis (DEA) have primarily focused on evaluating the efficiency of primary health facilities, namely, the NHS’s Health Centers (HCs) and IKA’s healthcare centers. [4,34,35,36,37]. One early study [35] analyzed 133 of the 242 IKA’s healthcare centers, which, at that time, served as the primary public health insurance provider. The analysis employed output data from 1998 (including patient visits and laboratory tests) and staffing data from September 1999. The results were categorized by the population served, with groups defined as up to 10,000, 10,000–25,000, 25,000–50,000, and over 50,000. The findings indicated that units equipped with technological resources for laboratory and radiological examinations demonstrated higher efficiency. Moreover, mid-sized units performed better than larger or smaller units. IKA HCs serving populations between 10,000 and 50,000 individuals were notably efficient, with none in the 25,001–50,000 range scoring below 50% efficiency.
Another study [4] evaluated the efficiency of 194 primary healthcare facilities using data from 2004, which included 103 NHS Health Centers (HCs), representing 67.9% of the total, and 91 IKA healthcare centers, representing 57.8% of the total. The facilities were categorized by the population served into three groups: small (<15,000), medium (15,000–30,000), and large (>30,000). Additionally, they were classified by location as urban/semi-urban or remote/island-based. The results showed that IKA healthcare centers were generally more efficient than NHS Health Centers (84.9% vs. 70.1%). Smaller units outperformed medium and large ones (84.2%, 72.4%, and 74.3%, respectively), and remote/island units were more efficient than urban centers (81.1% vs. 75.7%). In terms of scale efficiency, IKA units again outperformed NHS units (89.7% vs. 85.9%), with large and urban units showing higher efficiency scores (96.3% and 91.9%, respectively) than medium/small and remote units (90.9%–75.9% and 75.3%). These findings suggested a potential influence of facility size, location, and administrative affiliation on primary healthcare efficiency.
A separate study [36] assessed the efficiency of 152 NHS Health Centers using data from 2005. The findings indicated that 61 centers (39%) were technically efficient. Regarding scale efficiency, 96 centers (62.7%) demonstrated increasing returns to scale, while 15 (9.8%) exhibited decreasing returns. The results indicated that larger and urban Health Centers performed more effectively, whereas smaller and rural Health Centers faced scale inefficiencies.
A recent study [34] evaluated the efficiency of 42 Health Centers within the sixth Regional Health Authority, using data from 2010. The average efficiency scores were 0.57 for overall technical efficiency (TE), 0.67 for pure technical efficiency (PTE), and 0.87 for scale efficiency (SE). Only two Health Centers reached full technical efficiency, while nine achieved full pure technical efficiency and five attained full-scale efficiency. The study identified technical inefficiency as the primary source of overall inefficiency, suggesting that these Health Centers could potentially increase output by an average of 33% without additional resources by optimizing current production capacities.
Given these past findings, it is essential to reassess the current efficiency level of Greece’s primary healthcare (PHC) structures to evaluate resource utilization, identify inefficiencies, assess recent reforms, examine regional variations, and inform new policy recommendations.

1.2. Application Context

This study, based on 2019 data, offers the only post-crisis, pre-COVID-19 efficiency assessment of Health Centers (HCs) and Local Health Units (ToMYs) in Greece. The year 2019 is significant as it marks the first time operational data became available for the newly established ToMY facilities. Additionally, the number of HCs increased substantially following Law 4486/2017, which reclassified PEDY Health Units as Health Centers and established the ToMY units. The significance of this research is underscored by the ongoing healthcare reforms aimed at implementing a new national health map, with a primary focus on strengthening the PHC sector to reduce pressure on overburdened secondary facilities, conserve resources, and enhance the quality and accessibility of healthcare services.
Identifying the potential for output expansion, specifically the additional capacity for service delivery that Health Centers (HCs) and ToMYs could provide, is a significant goal of this study. This research is the first to quantify the “additional capacity” of the country’s primary healthcare structures using existing resources, employing a comparative evaluation approach.
To our knowledge, this is the only study that (a) includes the largest sample of Health Centers to date, covering over 75% of all centers with full geographic distribution across the country; (b) represents the first analysis of ToMY structures, examining nearly all units; and (c) uses the most recent data. This study’s methodological innovation extends beyond a conventional, naïve application of DEA models, employing a suite of methods to address the inherent limitations of basic DEA applications. This extensive methodological framework provides a detailed presentation of each method’s theoretical underpinnings and implications, equipping managers and policymakers with a deeper understanding of these analytical techniques.
This study addresses potential biases by identifying and excluding outliers and applying the bootstrap method to assess the robustness of results at specific statistical significance levels. Additionally, the super-efficiency method is used to classify efficient units and identify those with unique production profiles that warrant closer examination, while the Free Disposal Hull (FDH) model highlights indisputable benchmark units that outperform others across all dimensions. Furthermore, this study is the first to report results across regions stratified by levels of urbanization, using both Eurostat [38,39] and FAO [40] typologies, and by geographical area as defined by the administrative boundaries of the seven Health Regional Authorities (RHAs/YPE) in the Greek health system.
Key motivations for this approach include (a) assessing efficiency at the national level as well as within distinct geographic areas to determine the potential for additional services within the current resource framework, which could alleviate pressure on secondary care facilities, and (b) identifying individual efficiency scores for targeted interventions at the unit level.
The remainder of this paper is structured as follows: Section 2 details the methods and provides a brief overview of the theoretical background of the techniques employed to measure efficiency, including the software tools used, data selection process, model specification, DEA application, and urbanization and administrative classification levels chosen. Section 3 presents the results of the analysis; Section 4 discusses the findings, limitations, and policy implications; and, finally, Section 5 concludes with a summary and recommendations.

2. Materials and Methods

2.1. Methods

This research used cross-sectional data from 2019 to assess the relative efficiency of primary healthcare facilities in Greece. This study applied Data Envelopment Analysis (DEA) to benchmark Health Centers (HCs) and Local Health Units (ToMYs) based on resource utilization and service outputs. The non-parametric Data Envelopment Analysis (DEA) method was employed based on the assumptions of the constant returns to scale (CRS), variable returns to scale (VRS), increasing returns to scale (IRS/NDRS), and decreasing returns to scale (DRS/NIRS) models, as well as the Free Disposal Hull (FDH) model, all of them oriented toward output maximization. Efficiency scores were adjusted using a bootstrap technique to derive 95% confidence intervals. A suitable bootstrap method was also utilized to determine the type of global returns to scale for the underlying technology. Comparative evaluation and ranking of the efficient units were performed using the super-efficiency model. Additionally, the results were presented as aggregated scores (weighted averages) across various geographical areas and the seven Regional Health Authorities (RHAs/YPE) in the country and statistically analyzed to identify any significant differences.

2.1.1. Data Envelopment Analysis (DEA)

Data Envelopment Analysis (DEA) is a non-parametric method introduced by Charnes et al. in 1978 [41]. It utilizes linear programming to benchmark the efficiency of a group of similar production units, known as decision-making units (DMUs). These DMUs use a common set of inputs to produce a common set of outputs. The evaluation is relative, meaning that units showing no potential for improved efficiency are assigned a score of 1, indicating 100% efficiency. For all other units, a score below or above 100% is assigned. This score reflects, for each unit, the percentage by which inputs must be reduced (while maintaining current output levels) or the percentage by which outputs must be increased (while maintaining current input levels) to achieve 100% efficiency. In the former case, the efficiency is input-oriented, and, in the latter, it is output-oriented.
In geometric terms, each production unit can be represented as a point within the input–output space. Units with the same output levels can be compared based on their distance from the origin if they are positioned along the same radial direction in the input space. The unit closest to the origin is the most efficient, receiving an efficiency score of 1 (or 100%), while units farther from the origin are less efficient, with scores below 100%. This efficiency score is calculated as the ratio of the distance of the most efficient unit to the origin compared to that of the less efficient unit. Similarly, for units with the same input levels along the same radial direction in the output space, the unit farthest from the origin is deemed the most efficient, also receiving a score of 1. In this case, units closer to the origin have efficiency scores exceeding 100%. This score is determined by the ratio of the efficient unit’s distance from the origin to that of each less efficient unit. The assessment of efficiency mentioned above relies on what is known as the Farrell measure of efficiency [42].
However, this method can only be applied when there are at least two units along the same radial direction in the input or output space, sharing identical input or output levels. Only then is it possible to compare their distances and calculate efficiency scores. However, in practice, given that samples under evaluation consist of only a limited, discrete number of production units, it is highly improbable that units will align perfectly along the same radial direction in the input or output space, considering the infinite number of possible directions. To enable meaningful comparisons, it becomes necessary to determine the most efficient point along each radial direction.
This can be achieved by applying certain assumptions that allow transitioning from the discrete set of production possibilities in the sample to a complete (dense) set. From this full set, the most efficient operating point for each radial direction is identified. Geometrically, this corresponds to defining the feasible production possibilities region and, consequently, its outer boundary, known as the efficiency or best-practice frontier, which separates feasible from infeasible production possibilities. The points on this frontier represent the most efficient units. The intersection of this surface with the radial directions of the units being evaluated identifies the benchmarks, against which the efficiency scores of other units can be measured. The assumptions that are typically made include the assumption of the free disposability of inputs and outputs, the convexity of the production possibility set, and the type, if any, of prevailing returns to scale (non-increasing, non-decreasing, or constant). The construction of the production possibility set P under the aforementioned assumptions is derived mathematically as follows [43,44]:
Consider a set of n decision-making units (DMUs), where each unit p utilizes varying amounts of different m inputs
x 1 p , x 2 p , , x m p
and produces varying amounts of the total of s outputs
y 1 p ,   y 2 p ,   , y s p
where inputs and outputs are considered non-negative, without all being zero (i.e., at least one input and at least one output are strictly positive numbers) [45]:
x i p   0 ,   y j p   0     i , j , p
i = 1 m x i p > 0 ,   j = 1 s y j p > 0
Each such decision-making unit p is called an activity and will henceforth be generally denoted by the pair x , y or the pair x r , y r (when there is a need to refer to the specific index r that indicates a particular DMU).
Starting from the set of the n total units mentioned above, for which the corresponding n productive activities are (empirically) known
x r , y r , r = 1 , 2 , , n
a more comprehensive set of feasible production possibilities is constructed by incorporating all additional production possibilities derived from one or more of the following assumptions:
Assumption of Free Disposability of Inputs and Outputs: If an activity x , y belongs to P, then any activity x , y with x x and y y will also belong to P.
Assumption of Convexity of Set P: Any weighted average of activities in P also belongs to P
x , y P ,   x , y P     λ x , y + 1 λ x , y P ,   λ 0 , 1
Assumption of the Type of Returns to Scale: If an activity x , y belongs to P then the activity λ x , λ y = λ x , y also belongs to P, where
i.
λ 0 , 1 under the assumption of non-increasing returns to scale (NIRS)
ii.
λ 1 , + under the assumption of non-decreasing returns to scale (NDRS)
iii.
λ 0 , + under the assumption of constant returns to scale (CRS)
Based on these assumptions, the production possibility set P is given by the following construct:
P P P S = x , y   R + m × R + s |     λ r Λ ( P P S ) :   x r = 1 n λ r x r ,   y r = 1 n λ r y r
where PPS values are given by
P P S C R S ,   N I R S , N D R S , V R S ,   F D H
and the sets Λ(PPS) are defined according to the specifications provided in Table 1 below:
The input-oriented (-IO) efficiency of unit p is calculated by solving the next linear programming problem
T E p P P S I O = m i n θ p : θ p x p r = 1 n λ r p x r ,   y p     r = 1 n λ r p y r ,   λ r p Λ P P S
whereas the output-oriented efficiency (-OO) is derived respectively by the following problem:
T E p P P S O O = m a x Φ p : x p r = 1 n λ r p x r ,   y p Φ p     r = 1 n λ r p y r ,   λ r p Λ P P S
The assumptions listed in the first three rows of Table 1 correspond to the three DEA models, each defined by their assumption regarding returns to scale: DEA-CRS (constant returns to scale), DEA-NIRS (non-increasing returns to scale), and DEA-NDRS (non-decreasing returns to scale). The model in the fourth row is known as DEA-VRS (variable returns to scale), while the final row represents the FDH (Free Disposal Hull) model. For completeness of terminology, CRS and VRS are also referred to as CCR and BCC [46], with their scores representing overall efficiency and pure technical efficiency, respectively.

2.1.2. Free Disposal Hull (FDH)

The key distinction between FDH and DEA models lies in their assumptions about the production possibility set. In the FDH model [47], the sole assumption is the free disposal of inputs and outputs, meaning that the production possibility set is determined strictly by the actual production activities of the units being evaluated. Consequently, FDH benchmarking is based on comparison with a specific, existing unit rather than a theoretical production activity created from a linear combination of other units. In contrast, DEA benchmarks against a composite unit formed by aggregating portions of multiple existing units.
This characteristic is reflected in the set of possible values, denoted by Λ(FDH). Consequently, the only permissible linear combinations used for benchmarking in the FDH model are those involving a single existing unit with a coefficient of unity. This is because only in this manner does the condition λ r N + | r = 1 n λ r = 1 hold.
From a managerial perspective, it is important to note that the identification of a set of dominant production units through the presentation of actual implemented production activities that are demonstrably more efficient lends credibility to the inefficiency scores, which often lack such substantiation when reference is made to an abstract frontier [48].
It should also be noted that, specifically for FDH, the calculation of efficiency can be performed analytically, making it unnecessary to solve the linear programming problem. Specifically, the FDH efficiency score of unit p, with an input orientation (-IO) or output orientation (-OO), can be calculated by solving the following simple minimax problems [44]:
T E p F D H I O = min r :   y r y p   max i = 1 , 2 , , m x i r x i p
T E p F D H O O = max r :   x r x p   min j = 1 , 2 , , s y j r y j p

2.1.3. Peer Units

By solving the above linear programming problems for each unit p, the (input or output oriented) efficiency score, ( θ p ) or ( Φ p ), is calculated along with the weighting factors λ r p , which form a linear combination of fully efficient units [44].
r = 1 n λ r p x r , r = 1 n λ r p y r
This linearly combined activity corresponds to the most efficient point against which, if present, the evaluated unit p would ultimately be compared. In other words, this is the point on the efficient frontier where the production point under evaluation is radially projected. In this context, the units of the sample that have been assigned positive weighting factors are referred to as the “peer units” of unit p:
P e e r   U n i t s   o f   p = x r , y r ,   r   1 , 2 , , n   |   λ r p > 0

2.1.4. Super Efficiency

A limitation of the DEA evaluation method is that all units forming the efficiency frontier receive a score of 1, making it difficult to rank efficient units. To differentiate between them, the super-efficiency model, introduced by Andersen and Petersen [49], calculates how much a unit can “underperform” while still remaining efficient. This margin is based on either the maximum possible increase in inputs or decrease in outputs until the unit’s score reaches 1. Larger margins indicate greater relative efficiency.
In the super-efficiency model, each unit is excluded from the efficiency frontier calculation. If efficient, the unit’s removal shifts its position, resulting in a score either greater or less than 1. This score allows for the ranking of efficient units based on their distance from the frontier. In output-oriented models, a score below 1 means that the unit can produce more output than other efficient units, with larger deviations indicating higher super-efficiency [49,50,51].
Therefore, the super-efficiency DEA model enhances traditional DEA by ranking efficient units, detecting outliers, identifying untapped potential, and providing more accurate benchmarks for improvement. It offers decision-makers detailed insights into both the robustness and the vulnerabilities of their units, helping them optimize resource allocation, improve operational efficiency, and set more effective performance targets for both efficient and inefficient units.
To compute super-efficiency scores, the two linear programming problems, as well as the sets Λ(CRS), Λ(NIRS), Λ(NDRS), and Λ(VRS), are reformulated, with the sole modification being the exclusion of the unit being evaluated, denoted as p. Practically, this means that in all summations where the index r ranges from r = 1 to r = n , the value corresponding to the evaluated unit p is excluded ( r p ) .
r = 1 n r = 1 r p n

2.1.5. Weighted Average Efficiency Score

When assessing the efficiency of production units, it is often valuable to consider not only the individual efficiency scores but also the overall efficiency of all units, represented by a comprehensive score for an average representative unit. Using a simple arithmetic mean of the individual efficiency scores can be misleading [52] and, therefore, a weighted mean is recommended. For an output-oriented model with a sample of n units using s outputs, the weighted mean is calculated as follows:
T E ¯ a v g P P S O O = p = 1 n w p T E p P P S O O
where the weighting w p of each unit p is calculated as
w p = 1 s y 1 p r = 1 n y 1 r + y 2 p r = 1 n y 2 r + + y s p r = 1 n y s r = 1 s j = 1 s y j p r = 1 n y j r p = 1 , 2 , , n

2.1.6. Outliers

Outliers are observations that deviate substantially from the majority of observations in a sample dataset, either being much higher or lower than typical values, often falling outside the “expected range” of the data. These anomalies may result from natural variability, measurement errors, or rare occurrences and can significantly influence the accuracy of models, particularly in the context of Data Envelopment Analysis (DEA), where they can distort efficiency estimates and skew results. Outliers in a DEA dataset can significantly distort the construction of the corresponding best-practice frontier, which, in turn, can heavily influence the efficiency scores of certain units. As such, it is crucial to identify and remove these outliers. Although there are several methods for detecting outliers in non-parametric frontier models, no single method can identify all outliers under any circumstances. For this reason, it is often recommended to use multiple methods when analyzing data. In this study, the data cloud method, as outlined by Wilson [53], was chosen as the primary approach for detecting outliers. This method assesses the geometric volume D formed by the DMUs and analyzes how sensitive this volume is to changes when subsets of DMUs—such as individual units, pairs, or triplets—are excluded from the dataset. A group of r DMUs are considered potentially outlying if their removal results in a significant change in the volume.

2.1.7. Returns to Scale Assumption

In this study, the method proposed by Simar and Wilson [54] was utilized to identify the appropriate returns to scale type. This approach, which employs bootstrapping, consists of two sequential tests. The first test (test 1) evaluates the null hypothesis that the production function has globally constant returns to scale (CRS) against the alternative hypothesis that the technology exhibits globally variable returns to scale (VRS):
Test 1: [H0: technology is globally CRS] versus [H1: technology is VRS].
If H0 is rejected, the second test is conducted. Test 2 assesses whether the technology displays globally non-increasing returns to scale (NIRS) compared to the alternative of globally VRS:
Test 2: [H’0: technology is globally NIRS] versus [H’1: technology is VRS].
If both H0 and H’0 are rejected, this indicates that the underlying technology demonstrates globally variable returns to scale (VRS). Among the various test statistics available for these tests, the following ratios of sums (across all units) were chosen for application:
t s 1 = i = 1 n T E i C R S i = 1 n T E i V R S , t s 2 = i = 1 n T E i N I R S i = 1 n T E i V R S
These two statistics can be interpreted as representing the ratios of the average distances from CRS to VRS frontiers (for test 1) and from NIRS to VRS frontiers (for test 2).

2.1.8. Scale Efficiency: Most Productive Scale Size (MPSS)

The Most Productive Scale Size (MPSS) is defined [55] as a production possibility x 0 , y 0 P whose average productivity of its input–output mix is maximized. At the MPSS, the average output is at its peak, and any deviation from this point of operation results in reduced average output. Formally, x 0 , y 0 P is an MPSS for its input and output mix if, and only if, for all α x 0 , β y 0 P   α β
The loss from not operating at the MPSS is quantified by scale efficiency (SE), which is calculated as
S E = T E C R S T E V R S
The relationship between MPSS and scale efficiency is captured by the following proposition: A production is an MPSS for its input and output mix if, and only if, T E V R S   = 1 and S E = 1 . Therefore, if x 0 , y 0 is an MPSS, then constant returns to scale prevail at x 0 , y 0 .
The Most Productive Scale Size (MPSS) is the point where constant returns to scale prevail, and it reflects how close a unit is to achieving its optimal scale of operation. Units with increasing returns to scale (IRS) fall before the MPSS, while those with decreasing returns to scale (DRS) come after it on the efficiency frontier. It is important to note, however, that the MPSS is specific to the particular input and output mix being evaluated.

2.1.9. Bootstrap

A notable criticism of DEA is its underlying assumption that the entire distance of a DMU from the efficiency frontier is attributed solely to inefficiency. In practice, this distance also captures noise, as input and output data are typically prone to errors. Since DEA uses non-statistical linear programming methods, it cannot evaluate the accuracy of the efficiency estimates. Additionally, the estimated production technology is only a partial representation of the true, unknown technology, meaning that DEA efficiency scores tend to be higher than the actual values. This inherent upward bias in DEA scores is a structural issue. To mitigate this problem, Simar and Wilson [56,57] proposed a bootstrap-based methodology that helps assess the statistical properties of DEA estimators. In this study, their bootstrapping method was applied to compute bias-corrected efficiency scores and provide estimates of their confidence intervals.
Figure 1 presents the methodological workflow for the efficiency analysis, including outlier detection, returns-to-scale testing, DEA with various scale assumptions, super-efficiency, FDH, statistical validation via bootstrap, and statistical inference using Kruskal–Wallis tests.

2.2. Software Tools

For this study, existing software applications were used to solve specific models (EMS, DEA SOLVER, R packages—“Benchmarking” and “FEAR”) and SPSS 29.0 (IBM Corp., Armonk, NY, USA) for statistical tests. Additionally, custom Excel routines were developed where needed.

2.3. Data and Model Specification

2.3.1. Sample

The study sample comprised Health Centers and ToMYs from 2019. Primary data were sourced from the Ministry of Health (following a formal request) for all facilities, totaling 307 Health Centers and 117 ToMYs nationwide. Facilities with incomplete input/output data (66 Health Centers and 17 ToMYs) were excluded, resulting in a sample of 241 Health Centers and 100 ToMYs, representing 78.5% and 85.5% of the total units, respectively.
The data were then examined through distribution analysis and pairwise correlations, as illustrated in the scatter plot matrix in Figure 2. A preliminary review of these plots revealed the presence of outlier units, which, as discussed extensively in a later section, were excluded from the analysis to ensure that the remaining data were representative and robust for subsequent analysis. Using an appropriate outlier detection algorithm, 7 Health Centers and 6 ToMYs were identified and removed, resulting in a final sample of 234 Health Centers and 94 ToMYs, representing 76.2% and 80.3% of the original totals, respectively. These sample sizes were considered more than satisfactory, offering both comprehensive and current data coverage with extensive geographic representation across all regions of the country.
The flowchart in Figure 3 outlines the steps to derive the final sample sizes for Health Centers (HCs) and Local Health Units (ToMYs), beginning with the initial dataset of 424 units. Units were excluded due to missing input/output data, and additional outliers were removed, resulting in a final sample of 234 HCs and 94 ToMYs.

2.3.2. Inputs–Outputs

As evidenced in the pertinent literature, the appropriate selection of inputs and outputs is of paramount importance for ensuring the validity and reliability of results when utilizing Data Envelopment Analysis for benchmarking purposes.
Measuring efficiency in primary healthcare (PHC) is particularly challenging due to its holistic role, providing long-term, continuous support to individuals and families [32]. Unlike hospitals, where admissions and discharges create defined boundaries, PHC operates as an open, community-based system with fluid and often indefinite service parameters, complicating economic modeling [58]. Due to limited data on long-term health outcomes, researchers frequently adopt “activity-oriented” models, using metrics such as counts of visits or consultations, vaccinations, prescriptions, minor surgeries undertaken, referrals, and so on as proxies for output [33,59]. In Greece, assessing PHC efficiency is further complicated by variability in service delivery and the absence of a standardized framework. Greek PHC services vary widely, with little consensus on standardized definitions for resources, procedures, or outputs [34]. Different stakeholders often hold conflicting views on what constitutes effective care, while ambiguity in targets and resource definitions further complicates efficiency assessments. Consequently, input and output selection for benchmarking Greek PHC must be meticulously adapted to capture this variability and complexity.
In this study, the inputs selected for benchmarking Greek PHC facilities included (i1) medical staff, (i2) nursing and paramedical staff, and (i3) administrative and other support staff, each representing a distinct component of healthcare resources. The outputs measured were (o1) scheduled patient visits, (o2) emergency patient visits, and (o3) pharmaceutical prescription visits, capturing key activities and service demands. This selection aimed to provide a robust and contextually relevant measure of efficiency, mindful of the unique complexities and operational variability inherent in Greek primary healthcare.
The selected variables in this study were deemed to sufficiently represent the production process of PHC services and were consistent with those employed in similar studies [4,33,34,35,36,59,60,61,62], as shown in Table 2.

2.3.3. Discriminatory Power

With respect to discriminatory power, this study fully satisfied the empirical rule for sample adequacy in DEA. According to this rule of thumb [63], the number of decision-making units (DMUs) should exceed either three times the sum of inputs and outputs or twice their product. With three inputs and three outputs, the minimum threshold qs 18 DMUs (3 × (3 + 3) = 18, 2 × (3 × 3) = 18). The samples analyzed—241 Health Centers (HCs) and 94 Local Health Units (ToMYs)—exceeded this requirement by factors of 13 and 5.4, respectively, providing the DEA model with sufficient discriminatory power to avoid result distortion and minimize the likelihood of misclassifying inefficient units as efficient.
To rank fully efficient units, the super-efficiency model was applied [49] as an indirect method of enhancing the discriminatory capability of standard DEA. Although Principal Component Analysis (PCA) could potentially reduce the number of variables—and thereby decrease the number of units deemed efficient, enhancing discrimination [64]—this approach was not selected for several reasons. First, it was essential to retain all relevant variables to capture the unique contributions of each. The three selected input variables represented distinct components of human resources, each included in disaggregated form due to their complementary yet fundamentally different roles. Second, even after reducing the number of fully efficient units, a subset of efficient units would still require relative ranking. Third, and most importantly, PCA+DEA has inherent limitations: PCA often produces both positive and negative component values, which are incompatible with DEA’s requirement for non-negative data. A workaround—adding the absolute value of the minimum component plus one to each value [65]—would require the DEA models to be translation-invariant. However, this condition was not met by the CRS model, and in the output-oriented VRS model, translation-invariance applied only to the input side [66]. Consequently, in this study, PCA could only be applied to reduce the input dimensions by one, as the cumulative variance of the first two principal components accounted for 86.9% of the total. This reduction would have altered the input space while leaving the output space unchanged, complicating meaningful interpretation and, most importantly, limiting comparability with standard (non-PCA-based) DEA models from other studies, whether past or future.
Moreover, our results, presented in the next section, reveal substantial inefficiencies across DMUs, with mean efficiency scores indicating room for improvement. Given these findings, further refinement to increase discriminative power is unnecessary at this stage. Such adjustments may become relevant in future analyses, once the primary inefficiencies have been addressed.

2.3.4. Orientation

In terms of selecting between input minimization (reflecting a financial perspective) or output maximization (reflecting a social perspective), an output-oriented approach was chosen to identify potential areas for improvement, given the well-documented challenges of resource scarcity and the increasing demand for primary health services. In this context, it is crucial to evaluate current service levels relative to the available inputs. Recent years have seen a focus on strengthening primary care services for prevention and health promotion at the community level, with the goal of progressively developing an integrated healthcare model. Thus, assessing the capacity to expand service provision within the existing network of Health Centers (HCs) and Local Health Units (ToMYs) is essential for any future system redesign. Moreover, it is widely acknowledged that these healthcare units inherently aim to maximize population coverage, with a significant portion of their activities focusing on health promotion, disease prevention, and control. The objective, therefore, is to serve a larger proportion of the population with the available resources, rather than maintaining the same output levels with fewer resources [34,59,61,62,67,68,69,70,71].

2.4. Classification by Urbanization Levels

Most of the results for the Health Centers (HCs) will be presented based on urbanization levels, classified according to both Eurostat [38,39] and FAO [40] typologies, as well as their distribution across Regional Health Authorities (RHAs), whereas the results for the Local Health Units (ToMYs) will be shown solely according to their division among the RHAs.
Eurostat and FAO both classify urbanization levels into three categories, but with different criteria. Eurostat categorizes regions with over 1500 inhabitants per square kilometer as urban (‘1’), those with 300–1500 inhabitants per square kilometer as intermediate (‘2’), and those with fewer than 300 inhabitants per square kilometer as rural (‘3’). FAO classifies urban areas as settlements with over 5000 people, peri-urban areas as those with 1000 to 5000 people, and rural areas as having fewer than 1000 people, typically focused on agriculture. Both classifications aim to analyze population distribution and inform planning and development strategies.

3. Results

3.1. Input–Output Summary Statistics

Descriptive statistics of the input and output variables for Health Centers (HCs) and Local Health Centers (ToMYs) that were included in the final sample are presented in Table 3. For HCs, which included 234 units, the mean values indicated that the average HC employs 11.4 medical staff, 15.7 nursing/paramedical staff, and 8.2 administrative personnel. In terms of outputs, the average HC manages 15,992.4 scheduled visits, 9255.6 emergency visits, and 10,409.4 prescription-related visits. The high standard deviations across inputs and outputs suggested significant variation among HCs, with larger centers vastly outstripping smaller ones. For instance, while the median number of scheduled visits was 11,917, the maximum reached a notable 75,381 visits, indicating the presence of some very large centers. This was further evidenced by the range values, with scheduled visits varying by as much as 75,040 across different HCs. The percentage distribution of inputs and outputs across regions, categorized by Eurostat’s classification (‘1’, ‘2’, ‘3’) and FAO’s classification (rural, p[eri]-urban, urban), as well as by Regional Health Authorities (RHA 1-7), are depicted in Figure 4.
In contrast, ToMYs, which consist of 94 units, operate on a smaller scale. The average ToMY employs 2.6 medical staff, 3.1 nursing/paramedical staff, and 2.5 administrative staff. The output averages for scheduled visits (4944.7), emergency visits (1025.9), and prescription-related visits (2860.5) are significantly lower than those for HCs. The lower standard deviations and range values for ToMYs reflected more consistency and less variability among these units, suggesting that they are relatively homogeneous in terms of staffing and patient visits. The maximum number of scheduled visits for ToMYs was 14,161, significantly lower than for HCs, implying a smaller operational scope.
Key observations include the stark difference in scale between the two types of healthcare entities, with HCs serving many more patients and employing larger staff teams. HCs also show much greater variability, as indicated by the larger standard deviations and the presence of outliers, especially for scheduled patient visits. ToMYs, though smaller, still contribute substantially in terms of patient care, particularly in terms of prescription visits. The sum values reveal that across all units, HCs manage a total of about 3.7 million scheduled visits, compared to 464,801 for ToMYs, further illustrating the scale disparity. In summary, the statistical data in the table underscore the distinct operational profiles of the larger, more diverse Health Centers (HCs) compared to the smaller, more homogenous Local Health Units (ToMYs). These differences justified treating them as two separate groups in our analysis, as their underlying technologies appeared to differ significantly.

3.2. Global Returns to Scale

Regarding the type of global returns to scale, the application of Simar and Wilson’s two-part test (Table 4) resulted in the rejection of both the H0 (ts = 1.1526, p-value < 0.001) and H’0 (ts = 1.0256, p-value = 0.005) null hypotheses at the 1% significance level, following 2000 bootstrap replications. Consequently, variable returns to scale (VRS) was considered the appropriate returns to scale type for the underlying technology of the Health Centers under evaluation. Nonetheless, some results under the constant, non-increasing, and non-decreasing returns to scale (CRS, NIRS, NDRS) assumptions are also provided for comparison and completeness.

3.3. VRS Results

Table 5 presents DEA VRS results for HCs across various regions, highlighting their pure technical efficiency. The scores indicate how much the output of each unit could be expanded using the same level of inputs, with values closer to 1 representing higher efficiency. The table also includes mean values for uncorrected and bootstrapped (bias-corrected) efficiency scores, along with weighted means, offering insight into the efficiency across different regions and health authority areas.
The overall uncorrected mean efficiency score across all regions was 1.97, indicating that, on average, units could nearly double their outputs while maintaining the same input levels. After applying bootstrapping, the bias-corrected mean efficiency score rose to 2.31, as expected, since standard DEA scores tend to be overly optimistic. A maximum score of 8.04 revealed significant inefficiency in some units, suggesting substantial potential for output increases. The weighted mean efficiency score of 1.92 reflected slightly better performance when output adjustments were considered. In Figure 5b, the ordinary, bootstrapped, and weighted mean VRS scores are depicted across all subdivisions, alongside the CRS scores shown in Figure 5a, while the boxplots in Figure 6 provide a visual representation of the distribution of bootstrapped VRS scores, illustrating the median, quartiles, and potential outliers.
Examining the regions, Region ‘1’ had the lowest weighted mean efficiency score, with 1.69, implying that its units could enhance their outputs by 69%. The region’s lower maximum value (5.05) and smaller standard deviation suggested more consistent performance. In contrast, while Region ‘3’ had a weighted mean efficiency score of 2.00, indicating higher inefficiency, 57.9% of the efficient units were still located in this region. However, the maximum value of 8.04 for Region ‘3’ highlighted notable inefficiencies among some units.
Regarding urbanization levels, urban areas exhibited the highest efficiency, with a weighted mean score of 1.52. The low standard deviation and maximum score of 4.18 reflected more consistent performance. Peri-urban areas showed higher inefficiency, with a weighted mean score of 1.94 and greater variation in performance, while rural areas exhibited the highest inefficiency, with a weighted mean score of 2.42. This indicates significant potential for output improvement across all non-urban areas.
At the level of Regional Health Authorities, RHA 1 showed the highest efficiency with a mean score of 1.30, meaning that outputs could be increased by 30%. However, this region had only one efficient unit, indicating that few units reached top performance. In contrast, RHA 6 had a mean score of 1.88, suggesting great potential for output improvement, even though it hosted 42.1% of all efficient units. RHA 4 had the highest bootstrapped mean score (2.67), indicating greater inefficiency and variability among its units.
Similarly, Table 6 shows the corresponding figures for ToMYs across the seven Regional Health Authorities (RHAs). Overall, the mean efficiency score was 1.74, indicating that, on average, units could increase their outputs by 74%. The bootstrapped mean rose to 1.98, and the maximum score of 10.55 highlighted significant inefficiencies in certain units. The weighted mean of 1.58 reflected slightly better performance when adjusted for outputs. Since the weighted mean efficiency scores accounted for output levels, the lower score suggests that, on average, larger or higher-output units tend to be more efficient than smaller ones.
The standard deviation of 1.60 for RHA 5, one of the highest among RHAs, highlighted considerable variability in efficiency across its units. This suggested that while some units in RHA 5 performed near the average, others were significantly inefficient. Similarly, RHA 6 showed high variability, with a StDev of 1.83, further evidenced by its maximum score of 10.55, indicating significant disparities in efficiency within the region. In contrast, RHA 1 and RHA 7 exhibited more consistent performance, with lower standard deviations of 0.59 and 0.73, respectively, suggesting less variation in efficiency across their units. These findings suggest that regions with higher variability, such as RHA 5 and RHA 6, may require more targeted interventions to address inefficiencies, whereas RHA 1 and RHA 7 demonstrate more uniform, though still improvable, performance. In Figure 7, the mean values of ordinary (uncorrected) CRS/VRS scores and bias-corrected scores derived from Simar and Wilson’s bootstrap method are presented, both unweighted and weighted, for the seven Regional Health Authorities. The number of efficient facilities per RHA is shown as bars superimposed on the graph.

3.4. Super-Efficiency, Benchmarks, and Peer Counts

Table 7 presents the super-efficiency, output-oriented DEA scores for Health Centers, listing all of them that were super-efficient in at least one of the four models: (CRS), (VRS), (NIRS), and (NDRS). In an output-oriented framework, a super-efficiency score below 1 reflects the distance of a unit from the “reduced” efficiency frontier, which is formed by excluding that unit. In essence, this score indicates the extent of a unit’s superior performance relative to all other units that reside on the frontier. The DMUs in the table are ranked according to their super-efficiency scores in the VRS model, with additional columns showing peer counts and relative ranks (based on super-efficiency scores) across all four models.
First, The VRS super-efficiency scores ranged from 0.071 to 0.999, indicating a wide variability in the efficiency levels of the HCs. The highest-ranked unit in the VRS model was D-041, with a score of 0.071, indicating exceptional super-efficiency. Closely following was D-184, with a VRS score of 0.211, further demonstrating a high level of efficiency. Both units served as benchmarks for other inefficient units. However, they each acted as peers for only two other units, which may indicate that while they exhibit strong individual performance, their specific operational characteristics may not be as broadly applicable to improving the efficiency of other units.
HCs with high peer counts, such as D-173 (108 peers under CRS) and D-159 (67 peers), served as key benchmarks for other less efficient units. These HCs set the standard for performance, being frequently referenced by other units for comparison. In contrast, HC D-173, while ranking ninth in VRS with a score of 0.412, had a notable peer count of 61, indicating its significant influence as a benchmark. Similarly, D-159 (ranked 11th in VRS) and D-048 (ranked 10th) had significant peer counts, influencing 62 and 14 units, respectively, indicating their strategic importance within the system despite their mid-range VRS rankings.
Some DMUs, like D-048 and D-178, showed high super-efficiency scores despite lower peer counts, indicating that while less frequently used as benchmarks, their output performance was still exceptional. The variation in super-efficiency scores illustrates the differing capacities of DMUs to increase output, with units like D-238 (0.982) and D-176 (0.992) having limited room for further output increases compared to top-ranked units like D-173 and D-159. Overall, the analysis identified top performers and key benchmarks while highlighting the variation in super-efficiency across different models.
In the table, DMUs D-199 and D-212 have missing super-efficiency scores under the VRS model because the linear programming (LP) problem was not solvable, resulting in an infeasible solution. This issue typically arises when the DMU in question lies on the boundary of the efficiency frontier and removing it from the reference set causes the LP model to lack a feasible reference point for comparison. Essentially, this means that the DMU is located in an “area” where no other units can serve as a benchmark to evaluate its performance under the VRS assumption.

3.5. FDH Results

The Free Disposal Hull (FDH) scores presented in Table 8 provide a detailed analysis of output-oriented efficiency across different regions. Looking at all DMUs, the average FDH score was 1.34, indicating that, on average, DMUs operated 34% away from the efficient frontier, which implied a significant degree of inefficiency in terms of output generation. The bootstrapped mean score of 1.51 (with a standard deviation of 0.56) further highlighted the considerable variation in efficiency across DMUs, with some units significantly underperforming. Among the 234 DMUs analyzed, 115 were classified as efficient, yet only 35 of them served as benchmarks for the remaining 119 inefficient DMUs. The weighted mean efficiency of 1.39 reinforced the fact that although a large proportion of DMUs operated inefficiently, a few highly efficient units helped balance the overall efficiency.
Regionally, there were clear distinctions. Region ‘2’ demonstrated the highest mean FDH score (1.40), indicating relatively lower efficiency, with the bootstrapped mean score of 1.58 and the highest standard deviation of 0.67 reflecting significant variation in performance across DMUs within this region. Conversely, Region ‘1’ showed the lowest mean FDH score (1.25), making it the most efficient region on average. However, despite its higher efficiency, Region ‘1’ did not serve as a major source of benchmarks for other regions as it had a smaller number of efficient DMUs compared to Region ‘2’.
When comparing the different areas according to FAO classification, urban DMUs exhibited the highest efficiency, with a mean FDH score of 1.17, suggesting that urban centers, on average, need only a 17% increase in their current output levels to reach the efficiency frontier, making them the most efficient category. The bootstrapped scores for urban DMUs also showed the lowest variability (StDev = 0.31), implying greater consistency in efficiency across urban units. Suburban and rural DMUs had similar mean FDH scores of 1.34 and 1.41, respectively, but rural DMUs showed more variability in their performance, as indicated by the wide range of bootstrapped scores (from 1.13 to 3.45).
Among the Regional Health Authorities, RHA 4 and RHA 5 stood out for their relatively low efficiency, with mean FDH scores of 1.53 and 1.39, respectively. The high standard deviations in the bootstrapped scores for these regions suggested significant variation within them. On the other hand, RHA 1 was the most efficient, with a mean FDH score of 1.05 and very little variability (StDev = 0.10). Despite its high efficiency, RHA 1 did not provide any benchmark DMUs, indicating that it may not be heavily relied upon as a reference point for other inefficient DMUs in the dataset. Figure 8a presents the visualization of ordinary (uncorrected) FDH scores averaged across areas categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO’s classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7), alongside the bootstrapped bias-corrected scores in both unweighted and weighted forms. Additionally, Figure 8b displays the total number of efficient facilities, along with the subset that served as benchmarks.
On an individual basis, Table 9 provides an overview of the 35 Health Centers identified as benchmark units under FDH, alongside the frequency of their appearances as peers for inefficient units. These data points are categorized by Eurostat regions (‘1’,’2’,’3’), FAO urbanization levels (urban, p[eri]-urban, rural), and Regional Health Authorities (RHA 1–7). The shaded cells in each row indicate the region of origin for the corresponding benchmark unit, offering a visual representation of the subcategory source and its respective RHA. The table also summarizes the total number of peer appearances for all benchmark units and the number of benchmarks within each subcategory.
D-037 stood out as the most frequently cited benchmark unit, appearing 19 times as a peer for inefficient Health Centers. This unit primarily originated from Region ‘3’ and peri-urban areas, with significant representation across five RHAs, particularly RHAs 3 and 5. Other notable benchmark units included D-260 and D-172, which appeared 11 and 10 times, respectively, and were also from Region ‘3’. The fact that Region ‘3’ contributed the largest number of benchmarks, with 26 units, indicates that this region hosts many of the most efficient production units, making it a focal point for operational learning and model replication. Conversely, Regions ‘1’ and ‘2’ had a smaller number of benchmark units, with only 3 and 6 benchmarks, respectively.
Peri-urban areas were the largest contributors, with 24 benchmark units, highlighting that these areas have found an optimal balance between operational complexity and resource management. This balance made peri-urban areas more likely to serve as benchmarks for less efficient units. By contrast, urban areas contributed only four benchmarks, while rural areas provided seven, suggesting that strategies implemented in peri-urban areas may be more adaptable and scalable across different settings. RHA 6 led in benchmark contributions with 20 units, signaling that its operational practices could serve as effective models for other RHAs. On the other hand, RHA 1, with no benchmarks, and RHA 7, with only four, showed room for significant improvement.
Similarly, Table 10 presents output-oriented FDH efficiency scores for ToMYs across the seven RHAs. The mean FDH score of 1.36 indicated a moderate level of inefficiency, suggesting that ToMYs have the potential to increase output by 36% with their existing resources. The bootstrapped mean score of 1.55, with a standard deviation of 0.81, highlighted significant variability in efficiency across DMUs. Bootstrapped scores ranged from 1.08 to 6.58, revealing both highly efficient and substantially inefficient units. Despite this, 52 DMUs were classified as efficient, though only 15 of these served as benchmarks for the remaining 42 units. The weighted mean efficiency score of 1.34 further supported the conclusion that while some DMUs perform well, there is considerable room for overall improvement.
RHA 2 stood out as the most efficient, with a mean FDH score of 1.16, indicating that ToMYs in this region operate near the efficiency frontier. Its bootstrapped mean of 1.33 and low standard deviation (0.41) further suggested consistent performance. In contrast, RHA 3 was the least efficient, with a mean FDH score of 1.64 and no efficient DMUs, indicating significant inefficiencies and a lack of benchmarks for other regions.
RHA 5 and RHA 6 also exhibited high inefficiency, with mean FDH scores of 1.61 and 1.45, respectively. RHA 6, in particular, showed a wide range of bootstrapped scores (from 1.10 to 6.58), highlighting significant disparities in performance among its units. Despite having the most efficient DMUs (18 units), only 4 of these served as benchmarks, suggesting that these high-performing units are underutilized for broader improvement efforts. In contrast, RHA 1 and RHA 2 demonstrated more consistent performance, with mean FDH scores of 1.25 and 1.16, respectively, and a higher proportion of efficient DMUs. Notably, in RHA 5, five benchmark units served as peers for 20 inefficient DMUs across all RHAs.
The 15 benchmark units are listed in Table 11 along with their frequency of appearance as peers for inefficient units across the seven RHAs. Again, the shaded cells indicate the region of their origin, and the total numbers of peer appearances and benchmarks per RHA are summarized at the bottom.
TM-047, originating from RHA 5, was the most frequently cited benchmark, appearing 10 times as a peer across all regions except 1 and 7. TM-021 (from RHA 2) followed with six appearances, spanning RHA 1, 2, 5 and 6. These benchmarks demonstrated high versatility and performance across regions.
RHA 5 emerged as a top performer, with five benchmark units and 20 peer appearances, indicating strong operational efficiency. RHA 6 also stood out, with four benchmark units that appeared six times as peers, though only one of these served as a peer within its own RHA, despite a total of 12 peer appearances.

3.6. Kruskal–Wallis Test

Table 12 presents the results of Kruskal–Wallis and median tests applied to all models—CRS, VRS, IRS, DRS, and FDH—both in their conventional form and bootstrapped versions. These tests assessed whether there were statistically significant differences across regions (‘1’, ‘2’, ‘3’), urbanization levels (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7). The Kruskal–Wallis test showed significant differences in the DRS model for regions, with a p-value of 0.05, indicating regional variations in efficiency with the decreasing returns to scale model. The bootstrapped DRS version showed near significance (p = 0.06), suggesting potential regional differences. For urbanization levels, both VRS and DRS models revealed significant differences, with p-values < 0.001, indicating strong efficiency variation between urban, peri-urban, and rural units, confirmed by the bootstrapped results. Similarly, the analysis of RHAs showed significant differences in the FDH (p = 0.01) and VRS (p = 0.02) models, reflecting efficiency variations across RHAs. Bootstrapped results for FDH, VRS, and DRS further reinforced these findings, with consistently significant p-values.
In the median test, the VRS model showed significant regional differences (p = 0.05), while the DRS model approached significance (p = 0.06), indicating potential regional efficiency variation. For urbanization levels, DRS showed significant differences (p = 0.01), suggesting that efficiency varied notably between urban, peri-urban, and rural units, which was confirmed by the bootstrapped results. Regarding RHAs, FDH (p = 0.01), VRS (p = 0.05), and DRS (p = 0.04) all showed significant differences, with bootstrapped results supporting these findings.

3.7. Scale Efficiency

The summary statistics of scale efficiency scores for Health Centers (HCs) and Local Health Centers (ToMYs) in Table 13 and Table 14 offered insights into how efficiently—on average—these units operated relative to their optimal size (MPSS). A score of 1 indicated full scale efficiency, while values greater than 1 suggested inefficiency, which could be caused by operating either above or below the optimal scale.
The mean scale efficiency score across all HCs was 1.17, showing that, on average, Health Centers operate at 17% inefficiency relative to their optimal scale. This inefficiency was further highlighted by the weighted mean of 1.19, suggesting that larger or higher output centers tend to be slightly less efficient overall. The median score of 1.05 indicated that most centers were relatively close to the optimal scale, but the maximum score of 2.71 revealed the presence of highly inefficient outliers, particularly in ‘3’/Rural/RHA 6 regions. Urban Health Centers exhibited the highest mean score of 1.30, reflecting the challenges of scale in larger, more complex centers. Conversely, peri-urban (mean 1.16) and rural centers (mean 1.12) demonstrated somewhat better efficiency, though rural centers still showed some inefficiencies, likely related to resource constraints in smaller, isolated populations.
Variability in scale efficiency was also notable. The standard deviation for all HCs was 0.27, with urban centers showing the highest variability (StDev = 0.33), highlighting substantial variations in the efficiency with which these larger centers operate at scale. Among the Regional Health Authorities, RHA 7 showed the highest variability (StDev = 0.35), while RHA 4 exhibited the least (StDev = 0.13), suggesting that some regions have more consistent performance while others vary greatly in efficiency.
The findings for Local Health Centers (ToMYs) presented a similar trend. The overall mean scale efficiency score for ToMYs was 1.20, whereas the weighted mean was lower, at 1.06. This discrepancy suggested that smaller or lower-output units may have been less efficient on average, pulling up the mean score, while larger or higher-output units tended to perform closer to the optimal scale, leading to a lower weighted mean score. The weighted mean accounted for the size and output of the units, emphasizing the efficiency of more productive ToMYs, and implied that although there were significant inefficiencies in the system overall, larger or busier ToMYs were more efficient.
Furthermore, the data in the table show that there was considerable regional variation. For RHA 2, the gap between the mean (1.39) and weighted mean (1.10) was particularly pronounced, indicating that smaller or less productive ToMYs in this region were significantly less efficient, skewing the mean upward. In contrast, the larger units in this region operated closer to their optimal scale. Similar trends could be observed for RHA 6, where the mean was 1.33, but the weighted mean was lower at 1.08, further reinforcing that larger ToMYs in this region were more scale-efficient compared to smaller ones. On the other hand, RHAs 1, 4, 5, and 7 showed minimal differences between their mean and weighted mean scale efficiency scores, indicating more uniform efficiency distribution across ToMYs in these regions. Both smaller and larger units appeared to perform similarly in terms of scale efficiency. Additionally, the fact that both the mean and weighted mean scores in these RHAs were close to 1 suggested that the units in these regions operated near their optimal scale.

3.8. Outlier Influence: Bias

Table 15 compares the efficiency characteristics of two samples: the 241 HCs sample (original dataset with 241 DMUs) and the 234 HCs sample (reduced dataset after removing seven outlier DMUs). This comparison illustrates the effects of outlier removal on efficiency scores and peer selection within a DEA VRS output-oriented model (where lower scores—being closer to 1—represent higher efficiency). The key findings are as follows.
In the 241 HCs sample, the mean efficiency score of all DMUs was 2.33, while, in the 234 HCs sample, it decreased significantly to 1.97, indicating a substantial improvement in overall performance. This finding suggests that the presence of outliers in the original sample inflated the efficiency scores across DMUs. Additionally, the total number of efficient DMUs increased from 31 in the 241 HCs sample to 38 in the 234 HCs sample. This change indicates that certain DMUs achieve efficiency only after the influence of outlier peers is removed, further highlighting the effect of outliers in skewing efficiency measurements and potentially introducing errors where truly efficient units may be misclassified as inefficient.
In the 241 HCs sample, four out of the seven total outliers were identified as efficient, while the remaining three were classified as inefficient. Removing these outliers helped prevent efficiency assessments from being disproportionately influenced by extreme values, resulting in a clearer distinction between efficient and inefficient DMUs. At first glance, it may seem unusual that the data cloud method flagged three DMUs as outliers, despite their inefficiency. However, closer examination of their scores—1.02, 1.01, and 1.09 for facilities D-009, D-011, and D-069, respectively—showed that they were very close to the efficient threshold. Although these DMUs were not classified as efficient in the initial analysis, their proximity to the (extreme) efficiency boundary caused a noticeable shift in the “cloud” volume when they were removed, leading to their classification as outliers. This finding, we believe, demonstrates the robustness of the data cloud method’s underlying concept.
However, a subset of 27 DMUs remained efficient in both samples, suggesting that these units maintained their efficiency status regardless of the presence of outliers. This consistency provided robustness to the analysis, indicating a core set of efficient DMUs that acted as stable benchmarks across sample variations.
In the 241 HCs sample, 183 DMUs used at least one efficient outlier as a peer, and 90.1% of inefficient DMUs relied on efficient outliers for benchmarking. This high dependence on outliers provides strong evidence of their substantial influence on the efficiency scores of other DMUs. Efficient outliers accounted for 32.1% of all peer occurrences in the 241 HCs sample, underscoring the significant role these outliers played in shaping the efficiency landscape. In the reduced 234 HCs sample, this reliance on outliers as peers was eliminated, potentially resulting in a more balanced and unbiased peer structure.
Finally, turning the lights on the subset of 180 DMUs common to both samples (those that used at least one efficient outlier as a peer in the 241 HCs sample), the mean efficiency score improved from 2.65 to 2.14 following outlier removal. This significant reduction indicated that outliers contributed to higher efficiency scores within this group. The standard deviation also decreased from 1.23 in the 241 HCs sample to 0.96 in the 234 HCs sample, suggesting a reduction in score variability and a more stable efficiency distribution without the influence of outliers.
Regarding extreme values, the minimum efficiency score of the 180 common DMUs subset approached 100% in the 234 HCs sample (100.0%), indicating that some DMUs achieved nearly optimal efficiency. The maximum score, however, decreased from 8.07 to 5.68, highlighting that removing outliers limited extreme values and provided a more conservative and realistic view of the efficiency range.
In Figure 9, the efficiency scores of the subset of 180 common DMUs are presented in ascending order (green line), as calculated for the 241 HCs sample. The relative size of the orange area, representing the score differences of these DMUs when calculated within the 234 HCs sample, compared to the total area under the green line visually illustrates the extent of influence exerted by the removed outliers.
In summary, our findings demonstrate that outliers significantly impacted efficiency scores and peer relationships in the DEA model. By removing outliers, the 234 HCs sample provided a more reliable efficiency evaluation, reducing bias from questionably extreme performers. The more consistent range and reduced variability in the subset of 180 common DMUs underscored the benefits of this adjustment, suggesting that the 234 HCs sample offered a more accurate foundation for benchmarking and policy recommendations in healthcare efficiency.

4. Discussion

Strengthening primary healthcare (PHC) services is linked to improved equity in access, better clinical outcomes and population well-being, reduced healthcare and social costs, and increased productivity [3]. However, the growing healthcare needs of citizens—mainly due to increased life expectancy, population aging, technological innovations, and climate change—combined with limited available resources to meet these needs, make the efficient management of these resources imperative [32]. From a managerial perspective, understanding how scarce resources are utilized within the public healthcare system is essential for policymakers when making informed decisions [72].
In Greece, PHC development has been a central focus of reforms over the past decades, as it is considered essential for improving efficiency and access. To address these challenges, implementing modern methods to assess the efficiency of PHC units can provide invaluable insights to policymakers when developing health strategies. These methods facilitate the identification of inefficiencies and support corrective actions, contributing to ongoing performance monitoring and improvement. This study aimed to evaluate the efficiency of PHC units across Greece by assessing the efficiency levels of Health Centers (HCs) and Local Health Units (ToMYs) and identifying underlying factors contributing to observed inefficiencies. A variety of non-parametric methods were employed, primarily focusing on Data Envelopment Analysis (DEA), along with super-efficiency and Free Disposal Hull (FDH) techniques. Utilizing data from 2019, this study stands out as the only evaluation of Greek PHC units conducted after the financial crisis and before the COVID-19 pandemic.
The analysis of the data revealed significant inefficiency and differences in pure technical efficiency (VRS scores) between HCs and ToMYs. HCs, which operate with a much larger volume of activity, could nearly double their outputs while maintaining current input levels (mean weighted VRS bootstrapped score of 1.92), whereas ToMYs could increase their outputs by 58% (mean weighted VRS bootstrapped score of 1.58). This gap highlights the differing capacities of the two types of healthcare facilities to manage resources and serve their populations efficiently. The larger size and greater variability in outputs among HCs suggest more complex operational challenges, leading most likely to divergent operational profiles, which may be among the primary causes of their significant inefficiencies. In terms of scale efficiency, the scores showed better performance, with 1.17 for HCs and 1.20 for ToMYs, both very close to each other, indicating a slightly better proximity of HCs to their MPSS of operation.
Other findings from similar studies align with these observations. For instance, a study [34] of primary healthcare units in southern and western Greece revealed that technically efficient units were typically those covering larger populations and located near major cities. In contrast, less efficient units were in remote, sparsely populated areas that predominantly focused on acute medical services rather than preventive care. This mirrors the results of our study, where rural units generally exhibited higher inefficiencies, as these areas often face resource constraints and serve smaller populations.
In our analysis, HCs classified as ‘1’ and urban emerged as the most efficient (weighted mean VRS bootstrapped scores of 1.69 and 1.51, respectively), followed by the intermediate (‘2’) and peri-urban regions. Interestingly, the upward scaling of inefficiency was aligned along the transition from urban to rural areas in both categorization typologies (1 → 2 → 3, urban → peri-urban → rural). This finding, with respect to the FAO’s classification, was also found to be statistically significant according to the Kruskal–Wallis test.
The inefficiencies observed in rural and remote areas are consistent with the findings of one study [37] that reported that small, rural Health Centers, particularly those located on islands, exhibited lower efficiency, especially when lacking preventive care services. Similarly, another study [35] found that healthcare centers serving populations between 10,001 and 50,000 were the most efficient, whereas smaller centers serving fewer than 10,000 individuals demonstrated significantly lower efficiency. These findings suggest that an optimal population size, combined with proximity to urban centers, may enhance operational efficiency. Contrarily, other subsequent studies [4,36] highlighted that rural or remote and island-based health centers exhibited higher technical efficiency compared to urban and suburban centers, though the latter demonstrated superior scale efficiency. Our findings indicate that scale efficiency improves progressively from urban to rural areas.
Access to healthcare facilities is typically not a major issue for populations in large urban areas; however, residents of rural and remote regions face considerable barriers, often requiring them to travel to other areas or seek care from private practitioners. This may be attributed to the better staffing and equipment levels typically found in larger primary healthcare (PHC) units in urban areas. The challenges faced in rural settings are multifactorial, involving not only geographical and economic barriers but also differences in sociodemographic factors, perceptions, and healthcare needs, all of which influence access to and utilization of PHC services [4,6,73,74].
In our study, rural areas displayed the highest inefficiency, with a mean VRS score of 2.42, indicating the potential for a 142% increase in outputs. These findings underscore the persistent inefficiency challenges in rural settings, where geographic isolation, small population sizes, and limited resources present significant operational obstacles. This dichotomy was further emphasized in our results, where urban centers exhibited inefficiencies largely due to over-scaling, while semi-urban centers achieved a more balanced alignment between scale and resource management.
The findings across RHAs highlighted significant disparities in efficiency among them. RHA 1 emerged as the most efficient, with mean weighted VRS (pure technical) efficiency scores of 1.51 for its Health Centers (HCs) and 1.46 for its Local Health Units (ToMYs). However, the region had only a single HC classified as efficient, suggesting that top-level performance was achieved by only a small fraction of its facilities. On the other hand, RHA 3 and RHA 4 showed the highest inefficiency among HCs, with weighted mean scores of 2.12 and 2.24, respectively. RHA 6, despite contributing 42.1% of all efficient units, had a weighted mean score of 1.86, indicating significant room for output improvement. A previous study [34] reported a better efficiency level for RHA 6, suggesting that efficiency levels may have declined over time or that results may differ based on the methodologies used. Furthermore, examining the FDH results, RHA 6 accounted for 20 out of the 35 benchmark units (57%), offering strong evidence that its operational practices could serve as effective models not only for other RHAs but also for improving its own performance. On the other hand, RHA 1, with no benchmarks, and RHA 7, with only four, showed room for significant improvement. To address these disparities, it is recommended to replicate best practices from highly efficient areas in less efficient regions. Additionally, further investigation into outliers, particularly in rural areas, may reveal specific factors contributing to inefficiencies.
Regarding ToMYs, their VRS inefficiency levels across the seven RHA regions appeared to be more consistent, averaging around 1.60 and falling within a range of approximately 20 percentage points. Similarly, their FDH scores appeared to be consistent across all RHAs, remaining close to the overall weighted mean value (1.34). In terms of benchmarks, RHAs 5 and 6 accounted for the majority, hosting 5 and 4 out of the total 15, highlighting these regions as leaders in efficiency. Policymakers should focus on transferring the successful operational practices from these high-performing regions to underperforming areas. Doing so could substantially enhance the overall efficiency of the healthcare system.
In the context of super-efficiency results, several interesting points emerged from a policymaking perspective. First, the infeasibility of calculating super-efficiency VRS scores for DMUs D-199 and D-212 may suggest that these units are highly efficient and play a critical role in shaping the efficiency frontier. However, without a solvable super-efficiency score, it is difficult to directly compare their performance with other top-performing units. These units are considered hyper-efficient [75] and require a thorough examination to understand their unique operational performance. Policymakers might interpret this as an indication that D-199 and D-212 represent unique or specialized units within the dataset, potentially operating at an optimal level that cannot be surpassed by other DMUs under VRS. This finding should be considered alongside the fact that these two units had a notable presence as peers in the VRS model. As such, these units could serve as models of efficiency within the system, but their specific operational circumstances should be further analyzed to ensure sustained high performance. Second, certain DMUs such as D-041, and D-184 were ranked as super-efficient, yet they did not appear as peers for any inefficient units. This suggests that while these units operate exceptionally well, their specific input–output combinations may not be directly comparable to those of other DMUs, making them unsuitable as benchmarks. From a managerial perspective, this implies that these DMUs may be achieving efficiency through unique circumstances or specialized processes that are difficult to replicate. Although these DMUs excel in performance, their lack of peer appearances suggests limited potential for other units to model their practices, warranting further investigation into the factors driving their idiosyncratic efficiency.
The efficiency deficit identified in this study can be attributed to resource shortages, such as the understaffing of medical and nursing personnel, particularly general practitioners, due to inadequate financial incentives, educational programs, and career progression opportunities [4,9,76,77,78]. Additionally, there is an unequal distribution of healthcare professionals between urban and rural areas [8] and a lack of facilities and diagnostic equipment in primary healthcare (PHC) centers [5,6], which limits service delivery, particularly in rural areas [79,80].
These challenges lead to gaps in preventive and health promotion services [8], barriers to access, increased waiting times, citizen dissatisfaction [15,80], and unmet healthcare needs [74,81]. As a result, many individuals turn to private doctors [9,82,83], increasing out-of-pocket health costs, among the highest in the EU [15,84,85]. Greece’s hospital-centered system further contributes to inefficiency, with PHC services often provided through hospital emergency departments without mandatory referrals from PHC structures [5,15,86]. Though the EOPYY, established in 2011, aimed to reduce hospital overload by directing PHC cases to public units, patients frequently bypass PHC facilities and opt for hospitals, viewing the referral process as bureaucratic [74]. Mandatory referrals (gatekeeping system) from general practitioners, proven to increase efficiency and lower costs in other countries [87,88], are still not implemented in Greece, where general practitioners make up only 7% of the total physicians [84]. However, the COVID-19 pandemic saw more citizens utilizing PHC units to avoid going to hospital emergency departments [89].
Despite reforms, integration remains low, particularly in prevention and public health services [8], with Greece spending just 1.4% of its health budget on these areas, compared to the EU27 average of 2.9% [84]. A study in Crete and Epirus [81] revealed gaps in health promotion activities and the limited implementation of e-health, EHR, and telemedicine services. However, later, during the COVID-19 pandemic, the push for digitalization helped PHC units meet citizen needs, particularly through remote services [89,90,91].
In the present analysis, there are some limitations that should be considered. First, due to data inconsistency and/or unavailability, the results are based on only a portion of the total PHC facilities (76.2% of Health Centers and 80.3% of ToMYs), leaving out a significant portion of the total volume of inputs and outputs. Although this study included the majority of PHC facilities, providing a reliable overall picture, there may still be differences at an individual level and in conclusions drawn from aggregated indicators when referring to smaller groups/subdivisions. These variations could affect the interpretation of results for specific regions or units. Second, while the quantitative variations in technical efficiency were thoroughly addressed, equally important variations related to the quality of services offered were not captured due to a lack of necessary data. Ideally, the volume of outputs should be accompanied by a measure of service quality, as high efficiency scores do not necessarily imply high-quality outcomes. However, incorporating quality measures into DEA models presents certain challenges and limitations. This remains a critical area for future research, contingent upon the availability of relevant data.

5. Conclusions

The findings of this study revealed that there is room for improving the efficiency of primary healthcare units by expanding or strengthening the services offered to better meet healthcare needs.
Primary healthcare (PHC) serves as the gateway to the healthcare system, playing a crucial role in the effective treatment of patients and the efficient management of resources. These objectives are of central importance, especially in a context where healthcare needs are continuously rising, while available resources remain limited.
By broadening the range of outputs—through the development of new services and the enhancement of existing ones, in collaboration with the other services of the National Health System—PHC could increase its efficiency and make a significant contribution to addressing healthcare needs, alleviating the burden on hospitals and reducing wasteful use of healthcare resources.

Author Contributions

Conceptualization, A.F.; methodology, A.F.; software, A.F.; validation, A.F. and V.H.A.; formal analysis, A.F. and C.M.; investigation, A.F., D.A.N. and V.H.A.; data curation, A.F. and C.M.; writing—original draft preparation, A.F.; writing—review and editing, A.F., D.A.N., V.H.A. and D.C.K.; supervision, A.F.; project administration, A.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Restrictions apply to the availability of these data. Data were obtained from the Ministry of Health (Greece) and are available from the corresponding author with the permission of MoH.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. World Health Organization. Declaration of Astana. Available online: https://www.who.int/publications/i/item/WHO-HIS-SDS-2018.61 (accessed on 5 October 2024).
  2. Primary Health Care. Available online: https://www.who.int/news-room/fact-sheets/detail/primary-health-care (accessed on 4 October 2024).
  3. World Health Organization. Building the Economic Case for Primary Health Care: A Scoping Review. Available online: https://www.who.int/publications/i/item/WHO-HIS-SDS-2018.48 (accessed on 5 October 2024).
  4. Kontodimopoulos, N.; Moschovakis, G.; Aletras, V.H.; Niakas, D. The effect of environmental factors on technical and scale efficiency of primary health care providers in Greece. Cost Eff. Resour. Alloc. 2007, 5, 14. [Google Scholar] [CrossRef]
  5. Lionis, C.; Symvoulakis, E.K.; Markaki, A.; Vardavas, C.; Papadakaki, M.; Daniilidou, N.; Souliotis, K.; Kyriopoulos, I. Integrated primary health care in Greece, a missing issue in the current health policy agenda: A systematic review. Int. J. Integr. Care 2009, 9, e88. [Google Scholar] [CrossRef] [PubMed]
  6. Oikonomidou, E.; Anastasiou, F.; Dervas, D.; Patri, F.; Karaklidis, D.; Moustakas, P.; Andreadou, N.; Mantzanas, E.; Merkouris, B. Rural primary care in Greece: Working under limited resources. Int. J. Qual. Health Care J. Int. Soc. Qual. Health Care 2010, 22, 333–337. [Google Scholar] [CrossRef] [PubMed]
  7. Economou, C. Greece: Health system review. Health Syst. Transit. 2010, 12, 1–177. [Google Scholar] [PubMed]
  8. Myloneros, T.; Sakellariou, D. The effectiveness of primary health care reforms in Greece towards achieving universal health coverage: A scoping review. BMC Health Serv. Res. 2021, 21, 628. [Google Scholar] [CrossRef]
  9. Lionis, C.; Symvoulakis, E.K.; Markaki, A.; Petelos, E.; Papadakis, S.; Sifaki-Pistolla, D.; Papadakakis, M.; Souliotis, K.; Tziraki, C. Integrated people-centred primary health care in Greece: Unravelling Ariadne’s thread. Prim. Health Care Res. Dev. 2019, 20, e113. [Google Scholar] [CrossRef]
  10. Kondilis, E.; Smyrnakis, E.; Gavana, M.; Giannakopoulos, S.; Zdoukos, T.; Iliffe, S.; Benos, A. Economic crisis and primary care reform in Greece: Driving the wrong way? Br. J. Gen. Pract. J. R. Coll. Gen. Pract. 2012, 62, 264–265. [Google Scholar] [CrossRef]
  11. Niakas, D. Greek economic crisis and health care reforms: Correcting the wrong prescription. Int. J. Health Serv. 2013, 43, 597–602. [Google Scholar] [CrossRef]
  12. Hellenic Republic. Law 3918: Structural Changes in the Health System and Other Provisions; Hellenic Republic: Athens, Greece, 2011.
  13. Hellenic Republic. Law 4238: Primary National Health Network (PEDY), Change of EOPYY’s Purpose and Other Provisions; Hellenic Republic: Athens, Greece, 2014; Volume 38, p. 28.
  14. OECD; European Observatory on Health Systems and Policies. State of Health in the EU. In Greece: Country Health Profile 2017; OECD: Paris, France, 2017. [Google Scholar] [CrossRef]
  15. Karaferis, D.C.; Niakas, D.A.; Balaska, D.; Flokou, A. Valuing Outpatients’ Perspective on Primary Health Care Services in Greece: A Cross-Sectional Survey on Satisfaction and Personal-Centered Care. Healthcare 2024, 12, 1427. [Google Scholar] [CrossRef]
  16. Hellenic Republic. Law 4486: Reform of Primary Health Care, Urgent Regulations of the Ministry of Health and Other Provisions; Hellenic Republic: Athens, Greece, 2017; Volume 115.
  17. Hellenic Statistical Authority. Census of Health Centers and Other Units Providing Primary Health Care Services: Year 2019. Available online: https://www.statistics.gr/documents/20181/a4d681a4-f8e2-df6f-95b9-3569321f0ce8 (accessed on 5 October 2024).
  18. Hellenic Statistical Authority. Hospitals and Local Health Centres (Beds, Staff, Equipment), 2010–2018. Available online: https://www.statistics.gr/en/statistics/-/publication/SHE06/2018 (accessed on 5 October 2024).
  19. Hellenic Statistical Authority. System of Health Accounts (SHA) of Year 2015. Available online: https://www.statistics.gr/en/statistics/-/publication/SHE35/2015 (accessed on 5 October 2024).
  20. Hellenic Statistical Authority. System of Health Accounts (SHA) of Year 2019. Available online: https://www.statistics.gr/en/statistics/-/publication/SHE35/2019 (accessed on 5 October 2024).
  21. Hardouvelis, G.A.; Karalas, G.; Karanastasis, D.; Samartzis, P. Economic Policy Uncertainty, Political Uncertainty and the Greek Economic Crisis. SSRN Electron. J. 2018, 100–104. [Google Scholar] [CrossRef]
  22. Economou, C.; Kaitelidou, D.; Kentikelenis, A.; Maresso, A.; Sissouras, A. The impact of the crisis on the health system and health in Greece. In Economic Crisis, Health Systems and Health in Europe: Country Experience [Internet]; European Observatory on Health Systems and Policies: Brussels, Belgium, 2015. Available online: https://www.ncbi.nlm.nih.gov/books/NBK447857/ (accessed on 5 October 2024).
  23. Hollingsworth, B. The measurement of efficiency and productivity of health care delivery. Health Econ. 2008, 17, 1107–1128. [Google Scholar] [CrossRef] [PubMed]
  24. Cantor, V.J.M.; Poh, K.L. Integrated Analysis of Healthcare Efficiency: A Systematic Review. J. Med. Syst. 2018, 42, 8. [Google Scholar] [CrossRef]
  25. Flokou, A.; Kontodimopoulos, N.; Niakas, D. Employing post-DEA Cross-evaluation and Cluster Analysis in a Sample of Greek NHS Hospitals. J. Med. Syst. 2011, 35, 1001–1014. [Google Scholar] [CrossRef]
  26. Aletras, V.; Kontodimopoulos, N.; Zagouldoudis, A.; Niakas, D. The short-term effect on technical and scale efficiency of establishing regional health systems and general management in Greek NHS hospitals. Health Policy Amst. Neth. 2007, 83, 236–245. [Google Scholar] [CrossRef] [PubMed]
  27. Kohl, S.; Schoenfelder, J.; Fügener, A.; Brunner, J.O. The use of Data Envelopment Analysis (DEA) in healthcare with a focus on hospitals. Health Care Manag. Sci. 2019, 22, 245–286. [Google Scholar] [CrossRef]
  28. Flokou, A.; Aletras, V.; Niakas, D. Awindow-DEA based efficiency evaluation of the public hospital sector in Greece during the 5-year economic crisis. PLoS ONE 2017, 12, e0177946. [Google Scholar] [CrossRef] [PubMed]
  29. Flokou, A.; Aletras, V.; Niakas, D. Decomposition of potential efficiency gains from hospital mergers in Greece. Health Care Manag. Sci. 2017, 20, 467–484. [Google Scholar] [CrossRef]
  30. Pelone, F.; Kringos, D.S.; Romaniello, A.; Archibugi, M.; Salsiri, C.; Ricciardi, W. Primary care efficiency measurement using data envelopment analysis: A systematic review. J. Med. Syst. 2015, 39, 156. [Google Scholar] [CrossRef]
  31. Marathe, S.; Wan, T.T.H.; Zhang, J.; Sherin, K. Factors influencing community health centers’ efficiency: A latent growth curve modeling approach. J. Med. Syst. 2007, 31, 365–374. [Google Scholar] [CrossRef]
  32. Neri, M.; Cubi-Molla, P.; Cookson, G. Approaches to Measure Efficiency in Primary Care: A Systematic Literature Review. Appl. Health Econ. Health Policy 2022, 20, 19–33. [Google Scholar] [CrossRef]
  33. Zakowska, I.; Godycki-Cwirko, M. Data envelopment analysis applications in primary health care: A systematic review. Fam. Pract. 2020, 37, 147–153. [Google Scholar] [CrossRef] [PubMed]
  34. Oikonomou, N.; Tountas, Y.; Mariolis, A.; Souliotis, K.; Athanasakis, K.; Kyriopoulos, J. Measuring the efficiency of the Greek rural primary health care using a restricted DEA model; the case of southern and western Greece. Health Care Manag. Sci. 2016, 19, 313–325. [Google Scholar] [CrossRef] [PubMed]
  35. Zavras, A.I.; Tsakos, G.; Economou, C.; Kyriopoulos, J. Using DEA to evaluate efficiency and formulate policy within a Greek national primary health care network. Data Envelopment Analysis. J. Med. Syst. 2002, 26, 285–292. [Google Scholar] [CrossRef]
  36. Mitropoulos, P.; Kounetas, K.; Mitropoulos, I. Factors affecting primary health care centers’ economic and production efficiency. Ann. Oper. Res. 2016, 247, 807–822. [Google Scholar] [CrossRef]
  37. Kontodimopoulos, N.; Nanos, P.; Niakas, D. Balancing efficiency of health services and equity of access in remote areas in Greece. Health Policy Amst. Neth. 2006, 76, 49–57. [Google Scholar] [CrossRef]
  38. European Commission, Statistical Office of the European Union, Eurostat. Urban Europe: Statistics on Cities, Towns and Suburbs: 2016 Edition; 2016. Available online: https://data.europa.eu/doi/10.2785/91120 (accessed on 5 October 2024).
  39. Eurostat. Rural Development. Methodology. Available online: https://ec.europa.eu/eurostat/web/rural-development/methodology (accessed on 5 October 2024).
  40. Food and Agriculture Organization of the United Nations. Guidelines on Defining Rural Areas and Compiling Indicators. Available online: https://openknowledge.fao.org/server/api/core/bitstreams/5fc6bb8d-32a7-4c5e-ac71-81c1f19d13e4/content (accessed on 5 October 2024).
  41. Charnes, A.; Cooper, W.W.; Rhodes, E. Measuring the efficiency of decision making units. Eur. J. Oper. Res. 1978, 2, 429–444. [Google Scholar] [CrossRef]
  42. Farrell, M.J. The Measurement of Productive Efficiency. J. R. Stat. Soc. Ser. Gen. 1957, 120, 253–290. [Google Scholar] [CrossRef]
  43. Bogetoft, P.; Otto, L. Production Models and Technology. In Benchmarking with DEA, SFA, and R; Bogetoft, P., Otto, L., Eds.; Springer: New York, NY, USA, 2011; pp. 57–80. [Google Scholar]
  44. Bogetoft, P.; Otto, L. Data Envelopment Analysis DEA. In Benchmarking with DEA, SFA, and R; Bogetoft, P., Otto, L., Eds.; Springer: New York, NY, USA, 2011; pp. 81–113. [Google Scholar]
  45. Cooper, W.W.; Seiford, L.M.; Tone, K. The CCR Model and Production Correspondence. In Data Envelopment Analysis: A Comprehensive Text with Models, Applications, References and DEA-Solver Software; Cooper, W.W., Seiford, L.M., Tone, K., Eds.; Springer: New York, NY, USA, 2007; pp. 41–85. [Google Scholar]
  46. Banker, R.D.; Charnes, A.; Cooper, W.W. Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis. Manag. Sci. 1984, 30, 1078–1092. [Google Scholar] [CrossRef]
  47. Deprins, D.; Simar, L.; Tulkens, H. Measuring Labor-Efficiency in Post Offices. LIDAM Reprint CORE. 1984. Available online: https://ideas.repec.org//p/cor/louvrp/571.html (accessed on 10 October 2024).
  48. Tulkens, H. On FDH efficiency analysis: Some methodological issues and applications to retail banking, courts, and urban transit. J. Product. Anal. 1993, 4, 183–210. [Google Scholar] [CrossRef]
  49. Andersen, P.; Petersen, N.C. A Procedure for Ranking Efficient Units in Data Envelopment Analysis. Manag. Sci. 1993, 39, 1261–1264. [Google Scholar] [CrossRef]
  50. Adler, N.; Friedman, L.; Sinuany-Stern, Z. Review of ranking methods in the data envelopment analysis context. Eur. J. Oper. Res. 2002, 140, 249–265. [Google Scholar] [CrossRef]
  51. The Measurement of Efficiency. In Advanced Robust and Nonparametric Methods in Efficiency Analysis: Methodology and Applications; Daraio, C.; Simar, L. (Eds.) Springer: Boston, MA, USA, 2007; pp. 13–42. [Google Scholar]
  52. Fare, R.; Zelenyuk, V. On aggregate Farrell efficiencies. Eur. J. Oper. Res. 2003, 146, 615–620. [Google Scholar] [CrossRef]
  53. Wilson, P.W. Detecting Outliers in Deterministic Nonparametric Frontier Models with Multiple Outputs. J. Bus. Econ. Stat. 1993, 11, 319–323. [Google Scholar] [CrossRef]
  54. Simar, L.; Wilson, P.W. Non-parametric tests of returns to scale. Eur. J. Oper. Res. 2002, 139, 115–132. [Google Scholar] [CrossRef]
  55. Banker, R.D. Estimating most productive scale size using data envelopment analysis. Eur. J. Oper. Res. 1984, 17, 35–44. [Google Scholar] [CrossRef]
  56. Simar, L.; Wilson, P.W. Sensitivity Analysis of Efficiency Scores: How to Bootstrap in Nonparametric Frontier Models. Manag. Sci. 1998, 44, 49–61. [Google Scholar] [CrossRef]
  57. Simar, L.; Wilson, P.W. Statistical Inference in Nonparametric Frontier Models: The State of the Art. J. Product. Anal. 2000, 13, 49–78. [Google Scholar] [CrossRef]
  58. Murillo-Zamorano, L.R.; Petraglia, C. Technical efficiency in primary health care: Does quality matter? Eur. J. Health Econ. HEPAC Health Econ. Prev. Care 2011, 12, 115–125. [Google Scholar] [CrossRef]
  59. da Encarnação Filipe Amado, C.A.; Santos, S.P.D. Challenges for performance assessment and improvement in primary health care: The case of the Portuguese health centres. Health Policy Amst. Neth. 2009, 91, 43–56. [Google Scholar] [CrossRef]
  60. González-de-Julián, S.; Barrachina-Martínez, I.; Vivas-Consuelo, D.; Bonet-Pla, Á.; Usó-Talamantes, R. Data Envelopment Analysis Applications on Primary Health Care Using Exogenous Variables and Health Outcomes. Sustainability 2021, 13, 1337. [Google Scholar] [CrossRef]
  61. Capeletti, N.M.; Amado, C.A.F.; Santos, S.P. Performance assessment of primary health care services using data envelopment analysis and the quality-adjusted malmquist index. J. Oper. Res. Soc. 2024, 75, 361–377. [Google Scholar] [CrossRef]
  62. Miclos, P.V.; Calvo, M.C.M.; Colussi, C.F. Evaluation of the performance of actions and outcomes in primary health care. Rev. Saude Publica 2017, 51, 86. [Google Scholar] [CrossRef]
  63. Dyson, R.G.; Allen, R.; Camanho, A.S.; Podinovski, V.V.; Sarrico, C.S.; Shale, E.A. Pitfalls and protocols in DEA. Eur. J. Oper. Res. 2001, 132, 245–259. [Google Scholar] [CrossRef]
  64. Santamato, V.; Tricase, C.; Faccilongo, N.; Marengo, A.; Pange, J. Healthcare performance analytics based on the novel PDA methodology for assessment of efficiency and perceived quality outcomes: A machine learning approach. Expert Syst. Appl. 2024, 252, 124020. [Google Scholar] [CrossRef]
  65. Adler, N.; Golany, B. Evaluation of deregulated airline networks using data envelopment analysis combined with principal component analysis with an application to Western Europe. Eur. J. Oper. Res. 2001, 132, 260–273. [Google Scholar] [CrossRef]
  66. Pastor, J.T. Chapter 3 Translation invariance in data envelopment analysis: A generalization. Ann. Oper. Res. 1996, 66, 91–102. [Google Scholar] [CrossRef]
  67. Pereira, M.A.; Dinis, D.C.; Ferreira, D.C.; Figueira, J.R.; Marques, R.C. A network Data Envelopment Analysis to estimate nations’ efficiency in the fight against SARS-CoV-2. Expert Syst. Appl. 2022, 210, 118362. [Google Scholar] [CrossRef]
  68. Kirigia, J.M.; Sambo, L.G.; Renner, A.; Alemu, W.; Seasa, S.; Bah, Y. Technical efficiency of primary health units in Kailahun and Kenema districts of Sierra Leone. Int. Arch. Med. 2011, 4, 15. [Google Scholar] [CrossRef]
  69. Adang, E.M.M.; Gerritsma, A.; Nouwens, E.; van Lieshout, J.; Wensing, M. Efficiency of the implementation of cardiovascular risk management in primary care practices: An observational study. Implement. Sci. IS 2016, 11, 67. [Google Scholar] [CrossRef]
  70. González-de-Julián, S.; Vivas-Consuelo, D.; Barrachina-Martínez, I. Modelling efficiency in primary healthcare using the DEA methodology: An empirical analysis in a healthcare district. BMC Health Serv. Res. 2024, 24, 982. [Google Scholar] [CrossRef] [PubMed]
  71. Beiter, D.; Koy, S.; Flessa, S. Improving the technical efficiency of public health centers in Cambodia: A two-stage data envelopment analysis. BMC Health Serv. Res. 2023, 23, 912. [Google Scholar] [CrossRef] [PubMed]
  72. Ramírez-Valdivia, M.T.; Maturana, S.; Salvo-Garrido, S. A multiple stage approach for performance improvement of primary healthcare practice. J. Med. Syst. 2011, 35, 1015–1028. [Google Scholar] [CrossRef] [PubMed]
  73. Mariolis, A.; Mihas, C.; Alevizos, A.; Mariolis-Sapsakos, T.; Marayiannis, K.; Papathanasiou, M.; Gizlis, V.; Karanasios, D.; Merkouris, B. Comparison of primary health care services between urban and rural settings after the introduction of the first urban health centre in Vyronas, Greece. BMC Health Serv. Res. 2008, 8, 124. [Google Scholar] [CrossRef]
  74. Economou, C. Barriers and Facilitating Factors in Access to Health Services in Greece. Available online: https://www.who.int/publications/m/item/barriers-and-facilitating-factors-in-access-to-health-services-in-greece (accessed on 5 October 2024).
  75. Bogetoft, P.; Otto, L. Additional Topics in DEA. In Benchmarking with DEA, SFA, and R; Bogetoft, P., Otto, L., Eds.; Springer: New York, NY, USA, 2011; pp. 115–153. [Google Scholar]
  76. Simou, E.; Karamagioli, E.; Roumeliotou, A. Reinventing primary health care in the Greece of austerity: The role of health-care workers. Prim. Health Care Res. Dev. 2015, 16, 5–13. [Google Scholar] [CrossRef] [PubMed]
  77. Dresios, C.; Rachiotis, G.; Symvoulakis, E.K.; Rousou, X.; Papagiannis, D.; Mouchtouri, V.; Hadjichristodoulou, C. Nationwide Epidemiological Study of Knowledge, Attitudes, and Practices Study of Greek General Practitioners Related to Screening. Int. J. Prev. Med. 2019, 10, 199. [Google Scholar] [CrossRef]
  78. World Health Organization. Monitoring and Documenting Systemic and Health Effects of Health Reforms in Greece: Assessment Report. Available online: https://iris.who.int/handle/10665/346262 (accessed on 5 October 2024).
  79. Peritogiannis, V.; Manthopoulou, T.; Gogou, A.; Mavreas, V. Mental Healthcare Delivery in Rural Greece: A 10-year Account of a Mobile Mental Health Unit. J. Neurosci. Rural Pract. 2017, 8, 556–561. [Google Scholar] [CrossRef]
  80. Kampouraki, M.; Paraskevopoulos, P.; Fachouri, E.; Terzoudis, S.; Karanasios, D. Building Strong Primary Healthcare Systems in Greece. Cureus 2023, 15, e41333. [Google Scholar] [CrossRef]
  81. Sbarouni, V.; Petelos, E.; Kamekis, A.; Tzagkarakis, S.I.; Symvoulakis, E.K.; Lionis, C. Discussing issues of health promotion and research in the context of primary care during the ongoing austerity period: An exploratory analysis from two regions in Greece. Med. Pharm. Rep. 2020, 93, 69–74. [Google Scholar] [CrossRef]
  82. Tountas, Y.; Karnaki, P.; Pavi, E.; Souliotis, K. The “unexpected” growth of the private health sector in Greece. Health Policy Amst. Neth. 2005, 74, 167–180. [Google Scholar] [CrossRef]
  83. Oikonomou, N.; Tountas, Y.; Kyriopoulos, J. The need for measurement of efficiency in Greek primary healthcare: The case for rural Southern and Western Greece. Appl. Health Econ. Health Policy 2012, 10, 143–144. [Google Scholar] [CrossRef]
  84. OECD; European Observatory on Health Systems and Policies. State of Health in the EU. In Greece: Country Health Profile 2021; OECD: Paris, France, 2021. [Google Scholar] [CrossRef]
  85. Kringos, D.; Boerma, W.; Bourgueil, Y.; Cartier, T.; Dedeu, T.; Hasvold, T.; Hutchinson, A.; Lember, M.; Oleszczyk, M.; Rotar Pavlic, D.; et al. The strength of primary care in Europe: An international comparative study. Br. J. Gen. Pract. J. R. Coll. Gen. Pract. 2013, 63, e742–e750. [Google Scholar] [CrossRef] [PubMed]
  86. Sifaki-Pistolla, D.; Chatzea, V.-E.; Markaki, A.; Kritikos, K.; Petelos, E.; Lionis, C. Operational integration in primary health care: Patient encounters and workflows. BMC Health Serv. Res. 2017, 17, 788. [Google Scholar] [CrossRef] [PubMed]
  87. Moran, V.; Suhrcke, M.; Nolte, E. Exploring the association between primary care efficiency and health system characteristics across European countries: A two-stage data envelopment analysis. BMC Health Serv. Res. 2023, 23, 1348. [Google Scholar] [CrossRef] [PubMed]
  88. Blöndal, B.; Ásgeirsdóttir, T.L. Costs and efficiency of gatekeeping under varying numbers of general practitioners. Int. J. Health Plann. Manag. 2019, 34, 140–156. [Google Scholar] [CrossRef] [PubMed]
  89. Tsimtsiou, Z.; Fragkoulis, E.; Koupidis, S.; Theodorakis, P. World Health Organization. Greece: Introducing Paperless, Remote ePRESCRIPTION—A Game-Changer for Primary Care Services (2021). Available online: https://www.who.int/europe/publications/m/item/greece-introducing-paperless-remote-eprescription-a-game-changer-for-primary-care-services-(2021) (accessed on 5 October 2024).
  90. Tsampouri, E.; Kapetaniou, K.; Missiou, A.; Bakola, M.; Willems, S.; Van Poel, E.; Tatsioni, A. Measures during the COVID-19 pandemic in public primary health care in Greece: Is there still a missing link to universal health coverage? BMC Prim. Care 2024, 24, 287. [Google Scholar] [CrossRef]
  91. Michalaki, F.; Triantafillopoulou, K.M.; Pagkozidis, I.; Tirodimos, I.; Dardavesis, T.; Tsimtsiou, Z. The impact of COVID-19 pandemic on Primary Health Care through “health providers” eyes’: A qualitative study of focus groups and individual interviews in Greece. Eur. J. Gen. Pract. 2024, 30, 2382218. [Google Scholar] [CrossRef]
Figure 1. Workflow of methods used for efficiency analysis.
Figure 1. Workflow of methods used for efficiency analysis.
Healthcare 12 02230 g001
Figure 2. Scatter plot matrix of the three inputs and three outputs for the 241 HCs dataset. Pairwise relationships (in the off-diagonal plots) and individual variable distributions (on the main diagonal) are shown. Inputs: (i1) number of medical staff, (i2) number of nursing and paramedical staff, (i3) number of administrative and other staff. Outputs: (o1) number of scheduled patient visits, (o2) number of emergency patient visits, (o3) number of patient visits for obtaining pharmaceutical prescriptions.
Figure 2. Scatter plot matrix of the three inputs and three outputs for the 241 HCs dataset. Pairwise relationships (in the off-diagonal plots) and individual variable distributions (on the main diagonal) are shown. Inputs: (i1) number of medical staff, (i2) number of nursing and paramedical staff, (i3) number of administrative and other staff. Outputs: (o1) number of scheduled patient visits, (o2) number of emergency patient visits, (o3) number of patient visits for obtaining pharmaceutical prescriptions.
Healthcare 12 02230 g002
Figure 3. Data selection process for primary healthcare (PHC) units (Health Centers and ToMYs) in Greece, 2019.
Figure 3. Data selection process for primary healthcare (PHC) units (Health Centers and ToMYs) in Greece, 2019.
Healthcare 12 02230 g003
Figure 4. Percentage distribution of Health Center (HC) inputs and outputs across regions categorized by (a) Eurostat’s typology (‘1’, ‘2’, ‘3’) and FAO’s classification (rural, peri-urban, urban) and (b) Regional Health Authorities (RHA 1-7).
Figure 4. Percentage distribution of Health Center (HC) inputs and outputs across regions categorized by (a) Eurostat’s typology (‘1’, ‘2’, ‘3’) and FAO’s classification (rural, peri-urban, urban) and (b) Regional Health Authorities (RHA 1-7).
Healthcare 12 02230 g004
Figure 5. Comparison of the number of efficient Health Centers (HCs) and mean efficiency scores across regions. The graphs display the ordinary (uncorrected), bootstrapped (corrected), and weighted mean efficiency scores for (a) constant returns to scale (CRS) and (b) variable returns to scale (VRS). Regions are categorized according to Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO’s classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
Figure 5. Comparison of the number of efficient Health Centers (HCs) and mean efficiency scores across regions. The graphs display the ordinary (uncorrected), bootstrapped (corrected), and weighted mean efficiency scores for (a) constant returns to scale (CRS) and (b) variable returns to scale (VRS). Regions are categorized according to Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO’s classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
Healthcare 12 02230 g005
Figure 6. Boxplots of bootstrapped VRS efficiency scores for Health Centers (HCs) across regions. Panel (a) shows distributions categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’) and FAO’s classification (urban, peri-urban, rural), while panel (b) presents distributions by Regional Health Authorities (RHA 1–7).
Figure 6. Boxplots of bootstrapped VRS efficiency scores for Health Centers (HCs) across regions. Panel (a) shows distributions categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’) and FAO’s classification (urban, peri-urban, rural), while panel (b) presents distributions by Regional Health Authorities (RHA 1–7).
Healthcare 12 02230 g006
Figure 7. Comparison of the number of efficient ToMYs and mean efficiency scores across RHAs. The graphs display ordinary (uncorrected), bootstrapped (corrected), and weighted mean efficiency scores for (a) constant returns to scale (CRS) and (b) variable returns to scale (VRS) across Regional Health Authorities (RHA 1–7).
Figure 7. Comparison of the number of efficient ToMYs and mean efficiency scores across RHAs. The graphs display ordinary (uncorrected), bootstrapped (corrected), and weighted mean efficiency scores for (a) constant returns to scale (CRS) and (b) variable returns to scale (VRS) across Regional Health Authorities (RHA 1–7).
Healthcare 12 02230 g007
Figure 8. Distribution of Health Center (HC) FDH efficiency and benchmark units across regions, categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO’s classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7). Panel (a) shows the number of efficient facilities alongside the ordinary (uncorrected), bootstrapped (corrected), and weighted mean FDH scores. Panel (b) presents the count and percentage of efficient HCs that served as benchmarks.
Figure 8. Distribution of Health Center (HC) FDH efficiency and benchmark units across regions, categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO’s classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7). Panel (a) shows the number of efficient facilities alongside the ordinary (uncorrected), bootstrapped (corrected), and weighted mean FDH scores. Panel (b) presents the count and percentage of efficient HCs that served as benchmarks.
Healthcare 12 02230 g008
Figure 9. Efficiency scores for the subset of 180 common DMUs in ascending order, based on the 241 HCs sample (green line). The orange area represents score differences observed when calculated for the 234 HCs sample, illustrating the influence of removed outliers on efficiency score distribution.
Figure 9. Efficiency scores for the subset of 180 common DMUs in ascending order, based on the 241 HCs sample (green line). The orange area represents score differences observed when calculated for the 234 HCs sample, illustrating the influence of removed outliers on efficiency score distribution.
Healthcare 12 02230 g009
Table 1. Production possibility set (PPS) assumptions and constraints for different DEA and FDH models. Summary of assumptions and structural constraints for each PPS.
Table 1. Production possibility set (PPS) assumptions and constraints for different DEA and FDH models. Summary of assumptions and structural constraints for each PPS.
ScalingFree DisposabilityConvexityReturns to Scale
Λ C R S = λ r   R +   |   r = 1 n λ r > 0
(CRS)
Λ N I R S = λ r   R +   |   r = 1 n λ r 1
(NIRS)
Λ N D R S = λ r   R +   |   r = 1 n λ r 1  
(NDRS)
Λ V R S = λ r   R +   |   r = 1 n λ r = 1 -
Λ F D H = λ r   N +   |   r = 1 n λ r = 1 --
Table 2. Cross-reference of PHC DEA efficiency analysis studies utilizing specified input–output variables, including data sources and relevant literature references.
Table 2. Cross-reference of PHC DEA efficiency analysis studies utilizing specified input–output variables, including data sources and relevant literature references.
InputsOutputsRef.
i1i2i3o1o2o3
[59]
[61]
[60]
[4]
[62]
[36]
[34]
[33] *
[35]
Data Sources
MoHMoHMoHMoHMoHMoH
Note: (i1) No. of medical staff, (i2) no. of nursing and paramedical staff, (i3) no. of administrative and other staff, (o1) no. of scheduled patient visits, (o2) no. of emergency patient visits, (o3) no. of patient visits for obtaining pharmaceutical prescriptions. (*) Systematic review article, MoH = Ministry of Health.
Table 3. Descriptive statistics for input and output variables of Health Centers (HCs) and ToMYs.
Table 3. Descriptive statistics for input and output variables of Health Centers (HCs) and ToMYs.
EntityStatisticInputsOutputs
i1i2i3o1o2o3
HCs (N = 234)
Mean11.415.78.215,992.49255.610,409.4
St Dev8.710.04.913,975.18845.27185.5
Median8.013.57.511,917.06333.08529.5
Min11134175364
Max47533175,38154,15636,929
Range46523075,04054,08136,565
Sum2665367519083,742,2112,165,8212,435,798
ToMYs (N = 94)
Mean2.63.12.54944.71025.92860.5
St Dev1.20.90.73123.5845.22019.9
Median3.03.03.04491.0786.02419.0
Min11146512109
Max55514,16138128737
Range44413,69638008628
Sum242293234464,80196,438268,890
Note: (i1) No. of medical staff, (i2) no. of nursing and paramedical staff, (i3) no. of administrative and other staff, (o1) no. of scheduled patient visits, (o2) no. of emergency patient visits, (o3) no. of patient visits for obtaining pharmaceutical prescriptions.
Table 4. Results of the global returns-to-scale (RTS) determination test for Health Centers (HCs) using a homogenous bootstrap approach (B = 2000 replications).
Table 4. Results of the global returns-to-scale (RTS) determination test for Health Centers (HCs) using a homogenous bootstrap approach (B = 2000 replications).
TestNull vs.StatisticSignificance Level
Alternative(ts1, ts2)α = 0.01α = 0.05
test-1H0: CRS1.1526p < 0.001p < 0.001
H1: VRS
test-2H’0: NRS1.0256p = 0.005p = 0.0025
H’1: VRS
Table 5. Summary statistics of Health Centers’ (HCs) VRS bootstrapped efficiency scores, conventional and bootstrapped weighted means, and number and percentage of efficient DMUs. Data are categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
Table 5. Summary statistics of Health Centers’ (HCs) VRS bootstrapped efficiency scores, conventional and bootstrapped weighted means, and number and percentage of efficient DMUs. Data are categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
RegionMeanBootstrapped Scores (B = 2000 repls.)Efficient DMUsWeighted Mean Eff.
MeanStDevMedianMinMaxNo.% Total
All1.972.311.132.021.138.0438100.00%1.92
‘1’1.671.970.921.621.195.05513.20%1.69
‘2’2.022.371.212.011.136.191128.90%1.99
‘3’2.022.371.132.101.168.042257.90%2.00
Urban1.521.790.821.361.194.18615.80%1.51
P-Urban1.912.230.982.011.136.632565.80%1.94
Rural2.392.831.482.251.228.04718.40%2.42
RHA 11.301.510.321.451.192.2312.60%1.51
RHA 21.982.321.241.781.206.63718.40%1.83
RHA 32.042.350.722.381.213.6725.30%2.12
RHA 42.312.671.302.371.176.1925.30%2.24
RHA 52.062.411.072.241.135.92615.80%2.05
RHA 61.882.241.152.011.168.041642.10%1.86
RHA 71.591.890.771.541.223.55410.50%1.64
Table 6. Summary statistics of ToMYs’ VRS bootstrapped efficiency scores, conventional and bootstrapped weighted means, and number and percentage of efficient DMUs. Data are categorized by Regional Health Authorities (RHA 1–7).
Table 6. Summary statistics of ToMYs’ VRS bootstrapped efficiency scores, conventional and bootstrapped weighted means, and number and percentage of efficient DMUs. Data are categorized by Regional Health Authorities (RHA 1–7).
RHAMeanBootstrapped Scores (B = 2000 repls.)Efficient DMUsWeighted Mean Eff.
MeanStDevMedianMinMaxNo.% Total
All1.741.981.331.481.1110.5526100.00%1.58
RHA 11.411.600.591.321.133.03311.50%1.46
RHA 21.481.700.651.481.113.53623.10%1.55
RHA 31.641.87 1.871.871.8700.00%1.87
RHA 41.651.880.951.521.133.94311.50%1.55
RHA 52.082.361.601.591.146.54415.40%1.69
RHA 61.912.191.831.511.1510.55726.90%1.63
RHA 71.581.770.731.381.163.03311.50%1.48
Table 7. Super-efficiency scores of efficient HCs, relative ranks, and peer counts as benchmarks under four different returns-to-scale (RTS) assumptions. The table is ordered in ascending order based on VRS ranks. Minimum and maximum values are denoted in bold.
Table 7. Super-efficiency scores of efficient HCs, relative ranks, and peer counts as benchmarks under four different returns-to-scale (RTS) assumptions. The table is ordered in ascending order based on VRS ranks. Minimum and maximum values are denoted in bold.
DMUSuper-Eff CRS Peer
Count
Super-Eff VRSPeer
Count
Supper-Eff NDRSPeer
Count
Super-Eff NIRSPeer
Count
ScoreRankScoreRankScoreRankScoreRank
D-1990.956190-134-1340.956260
D-212 -111-111
D-041 0.071320.07132
D-184 0.211420.21142
D-1680.8381050.310530.310550.838143
D-043 0.314600.31460
D-243 0.391720.39172
D-153 0.3938160.393816
D-1730.47611080.4129610.41291030.476166
D-0480.7457330.42810140.42810350.745812
D-1590.5983670.55011620.59813610.550268
D-1720.6114940.55512660.55511940.611566
D-0370.6145660.59013690.61414610.590374
D-2520.5962700.59414700.59412670.596473
D-1780.798990.6231530.6231580.798134
D-2560.6956970.68216700.69517940.682673
D-211 0.6931700.693160
D-035 0.7391831 0.739731
D-058 0.7741929 0.774929
D-0930.7958220.77520160.77518220.7951216
D-1890.8681150.7802110.8682140.780102
D-1660.87113590.78122810.87122560.7811184
D-1110.8751420.7912330.7911920.875193
D-044 0.842248 0.842158
D-2250.86912320.85225180.85220350.8691814
D-0570.87916430.85326330.87925380.8531638
D-007 0.858279 0.858179
D-2600.87615480.87528230.87624470.8752024
D-1620.8881770.8762910.8762350.888233
D-028 0.8853021 0.8852121
D-1550.9201830.8853150.9202610.885227
D-203 0.9103225 0.9102425
D-120 0.924338 0.924258
D-262 0.9593410.959271
D-133 0.965353 0.965273
D-238 0.982363 0.982283
D-176 0.992371 0.992291
D-152 0.999382 0.999302
# DMUs 19 38 27 30
Note: Min and max scores are shown in bold.
Table 8. Summary statistics of Health Centers’ (HCs) FDH bootstrapped efficiency scores, conventional and bootstrapped weighted means, and number and percentage of efficient and benchmark DMUs. Data are categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
Table 8. Summary statistics of Health Centers’ (HCs) FDH bootstrapped efficiency scores, conventional and bootstrapped weighted means, and number and percentage of efficient and benchmark DMUs. Data are categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
RegionMeanBootstrapped Scores (B = 2000 repls.)Efficient DMUsBenchmarksWeighted Mean Eff.
MeanStDevMedianMinMaxNo.% TotalNo.Peer Count
All1.341.510.561.221.094.28115100.0%351191.39
‘1’1.251.410.451.201.102.891916.5%3211.31
‘2’1.401.580.671.221.094.283328.7%6121.43
‘3’1.331.500.521.221.093.476354.8%26861.41
Urban1.171.330.311.201.102.261513.0%4241.25
P-Urban1.341.510.571.221.094.287767.0%24681.41
Rural1.411.590.601.221.133.452320.0%7271.48
RHA 11.051.180.101.151.111.4465.2%001.18
RHA 21.301.470.551.211.103.441916.5%2221.33
RHA 31.491.640.471.681.133.0732.6%221.54
RHA 41.531.710.691.331.093.451916.5%5111.55
RHA 51.391.550.671.251.094.281412.2%241.47
RHA 61.241.420.431.221.093.314438.3%20631.33
RHA 71.211.390.471.211.122.69108.7%4171.29
Table 9. Frequency distribution of Free Disposal Hull (FDH) benchmark appearances as peers for Health Centers (HCs) across regions, categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO’s classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
Table 9. Frequency distribution of Free Disposal Hull (FDH) benchmark appearances as peers for Health Centers (HCs) across regions, categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO’s classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
CountBenchmarkNumber of Benchmark Appearances as Peers
All‘1’‘2’‘3’UrbanP-UrbanRuralRegional Health Authority
1234567
1D-037195311·18112545 2
2D-26011 47 65 4 2 41
3D-17210 37 82 12214
4D-2039 3618 1413
5D-1738 26 53 17
6D-1666 33 6 11121
7D-2255 23221 311
8D-1784 13 13 1 12
9D-25243·1 311 3 ·
10D-0283 21 3 · 2 1
11D-0913 3 3 21·
12D-09231 2 3 1 ·11
13D-1523 1212 111
14D-1042 2 11 ·11
15D-1182 1111 1· 1
16D-2082 2· 11 1 1
17D-2102 211 2·
18D-2222 11 2· 11
19D-2232 2 ·2 2
20D-2322 ·2 11 1 ·1
21D-2362 11 ·2 11
22D-23822 ·2 1 1 ·
23D-0571·1 1 · 1
24D-05811 1 1 ·
25D-0881 1 ·1 1 ·
26D-1591 1·1 ·1
27D-1621 1 1 1
28D-1681 1 1 1·
29D-1761 1 1 1 ·
30D-1831 1 1 1 ·
31D-2111 ·1 ·1 1 ·
32D-2161 ·1 1 1·
33D-2331 1 ·1 1·
34D-2501 1 1· 1·
35D-25311· 1· 1 ·
TotalAppearences11913317598228414132525335
Benchmarks353626424702252204
Note: The shaded cells in each row indicate the sub(category) of origin (i.e., column header) for the corresponding benchmark unit.
Table 10. Summary statistics of ToMYs’ FDH bootstrapped efficiency scores, conventional and bootstrapped weighted means, and number and percentage of efficient and benchmark DMUs. Data are categorized by Regional Health Authorities (RHA 1–7).
Table 10. Summary statistics of ToMYs’ FDH bootstrapped efficiency scores, conventional and bootstrapped weighted means, and number and percentage of efficient and benchmark DMUs. Data are categorized by Regional Health Authorities (RHA 1–7).
RegionMeanBootstrapped Scores (B = 2000 repls.)Efficient DMUsBenchmarksWeighted
Mean Eff.
MeanStDevMedianMinMaxNo. % TotalNo.Peer Count
All1.361.550.811.211.086.5852100.0%15421.34
RHA 11.251.420.391.221.142.25611.5%261.34
RHA 21.161.330.411.211.082.83917.3%271.27
RHA 31.641.82 1.821.821.8200.0%001.82
RHA 41.301.470.661.201.093.47713.5%111.31
RHA 51.611.831.061.221.164.12815.4%5201.43
RHA 61.451.631.051.201.106.581834.6%461.34
RHA 71.251.410.361.241.152.0847.7%121.28
Table 11. Frequency distribution of Free Disposal Hull (FDH) benchmark appearances as peers for ToMYs across Regional Health Authorities (RHA 1–7).
Table 11. Frequency distribution of Free Disposal Hull (FDH) benchmark appearances as peers for ToMYs across Regional Health Authorities (RHA 1–7).
CountBenchmarkNumber of Benchmark Appearances as Peers
ALLRegional Health Authority (RHA/YPE)
1234567
1TM-04710 11323
2TM-021622 11
3TM-0104· 2 2
4TM-0514 22
5TM-0544 1 1·2
6TM-0733 2 1
7TM-006211
8TM-0912 11
9TM-0161 · 1
10TM-0421 1
11TM-0451 ·1
12TM-04811 ·
13TM-06411 ·
14TM-0691 ·1
15TM-0841 1 ·
TotalAppearances4258157124
Benchmarks152201541
Note: The shaded cells in each row indicate the sub(category) of origin (i.e., column header) for the corresponding benchmark unit.
Table 12. Kruskal–Wallis and median test results for HCs across DEA and FDH (conventional and bootstrapped) models. Tests assessed efficiency score differences across regions, categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO’s classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
Table 12. Kruskal–Wallis and median test results for HCs across DEA and FDH (conventional and bootstrapped) models. Tests assessed efficiency score differences across regions, categorized by Eurostat’s typology (‘1’, ‘2’, ‘3’), FAO’s classification (urban, peri-urban, rural), and Regional Health Authorities (RHA 1–7).
(Sub)Categories CRSVRSIRSDRSFDHBootstrapped
TestedCRSVRSIRSDRSFDH
Kruskal–Wallis test
‘1’, ‘2’,’3’χ2 (df = 2)1.973.890.935.921.562.314.801.525.764.85
p-value0.370.140.630.050.460.320.090.470.060.09
Urban, P-Urban, Ruralχ2 (df = 2)6.0512.403.1718.283.286.1516.344.3319.105.03
p-value0.050.000.210.000.190.050.000.120.000.08
RHA 1, 2, 3, 4, 5, 6, 7χ2 (df = 6)10.6015.2012.5214.8916.379.0514.599.5214.4016.95
p-value0.100.020.050.020.010.170.020.150.030.01
Median test
‘1’, ‘2’, ‘3’χ2 (df = 2)1.655.981.655.541.651.395.981.395.544.23
p-value0.440.050.440.060.440.500.050.500.060.12
Urban, P-Urban, Ruralχ2 (df = 2)0.715.760.3510.472.001.289.321.2810.473.63
p-value0.700.060.840.010.370.530.010.530.010.16
RHA 1, 2, 3, 4, 5, 6, 7χ2 (df = 6)8.1210.878.0511.4515.928.1212.558.1113.0513.79
p-value0.230.090.230.080.010.230.050.230.040.03
Table 13. Summary tsatistics of scale efficiency scores for Health Centers (HCs) across regions categorized by Eurostat’s typology, FAO’s classification, and Regional Health Authorities (RHA 1–7).
Table 13. Summary tsatistics of scale efficiency scores for Health Centers (HCs) across regions categorized by Eurostat’s typology, FAO’s classification, and Regional Health Authorities (RHA 1–7).
StatisticAll‘1’‘2’‘3’UrbanP-UrbanRuralRegional Health Authority (RHA/YPE)
1234567
Mean1.171.231.181.151.301.161.121.341.171.291.091.181.151.21
StDev0.270.290.290.250.330.250.280.310.290.320.130.250.280.35
Median1.051.091.061.051.241.051.041.241.051.151.051.091.041.04
Min1.001.001.001.001.001.001.001.031.001.001.001.001.001.00
Max2.712.122.442.711.982.442.711.802.442.121.762.122.712.12
Weighted mean1.191.241.211.151.351.171.081.361.151.251.111.191.161.30
Note: Classifications ‘1’, ’2’, and ’3’ according to Eurostat (‘1’ = urban areas, ‘2’ = intermediate areas, ‘3’ = rural areas); classifications ‘urban’, ‘p[eri]-urban’, and ‘rural’ according to FAO.
Table 14. Summary statistics of scale efficiency scores for ToMYs across Regional Health Authorities (RHA 1–7).
Table 14. Summary statistics of scale efficiency scores for ToMYs across Regional Health Authorities (RHA 1–7).
StatisticAllRegional Health Authority (RHA/YPE)
1234567
Mean1.201.041.391.001.061.041.331.02
StDev0.660.071.07 0.080.040.810.03
Median1.021.001.051.001.001.031.031.01
Min1.001.001.001.001.001.001.001.00
Max5.431.225.431.001.251.164.021.07
Weighted mean1.061.041.101.001.051.031.081.01
Table 15. Comparison of efficiency metrics and peer relationships in two samples (241 HCs and 234 HCs) before and after the removal of outlier decision-making units (DMUs).
Table 15. Comparison of efficiency metrics and peer relationships in two samples (241 HCs and 234 HCs) before and after the removal of outlier decision-making units (DMUs).
Entity241 HCs Sample
(Original)
234 HCs Sample
(Outliers Removed)
Number of decision-making units (DMUs) in sample241234
Number of outliers7-
Number of efficient DMUs3138
Number of efficient outliers4-
Number of inefficient outliers3-
Number of efficient DMUs common to both samples2727
Number of DMUs with at least one efficient outlier as a peer183-
Percentage of inefficient DMUs using at least one efficient outlier as a peer90.1%-
Total occurrences of efficient outliers as peers261-
Total occurrences of all efficient DMUs as peers812-
Percentage of peer occurrences for efficient outliers (of total)32.1%-
Mean efficiency score of all DMUs in sample2.331.97
Number of inefficient DMUs common to both samples (using at least one efficient outlier as a peer in the 241 HCs sample)180180
Descriptive statistics of the subset (180 common DMUs)241 HCs Sample234 HCs Sample
    Mean efficiency score2.652.14
    Standard deviation1.230.96
    Minimum efficiency score1.021.00
    Maximum efficiency score8.075.68
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Flokou, A.; Aletras, V.H.; Miltiadis, C.; Karaferis, D.C.; Niakas, D.A. Efficiency of Primary Health Services in the Greek Public Sector: Evidence from Bootstrapped DEA/FDH Estimators. Healthcare 2024, 12, 2230. https://doi.org/10.3390/healthcare12222230

AMA Style

Flokou A, Aletras VH, Miltiadis C, Karaferis DC, Niakas DA. Efficiency of Primary Health Services in the Greek Public Sector: Evidence from Bootstrapped DEA/FDH Estimators. Healthcare. 2024; 12(22):2230. https://doi.org/10.3390/healthcare12222230

Chicago/Turabian Style

Flokou, Angeliki, Vassilis H. Aletras, Chrysovalantis Miltiadis, Dimitris Charalambos Karaferis, and Dimitris A. Niakas. 2024. "Efficiency of Primary Health Services in the Greek Public Sector: Evidence from Bootstrapped DEA/FDH Estimators" Healthcare 12, no. 22: 2230. https://doi.org/10.3390/healthcare12222230

APA Style

Flokou, A., Aletras, V. H., Miltiadis, C., Karaferis, D. C., & Niakas, D. A. (2024). Efficiency of Primary Health Services in the Greek Public Sector: Evidence from Bootstrapped DEA/FDH Estimators. Healthcare, 12(22), 2230. https://doi.org/10.3390/healthcare12222230

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop