Strategic Queueing Behavior of Two Groups of Patients in a Healthcare System

Liu, Youxin; Liu, Liwei; Jiang, Tao; Chai, Xudong

doi:10.3390/math12101579

Open AccessArticle

Strategic Queueing Behavior of Two Groups of Patients in a Healthcare System

¹

School of Mathematics and Statistics, Nanjing University of Science and Technology, Nanjing 210094, China

²

Department of Elementary Teaching, Wuhu Institute of Technology, Wuhu 241003, China

³

College of Economics and Management, Shandong University of Science and Technology, Qingdao 266590, China

⁴

School of Mathematics-Physics and Finance, Anhui Polytechnic University, Wuhu 241000, China

^*

Authors to whom correspondence should be addressed.

Mathematics 2024, 12(10), 1579; https://doi.org/10.3390/math12101579

Submission received: 18 April 2024 / Revised: 10 May 2024 / Accepted: 16 May 2024 / Published: 18 May 2024

(This article belongs to the Special Issue Queueing Systems Models and Their Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Long waiting times and crowded services are the current medical situation in China. Especially in hierarchic healthcare systems, as high-quality medical resources are mainly concentrated in comprehensive hospitals, patients are too concentrated in these hospitals, which leads to overcrowding. This paper constructs a game-theoretical queueing model to analyze the strategic queueing behavior of patients. In such hospitals, patients are divided into first-visit and referred patients, and the hospitals provide patients with two service phases of “diagnosis” and “treatment”. We first obtain the expected sojourn time. By defining the patience level of patients, the queueing behavior of patients in equilibrium is studied. The results suggest that as long as the patients with low patience levels join the queue, the patients with high patience levels also join the queue. As more patients arrive at the hospitals, the queueing behavior of patients with high patience levels may have a negative effect on that of patients with low patience levels. The numerical results also show that the equilibrium behavior deviates from a socially optimal solution; therefore, to reach maximal social welfare, the social planner should adopt some regulatory policies to control the arrival rates of patients.

Keywords:

healthcare system; comprehensive hospital; patients; rational queueing behavior; equilibrium analysis

MSC:

90B22; 60K25

1. Introduction

For a long time, in China, “being difficult and expensive to see a doctor” has been an important problem troubling the government, medical service institutions and patients. However, due to the inherent advantages of comprehensive hospitals in medical quality and service capacity, they usually act as high-level healthcare providers, which leads to an excessive concentration of patients in these hospitals, while the number of patients in community hospitals is relatively small, and part of the resources are scarce and part of the resources are idle. As a result, patients need to wait for a long time to see a doctor, leading to overcrowding in such hospitals.

This paper studies a queueing setting of comprehensive hospitals in which two groups of delay-sensitive patients (first-visit and referred patients) with common or chronic diseases arrive at comprehensive hospitals to get their conditions diagnosed and also receive the required treatment. Both groups line up in a common queue for service based on a first-come, first-served (FCFS) discipline. Our motivation to analyze this setting is overcrowding in comprehensive hospitals, as they serve both their own patients and also the patients referred to them, where the referred patients come from the community hospitals for treatments. For example, in comprehensive hospitals for common or chronic diseases, their own patients (e.g., the first-visit patients) need to go through two service phases of “diagnosis” and “treatment”, but the referred patients (e.g., the patients who firstly seek service at the community hospitals and cannot be cured) need to line up in a common queue with the first-visit patients, bypass the “diagnosis” and go directly to “treatment”. For these patients with common or chronic diseases, they are more concerned about their waiting time in hospitals. Therefore, service delay is an important indicator of patients’ perceived value, and the length of waiting time has an important effect on patients’ strategic decisions. Since queueing theory is the main tool for solving the queueing problem in healthcare systems, we could incorporate queueing economics in comprehensive hospitals. On the basis of the queueing setting in comprehensive hospitals, in this study, we model the hospitals as queueing systems for a mixture of two types of patients with different delay sensitivities and service rewards, where the first-visit patients go through two service phases of “diagnosis” and “treatment”, and the referred patients only need “treatment”. To the best of our knowledge, the strategic behavior of patients in such queueing systems has not been analyzed and will be a good research direction.

We present a queueing-theoretic formulation of healthcare systems. In healthcare systems, healthcare providers with a limited capacity are often faced with delay-sensitive patients. For the sake of analytical tractability, in our work, we assume that patients arrive at the system according to a Poisson process. Patients are divided into two groups according to their medical condition and their delay sensitivities as well as their valuation of service offerings. More generally, we do not set a limit on the relative magnitude of the delay sensitivity and the valuation of service offerings between the two groups of patients. Since the cost of diagnosis is relatively lower than that of treatment, it is assumed that the cost of service is the same for the two types of patients, and the influence of service cost on the decision-making of delay-sensitive patients is not considered in this paper. Therefore, based on the valuation of service offerings and the expected cost of waiting in hospitals, each patient who arrives at the comprehensive hospital decides whether to join the queue or balk. Different from the assumption in Wen et al. (2019) that the hospital provides two separate queues [1], namely a common line for first-visit patients and a green line for referred patients, in this paper, we assume that the two groups of patients line up in a common queue. Therefore, in this paper, we adopt the analysis method of Tang et al. (2018) [2] to deal with the queueing problem in comprehensive hospitals, study how the delay of the pooled queue affects patients’ queueing decisions and derive the queueing behavior of patients in equilibrium.

There are three streams in the literature related to our study: queueing behavior of customers, various customer classifications in behavioral queueing models and queueing economics in healthcare systems.

The first related stream of research is the analysis of queueing behavior of customers. In general, the literature on the queueing behavior of customers can be roughly divided into two categories: observable and unobservable queues. In the observable queues, the problem that needs to be analyzed is the dynamic control of queues, whereas in the unobservable queues, customers rely on their net service utilities to make queueing decisions. Naor (1969) [3] is the first scholar to study customers’ strategic behavior in queueing systems. In Naor’s model, the author considers an observable M/M/1 queue, where customers decide to either join the queue or balk based on the service reward and the waiting cost. Edelson and Hilderbrand (1975) [4] extended Naor’s model and considered the strategic queueing behavior of customers in an unobservable M/M/1 queueing system. Since then, by allowing the customers to make their queueing decisions, a large amount of the literature has studied both the observable and unobservable queues, and studied the relationship between socially and individually optimal queueing decisions; interested readers may refer to the good survey of studies; see, e.g., Hassin and Haviv (2003) [5], Hassin (2016) [6], Ibrahim (2018) [7] and Economou (2022) [8]. In healthcare systems, service delay and waiting time are major concerns for delay-sensitive patients; hence, we need to introduce a queueing model in the healthcare systems and study the patients’ queueing decisions.

The second related stream of research studies various customer classifications in behavioral queueing models. Roughly speaking, depending on the number of customer types in the queue, the literature on behavioral queueing models can also be divided into two categories: homogeneous and heterogeneous customers. In the first category, there is no difference between customers, i.e., all customers have the same service reward, the same sensitivity to delay, share the same information level, etc. Therefore, all customers are indistinguishable; in the observable queue, the queueing decision of a customer is independent of the strategies of all other customers, and in the unobservable queue, the game among customers is a symmetric game. The classic results can be seen in the review literature: Hassin and Haviv (2003) [5] and Hassin (2016) [6]. However, in the second category, customers are heterogeneous as they may have different sensitivities to delay, different service rewards, different psychological characteristics, heterogeneous information, etc. So far, there is also a large number of studies in the literature on various customer classifications in behavioral queueing models. For example, Ni et al. (2013) [9] classified customers according to “customer intensity” and studied the revenue-maximization service provider’s price–speed decision in customer-intensive service systems. In Zhou et al. (2014) [10], the customers are categorized into two types based on the valuation for service and sensitivity to delay, and the authors solved the optimal uniform pricing problem by using a queueing model with two types of customers. Considering whether customers have sufficient information about the service, Zhou et al. (2014) [11] divided customers into informed and uninformed customers and solved the problem of when service enterprises should provide free experience service to uninformed customers. Tang et al. (2018) [2] classified customers into two types based on workload and delay sensitivity and obtained the equilibrium queueing strategies of the two types of customers. Based on the availability of queue length information and system state, Hu et al. (2018) [12] and Wang and Wang (2019) [13] classified customers into informed and uninformed customers. In the M/M/1 queue and retrial queue, the authors, respectively, studied the equilibrium strategies of customers and the effect of information heterogeneity on throughput and social welfare. Wang and Fang (2022) [14] took the heterogeneity of priority awareness as a classification criterion; Wang and Sun (2022) [15] classified customers according to customer service experience, according to whether customers choose to stay on-site when the system provides services; and Hanukov et al. (2023) [16] also divided customers into questioning customers and trusting customers; then, the authors studied the customer equilibrium strategy and system optimization decision-making in the priority queueing system, online service queueing system, queue-inventory system and other related systems.

In comprehensive hospitals for common or chronic diseases, as they serve both their own patients and also the patients referred to them, we need to categorize patients into first-visit and referred patients, so that we can investigate the mutual influence mechanism of the decision-making process of two types of customers and deduce the equilibrium queueing strategies of patients.

Our study is also very relevant to queueing economics in healthcare systems. A lot of researchers are interested in using queueing models to study patient queueing problems in hierarchic healthcare systems. So far, according to the composition of the institution in hierarchic healthcare systems, these systems can be broadly divided into two categories: horizontal and vertical hierarchical healthcare systems. In the first category, the hierarchic healthcare systems consist of public hospitals offering free services and private hospitals offering toll services. In the second category, the hierarchic healthcare systems consist of two types of hospitals, namely comprehensive hospitals with high-quality medical resources and community or primary hospitals with low-quality medical resources, where uncured patients could be referred to comprehensive hospitals from community hospitals and cured patients could be transferred from comprehensive hospitals to community hospitals for further rehabilitation therapy.

For the first category, the related literature includes Guo et al. (2014) [17], Chen et al. (2015) [18], Hua et al. (2016) [19], Qian and Zhuang (2017) [20], Wan and Wang (2017) [21], Qian et al. (2017) [22], Zhang and Yin (2021) [23,24], Zhou et al. (2022) [25], Chen (2023) [26], and Hu et al. (2024) [27]. In Guo et al. (2014) [17], the authors analyzed the pricing and capacity decisions of a two-tier medical service system under the condition of ensuring self-financing. In Chen et al. (2015) [18] and Hua et al. (2016) [19], the authors considered two subsidy schemes and analyzed the competition between public and private hospitals in a two-tier medical service system. The main results showed that a relatively small rate could perfectly harmonize the two-tier medical service system, and at the same time, the subsidy coordination method can effectively reduce the waiting time in public hospitals and improve social welfare. In Qian and Zhuang (2017) [20], the authors studied the coordination mechanism of tax/subsidy and service capacity planning of a two-tier medical service system in terms of welfare redistribution. The results showed that tax subsidies or capacity planning for hospitals can induce patients with different time-delay sensitivities to choose different hospitals. In Wan and Wang (2017) [21], from the perspectives of patient-waiting-time minimization and social welfare maximization, the authors analyzed the optimal decision of a two-tier medical service system. In Qian et al. (2017) [22], the authors studied the strategy of reducing patients’ waiting time through a subsidy mechanism in the public medical service system. By analyzing the public medical service system, the authors obtained the optimal subsidy mechanism for subsidy and waiting time. In Zhang and Yin (2021) [23], the authors defined the mixed information and non-real-time information cases, and then based on the matrix analytic method, they proposed a computational approach to analyze the system performance and examine the joint effect of delay information and pricing on the system performance. In Zhang and Yin (2021) [24], the authors investigated a two-tier service system with customers’ asymmetric preference for charged-service and free-service providers. By constructing an M/M/1 queueing model, they derived the customers’ choice in equilibrium and found that in some cases, the two-tier service system could solve the over-congestion problem and reduce the total social cost. Moreover, in Zhou et al. (2022) [25], the authors studied a mixed duopoly service system with private and public service providers. In Chen (2023) [26], the author investigated a two-tier co-payment healthcare system under a uniform pricing and subsidy coordination mechanism. In Hu et al. (2024) [27], the authors modeled privatized public service systems as queueing systems and displayed whether the government adopting myopic adjustment plays a critical role in choosing the regulation instrument.

For the second category, the related literature includes Li et al. (2017) [28], Li et al. (2019) [29], Wen et al. (2019) [1], Zhou et al. (2021) [30], Li et al. (2021) [31], Li et al. (2021) [32], Wang et al. (2021) [33], Rajan et al. (2019) [34], and Li et al. (2023) [35,36]. For example, Li et al. (2017) [28] considered the reverse referral (upstream referral) in the tiered healthcare system with delay-sensitive patients and used the queueing approach to examine the effect of reverse referral partnerships. Li et al. (2019) [29] considered a gatekeeping system with heterogeneous patients and investigated the effects of online inquiry service on performance by using the queueing-game theory. In the hierarchical healthcare system, Wen et al. (2019) [1] proposed a stochastic tandem queueing model to obtain the optimal capacity allocation. In a referral healthcare system, Zhou et al. (2021) [30] considered gatekeeping and non-gatekeeping settings. By using the queueing approach, the authors compared the effect of the two settings on a social planner’s capacity decision. Li et al. (2021) [31] considered an operational-level control agreement framework and developed a multi-fidelity model-based optimization approach to solve the Pareto optimization for control agreement in patient referral coordination. Li et al. (2021) [32] developed two payment schemes to facilitate capacity sinking. The authors constructed a four-stage game model under the queueing-game theory framework to study the capacity reallocation in a downstream referral healthcare system. Wang et al. (2021) [33] investigated the hospital referral and capacity strategies in a downstream referral healthcare system, in which the authors obtain the equilibrium strategy of the comprehensive hospital provider’s referral rate and primary hospital provider’s capacity level. Furthermore, the authors also explored the impact of some parameters on the referral healthcare system. In addition to the literature on referral healthcare systems, Rajan et al. (2019) [34] also considered a healthcare system with heterogeneous patients. In the healthcare system, the authors investigated the effects of telemedicine technology on patient utility and healthcare provider’s operating decisions. Moreover, the authors presented some policy implications for facilitating the further development of telemedicine in chronic care. Li et al. (2023) [35,36] proposed some contract mechanisms to solve healthcare imbalance in hierarchical healthcare systems.

In contrast to these studies, we model the comprehensive hospital as a queueing system with a mixture of two types of patients and two service phases, rather than the classic M/M/1 queue with homogeneous patients, which makes the analysis more challenging. Moreover, different from the homogeneous patients who have either the same service reward or the same sensitivity for delay, we assume that the two types of patients are heterogeneous in both delay sensitivity and service reward, and the patients line up in a pooled queue rather than two dedicated queues. Under this assumption, either group of patients may have a higher incentive to join the hospital, rather than one group of patients always having a higher incentive than the other. As a result, we need to distinguish different scenarios to solve the problem under consideration. Finally, we assume that the two groups of patients arriving at the comprehensive hospital are two independent Poisson flows, so that, the proportion of one type of patient is no longer regarded as exogenous but endogenous through the two independent Poisson flows. Based on these differences, we aim to find how the delay of the pooled queue affects patients’ queueing decisions and derive the queueing behavior of patients in equilibrium.

This paper is structured as follows: Section 2 incorporates the heterogeneity of patients into a queueing system and describes the queueing model. Section 3 gives the concrete expression of the expected sojourn time of an arbitrary patient after deriving the steady-state probabilities of the system. Section 4 derives the equilibrium queueing strategies of the two groups of patients. Section 5 presents some numerical examples to verify the correctness of the theoretical results. Section 6 is a separate discussion section to brief the research findings. Section 7 concludes this study.

2. Model Formulation

Consider a monopolistic comprehensive hospital in a healthcare system that provides patients with two service phases of “diagnosis” and “treatment”. The service settings in the comprehensive hospital are shown in Figure 1. Patients are divided into two groups: the first-visit patients (labeled d) and the referred patients (labeled t). The arrival process of first-visit and referred patients is independent of each other; they arrive at the system according to a Poisson process with rates

λ_{d}

and

λ_{t}

. As a result, the total arrival rate of patients is

λ = λ_{d} + λ_{t}

, the fraction of first-visit patients is

γ = \frac{λ_{d}}{λ_{d} + λ_{t}}

and the fraction of referred patients is

\bar{γ} = 1 - γ

, which is endogenous via the two independent Poisson flows.

The two groups of patients differ in three ways. First, the two groups of patients come from different sources and have different medical experiences. The first-visit patients do not receive any medical diagnosis and treatment before reaching the comprehensive hospital. After arriving at the hospital, they need to line up in the queue and go through two service phases of “diagnosis” and “treatment”. However, the referred patients who have received medical diagnosis and treatment at community hospitals cannot be cured and need to be referred to the comprehensive hospital, where they queue up with the first-visit patients, bypass the “diagnosis” and go directly to “treatment”. Second, the two groups of patients are heterogeneous in their delay sensitivities, denoted by

θ_{d}

,

θ_{t}

. Third, the valuation of service offerings (the service reward obtained after the service is completed) for both groups of patients is different, denoted by

R_{d}

,

R_{t}

. For the last two aspects, we do not set a limit on the relative magnitude of the delay sensitivity and the valuation of service offerings between the two groups of patients.

Moreover, the “diagnosis” and “treatment” are independent. It is assumed that the service time in the “diagnosis” and “treatment” service phases follows the exponential distribution with the parameters

μ_{d}

and

μ_{t}

, respectively. The hypotheses of the Poisson arrival process and exponential service time in hospitals have been empirically tested by Kim et al. (1999) [37] and have been widely used in the relevant medical operations management literature. An arriving patient, if required, joins the queue. The discipline is FCFS and is not related to the type of service.

Actually, by this assumption, the service time in our model is type-dependent, i.e., the service time of first-visit patients is the sum of two exponential distributions of mean

1 / μ_{d}

and

1 / μ_{t}

, and that of referred patients follows an exponential distribution with mean

1 / μ_{t}

. Therefore, the problem studied can be modeled as a special M/H₂/1 queue, where the unconditional service time does not follow an exponential distribution, but rather the sum of two exponential distributions of mean

1 / μ_{d}

and

1 / μ_{t}

with probability

γ

, and an exponential distribution of mean

1 / μ_{t}

with

\bar{γ}

, which is a weighting of two distributions and represents a hyper-exponential distribution. By Pollaczek–Khintchine lemma, we may obtain the expected sojourn time of an arbitrary patient. Next, we first obtain the stability condition, then we use the queueing theory and construct a continuous-time Markov chain to obtain the expected sojourn time of an arbitrary patient. Table 1 and Table 2 list the established model parameters and the system variables in the subsequent content.

3. Performance Analysis

3.1. Stability Condition

Let

L (t)

denote the number of patients in the system at time t and

J (t)

denote the service phase provided by the hospital at time t, where

J (t) = \{\begin{cases} 0, the hospital is in the service phase of “ diagnosis ” at time t, \\ 1, the hospital is in the service phase of “ treatment ” at time t . \end{cases}

Obviously,

X (t) = \{(L (t), J (t)), t \geq 0\}

is a continuous-time Markov chain with state space

Ω = {(0)} \cup {(k, j) : k \geq 1, j = 0, 1}

. The transition rate diagram is given in Figure 2.

Considering the state change of the Markov chain, we have the following state-transition-rate matrix:

Q = (\begin{matrix} A_{0} & C_{0} & 0 & 0 & \dots \\ B_{0} & A & C & 0 & \dots \\ 0 & B & A & C & \dots \\ 0 & 0 & B & A & \dots \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋱ \end{matrix})

where

A_{0} = - λ, C_{0} = (λ γ, λ \bar{γ}), B_{0} = (\begin{matrix} 0 \\ μ_{t} \end{matrix}), A = (\begin{matrix} - (λ + μ_{d}) & μ_{d} \\ 0 & - (λ + μ_{t}) \end{matrix}), B = (\begin{matrix} 0 & 0 \\ μ_{t} γ & μ_{t} \bar{γ} \end{matrix}), C = (\begin{matrix} λ & 0 \\ 0 & λ \end{matrix}) .

We first use Theorem 1.7.1 of Neuts (1981) [38] to derive the stability condition of the underlying queueing system.

Lemma 1.

The queueing system is stable if and only if

λ_{t} μ_{d} + λ_{d} (μ_{t} + μ_{d}) < μ_{t} μ_{d} .

Proof of Lemma 1.

The proof process is provided in the Appendix A. □

3.2. Expected Sojourn Time

To obtain the expected sojourn time of an arbitrary patient, we first define the steady-state probabilities by

π_{k, j} = \lim_{t \to + \infty} P (L (t) = k, J (t) = j), k \geq 1, j = 0, 1, π_{0} = \lim_{t \to + \infty} P (L (t) = 0)

π_{k} = (π_{k, 0}, π_{k, 1}), k \geq 1, π = {π_{0}, π_{1}, π_{2}, \dots,}

Then, by constructing the balance equations and obtaining the steady-state probabilities, we could derive the expected sojourn time of an arbitrary patient. The following proposition gives the explicit solution of the expected sojourn time:

Proposition 1.

The expected sojourn time of an arbitrary patient is given by

W (λ_{d}, λ_{t}) = (\frac{λ_{d}}{μ_{d} (λ_{d} + λ_{t})} + \frac{1}{μ_{t}}) [1 + \frac{(λ_{d} + λ_{t}) [μ_{d} + μ_{t} - (λ_{d} + λ_{t})]}{μ_{t} μ_{d} - λ_{d} μ_{t} - (λ_{d} + λ_{t}) μ_{d}}] - \frac{λ_{d} + λ_{t}}{μ_{t} μ_{d}}

(1)

Proposition 1 identifies the expected sojourn time of an arbitrary patient given the arrival rates

λ_{d}

and

λ_{t}

. From the expression, we find that the expected sojourn time should be a weighted average of the sojourn time for the two types of patients. Next, we discuss the properties of the result by the following proposition.

Proposition 2.

(i) If

λ_{d}

and

λ_{t}

satisfy

λ_{d} > 0

,

λ_{t} > 0

and

λ_{t} μ_{d} + λ_{d} (μ_{t} + μ_{d}) < μ_{t} μ_{d}

, then

\frac{\partial W (λ_{d}, λ_{t})}{\partial λ_{d}} > 0

(ii) If

λ_{d}

and

λ_{t}

satisfy

λ_{d} > 0

,

λ_{t} > 0

,

λ_{t} μ_{d} + λ_{d} (μ_{t} + μ_{d}) < μ_{t} μ_{d}

, and

λ_{d} \in [{\bar{λ}}_{d}, \frac{μ_{t} μ_{d}}{μ_{t} + μ_{d}})

, where

{\bar{λ}}_{d}

satisfies the equation

\frac{μ_{d}^{2} + {\bar{λ}}_{d} μ_{t}}{{(μ_{t} μ_{d} - {\bar{λ}}_{d} μ_{t} - {\bar{λ}}_{d} μ_{d})}^{2}} = \frac{1}{{\bar{λ}}_{d} μ_{d}}

, then

\frac{\partial W (λ_{d}, λ_{t})}{\partial λ_{t}} > 0

.

Proof of Proposition 2.

The proof process is provided in the Appendix A. □

Proposition 2 shows how the arrival rates

λ_{d}

and

λ_{t}

affect the expected sojourn time of patients in a hospital. See Figure 3 for the illustration. As shown in Figure 3, with different arrival rates

λ_{t}

,

W (λ_{d}, λ_{t})

is monotonically increasing in

λ_{d}

. However, with different arrival rates

λ_{d}

, as

λ_{t}

increases, the expected sojourn time no longer increases monotonously. This interesting counter-intuitive phenomenon may be attributed to two factors: (1) the differences in service time between two groups of patients; and (2) whether the hospital is congested. When the arrival rates of two types of patients are very small, the hospital is not congested, and relatively speaking, the hospital has enough service capacity at this time; therefore, the waiting time of patients in the queue (queueing time) is almost negligible, and the sojourn time (the sum of waiting time in the queue and the service time) is only related to the service time. Since the service time of first-visit patients is longer than that of referred patients, and the expected sojourn time of patients is a weighted average of the two types of patients’ sojourn time, when the arrival rate of referred patients increases gradually from zero, the proportion of referred patients increases, which leads to a gradual decrease in the weighted average sojourn time. However, when the arrival rate of referred patients exceeds a threshold, the expected sojourn time increases as the arrival rate of referred patients increases.

Proposition 2 indicates that the waiting time and sojourn time have different properties in our queueing model. When the hospital has enough service capacity, patients can receive service immediately without waiting in the queue; in this case, the waiting time in the queue can be ignored, and the sojourn time is equal to the service time. Only when the hospital is congested, that is, patients need to wait for service, the sojourn time of patients in the hospital is mainly the waiting time in the queue, and the more congested queue always leads to the longer expected waiting time and expected sojourn time. In order to reduce the sojourn time of patients in hospitals, the result could provide the following management implications for medical service managers: (1) when the arrival rates of the two types of patients remain unchanged, the most direct way is to increase investment and expand service capacity; and (2) without changing the service capacity, comprehensive hospitals should control the arrival rate of first-visit patients below a certain level and adjust the arrival rate of acceptable referred patients based on the service capacity and specific threshold.

Remark 1.

\frac{\partial W (λ_{d}, λ_{t})}{\partial λ_{t}} < \frac{\partial W (λ_{d}, λ_{t})}{\partial λ_{d}}

, which indicates that the marginal impact of first-visit patients on the expected sojourn time is larger than that of referred patients. This is because first-visit patients need to go through two service phases of “diagnosis” and “treatment”, so they have a stronger impact on the sojourn time than referred patients.

4. Equilibrium Queueing Behavior of Two Groups of Patients

4.1. Analysis of Fully Observable Case

We begin the analysis by studying the fully observable case. In this case, each patient knows their own type and also receives exact information about the state of the queueing system upon arrival, i.e., they get informed about the number of patients in the system and the service phase being provided by the hospital.

For a first-visit patient, when they find the system at a state

(0)

upon their arrival, then, if they join the queue, their expected sojourn time is equal to

\frac{1}{μ_{t}} + \frac{1}{μ_{d}}

. When they find the system at a state

(n, 0), n > 0

, their expected sojourn time

W_{(n, 0)}^{d}

is equal to

[(n - 1) γ + 2] (\frac{1}{μ_{t}} + \frac{1}{μ_{d}}) + (n - 1) (1 - γ) \frac{1}{μ_{t}}

. When they find the system at a state

(n, 1), n > 0

, their expected sojourn time

W_{(n, 1)}^{d}

is equal to

[(n - 1) γ + 1] (\frac{1}{μ_{t}} + \frac{1}{μ_{d}}) + [(n - 1) (1 - γ) + 1] \frac{1}{μ_{t}}

.

For a referred patient, when they find the system at a state

(0)

upon their arrival, then, if they join the queue, their expected sojourn time is equal to

\frac{1}{μ_{d}}

. When they find the system at a state

(n, 0), n > 0

, their expected sojourn time

W_{(n, 0)}^{t}

is equal to

[(n - 1) γ + 1] (\frac{1}{μ_{t}} + \frac{1}{μ_{d}}) + [(n - 1) (1 - γ) + 1] \frac{1}{μ_{t}}

. When they find the system at a state

(n, 1), n > 0

, their expected sojourn time

W_{(n, 1)}^{t}

is equal to

(n - 1) γ (\frac{1}{μ_{t}} + \frac{1}{μ_{d}}) + [(n - 1) (1 - γ) + 2] \frac{1}{μ_{t}}

.

Proposition 3.

(1) In the fully observable case, the unique individual optimal pure strategy of first-visit patients is as follows:

Case 1: If

R_{d} < θ_{d} (\frac{1}{μ_{d}} + \frac{1}{μ_{t}})

, balking is the unique individual optimal pure strategy.

Case 2: If

R_{d} \geq θ_{d} (\frac{1}{μ_{d}} + \frac{1}{μ_{t}})

, there exists a unique individual optimal pure strategy which has the following forms:

q_{n, j}^{d} = \{\begin{cases} 1, n \leq n_{d}^{j} \\ 0, n > n_{d}^{j} \end{cases}, j = 0, 1,

where

n_{d}^{0} = ⌊n_{0}^{*}⌋

,

n_{0}^{*} = \frac{\frac{R_{d}}{θ_{d}} - 2 (\frac{1}{μ_{t}} + \frac{1}{μ_{d}})}{\frac{1}{μ_{t}} + \frac{γ}{μ_{d}}} + 1

,

n_{d}^{1} = ⌊n_{1}^{*}⌋

,

n_{1}^{*} = \frac{\frac{R_{d}}{θ_{d}} - (\frac{2}{μ_{t}} + \frac{1}{μ_{d}})}{\frac{1}{μ_{t}} + \frac{γ}{μ_{d}}} + 1

.

(2) In the fully observable case, the unique individual optimal pure strategy of referred patients is as follows:

Case 1: If

R_{t} < \frac{θ_{t}}{μ_{t}}

, balking is the unique individual optimal pure strategy.

Case 2: If

R_{t} \geq \frac{θ_{t}}{μ_{t}}

, there exists a unique individual optimal pure strategy which has the following forms:

q_{n, j}^{t} = \{\begin{cases} 1, n \leq n_{t}^{j} \\ 0, n > n_{t}^{j} \end{cases}, j = 0, 1,

where

n_{t}^{0} = ⌊n_{0}^{* *}⌋

,

n_{0}^{* *} = \frac{\frac{R_{t}}{θ_{t}} - (\frac{2}{μ_{t}} + \frac{1}{μ_{d}})}{\frac{1}{μ_{t}} + \frac{γ}{μ_{d}}} + 1

,

n_{t}^{1} = ⌊n_{1}^{* *}⌋

,

n_{1}^{* *} = \frac{\frac{R_{t}}{θ_{t}} - \frac{2}{μ_{t}}}{\frac{1}{μ_{t}} + \frac{γ}{μ_{d}}} + 1

.

Proof of Proposition 3.

The proof process is provided in the Appendix A. □

Proposition 3 presents the explicit equilibrium strategies of patients in the fully observable case, which is a four-threshold strategy

[n_{d}^{0}, n_{d}^{1}, n_{t}^{0}, n_{t}^{0}]

. From the results, we observe that the service time, the delay sensitivity, the service reward and the fraction of first-visit patients have an impact on the four-threshold strategy. The difference between

n_{d}^{j}

and

n_{t}^{j}, j = 0, 1

depends on the magnitude of the relationship between

R_{d} / θ_{d}

and

R_{t} / θ_{t}

.

4.2. Analysis of Fully Unobservable Case

Next, we focus on the fully unobservable case. In the unobservable case, patients cannot observe the number of patients in the system and the service phase being provided by the hospital. Thus, all patients decide whether to join the queue or balk based on the valuation of service offerings and the expected waiting cost. Given the arrival rates

λ_{d}

and

λ_{t}

, the utility of a tagged type-i patient who decides to join the queue can be defined as

U_{i} = R_{i} - θ_{i} W (λ_{d}, λ_{t}), i = d, t

.

Define

W (λ_{t}) = W (0, λ_{t}) = \frac{1}{μ_{t} - λ_{t}}

and

W (λ_{d}) = W (λ_{d}, 0) = \frac{μ_{t} + μ_{d} - λ_{d}}{μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d}}

, we first characterize the rational equilibrium behavior of patients under two extreme cases.

Lemma 2.

If

λ_{d} = 0

, i.e., only the referred patients arrive at the comprehensive hospital, then the equilibrium queueing behavior of referred patients is given by

λ_{t}^{e} = \{\begin{cases} λ_{t}, & β_{t} \geq \frac{1}{μ_{t} - λ_{t}} \\ μ_{t} - \frac{1}{β_{t}}, & \frac{1}{μ_{t}} \leq β_{t} < \frac{1}{μ_{t} - λ_{t}} \\ 0, & β_{t} < \frac{1}{μ_{t}} \end{cases} .

Lemma 3.

If

λ_{t} = 0

, i.e., only the first-visit patients arrive at the comprehensive hospital, then the equilibrium queueing behavior of first-visit patients is given by

λ_{d}^{e} = \{\begin{cases} λ_{d}, & β_{d} \geq \frac{μ_{t} + μ_{d} - λ_{d}}{μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d}} \\ \frac{β_{d} μ_{t} μ_{d} - (μ_{t} + μ_{d})}{β_{d} (μ_{t} + μ_{d}) - 1}, & \frac{1}{μ_{t}} + \frac{1}{μ_{d}} \leq β_{d} < \frac{μ_{t} + μ_{d} - λ_{d}}{μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d}} \\ 0, & β_{d} < \frac{1}{μ_{t}} + \frac{1}{μ_{d}} \end{cases} .

Proof of Lemma 3.

The proof process is provided in the Appendix A. □

Lemmas 2 and 3 identify the equilibrium form of patients for the extreme cases (

λ_{d} = 0

and

λ_{t} = 0

). Under the two extreme cases, the underlying queueing system respectively degrades to a classical M/M/1 queue and an M/M/1 queue with two service phases; therefore, the equilibrium arrival rate can be easily obtained.

Next, we consider the general case. Using the method of Tang et al. (2018) [2], we define

β_{i} = \frac{R_{i}}{θ_{i}}, i = d, t

, which represents the maximum time that a type-i patient is willing to stay in the comprehensive hospital. According to the model description, although the service time in our model is type-dependent, the problem studied (the M/M/1 queue with two groups of patients) is also equivalent to an M/H₂/1 queue with two groups of patients, where H₂ represents that the service time follows a hyper-exponential distribution. Since the two groups of patients queue up in the same queue, the expected sojourn time is the same for both groups. At this time, as long as the group of patients with low patience levels choose to join the queue, the group of patients with high patience levels will join the queue (they think they will also receive service soon). For instance,

β_{d} > β_{t}

(respectively,

β_{d} < β_{t}

) means that the first-visit patients (referred patients) have a relatively higher level of patience than the referred patients (first-visit patients), as long as the referred patients (first-visit patients) decide to join the queue, then all the first-visit patients (referred patients) also join the queue.

It is worth noting that, if the two groups of patients differ in only one aspect, either in the service reward or in the sensitivity to delay, then one group of patients always has a higher patience level (a higher incentive to join the queue) than the other, i.e., either

β_{d} > β_{t}

or

β_{d} < β_{t}

; however, if the two groups of patients differ in two aspects (service reward and the sensitivity to delay), either group of patients may have a higher patience level, i.e., the relationship between

β_{d}

and

β_{t}

is uncertain. Therefore, we need to analyze the rational equilibrium queueing strategies of the two groups of patients by distinguishing the relationship between

β_{d}

and

β_{t}

.

It should be noted that, if

β_{d} = β_{t}

, since the two groups of patients queue up in the same queue, there is no difference in decision-making behavior between the two groups of patients. Therefore, when

R_{t} - θ_{t} W (λ_{d}, λ_{t}) > 0

, i.e.,

β_{t} > W (λ_{d}, λ_{t})

, all patients join the queue, the corresponding equilibrium arrival rates

(λ_{d}^{e}, λ_{t}^{e}) = (λ_{d}, λ_{t})

; when

R_{t} - θ_{t} \lim_{λ_{t} \to 0} W (λ_{t}) < 0

, i.e.,

β_{t} < 1 / μ_{t}

, all patients balk the queue, the corresponding equilibrium arrival rates

(λ_{d}^{e}, λ_{t}^{e}) = (0, 0)

. When

R_{t} - θ_{t} W (λ_{d}, λ_{t}) \leq 0

and

R_{t} - θ_{t} \lim_{λ_{t} \to 0} W (λ_{t}) \geq 0

, there exists a unique equilibrium joining probability of patients

q^{e}

that satisfies

R_{t} - θ_{t} W (λ_{d} q^{e}, λ_{t} q^{e}) = 0

. The corresponding equilibrium arrival rates

(λ_{d}^{e}, λ_{t}^{e}) = (λ_{d} q^{e}, λ_{t} q^{e})

. Below, we study the equilibrium behavior of patients for the case

β_{d} \neq β_{t}

.

Proposition 4.

When

λ_{d}

,

λ_{t}

satisfy

λ_{d} > 0, λ_{t} > 0

and

λ_{t} μ_{d} + λ_{d} (μ_{t} + μ_{d}) < μ_{t} μ_{d}

, if

β_{d} < β_{t}

, the equilibrium queueing behavior of patients can be obtained as follows.

(i) If

β_{d} < W (λ_{t}) \leq β_{t}

, the equilibrium arrival rates of the two groups of patients

λ_{d}^{e} = 0, λ_{t}^{e} = λ_{t}

;

(ii) If

β_{d} < β_{t} < W (λ_{t})

, the equilibrium arrival rates of the two groups of patients

λ_{d}^{e} = 0, λ_{t}^{e} = \max (μ_{t} - \frac{1}{β_{t}}, 0)

;

(iii) If

β_{t} > β_{d} \geq W (λ_{d}, λ_{t}) > W (λ_{t})

, the equilibrium arrival rates of the two groups of patients

λ_{d}^{e} = λ_{d}, λ_{t}^{e} = λ_{t}

;

(iv) If

β_{t} > β_{d} \geq W (λ_{t})

and

β_{d} < W (λ_{d}, λ_{t})

, the equilibrium arrival rates of the two groups of patients

λ_{t}^{e} = λ_{t}

,

λ_{d}^{e}

satisfies the following equation:

(\frac{λ_{d}^{e}}{μ_{d} (λ_{d}^{e} + λ_{t})} + \frac{1}{μ_{t}}) [1 + \frac{(λ_{d}^{e} + λ_{t}) [μ_{d} + μ_{t} - (λ_{d}^{e} + λ_{t})]}{μ_{t} μ_{d} - λ_{d}^{e} μ_{t} - (λ_{d}^{e} + λ_{t}) μ_{d}}] - \frac{λ_{d}^{e} + λ_{t}}{μ_{t} μ_{d}} = β_{d}

Proof of Proposition 4.

The proof process is provided in the Appendix A. □

Proposition 5.

When

λ_{d}

,

λ_{t}

satisfy

λ_{d} > 0, λ_{t} > 0

,

λ_{t} μ_{d} + λ_{d} (μ_{t} + μ_{d}) < μ_{t} μ_{d}

and

{\bar{λ}}_{d} \leq λ_{d} < \frac{μ_{t} μ_{d}}{μ_{t} + μ_{d}}

, where

{\bar{λ}}_{d}

satisfies the equation

\frac{μ_{d}^{2} + {\bar{λ}}_{d} μ_{t}}{{(μ_{t} μ_{d} - {\bar{λ}}_{d} μ_{t} - {\bar{λ}}_{d} μ_{d})}^{2}} = \frac{1}{{\bar{λ}}_{d} μ_{d}}

, if

β_{d} > β_{t}

, the equilibrium queueing behavior of patients can be obtained as follows:

(i) If

β_{t} < W (λ_{d}) \leq β_{d}

, the equilibrium arrival rates of the two groups of patients

λ_{d}^{e} = λ_{d}, λ_{t}^{e} = 0

;

(ii) If

β_{t} < β_{d} < W (λ_{d})

, the equilibrium arrival rates of the two groups of patients

λ_{d}^{e} = \max (\frac{β_{d} μ_{t} μ_{d} - (μ_{t} + μ_{d})}{β_{d} (μ_{t} + μ_{d}) - 1}, 0), λ_{t}^{e} = 0

;

(iii) If

β_{d} > β_{t} \geq W (λ_{d}, λ_{t})

, the equilibrium arrival rates of the two groups of patients

λ_{d}^{e} = λ_{d}, λ_{t}^{e} = λ_{t}

;

(iv) If

β_{d} > β_{t} \geq W (λ_{d})

and

β_{t} < W (λ_{d}, λ_{t})

, the equilibrium arrival rates of the two groups of patients

λ_{d}^{e} = λ_{d}

,

λ_{t}^{e}

satisfies the following equation:

(\frac{λ_{d}}{μ_{d} (λ_{d} + λ_{t}^{e})} + \frac{1}{μ_{t}}) [1 + \frac{(λ_{d} + λ_{t}^{e}) [μ_{d} + μ_{t} - (λ_{d} + λ_{t}^{e})]}{μ_{t} μ_{d} - λ_{d} μ_{t} - (λ_{d} + λ_{t}^{e}) μ_{d}}] - \frac{λ_{d} + λ_{t}^{e}}{μ_{t} μ_{d}} = β_{t},

where

W (λ_{d}) = \frac{μ_{t} + μ_{d} - λ_{d}}{μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d}}

.

Proof of Proposition 5.

The proof process is provided in the Appendix A. □

In general, Propositions 4 and 5 give the queueing behavior of patients in equilibrium. According to the above results, we observe that both groups of patients decide to join the queue when their patience levels are very high. When the patience level of one group of patients is higher than the other, if the group of patients with low patience levels does not join the queue, i.e., they adopt the “all balk” strategy, then the group of patients with high patience levels might be more likely to decide to join the queue, that is, they might adopt either the “all join” strategy or a “mixed strategy”. However, if a fraction of patients with low patience levels joins the queue, i.e., the group of patients with low patience levels adopt a “mixed strategy”, then the group of patients with high patience levels must adopt the “all join” strategy. These results imply that the queueing behavior of one group of patients hinges not only on their patience level but also on their sojourn time in the system, which in turn is influenced by the joining-or-balking decision of the other group of patients.

Next, we turn to a social planner’s point of view. The problem is the maximization of social welfare, with respect to the strategy space

(q_{d}, q_{t}) \in [0, 1] \times [0, 1]

that should be imposed on the patients. Define

S W_{d} (q_{d})

and

S W_{t} (q_{t})

as the net utilities accruing to first-visit and referred patients in unit time, respectively, we then have

S W_{d} (q_{d}) = λ_{d} q_{d} [R_{d} - θ_{d} W (λ_{d} q_{d}, λ_{t} q_{t})], S W_{t} (q_{t}) = λ_{t} q_{t} [R_{t} - θ_{t} W (λ_{d} q_{d}, λ_{t} q_{t})]

Social welfare is the sum of all patients’ net utilities, that is,

S W (q_{d}, q_{t}) = λ_{d} q_{d} R_{d} + λ_{t} q_{t} R_{t} - (λ_{d} q_{d} θ_{d} + λ_{t} q_{t} θ_{t}) W (λ_{d} q_{d}, λ_{t} q_{t})

The social welfare maximization problem for social planners can be established as

\max_{q_{d}, q_{t}} S W (q_{d}, q_{t}) = λ_{d} q_{d} R_{d} + λ_{t} q_{t} R_{t} - (λ_{d} q_{d} θ_{d} + λ_{t} q_{t} θ_{t}) W (λ_{d} q_{d}, λ_{t} q_{t})

Because of the very involved nature of social welfare, it would be difficult to derive the socially optimal solutions

(q_{d}^{s o c}, q_{t}^{s o c})

in closed form; thus, the result can only be achieved numerically. For example, the two-variable optimization problem can be solved by using a sequential optimization approach. First, for any given joining probability

q_{d} \in [0, 1]

, we derive the optimal conditional probability

q_{t}^{s o c} (q_{d})

, then by plugging

q_{t}^{s o c} (q_{d})

into

S W (q_{d}, q_{t})

, the two-variable optimization problem becomes a univariate optimization problem, and the optimal

q_{d}

is established on the basis of

q_{t}^{s o c} (q_{d})

. Of course, the two-variable optimization problem can also be solved by other optimization algorithms (e.g., particle swarm optimization and genetic algorithm).

5. Numerical Examples

In this section, we will give some numerical examples to verify the accuracy of the theoretical results and provide some management implications.

Define

q_{i}^{e} = \frac{λ_{i}^{e}}{λ_{i}}, i = d, t

as the equilibrium joining probabilities of two groups of patients. Assume that

β_{d} = 1

,

β_{t} = 2

,

λ_{d} = 0.1

(

λ_{t} = 1

),

μ_{d} = 4

,

μ_{t} = 4

, i.e.,

β_{d} < β_{t}

. Proposition 4 is pictorially shown in Figure 4. From Figure 4, we find that the equilibrium joining probabilities of both groups of patients decrease as the arrival rates

λ_{t}

and

λ_{d}

increase, and the equilibrium joining probability

q_{t}^{e}

is no less than

q_{d}^{e}

as

β_{t} > β_{d}

, which is consistent with the conclusion of Proposition 4. When the first-visit patients adopt the “all balk” strategy, the referred patients might adopt either the “all join” strategy or a “mix strategy” (choose randomly between balking and joining), when the first-visit patients adopt a “mixed strategy”, the referred patients adopt the “all join” strategy. The reason is that the maximum patience time could affect the equilibrium joining probabilities of patients. As long as the expected sojourn time is less than the maximum patience time, patients are more likely to join the queue. Finally, from Figure 4a,b, as

λ_{t}

and

λ_{d}

are greater than certain thresholds, a slight increase in

λ_{t}

or

λ_{d}

may lead to a relatively large change in the equilibrium joining probabilities of first-visit patients, and compared with

λ_{t}

, the increase in

λ_{d}

has less impact on

q_{d}^{e}

.

Assume that

β_{d} = 2

,

β_{t} = 1.5

(

β_{t} = 1

),

λ_{d} = 2

(

λ_{t} = 1

),

μ_{d} = 4

,

μ_{t} = 8

, i.e.,

β_{t} < β_{d}

. Proposition 5 is pictorially shown in Figure 5. From Figure 5, we still find that the equilibrium joining probabilities of both groups of patients decrease as the arrival rates

λ_{t}

and

λ_{d}

increase, and the equilibrium joining probability

q_{d}^{e}

is no less than

q_{t}^{e}

as

β_{d} > β_{t}

, which is also consistent with the conclusion of Proposition 5. When the referred patients adopt the “all balk” strategy, the first-visit patients might adopt either the “all join” strategy or a “mix strategy” (randomize between balking and joining), when the referred patients adopt a “mixed strategy”, the first-visit patients adopt the “all join” strategy. The results are also attributed to the relationship between the maximum patience time and the expected sojourn time. Moreover, from Figure 5a,b, as

λ_{t}

and

λ_{d}

are greater than certain thresholds, a slight increase in

λ_{t}

or

λ_{d}

may also lead to a relatively large change in the equilibrium joining probabilities of referred patients, and compared with

λ_{t}

, the increase in

λ_{d}

has a greater impact on

q_{d}^{e}

, which is contrary to the conclusion of Figure 4.

From Figure 4 and Figure 5, we conclude that, with the increase in the arrival rates

λ_{t}

and

λ_{d}

, the group of patients with high patience levels may have a negative impact on the joining decision of patients with low patience levels, i.e., the rational queueing strategies of patients with high patience levels could discourage patients with low patience levels from making the joining decision. In brief, as more patients arrive at the comprehensive hospital, patients with a high patience level may crowd out the other group of patients and push them to balk the queue.

In Figure 6 and Figure 7, we, respectively, assume that

λ_{d} = 0.8

,

μ_{d} = 5

,

μ_{t} = 3

and

λ_{d} = 1

,

λ_{t} = 0.4

,

μ_{d} = 4

,

μ_{t} = 4

,

R_{t} = 1

,

θ_{d} = 2

,

θ_{t} = 1

, then Figure 6 and Figure 7 depict the effect of

λ_{t}

and

R_{d}

on the individual equilibrium strategies and social optimal strategies. From Figure 6, it can be deduced that under the cases

β_{t} < β_{d}

and

β_{t} > β_{d}

, the individual equilibrium strategies and social optimal strategies decrease as

λ_{t}

increases. It makes sense that decreasing the arrival rate could benefit patients by reducing their sojourn time; meanwhile, the new arriving patient who knows a high arrival rate can predict the higher load of the hospital, which increases their waiting cost and makes them reluctant to join the queue. However, as shown in Figure 7, the individual equilibrium strategy and social optimal strategy of first-visit patients increase with

R_{d}

, whereas the individual equilibrium strategy and social optimal strategy of referred patients decrease with

R_{d}

. This may be caused by the fact that the patients with a high patience level may crowd out the other group of patients and push them to balk the queue. Finally, from Figure 6 and Figure 7, we note that the inequality

q_{i}^{s o c} \leq q_{i}^{e}, i = d, t

holds. The main reason for the results is that multiple self-interested patients in a queueing game ignore the negative externalities of their actions on each other (e.g., the joining decision of new arriving patients would prolong the sojourn time and increase the delay cost of future patients), and one consequence of this is an equilibrium that deviates from the socially optimal solution. Therefore, when maximizing social welfare, we should consider the negative externalities, that directly lead to inequality. The results could provide managerial insights to medical managers, especially the government. Given the service reward, the delay sensitivity and the service time, in order to maximize social welfare, medical managers need to allocate patients, which will break the individual optimal equilibrium between patients, and take measures to eliminate the deviation between the individual equilibrium strategy and social optimal strategy, i.e., to guide some patients to choose primary hospitals for diagnosis and treatment so as to reduce the number of patients in comprehensive hospitals. Specific measures are as follows: First, the medical service managers could adopt a gatekeeping design to reduce the number of first-visit patients in comprehensive hospitals. For example, in China, through the medical insurance system, the government uses appropriate cost adjustment mechanisms to guide first-visit patients to preferentially choose community or primary hospitals for diagnosis and treatment. In addition, the government has improved the family doctor signing services and adopted the priority mechanism to regulate the flow of first-visit patients. Moreover, the government could also set reasonable prices for first-visit patients who first choose comprehensive hospitals so as to reduce the proportion of first-visit patients to comprehensive hospitals. Second, the medical service managers could adopt measures to reduce the number of referred patients in comprehensive hospitals. For example, the government should allocate medical resources rationally, improve the medical level of community hospitals and the cure rate of patients by the sinking of high-quality resources, and then reduce the number of patients referred to comprehensive hospitals.

6. Research Findings

The theoretical results and numerical results are summarized as follows: (1) Both groups of patients decide to join the queue when their patience levels are very high; (2) when the patience level of one group of patients are higher than the other, if the group of patients with low patience levels adopt the “all balk” strategy, then the group of patients with high patience levels might adopt either the “all join” strategy or a “mixed strategy”, whereas, if the group of patients with low patience levels adopt a “mixed strategy”, then the group of patients with high patience levels must adopt the “all join” strategy; (3) as more patients arrive at the comprehensive hospital, the group of patients with high patience levels may crowd out the other group of patients and push them to balk the queue; (4) the equilibrium behavior deviates from a socially optimal solution; therefore, to reach the maximal social welfare, the social planner should adopt some regulation measures, such as appropriate cost adjustment mechanisms, the family doctor signing services, the priority mechanism and the sinking of high-quality resources, etc., to control the arrival rates of patients in comprehensive hospitals. By doing so, they can mitigate overcrowding and reduce waiting times, ultimately improving the efficiency of the healthcare system.

7. Concluding Remarks

Motivated by the queueing problems arising in comprehensive hospitals, we constructed a game-theoretical queueing model to analyze the equilibrium queueing behavior of patients in comprehensive hospitals. We first derived the stability condition, so that we could obtain the expected sojourn time of an arbitrary patient under the stability condition. Based on the expected sojourn time, we defined the utility of patients who join the queue and derived the strategic queueing behavior of patients in equilibrium under the fully observable and unobservable cases. Finally, we derived the socially optimal solutions numerically and provided managerial insights according to the theoretical results and numerical results.

In this current work, we assume that patients could share the same information level. However, due to the lack of access to information, in today’s service industries, information homogeneity is difficult to determine, and the queueing problem with information heterogeneity is often encountered; in future work, analyzing the strategic behavior of patients with information heterogeneity would be an interesting direction. Another direction is that strategic patients exhibit bounded rationality instead of full rationality, so studying the effect of bounded rationality on the queueing behavior of patients would also be an interesting and important direction.

Author Contributions

Conceptualization, Y.L. and L.L.; methodology, T.J. and X.C.; software, T.J. and X.C.; validation, Y.L. and L.L.; formal analysis, T.J.; investigation, Y.L.; resources, Y.L.; data curation, Y.L.; writing—original draft preparation, Y.L. and T.J.; writing—review and editing, T.J. and L.L.; visualization, Y.L.; supervision, L.L.; project administration, Y.L. and L.L.; funding acquisition, Y.L., L.L., T.J. and X.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant No.61773014, Grant No.12001329), National Natural Science Foundation of China (Grant 11901307) and Natural Science Foundation of Jiangsu Province, China (Grants BK20180783 and 18KJB110021), Anhui University Natural Science Research Project of China (Grant No.2022AH052209), and Scientific Research Foundation of Anhui Polytechnic University (Grant No. 2022YQQ026).

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Lemma 1.

Based on Theorem 1.7.1 in Neuts (1981), the system is stable if and only if

x C e < x B e

, where

e = {(1, 1)}^{T}

,

x = (x_{0}, x_{1})

is the invariant probability vector of

Y = A + B + C = (\begin{matrix} - μ_{d} & μ_{d} \\ μ_{t} γ & - μ_{t} γ \end{matrix}),

which satisfies

x Y = 0

and

x e = 1

. From the equations, we have

x_{0} = \frac{μ_{t} γ}{μ_{d} + μ_{t} γ}

,

x_{1} = \frac{μ_{d}}{μ_{d} + μ_{t} γ}

, then, the stability condition

x C e < x B e

converts into

λ < \frac{μ_{t} μ_{d}}{μ_{d} + μ_{t} γ}

, i.e.,

λ_{t} μ_{d} + λ_{d} (μ_{t} + μ_{d}) < μ_{t} μ_{d}

. □

Proof of Proposition 1.

According to the special structure of the state-transition-rate matrix, the balance equation can be obtained as follows:

\begin{matrix} λ π_{0} = μ_{t} π_{1, 1}, \\ (λ + μ_{d}) π_{1, 0} = λ γ π_{0} + μ_{t} γ π_{2, 1}, \\ (λ + μ_{d}) π_{k, 0} = λ π_{k - 1, 0} + μ_{t} γ π_{k + 1, 1}, k \geq 2, \\ (λ + μ_{t}) π_{1, 1} = λ \bar{γ} π_{0} + μ_{d} π_{1, 0} + μ_{t} \bar{γ} π_{2, 1}, \\ (λ + μ_{t}) π_{k, 1} = λ π_{k - 1, 1} + μ_{d} π_{k, 0} + μ_{t} \bar{γ} π_{k + 1, 1}, k \geq 2 . \end{matrix}

Define

P_{0} (z) = \sum_{k = 1}^{+ \infty} π_{k, 0} z^{k}, P_{1} (z) = \sum_{k = 1}^{+ \infty} π_{k, 1} z^{k}

, (

| z | < 1)

from the above balance equations, we could easily have the following equation set:

\{\begin{cases} (λ + μ_{d} - λ z) P_{0} (z) = \frac{μ_{t} γ}{z} P_{1} (z) + π_{0} λ γ (z - 1) \\ (λ + μ_{t} - λ z - \frac{μ_{t} \bar{γ}}{z}) P_{1} (z) = μ_{d} P_{0} (z) + π_{0} λ \bar{γ} (z - 1) \end{cases} .

Solving the equation set, we can derive the expressions of

P_{0} (z), P_{1} (z)

:

P_{0} (z) = \frac{λ γ z^{2} (z - 1) [λ (1 - z) + μ_{t}]}{g_{0} (z) g_{1} (z) - μ_{d} μ_{t} γ z} π_{0}, P_{1} (z) = \frac{λ z^{2} (z - 1) [λ \bar{γ} (1 - z) + μ_{d}]}{g_{0} (z) g_{1} (z) - μ_{d} μ_{t} γ z} π_{0}

where

g_{0} (z) = λ z (1 - z) + μ_{d} z

,

g_{1} (z) = λ z (1 - z) + μ_{t} (z - \bar{γ})

,

π_{0} = 1 - ρ = 1 - λ (\frac{1}{μ_{t}} + \frac{γ}{μ_{d}})

.

According to the results, we could easily derive the expected queue length (including the patient being served):

L = \frac{d P_{0} (z)}{d z} |_{z = 1} + \frac{d P_{1} (z)}{d z} |_{z = 1} = λ (\frac{γ}{μ_{d}} + \frac{1}{μ_{t}}) [1 + \frac{λ (μ_{d} + μ_{t} - λ)}{μ_{t} μ_{d} - λ γ μ_{t} - λ μ_{d}}] - \frac{λ^{2}}{μ_{t} μ_{d}} .

By using Little’s law, we could obtain the expected sojourn time of an arbitrary patient:

W (λ, γ) = (\frac{γ}{μ_{d}} + \frac{1}{μ_{t}}) [1 + \frac{λ (μ_{d} + μ_{t} - λ)}{μ_{t} μ_{d} - λ γ μ_{t} - λ μ_{d}}] - \frac{λ}{μ_{t} μ_{d}}

Substituting

λ = λ_{d} + λ_{t}, γ = \frac{λ_{d}}{λ_{d} + λ_{t}}

into

W (λ, γ)

, we have the final expression

W (λ_{d}, λ_{t}) = (\frac{λ_{d}}{μ_{d} (λ_{d} + λ_{t})} + \frac{1}{μ_{t}}) [1 + \frac{(λ_{d} + λ_{t}) [μ_{d} + μ_{t} - (λ_{d} + λ_{t})]}{μ_{t} μ_{d} - λ_{d} μ_{t} - (λ_{d} + λ_{t}) μ_{d}}] - \frac{λ_{d} + λ_{t}}{μ_{t} μ_{d}} .

□

Proof of Proposition 2.

Based on the expression of

W (λ_{d}, λ_{t})

, we find that it is complicated to determine the relationship between

W (λ_{d}, λ_{t})

and

λ_{d}, λ_{t}

directly. In the following content, we will use the chain rule of binary functions to determine the monotonicity of

W (λ_{d}, λ_{t})

with respect to

λ_{d}, λ_{t}

. First, according to

W (λ, γ)

, we have

\frac{\partial W}{\partial λ} = \frac{γ μ_{t}^{2} + μ_{d}^{2} + γ μ_{t} μ_{d}}{{(μ_{t} μ_{d} - λ γ μ_{t} - λ μ_{d})}^{2}} > 0, \frac{\partial W}{\partial γ} = \frac{1}{μ_{d}} (1 + \frac{λ μ_{t} μ_{d} (μ_{t} + μ_{d} - λ)}{{(μ_{t} μ_{d} - λ γ μ_{t} - λ μ_{d})}^{2}}) > 0 .

From

λ = λ_{d} + λ_{t}, γ = \frac{λ_{d}}{λ_{d} + λ_{t}}

, we have

\frac{\partial λ}{\partial λ_{d}} = 1, \frac{\partial γ}{\partial λ_{d}} = \frac{λ_{t}}{{(λ_{d} + λ_{t})}^{2}}

,

\frac{\partial λ}{\partial λ_{t}} = 1, \frac{\partial γ}{\partial λ_{t}} = - \frac{λ_{d}}{{(λ_{d} + λ_{t})}^{2}}

.

According to the chain rule, we could obtain the partial derivative of

W (λ_{d}, λ_{t})

with respect to

λ_{d}

and

λ_{t}

:

\begin{matrix} \frac{\partial W (λ_{d}, λ_{t})}{\partial λ_{d}} & = \frac{\partial W}{\partial λ} \frac{\partial λ}{\partial λ_{d}} + \frac{\partial W}{\partial γ} \frac{\partial γ}{\partial λ_{d}} \\ = \frac{γ μ_{t}^{2} + μ_{d}^{2} + γ μ_{t} μ_{d}}{{(μ_{t} μ_{d} - λ γ μ_{t} - λ μ_{d})}^{2}} + \frac{1}{μ_{d}} (1 + \frac{λ μ_{t} μ_{d} (μ_{t} + μ_{d} - λ)}{{(μ_{t} μ_{d} - λ γ μ_{t} - λ μ_{d})}^{2}}) \frac{λ_{t}}{{(λ_{d} + λ_{t})}^{2}} > 0, \\ \frac{\partial W (λ_{d}, λ_{t})}{\partial λ_{t}} & = \frac{\partial W}{\partial λ} \frac{\partial λ}{\partial λ_{t}} + \frac{\partial W}{\partial γ} \frac{\partial γ}{\partial λ_{t}} \\ = \frac{γ μ_{t}^{2} + μ_{d}^{2} + γ μ_{t} μ_{d}}{{(μ_{t} μ_{d} - λ γ μ_{t} - λ μ_{d})}^{2}} - \frac{1}{μ_{d}} (1 + \frac{λ μ_{t} μ_{d} (μ_{t} + μ_{d} - λ)}{{(μ_{t} μ_{d} - λ γ μ_{t} - λ μ_{d})}^{2}}) \frac{λ_{d}}{{(λ_{d} + λ_{t})}^{2}} \\ = \frac{μ_{d}^{2} + λ_{d} μ_{t}}{{(μ_{t} μ_{d} - λ_{d} μ_{t} - (λ_{d} + λ_{t}) μ_{d})}^{2}} - \frac{λ_{d}}{{(λ_{d} + λ_{t})}^{2} μ_{d}} . \end{matrix}

To determine the value of

\frac{\partial W (λ_{d}, λ_{t})}{\partial λ_{t}}

, define

M (λ_{d}, λ_{t}) = \frac{μ_{d}^{2} + λ_{d} μ_{t}}{{(μ_{t} μ_{d} - λ_{d} μ_{t} - (λ_{d} + λ_{t}) μ_{d})}^{2}} - \frac{λ_{d}}{{(λ_{d} + λ_{t})}^{2} μ_{d}} .

which is not necessarily continuous at the point (0,0). Since

\frac{\partial M (λ_{d}, λ_{t})}{\partial λ_{t}} > 0

, then,

M (λ_{d}, λ_{t})

is monotonically increasing with respect to

λ_{t}

, i.e.,

M (λ_{d}, λ_{t}) > M (λ_{d}, 0)

.

From

M (λ_{d}, 0) = \frac{μ_{d}^{2} + λ_{d} μ_{t}}{{(μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d})}^{2}} - \frac{1}{λ_{d} μ_{d}}

and

μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d} > 0

, we could easily have

\frac{d M (λ_{d}, 0)}{d λ_{d}} = \frac{μ_{t} {(μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d})}^{2} + 2 (μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d}) (μ_{d}^{2} + λ_{d} μ_{t}) (μ_{t} + μ_{d})}{{(μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d})}^{4}} + \frac{1}{λ_{d}^{2} μ_{d}} > 0

which means that

M (λ_{d}, 0)

is monotonically increasing with respect to

λ_{d}

.

Due to

\lim_{λ_{d} \to 0} M (λ_{d}, 0) < 0

and

\lim_{λ_{d} \to \frac{μ_{t} μ_{d}}{μ_{t} + μ_{d}}} M (λ_{d}, 0) > 0

, we find that there exists a unique

{\bar{λ}}_{d}

that makes

M ({\bar{λ}}_{d}, 0) = 0

, i.e., if

λ_{d} \in [{\bar{λ}}_{d}, \frac{μ_{t} μ_{d}}{μ_{t} + μ_{d}})

,

M (λ_{d}, 0) \geq 0

.

According to the above analysis, we could obtain the results. □

Proof of Proposition 3.

If an arriving first-visit or referred patient finds the system at a state

(n, i), i = 0, 1

, then, the utilities would be

U_{o b s}^{d} (n, i) = R_{d} - θ_{d} W_{(n, i)}^{d}

or

U_{o b s}^{t} (n, i) = R_{t} - θ_{t} W_{(n, i)}^{t}

. Since

U_{o b s}^{d} (n, i)

and

U_{o b s}^{t} (n, i)

are monotonically decreasing with respect to n, we only need to find the unique root that satisfies

U_{o b s}^{d} (n, i) = 0

or

U_{o b s}^{t} (n, i) = 0

. The results can be easily obtained. □

Proof of Lemma 2.

If

λ_{d} = 0

, the queueing system degrades into a classical M/M/1 queue, the expected sojourn time of an arbitrary patient is

W (λ_{t}) = W (λ_{t}, 0) = \frac{1}{μ_{t} - λ_{t}}

, the utility of a patient who chooses to join the queue is

U_{t} = R_{t} - θ_{t} W (λ_{t})

, then

(1) If

U_{t} = R_{t} - θ_{t} W (λ_{t}) \geq 0

, i.e.,

β_{t} \geq \frac{1}{μ_{t} - λ_{t}}

, all the referred patients join the queue, and the corresponding equilibrium arrival rate

λ_{t}^{e} = λ_{t}

;

(2) If

U_{t} = R_{t} - θ_{t} W (λ_{t}) < 0

and

U_{t} = R_{t} - θ_{t} \lim_{λ_{t} \to 0} W (λ_{t}) \geq 0

, i.e.,

\frac{1}{μ_{t}} \leq β_{t} < \frac{1}{μ_{t} - λ_{t}}

, a fraction of the referred patients joins the queue, the corresponding equilibrium arrival rate

λ_{t}^{e} = μ_{t} - \frac{1}{β_{t}}

;

(3) If

U_{t} = R_{t} - θ_{t} \lim_{λ_{t} \to 0} W (λ_{t}) < 0

, i.e.,

β_{t} < \frac{1}{μ_{t}}

, all the referred patients balk the queue, the corresponding equilibrium arrival rate

λ_{t}^{e} = 0

. □

Proof of Lemma 3.

If

λ_{t} = 0

, the queueing system degrades into an M/M/1 queue with two service phases, then the expected sojourn time of an arbitrary patient is

W (λ_{d}) = \frac{μ_{t} + μ_{d} - λ_{d}}{μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d}}

, the first derivative

\frac{d W (λ_{d})}{d λ_{d}} = \frac{μ_{t}^{2} + μ_{d}^{2} + μ_{t} μ_{d}}{{(μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d})}^{2}} > 0

, i.e.,

W (λ_{d})

is an increasing function with respect to

λ_{d}

, and the utility of a patient who chooses to join the queue is

U_{d} = R_{d} - θ_{d} W (λ_{d})

, then,

(1) If

U_{d} = R_{d} - θ_{d} W (λ_{d}) \geq 0

, i.e.,

β_{d} \geq W (λ_{d})

, all the first-visit patients join the queue, and the corresponding equilibrium arrival rate

λ_{d}^{e} = λ_{d}

;

(2) If

U_{d} = R_{d} - θ_{d} W (λ_{d}) < 0

and

U_{d} = R_{d} - θ_{d} \lim_{λ_{d} \to 0} W (λ_{d}) \geq 0

, i.e.,

\frac{1}{μ_{t}} + \frac{1}{μ_{d}} \leq β_{d} < W (λ_{d})

, a fraction of the first-visit patients joins the queue, and the corresponding equilibrium arrival rate

λ_{d}^{e} = \frac{β_{d} μ_{t} μ_{d} - (μ_{t} + μ_{d})}{β_{d} (μ_{t} + μ_{d}) - 1}

;

(3) If

U_{d} = R_{d} - θ_{d} \lim_{λ_{d} \to 0} W (λ_{d}) < 0

, i.e.,

β_{d} < \frac{1}{μ_{t}} + \frac{1}{μ_{d}}

, all the first-visit patients balk the queue, and the corresponding equilibrium arrival rate

λ_{d}^{e} = 0

. □

Proof of Proposition 4.

If

λ_{t} > 0, λ_{d} > 0

, both groups of patients arrive at the comprehensive hospital. Since

β_{d} < β_{t}

, as long as the first-visit patients decide to join the queue, then all the referred patients also join the queue. Next, we consider the following two scenarios.

Scenario 1: All the first-visit patients balk the queue, i.e., the corresponding equilibrium arrival rate

λ_{d}^{e} = 0

. In this scenario,

W (0, λ_{t}) = W (λ_{t}) = \frac{1}{μ_{t} - λ_{t}}

, which is consistent with the expected sojourn time of the classical M/M/1 queue. Next, we consider two subcases.

Case I: If

U_{t} = R_{t} - θ_{t} W (λ_{t}) \geq 0

, i.e.,

β_{t} \geq \frac{1}{μ_{t} - λ_{t}}

, then all the referred patients join the queue. Since all the first-visit patients balk the queue, then,

U_{d} = R_{d} - θ_{d} W (λ_{t}) < 0

, that is,

β_{d} < \frac{1}{μ_{t} - λ_{t}}

. Therefore, if

β_{d} < \frac{1}{μ_{t} - λ_{t}} \leq β_{t}

, all the first-visit patients balk the queue, and all the referred patients join the queue, i.e., the equilibrium arrival rates

λ_{d}^{e} = 0, λ_{t}^{e} = λ_{t}

;

Case II: If

U_{t} = R_{t} - θ_{t} W (λ_{t}) < 0

, then only a fraction of the referred patients chooses to join the queue while others balk. In this subcase, there exists a unique equilibrium arrival rate of the referred patients makes that

R_{t} - θ_{t} W (λ_{t}^{e}) = 0

, i.e.,

λ_{t}^{e} = μ_{t} - \frac{1}{β_{t}}

. Therefore, if

β_{d} < β_{t} < \frac{1}{μ_{t} - λ_{t}}

, all the first-visit patients balk the queue, and a fraction of the referred patients joins the queue, i.e., the corresponding equilibrium arrival rates

λ_{d}^{e} = 0, λ_{t}^{e} = μ_{t} - \frac{1}{β_{t}}

;

Scenario 2. A fraction of the first-visit patients join the queue, i.e., the equilibrium arrival rate of the first-visit patients

λ_{d}^{e} > 0

. In this scenario, all the referred patients join the queue, that is,

λ_{t}^{e} = λ_{t}

. Moreover, since there are first-visit patients joining the queue, then

U_{d} = R_{d} - θ_{d} W (λ_{t}) \geq 0

, i.e.,

β_{d} \geq W (λ_{t}) = \frac{1}{μ_{t} - λ_{t}}

. Next, we also consider two subcases.

Case I: If

U_{d} = R_{d} - θ_{d} W (λ_{d}, λ_{t}) \geq 0

, i.e.,

W (λ_{d}, λ_{t}) \leq β_{d}

, since

W (λ_{d}, λ_{t})

is monotonically increasing with respect to

λ_{d}

, we then have

W (0, λ_{t}) < W (λ_{d}, λ_{t})

. Therefore, if

β_{d} \geq W (λ_{d}, λ_{t})

, all the first-visit patients join the queue, and all the referred patients join the queue, i.e., the equilibrium arrival rates

λ_{d}^{e} = λ_{d}, λ_{t}^{e} = λ_{t}

;

Case II: If

U_{d} = R_{d} - θ_{d} W (λ_{d}, λ_{t}) < 0

, i.e.,

W (λ_{d}, λ_{t}) > β_{d}

, since

W (λ_{d}, λ_{t})

is monotonically increasing with respect to

λ_{d}

, then, there exists a unique

λ_{d}^{e}

makes that

U_{d} = R_{d} - θ_{d} W (λ_{d}^{e}, λ_{t}) = 0

, i.e.,

λ_{d}^{e}

satisfies the equation

W (λ_{d}^{e}, λ_{t}) = β_{d}

. In this subcase, if

\frac{1}{μ_{t} - λ_{t}} \leq β_{d} < W (λ_{d}, λ_{t})

, the equilibrium arrival rates are

λ_{t}^{e} = λ_{t}

,

λ_{d} = λ_{d}^{e}

, where

λ_{d}^{e}

satisfies the following equation:

(\frac{λ_{d}^{e}}{μ_{d} (λ_{d}^{e} + λ_{t})} + \frac{1}{μ_{t}}) [1 + \frac{(λ_{d}^{e} + λ_{t}) [μ_{d} + μ_{t} - (λ_{d}^{e} + λ_{t})]}{μ_{t} μ_{d} - λ_{d}^{e} μ_{t} - (λ_{d}^{e} + λ_{t}) μ_{d}}] - \frac{λ_{d}^{e} + λ_{t}}{μ_{t} μ_{d}} = β_{d}

By summarizing the above analysis, we could obtain the results in Proposition 4. □

Proof of Proposition 5.

If

λ_{t} > 0, λ_{d} > 0

, both two groups of patients arrive at the comprehensive hospital. Since

β_{d} > β_{t}

, as long as the referred patients decide to join the queue, then all the first-visit patients also join the queue. Similar to the proof of Proposition 4, we consider two scenarios to obtain the results of Proposition 5 next.

Scenario 1. All the referred patients balk the queue, i.e., the equilibrium arrival rate

λ_{t}^{e} = 0

. In this scenario, from the stability condition and the expression of the expected sojourn time, we have

W (λ_{d}, 0) = W (λ_{d}) = \frac{μ_{t} + μ_{d} - λ_{d}}{μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d}} > 0

. Next, we consider two subcases as follows:

Case I: If

U_{d} = R_{d} - θ_{d} W (λ_{d}) \geq 0

, i.e.,

β_{d} \geq W (λ_{d})

, all the first-visit patients join the queue. Since all the referred patients balk the queue, then

U_{t} = R_{t} - θ_{t} W (λ_{d}) < 0

, i.e.,

β_{t} < W (λ_{d})

. Therefore, when

β_{t} < W (λ_{d}) \leq β_{d}

, in equilibrium, all the referred patients join the queue, and all the first-visit patients balk the queue, the equilibrium arrival rates

λ_{d}^{e} = λ_{d}, λ_{t}^{e} = 0

;

Case II: If

U_{d} = R_{d} - θ_{d} W (λ_{d}) < 0

, a fraction of the first-visit patients decides to join the queue, and others balk. Since

W (λ_{d}) = \frac{μ_{t} + μ_{d} - λ_{d}}{μ_{t} μ_{d} - λ_{d} μ_{t} - λ_{d} μ_{d}}

is monotonically increasing with respect to

λ_{d}

, then, there exists a unique equilibrium arrival rate

λ_{d}^{e} = \frac{β_{d} μ_{t} μ_{d} - (μ_{t} + μ_{d})}{β_{d} (μ_{t} + μ_{d}) - 1}

makes that

R_{d} - θ_{d} W (λ_{d}^{e}) = 0

. Therefore, when

β_{t} < β_{d} < W (λ_{d})

, the equilibrium arrival rates of the two groups of patients

λ_{d}^{e} = \frac{β_{d} μ_{t} μ_{d} - (μ_{t} + μ_{d})}{β_{d} (μ_{t} + μ_{d}) - 1}, λ_{t}^{e} = 0

;

Scenario 2. A fraction of the referred patients joins the queue, i.e., the equilibrium arrival rate

λ_{t}^{e} > 0

, in this scenario, all the first-visit patients choose to join the queue, i.e.,

λ_{d}^{e} = λ_{d}

. Moreover, since a fraction of the referred patients joins the queue, we have

U_{t} = R_{t} - θ_{t} W (λ_{d}) \geq 0

, that is,

β_{t} \geq W (λ_{d})

. Next, we continue to consider two subcases.

Case I: If

U_{t} = R_{t} - θ_{t} W (λ_{d}, λ_{t}) \geq 0

, i.e.,

W (λ_{d}, λ_{t}) \leq β_{t}

, since

W (λ_{d}, λ_{t})

is a monotonically increasing function about

λ_{t}

as

λ_{d} \in [{\bar{λ}}_{d}, \frac{μ_{t} μ_{d}}{μ_{t} + μ_{d}})

; therefore,

W (λ_{d}, 0) < W (λ_{d}, λ_{t})

. When

β_{d} > β_{t} \geq W (λ_{d}, λ_{t})

, then all the first-visit patients join the queue, and all the referred patients join the queue, the equilibrium arrival rates

λ_{d}^{e} = λ_{d}, λ_{t}^{e} = λ_{t}

;

Case II: If

U_{t} = R_{t} - θ_{t} W (λ_{d}, λ_{t}) < 0

, i.e.,

W (λ_{d}, λ_{t}) > β_{t}

. Since

W (λ_{d}, λ_{t})

is a monotonically increasing function about

λ_{t}

as

λ_{d} \in [{\bar{λ}}_{d}, \frac{μ_{t} μ_{d}}{μ_{t} + μ_{d}})

, then, there exists a unique equilibrium arrival rate

λ_{t}^{e}

makes that

U_{t} = R_{t} - θ_{t} W (λ_{d}, λ_{t}^{e}) = 0

. In this subcase, when

W (λ_{d}) \leq β_{t} < W (λ_{d}, λ_{t})

, the equilibrium arrival rates

λ_{d}^{e} = λ_{d}

and

λ_{t}^{e}

satisfies

(\frac{λ_{d}}{μ_{d} (λ_{d} + λ_{t}^{e})} + \frac{1}{μ_{t}}) [1 + \frac{(λ_{d} + λ_{t}^{e}) [μ_{d} + μ_{t} - (λ_{d} + λ_{t}^{e})]}{μ_{t} μ_{d} - λ_{d} μ_{t} - (λ_{d} + λ_{t}^{e}) μ_{d}}] - \frac{λ_{d} + λ_{t}^{e}}{μ_{t} μ_{d}} = β_{t}

.

By summarizing the above analysis, we could obtain the results in Proposition 5. □

References

Wen, J.; Jiang, H.; Song, J. A Stochastic Queueing Model for Capacity Allocation in the Hierarchical Healthcare Delivery System. Asia Pacific J. Oper. Res. 2019, 36, 1950005. [Google Scholar] [CrossRef]
Tang, Y.; Guo, P.; Wang, Y. Equilibrium queueing strategies of two types of customers in a two-server queue. Oper. Res. Lett. 2018, 46, 99–102. [Google Scholar] [CrossRef]
Naor, P. The Regulation of Queue Size by Levying Tolls. Econometrica 1969, 37, 15. [Google Scholar] [CrossRef]
Edelson, N.M.; Hilderbrand, D.K. Congestion Tolls for Poisson Queuing Processes. Econometrica 1975, 43, 81. [Google Scholar] [CrossRef]
Hassin, R.; Haviv, M. To Queue or Not to Queue: Equilibrium Behavior in Queueing Systems; Kluwer Academic Publishers: Boston, MA, USA, 2003. [Google Scholar]
Hassin, R. Rational Queueing; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
Ibrahim, R. Sharing delay information in service systems: A literature survey. Queueing Syst. 2018, 89, 49–79. [Google Scholar] [CrossRef]
Economou, A. How much information should be given to the strategic customers of a queueing system? Queueing Syst. 2022, 100, 421–423. [Google Scholar] [CrossRef]
Ni, G.; Xu, Y.; Dong, Y. Price and speed decisions in customer-intensive services with two classes of customers. Eur. J. Oper. Res. 2013, 228, 427–436. [Google Scholar] [CrossRef]
Zhou, W.; Chao, X.; Gong, X. Optimal Uniform Pricing Strategy of a Service Firm When Facing Two Classes of Customers. Prod. Oper. Manag. 2014, 23, 676–688. [Google Scholar] [CrossRef]
Zhou, W.; Lian, Z.; Wu, J. When should service firms provide free experience service? Eur. J. Oper. Res. 2014, 234, 830–838. [Google Scholar] [CrossRef]
Hu, M.; Li, Y.; Wang, J. Efficient Ignorance: Information Heterogeneity in a Queue. Manag. Sci. 2018, 64, 2650–2671. [Google Scholar] [CrossRef]
Wang, Z.; Wang, J. Information heterogeneity in a retrial queue: Throughput and social welfare maximization. Queueing Syst. 2019, 92, 131–172. [Google Scholar] [CrossRef]
Wang, Z.; Fang, L. The effect of customer awareness on priority queues. Nav. Res. Logist. 2022, 69, 801–815. [Google Scholar] [CrossRef]
Wang, J.; Sun, K. Optimal pricing and capacity sizing for online service systems with free trials. OR Spectr. 2022, 44, 57–86. [Google Scholar] [CrossRef]
Hanukov, G. A queueing-inventory model with skeptical and trusting customers. Ann. Oper. Res. 2023, 331, 763–786. [Google Scholar] [CrossRef]
Guo, P.; Lindsey, R.; Zhang, Z.G. On the Downs–Thomson Paradox in a Self-Financing Two-Tier Queuing System. Manuf. Serv. Oper. Manag. 2014, 16, 315–322. [Google Scholar] [CrossRef]
Chen, W.; Zhang, Z.G.; Hua, Z. Analysis of two-tier public service systems under a government subsidy policy. Comput. Ind. Eng. 2015, 90, 146–157. [Google Scholar] [CrossRef]
Hua, Z.; Chen, W.; Zhang, Z.G. Competition and Coordination in Two-Tier Public Service Systems under Government Fiscal Policy. Prod. Oper. Manag. 2016, 25, 1430–1448. [Google Scholar] [CrossRef]
Qian, Q.; Zhuang, W. Tax/subsidy and capacity decisions in a two-tier health system with welfare redistributive objective. Eur. J. Oper. Res. 2017, 260, 140–151. [Google Scholar] [CrossRef]
Wan, G.; Wang, Q. Two-tier healthcare service systems and cost of waiting for patients. Appl. Stoch. Model. Bus. Ind. 2017, 33, 167–183. [Google Scholar] [CrossRef]
Qian, Q.; Guo, P.; Lindsey, R. Comparison of Subsidy Schemes for Reducing Waiting Times in Healthcare Systems. Prod. Oper. Manag. 2017, 26, 2033–2049. [Google Scholar] [CrossRef]
Zhang, Z.G.; Yin, X. Information and pricing effects in two-tier public service systems. Int. J. Prod. Econ. 2021, 231, 107897. [Google Scholar] [CrossRef]
Zhang, Z.G.; Yin, X. Designing a Sustainable Two-Tier Service System with Customer’s Asymmetric Preference for Servers. Prod. Oper. Manag. 2021, 30, 3856–3880. [Google Scholar] [CrossRef]
Zhou, W.; Huang, W.; Hsu, V.N.; Guo, P. On the benefit of privatization in a mixed duopoly service system. Manag. Sci. 2023, 69, 1486–1499. [Google Scholar] [CrossRef]
Chen, W. Uniform pricing and subsidy coordination mechanism in a two-tier healthcare system under a co-payment policy. Appl. Stoch. Model. Bus. Ind. 2023, 39, 498–519. [Google Scholar] [CrossRef]
Hu, M.; Huang, W.; Liu, C.; Zhou, W. Regulation of Privatized Public Service Systems. Prod. Oper. Manag. 2024; advance online publication. [Google Scholar] [CrossRef]
Li, N.; Kong, N.; Li, Q.; Jiang, Z. Evaluation of reverse referral partnership in a tiered hospital system—A queuing-based approach. Int. J. Prod. Res. 2017, 55, 5647–5663. [Google Scholar] [CrossRef]
Li, X.; Huang, W.; Chen, Y.; Zhou, W. The effects of an online inquiry service on gatekeeping systems with heterogeneous patients. J. Manag. Sci. Eng. 2019, 4, 211–227. [Google Scholar] [CrossRef]
Zhou, W.; Li, X.; Qian, Q. Comparison of Gatekeeping and Non-gatekeeping Designs in a Service System with Delay-sensitive Customers. J. Syst. Sci. Syst. Eng. 2021, 30, 125–150. [Google Scholar] [CrossRef]
Li, N.; Zhang, Y.; Teng, D.; Kong, N. Pareto optimization for control agreement in patient referral coordination. Omega 2021, 101, 102234. [Google Scholar] [CrossRef]
Li, Z.-P.; Wang, J.-J.; Chang, A.-C.; Shi, J. Capacity reallocation via sinking high-quality resource in a hierarchical healthcare system. Ann. Oper. Res. 2021, 300, 97–135. [Google Scholar] [CrossRef]
Wang, J.-J.; Li, Z.-P.; Shi, J.; Chang, A.-C. Hospital referral and capacity strategies in the two-tier healthcare systems. Omega 2021, 100, 102229. [Google Scholar] [CrossRef]
Rajan, B.; Tezcan, T.; Seidmann, A. Service Systems with Heterogeneous Customers: Investigating the Effect of Telemedicine on Chronic Care. Manag. Sci. 2019, 65, 1236–1267. [Google Scholar] [CrossRef]
Li, Z.; Chang, J.; Shi, J.; Wang, J. Coordination schemes for resource reallocation and patient transfer in hospital alliance models. Decis. Sci. 2023; advance online publication. [Google Scholar] [CrossRef]
Li, Z.-P.; Chang, A.; Zou, Z. Design mechanism to coordinate a hierarchical healthcare system: Patient subsidy vs. capacity investment. Omega 2023, 118, 102852. [Google Scholar] [CrossRef]
Kim, S.-C.; Horowitz, I.; Young, K.K.; Buckley, T.A. Analysis of capacity management of the intensive care unit in a hospital. Eur. J. Oper. Res. 1999, 115, 36–46. [Google Scholar] [CrossRef]
Neuts, M.F. Matrix-Geometric Solutions in Stochastic Models: Algorithmic Approach; Johns Hopkins University Press: Baltimore, MD, USA, 1981. [Google Scholar]

Figure 1. Queueing model settings for the comprehensive hospital.

Figure 2. Transition rate diagram of

\{(L (t), J (t)), t \geq 0\}

Figure 2. Transition rate diagram of

\{(L (t), J (t)), t \geq 0\}

Figure 3. The impact of arrival rates on the expected sojourn time (

μ_{d} = 4

,

μ_{t} = 5

).

Figure 3. The impact of arrival rates on the expected sojourn time (

μ_{d} = 4

,

μ_{t} = 5

).

Figure 4. The equilibrium joining probabilities of patients versus

λ_{t}

and

λ_{d}

(

β_{t} > β_{d}

).

Figure 4. The equilibrium joining probabilities of patients versus

λ_{t}

and

λ_{d}

(

β_{t} > β_{d}

).

Figure 5. The equilibrium joining probabilities of patients versus

λ_{t}

and

λ_{d}

(

β_{t} < β_{d}

).

Figure 5. The equilibrium joining probabilities of patients versus

λ_{t}

and

λ_{d}

(

β_{t} < β_{d}

).

Figure 6. The individual equilibrium strategies and social optimal strategies versus

λ_{t}

.

Figure 6. The individual equilibrium strategies and social optimal strategies versus

λ_{t}

.

Figure 7. The individual equilibrium strategies and social optimal strategies versus

R_{d}

.

Figure 7. The individual equilibrium strategies and social optimal strategies versus

R_{d}

.

Table 1. The established model parameters.

$λ_{d}$ ( $λ_{t}$ )	The arrival rate of first-visit patients (referred patients)
$λ$	The total arrival rate of patients
$γ$ ( $\bar{γ}$ )	The fraction of first-visit patients (referred patients)
$θ_{d}$ ( $θ_{t}$ )	The unit-time waiting cost of first-visit patients (referred patients)
$R_{d}$ ( $R_{t}$ )	The service reward received by a first-visit patient (referred patient) once the service is complete, or the perceived value for first-visit patients (referred patients)
$1 / μ_{d}$ ( $1 / μ_{t}$ )	The expected service time of a first-visit patient (referred patient)
$W (λ_{d}, λ_{t})$	The expected sojourn time of an arbitrary patient
$U_{d}$ ( $U_{t}$ )	The utility of an arriving first-visit patient (referred patient) who decides to join the queue
$β_{d}$ ( $β_{t}$ )	The patience level of first-visit patients (referred patients)

Table 2. Notation and system variables.

$q_{n, j}^{d}$ ( $q_{n, j}^{t}$ )	The unique individual optimal pure strategy of first-visit patients (referred patients) in the fully observable case
$λ_{d}^{e}$ ( $λ_{t}^{e}$ )	The equilibrium queueing behavior of first-visit patients (referred patients) in the unobservable case
$q_{d}^{e}$ ( $q_{t}^{e}$ )	The equilibrium joining probability of first-visit patients (referred patients) in the unobservable case
$q_{d}^{s o c}$ ( $q_{t}^{s o c}$ )	The social optimal strategies of first-visit patients (referred patients)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, Y.; Liu, L.; Jiang, T.; Chai, X. Strategic Queueing Behavior of Two Groups of Patients in a Healthcare System. Mathematics 2024, 12, 1579. https://doi.org/10.3390/math12101579

AMA Style

Liu Y, Liu L, Jiang T, Chai X. Strategic Queueing Behavior of Two Groups of Patients in a Healthcare System. Mathematics. 2024; 12(10):1579. https://doi.org/10.3390/math12101579

Chicago/Turabian Style

Liu, Youxin, Liwei Liu, Tao Jiang, and Xudong Chai. 2024. "Strategic Queueing Behavior of Two Groups of Patients in a Healthcare System" Mathematics 12, no. 10: 1579. https://doi.org/10.3390/math12101579

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Strategic Queueing Behavior of Two Groups of Patients in a Healthcare System

Abstract

1. Introduction

2. Model Formulation

3. Performance Analysis

3.1. Stability Condition

3.2. Expected Sojourn Time

4. Equilibrium Queueing Behavior of Two Groups of Patients

4.1. Analysis of Fully Observable Case

4.2. Analysis of Fully Unobservable Case

5. Numerical Examples

6. Research Findings

7. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI