Next Article in Journal
Fuzzy Graph Learning Regularized Sparse Filtering for Visual Domain Adaptation
Previous Article in Journal
Study of Image Classification Accuracy with Fourier Ptychography
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Unconstrained Estimation of Multitype Car Rental Demand

1
College of Traffic & Transportation, Chongqing Jiaotong University, Chongqing 400074, China
2
Department of Civil and Environmental Engineering, Technion-Israel Institute of Technology, Haifa 32000, Israel
*
Author to whom correspondence should be addressed.
Appl. Sci. 2021, 11(10), 4506; https://doi.org/10.3390/app11104506
Submission received: 24 April 2021 / Revised: 11 May 2021 / Accepted: 11 May 2021 / Published: 14 May 2021
(This article belongs to the Section Transportation and Future Mobility)

Abstract

:
The unconstrained demand forecast for car rentals has become a difficult problem for revenue management due to the need to cope with a variety of rental vehicles, the strong subjective desires and requests of customers, and the high probability of upgrading and downgrading circumstances. The unconstrained demand forecast mainly includes repairing of constrained historical demand and forecasting of future demand. In this work, a new methodology is developed based on multiple discrete choice models to obtain customer choice preference probabilities and improve a known spill model, including a repair process of the unconstrained demand. In addition, the linear Holt–Winters model and the nonlinear backpropagation neural network are combined to predict future demand and avoid excessive errors caused by a single method. In a case study, we take advantage of a stated preference and a revealed preference survey and use the variable precision rough set to obtain factors and weights that affect customer choices. In this case study and based on a numerical example, three forecasting methods are compared to determine the car rental demand of the next time cycle. The comparison with real demand verifies the feasibility and effectiveness of the hybrid forecasting model with a resulting average error of only 3.06%.

1. Introduction

The car rental industry plays a huge role globally. It can act as a lubricant for production and consumption to ease their mutual restraints. It can also expand the automotive consumer market and evaluate the popularity of new cars before they come into the market. At the same time, because the public transportation system is limited by operating time, departure frequency, accessible range, comfort, privacy, and other conditions, car rental, with its outstanding advantages such as high flexibility, ease of use, and privacy, has shown considerable growth over the years. The global car rental market was valued at $79.5 bn in 2018 and is expected to reach $125.4 bn by 2025, registering a CAGR of 5.1% from 2019 to 2025 [1]. In recent years, the demand for car rental services in the Asia-Pacific region has become the largest and fastest growing segment, especially in China and India, where there are high population density and rapidly growing demand.
Mainstream car rental companies started to use revenue management (RM) as a crucial tool for reducing operating costs and improving competitiveness in the 1990s. RM is an important interface between enterprises’ supply and market demand, and its objective is to maximize profits. The most widely known RM application occurs in the airline industry. It mainly includes four aspects, namely forecasting, overbooking, capacity control, and pricing [2]. Among them, forecasting is the foundation and provides vital input information for other work. A 12.5–25% reduction of forecast error could translate into 1–3% incremental revenue generated from the RM system [3,4]. Therefore, to achieve the desired level of accuracy, it is important and necessary to innovate the forecast method constantly [5]. In reality, influenced by booking and/or capacity limits, the customer’s present demand may not be satisfied, which will lead to it being lost (spilled) or recaptured by a more expensive (buyup) or cheaper (buydown) available product. The sample booking curves with three grades of products (the fare order is A > B > C) are shown in Figure 1 [6,7]. Therefore, in order to repair the constrained demand, it is necessary to carry out an unconstrained estimation of the customer’s actual demand and forecast the future demand on this basis [8,9].
Research into unconstrained estimation (sometimes also called detruncation, uncensoring) in the RM system started more than 40 years ago and mainly focused on the airline industry. Unconstrained estimation mainly uses observed “censored demand” to estimate and calculate the “uncensored demand”. The original methods were based on statistics, such as the expectation maximization (EM), projection detruncation (PD), booking curve (BC), and mean imputation. Since the beginning of this century, discrete customer choice models have grown in popularity to estimate unconstrained demand [8,10,11]. Forecasting usually yields the demands and no-show rate by booking class, the latter of which will not be discussed here. The common methods include time series, linear regression, exponential smoothing, double exponential smoothing, pickup, neural networks, principal component analysis, and adaptive models [6,12,13]. Since air transportation can meet the travel demand between two points, the prediction of OD demand is necessary [14,15,16]. Guo surveyed the history of research on unconstraining methods by reviewing over 130 references [17]. Weatherford provided two reviews of unconstrained estimation and forecast methods broadly used in different industries up to 2016 [18,19]. Recent discussions include forecast multipliers and hybrid forecasting [9], the effect of customers’ reference price on demand [20], Gaussian processes for unconstraining demand [21], and demand forecast accuracy [22]. Due to the important research and application value of revenue management theory, many advances have been made in the hotel, car or truck rental, cargo/freight, internet service and retailing, cruise line, rail, and other industries regarding demand estimation and forecasting [19].
Unlike the airline industry, the car rental industry is more concerned with the demand at each rental station, and OD forecasting does not need to be considered as much. Supply and demand are dynamically balanced, but there is still an overestimation or underestimation. Since demand determines the size of the leasing station inventory and fleet, demand estimation and forecasting techniques are worth studying [23]. The first reports of RM application to car rental were at Hertz and National Car Rental [24,25]. Jishan used a decomposition method to identify the actual latent demand from the total recorded turndowns [26]. Gordon simplified the problem by human–computer interaction, hoping to find the right level of detail at which to forecast and optimize [27]. Dhruval optimized the fleet with the help of a robust design methodology, considering fleet as a product and cars as an affecting variable [28]. Kourentzes addressed the frequently encountered situation of observing only a few sales events at the individual product level and proposed variants of small demand forecasting methods to be used for unconstraining [29]. In general, there is relatively little research on unconstrained estimation and prediction of car rental demand. By referring to research methods in other fields, relevant research in the car rental industry is worthy of a further discussion.
Due to the relative lack of articles that deal with car rental demand forecasting, this work develops an optimization methodology to estimate and forecast the demand for a car rental station; it is then applied to a case study in China. By using the case study, the work also enables comparison between the effects of different methods, especially for cases of small demand. The remainder of this work is organized as follows. Section 2 presents the problem statement and hypothesis, including an improved spill model for unconstrained estimation. In Section 3, we propose a hybrid forecasting method based on the Holt–Winters model and backpropagation neural network. Subsequently, Section 4 introduces the case study with numerical experiments and a discussion. Section 5 concludes the work.

2. Unconstrained Estimation

2.1. Hypothesis and Variable Definitions

Before studying the forecasting method, the conditional hypothesis and variable definitions should be explained. This work is mainly aimed at the car rental industry, and the method is also applicable to the hotel and aviation industries, etc.
Hypothesis 1.
Unconstrained demand X of car type i in presell lead time meets normal distribution, and the lead time is seven days.
Hypothesis 2.
Demands of various car types are random and independent, and the numbers of car rentals are counted by day; the deadline is midnight.
Hypothesis 3.
Inventory quantity is constant without car dispatching among stations, taking no account of batch demand, overdue return, or oversold behaviors.
Hypothesis 4.
The customer’s willingness to pay is from low to high.
The variables are defined as follows:
  • i : index of car types, i I . Indexes of I are listed in an increasing order of car rental price, corresponding to the gradually increased levels;
  • j : index of rental demand recording times, j = 1 , 2 , , J ;
  • t : review points of inventory control, t = 0 , 1 , , T ; presale system will open when t = 0 , and it will be closed when t = T .
  • Δ t : presale lead time interval between t and t 1 ;
  • B ( i , j , t ) : order quantity limit of car type i in Δ t in the j th documented circle, namely the quota ceiling for rent;
  • C ( i , j , t ) : cumulative order quantity of car type i at review point t in the j th documented circle, particularly C ( i , j , 0 ) = 0 ;
  • I ( i , j , t ) : presale state of car type i in Δ t in the j th documented circle, “1” indicates that demand is not restricted, and “0” indicates that demand is limited.
  • I B ( i , j , t ) : order quantity of car type i in Δ t in the j th documented circle, particularly I B ( i , j , 0 ) = 0 , I B ( i , j , t ) = C ( i , j , t ) C ( i , j , t 1 ) ;
  • U ( i , j , t ) : spillage of car type i in Δ t in the j th documented circle, particularly U ( i , j , 0 ) = 0 ;
  • C P ( i , j , t ) : complete spillage of car type i in Δ t in the j th documented circle, particularly C P ( i , j , 0 ) = 0 ;
  • S U ( i , j , t ) : cumulative spillage of car type 1 to i at review point t in the j th documented circle, S U ( i , j , 0 ) = 0 ;
  • I U ( i , j , t ) : unconstrained estimation of car type i in Δ t in the j th documented circle, particularly I U ( i , j , 0 ) = 0 ;
  • T U ( i , j , t ) : real value of car type i in Δ t in the j th documented circle.

2.2. Unconstrained Estimation Methods

For the revenue management system, the starting point of analysis is some set of assumptions regarding an underlying stochastic or deterministic demand process [30]. In fact, when the real customer demand exceeds the preset order quantity limit, the recorded demand data in the presale system would be censored. Using these “cutoff demand data” for demand forecasting is bound to reduce the accuracy of forecasts and directly impact the revenue. Therefore, deducing the real historical customer demand from the recorded demand data is the first problem. Some common unconstrained estimation methods are as follows.

2.2.1. Expectation Maximization

EM is one of the most common methods to restore incomplete data by using the iterative algorithm in statistics. The EM algorithm is mainly composed of two steps, E (expectation) and M (maximization)—that is, after initializing the parameters to be estimated through the observable incomplete data information, the conditional expectation of missing information in the incomplete data is calculated by parameter estimation, and the missing information is replaced accordingly. In step E, an expected log-likelihood function of complete data is obtained. In step M, the improved parameter values are obtained by using this function through the maximum likelihood estimation method. The whole process will be repeated until convergence [31].
Considering the unconstrained demand X in Δ t with the expectation μ and variance σ 2 , the initial values are then
μ ( i , t ) ( 0 ) = j = 1 J I B ( i , j , t ) I ( i , j , t ) j = 1 J I ( i , j , t )
σ 2 ( i , t ) ( 0 ) = t = 1 J { [ I B ( i , j , t ) μ ( i , t ) ( 0 ) ] I ( i , j , t ) } 2 t = 1 J I ( i , j , t ) 1
When all data is constrained, μ ( i , t ) and σ 2 ( i , t ) cannot be initialized properly, and the repairing process is abandoned. Let the unconstrained preliminary estimation value be equal to the increment of the observed value; then, the unconstrained estimation value after k iterations will be
I U ( i , j j , t ) ( k ) = I B ( i , j , t ) + I U ( i , j , t ) ( k 1 )
When only one set of data is constrained, σ 2 ( i , t ) initialization is abandoned, and the convergence is tested directly by μ ( i , t ) . The k th iteration is divided into two steps.
  • Step E: unconstrained estimation value is
    I U ( i , j , t ) ( k ) = E [ X | X I U ( i , j , t ) ( k 1 ) ] = I U ( i , j , t ) ( k 1 ) x f ( x ) d x I U ( i , j , t ) ( k 1 ) f ( x ) d x
    where f ( x ) is the normal probability density function, and I U ( i , j , t ) ( 0 ) = I B ( i , j , t ) when k = 0 .
  • Step M: find the partial derivative of the log-likelihood function Q ( μ , σ 2 ) versus μ ( i , t ) ( k 1 ) and σ 2 ( i , t ) ( k 1 ) , and let them equal 0 to calculate the re-estimated μ and σ 2 .
    Q ( μ , σ 2 ) = ln L ( μ , σ 2 ) = J 2 ln 2 π J ln σ j = 1 J [ I B ( i , j , t ) μ ] 2 I ( i , j , t ) + j = 1 J [ I B ( i , j , t ) μ ] 2 [ 1 I ( i , j , t ) ] 2 σ 2
    μ = μ ( i , t ) ( k ) = 1 J { j = 1 J I B ( i , j , t ) I ( i , j , t ) + j = 1 J I U ( i , j , t ) ( k 1 ) [ 1 I ( i , j , t ) ] }
    σ 2 = σ 2 ( i , t ) ( k ) = 1 J 1 { j = 1 J [ I B ( i , j , t ) μ ( i , t ) ( k 1 ) ] 2 I ( i , j , t ) + j = 1 J [ I U ( i , j , t ) ( k 1 ) μ ( i , t ) ( k 1 ) ] 2 [ 1 I ( i , j , t ) ] }
    Convergence condition: the operation should be stopped when | μ ( i , t ) ( k ) μ ( i , t ) ( k 1 ) | ε or | σ 2 ( i , t ) ( k ) σ 2 ( i , t ) ( k 1 ) | ε , where ε is any small given number. Otherwise, go back to step E and continue iterating.

2.2.2. Projection Detruncation

The initial values, convergence time, and iteration number of the EM algorithm in the iterative process are more susceptible to their own data in varying degrees. Based on the EM algorithm, Hopperstad proposed PD, which solves the limitation of EM in a large-scale unconstrained estimation [32,33]. The research proves that the PD algorithm is more flexible in practical applications.
PD and EM have the same principle, both of which include step E and M. Both algorithms use the iterative idea of statistics to estimate the parameters of constraint demand data, but the difference is that the parameter-related heuristic method is used in PD at step E. In PD, the unconstrained estimation value is gained without calculating the conditional expectation of the constraint data. In step M, the convergence conditions are the same, namely
I U ( i , j , t ) ( k ) = E [ X | X I U ( i , j , t ) ( k 1 ) ] = I U ( i , j , t ) ( k ) f [ x | μ ( i , t ) ( k 1 ) , σ 2 ( i , t ) ( k 1 ) ] d x = τ I B ( i , j , t ) ( k 1 ) f ( x | μ ( i , t ) ( k 1 ) , σ 2 ( i , t ) ( k 1 ) ) d x
When I U ( i , j , t ) ( k 1 ) > μ ( i , t ) ( k 1 ) ,
I U ( i , j , t ) ( k ) = μ ( i , t ) ( k 1 ) + 1.6 σ 2 ( i , t ) ( k 1 ) ln { 1 [ 1 2 τ + 2 τ Φ ( I U ( i , j , t ) ( k 1 ) μ ( i , t ) ( k 1 ) σ ( i , t ) ( k 1 ) ) ] 2 }
When I U ( i , j , t ) ( k 1 ) μ ( i , t ) ( k 1 ) ,
I U ( i , j , t ) ( k ) = μ ( i , t ) ( k 1 ) 1.6 σ 2 ( i , t ) ( k 1 ) ln { 1 [ 1 2 τ + 2 τ Φ ( I U ( i , j , t ) ( k 1 ) μ ( i , t ) ( k 1 ) σ ( i , t ) ( k 1 ) ) ] 2 }
where Φ ( x ) is the distribution function of normal distribution, and parameter τ is any given constant ( 0 τ 1 ) and stays the same throughout the whole process. τ represents the probability that the observed constrained demand has a true value greater than E [ X | X I U ( i , j , t ) ( k 1 ) ] . Instead of calculating a conditional expectation, in the probability distribution graph (Figure 2), PD balances two things. The first is the area of the probability distribution between the order quantity limit (that is, the original constrained value) and the new estimate or projected value (area M), and the other is the area between the new estimate and infinity (area N). The horizontal axis value corresponding to region N represents the portion of the real demand that is underestimated by the unconstrained estimation [34]. When τ = 0.5 , the PD algorithm can truly balance the values of M and N, yielding nearly similar results to the EM algorithm.

2.2.3. Multitype Spill Model

The multitype spill model (MSM) takes into account the vertical recapture of different price grade requirements, where the buyer’s behaviors of buyup and buydown indwell in different price levels. Therefore, the unconstrained estimation process based on MSM is closer to reality, which can effectively avoid the overestimation of real demand due to the repeated records of the revenue management system’s actual observable demand between different car type prices. The spill model is mainly divided into two parts, the first part being the calculation of spillage, and the second part being the calculation of unconstrained demand [35]. The specific calculation process is as follows:
  • Check if the rental demand is constrained in Δ t . When I ( i , j , t ) = 1 , the rental demand is not constrained and no repair is performed; otherwise, the repairing process is performed, go to next step.
    I ( i , j , t ) = { 1 , C ( i , j , t ) B ( i , j , t ) < 0 0 , C ( i , j , t ) B ( i , j , t ) 0
  • Parameters initialization. The distribution parameters of the demand in Δ t are initialized by the observable “unconstrained demand”, which is the same as the EM algorithm.
  • Calculation of spillage and cumulative spillage. According to the assumptions, the customer’s willingness to pay is characterized from low to high car type, so the spillage should be calculated from the lowest car type.
    • For a single car type, cumulative spillage can be determined by the following formula:
      S U ( 1 , j , t ) = { 0 , I ( 1 , j , t ) = 1 c f i , t ( x ) ( x c ) d x , I ( 1 , j , t ) = 0
      where c = I B ( i , j , t ) , f i , t ( x ) is the probability density function of unconstrained demand for car type i in Δ t . When I ( 1 , j , t ) = 0 , rental demand was constrained,
      S U ( 1 , j , t ) = c f i , t ( x ) ( x c ) d x = σ ϕ ( b ) + ( μ c ) [ 1 Φ ( b ) ]
      where b = c μ σ , ϕ ( x ) is the distribution function of standard normal distribution, and Φ ( x ) is the distribution function of standard normal distribution [36].
    • For a multitype situation, when i = 1 , considering cumulative spillage S U ( I , j , t 1 ) at review point t 1 from car type 1 to I .
      S U ( I , j , t ) = { S U ( I , j , t 1 ) ,   I ( i , j , t ) = 1 c f 1 , t ( x ) [ x c + S U ( I , j , t 1 ) ] d x ,   I ( i , j , t ) = 0
      When I ( i , T j , 1 ) = 0 ,
      S U ( I , j , t ) = c f 1 , t ( x ) [ x c + S U ( I , j , t 1 ) ] d x = σ ϕ ( b ) + [ S U ( I , j , t 1 ) σ b ] [ 1 Φ ( b ) ]
      When i > 1 , calculate S U ( i , j , t ) by considering the cumulative spillage S U ( i 1 , j , t ) at review point t from car type 1 to i 1 .
      S U ( i , j , t ) = { S U ( i 1 , j , t ) ,   I ( i , j , t ) = 1 c f i , t ( x ) [ x c + S U ( i 1 , j , t ) ] d x ,   I ( i , j , t ) = 0
      When I ( i , T j , 1 ) = 0 ,
      S U ( i , j , t ) = c f i , t ( x ) [ x c + S U ( i 1 , j , t ) ] d x = σ ϕ ( b ) + [ S U ( i 1 , j , t ) σ b ] [ 1 Φ ( b ) ]
  • Calculate spillage.
    U ( i , j , t ) = { S U ( 1 , j , t ) , i = 1 S U ( i , j , t ) S U ( i 1 , j , t ) , i > 1
  • Get the repaired result, namely unconstrained estimation.
    I U ( i , j , t ) = { I B ( i , j , t ) , I ( i , j , t ) = 1 I B ( i , j , t ) + U ( i , j , t ) , I ( i , j , t ) = 0

2.3. Improved Multitype Spill Model

Generally, traditional revenue management assumes that customer selection behavior is short sighted and ignores customer subjectivity. However, competition in the car rental market has intensified, and the “buyer market” has gradually expanded, so the subjective initiative of customers has been more fully reflected. The customer can choose target commodity according to his or her own preferences and subjective utility. These factors can be divided into three categories according to the differences in sources, namely accidental factors, objective factors, and subjective factors.
Accidental factors are not expected and have some randomness, such as concerts, sports events, and weather changes. Objective factors are external factors, which are generally not changeable in a short period of time. Objective factors are not directly related to the customers themselves, including policy and regulations, regional economic level, and rental vehicle attributes. Subjective factors are the customer’s own determinants, which are expressed in terms of customer preferences, trip purposes, age, and literacy. Subjective factors can be grasped by the analysis of the customer’s choice intention, usually in the form of a questionnaire.

2.3.1. SP/RP Survey

The basic data of customer selection preference were obtained by stated preference (SP) and revealed preference (RP) surveys. The SP survey preresearches the subjective preferences of customers when renting a car before the rental occurs. The RP survey analyzes the actual selection behavior of customers in the context of what has happened. SP data were more flexible, while RP data were more reliable.
The SP survey can obtain multiple data from a single interviewee, with the advantages of small sample and low survey cost. The basic principle of the SP survey is to predetermine the various attribute factors and their influence levels and then invite interviewees to score the plan in the set situation. The respondents’ choices may be different from their actual actions, but they can still find the main factors that can be calculated and have a large impact on the overall mean. Then, the secondary factors that are difficult to measure will be removed. The data obtained from the RP survey are the actual occurrence data, since the data records behaviors that have been taken by the respondent or the actual selection behavior observed. The basic principle of the RP survey is to describe the occurrence or existence of the situation with different attribute factors and influence levels and to invite the respondents to score the content.
Before the SP survey plan is designed, condition attributes, decision attributes, and corresponding level values should be determined. The survey content includes the importance of the brand effect, the satisfaction of car rental type, the sensitivity of the rental price, returning location, satisfaction of the vehicle status, and the satisfaction with a special offer, service attitude, procedure convenience, and accident handling. The RP survey is aimed at customers who have made choices, and the influencing factors are rated as very satisfied, satisfied, general, not satisfied, and very dissatisfied. The survey includes the type of rental vehicle and satisfaction with the rental price, return location restrictions, vehicle status, service attitude, procedure convenience, and accident handling when the preferred rental vehicle is rejected. See Appendix A and Appendix B for the SP/RP questionnaire of the case study.
Customers are affected by many attributes when selecting multiple models. In order to determine the degree of influence of each attribute, AHP is usually used. AHP (analytic hierarchy process) is an easy way to quantitatively and qualitatively deal with some fuzzy and complex subjective problems. The most critical step is to obtain a judgment matrix by pairwise comparison to calculate the weight of each factor. The specific calculation process is not given. However, due to the uncertainty of expert experience, the subjective constructed judgment matrix may not meet the conditions of complete consistency. In order to reduce the inconsistency of subjective judgment, the VPRS (variable precision rough set) can be used to simplify the factors affecting the customer’s choice behavior, and similar factors are described by several factors to construct the judgment matrix.
VPRS expands the Pawlak rough set by introducing a confidence level η ( 0.5 < η 1 ) , so that admissible classification error is allowed to a certain extent. It can improve the flexibility of decision rules and, at the same time, reflect the correlation in data analysis, which is beneficial to the discovery of relevant data from unrelated data, that is, the implicit patterns in the data can be more clearly expressed [37].
Let S = ( U , A , V , f ) be an information system, where U = { u 1 , u 2 , , u | U | } is a finite nonempty set called the domain object space. A = { a 1 , a 2 , , a | A | } is a finite nonempty subset of attributes. If the attribute in A can be further divided into two disjoint nonempty subsets, namely the conditional attribute set C Ø and decision attribute set D Ø , A = C D , X U , C D = Ø , then S is called the decision table. V is the union of the attribute domains, i.e., V = V a for a A , where V a denotes the domain of attribute a ; f is an information function which associates a unique value of each attribute with every object belonging to U .
For any subset P Ø and P C , R ( P ) = { ( x , y ) U × U : f ( x , p ) = f ( y , p ) , p P } is an equivalence relation on U , then the corresponding equivalence class is represented as [ x ] p . By replacing the inclusion relation with a majority inclusion relation in the original definition of lower approximation of a set, we obtain the following generalized notion η -lower approximation and η -upper approximation:
a p r _ p η ( X ) = { [ x ] p : | [ x ] p X | | [ x ] p | η }
a p r ¯ p η ( X ) = { [ x ] p : | [ x ] p X | | [ x ] p | 1 η }
Then, the classification accuracy can be defined as
γ η ( P , D ) = | { x U : | [ x ] p X | | [ x ] p | η } | | U |
where γ η ( P , D ) measures the percentage of knowledge that can be correctly classified in the existing knowledge given a certain η in the domain.

2.3.2. Intention Analysis

Nine factors affecting the customer’s car rental behavior are regarded as conditional attributes by the Delphi method (Table 1), and the level values of the conditional attribute are defined by three grades: 1 = satisfaction, 2 = something in between, and 3 = dissatisfaction. The level values of the decision attribute can also be defined by three grades: R = rent, B = book but not necessarily rent, and G = give up.
Two rental sites (International Convention & Exhibition Center and Jiangbei Airport) of China Auto Rental Inc. (Beijing, China) were selected in Chongqing, China. Interviewees were randomly selected near each site, and the investigators cooperated to fill out the questionnaire. A total of 50 questionnaires were distributed and 40 questionnaires were returned, of which 30 were valid. By sorting out and analyzing the survey data, customer selection decision information can be obtained, as shown in Table 2.
The following equivalence classes are available:
  • U / C = ( X 1 , X 2 , , X 9 ) , where X 1 = { u 1 , u 9 , u 16 , u 19 , u 30 } , X 2 = { u 2 , u 8 , u 12 , u 15 , u 18 , u 29 } , X 3 = { u 3 , u 20 , u 23 , u 28 } , X 4 = { u 4 , u 17 , u 22 , u 25 } , X 5 = { u 5 } , X 6 = { u 6 , u 10 , u 24 , u 26 } , X 7 = { u 7 , u 14 , u 21 , u 27 } , X 8 = { u 11 } , X 9 = { u 13 } .
  • U / D = ( D R , D B , D G ) , where D R = { u 3 , u 7 , u 8 , u 11 , u 12 , u 14 , u 20 , u 21 , u 23 , u 27 , u 28 } , D B =   { u 2 , u 5 , u 6 , u 10 , u 13 , u 18 , u 24 , u 26 } , D G = { u 1 , u 4 , u 9 , u 15 , u 16 , u 17 , u 19 , u 22 , u 25 , u 29 , u 30 } .
  • a p r _ p η ( D R ) = { X 3 , X 7 , X 8 } , a p r _ p η ( D B ) = { X 1 , X 4 } , a p r _ p η ( D G ) = { X 5 , X 6 , X 9 } .
Classification accuracy of VPRS is an evaluation of the ability to perform object classification with a confidence level. Let η = 0.8 . Similarly, the classification quality of each attribute division can be determined according to Equations (20)–(22).
γ η ( P , D ) = 4 / 5 , γ c 1 η ( P , D ) = γ c 2 η ( P , D ) = γ c 4 η ( P , D ) = 0 , γ c 3 η ( P , D ) = γ c 5 η ( P , D ) = 7 / 15 , γ c 6 η ( P , D ) = γ c 8 η ( P , D ) = 2 / 15 , γ c 7 η ( P , D ) = γ c 9 η ( P , D ) = 1 / 5 .
The case where there is zero in the above classification quality indicates that some interleaved repetitions or substitutions exist among the nine main attributes. Therefore, there are six main factors affecting customer choice, namely rental price c 3 , service attitude c 5 , returning location c 6 , vehicle status c 7 , procedure convenience c 8 , and accident handling c 9 . By comparing the six elements in pairs, the importance of each factor in the conditional attribute on the customer decision can be obtained. The judgment matrix Λ is
Λ = [ 1 1 7 / 2 7 / 3 7 / 2 7 / 3 1 1 7 / 2 7 / 3 7 / 2 7 / 3 2 / 7 2 / 7 1 2 / 3 1 2 / 3 3 / 7 3 / 7 3 / 2 1 3 / 2 1 2 / 7 2 / 7 1 2 / 3 1 2 / 3 3 / 7 3 / 7 3 / 2 1 3 / 2 1 ]
Obviously, the judgment matrix Λ is a positive and negative matrix, which fully satisfies the consistency condition, and its maximum eigenvalue is π max = 4 . The corresponding feature vector is ω = ( 0.629 , 0.629 , 0.180 , 0.269 , 0.180 , 0.269 ) T . After normalizing, the weight vector of Λ is ω ¯ = ( 0.292 , 0.292 , 0.083 , 0.125 , 0.083 , 0.125 ) T .

2.3.3. Customer Choice Probability

The multitype spill model is based on traditional revenue management. The assumption is that people in reality are irrational. When the product price is lower than the willingness to pay, the customer will buy it immediately. Obviously, this assumption ignores the initiative of the decision-making subject. In reality, individuals have various cognitive deviations. If we do not proceed from the customer’s choice behavior, we cannot find the customer’s car rental rules and the changing trend of demand, which is very unfavorable to the car rental company’s ability to set prices and inventory in the later stage.
Therefore, in order to describe the customer’s choice behavior, the Logit model is used to solve the above problems. The utility maximization theory is assumed, and the utility size is used to measure the probability that a customer selects a certain type of vehicle. The various attributes of different types constitute a comprehensive utility value. It is generally believed that the greater the utility of a customer to select a particular type of vehicle, the greater the probability that such a vehicle will be selected. Customers choose a variety of types and make a choice after evaluating the utility based on various attributes. The customer’s choice behavior can be described by a utility function U i = v i + ξ i , where v i and ξ i are the average utility and the random utility error, respectively, corresponding to the customer’s selecting.
Assuming that the customer is rational, then each customer will choose the product that will maximize utility. The probability that the customer chooses car type i is
P i = P ( v i v h , h I , h i ) = P ( v i + ξ i v h + ξ h , h I , h i ) = P ( v i + ξ i max h I , h i ( v h + ξ h ) )
Assuming that ξ i are mutually independent and obey Gumbel distribution, then the probability variable deviations of the two independent Gumbel distributions are subject to the Gumbel distribution, which yields the general form of the multinomial logit model:
P i = e v i i I e v i
Since the RP survey is based on actual conditions, the survey respondents are customers who return cars to the rental sites. In the design of the questionnaire, the actual upgrade or downgrade has been fully reflected. Therefore, the corresponding utility when the customer’s rental car type is upgraded or downgraded is
U i , l = 0.292 c 3 + 0.292 c 5 + 0.083 c 6 + 0.125 c 7 + 0.083 c 8 + 0.125 c 9
If the customer did not lease type i and then rented the type h , the probability of upgrading or downgrading is
P i , h = e v i , h l = 0 l h I e v i , l
where P i , h is the preference probability of buyup or buydown with car type h when the customer’s choice of car type i is rejected, and P i , 0 is the leaving probability when car type i is rejected.
The improved multitype spill model (IMSM) needs to calculate the complete spillage of all constrained data first and then calculate the demand transferring from other car types which are constrained according to the customer preference probability. Excluding this part is the spillage; then, the sum of cumulative order quantity and spillage will be the unconstrained estimation.
C P ( i , j , t ) = { 0 , I ( i , j , t ) = 1 c f i , t ( x ) ( x c ) d x , I ( i , j , t ) = 0
Excluding the transferring is the spillage:
U ( i , j , t ) = C P ( i , j , t ) [ C P ( 1 , j , t ) P 1 , i + C P ( 2 , j , t ) P 2 , i + + C P ( I , j , t ) P I , i ]
Then, the unconstrained estimation is
I U ( i , j , t ) = I B ( i , j , t ) + U ( i , j , t )

3. Hybrid Forecasting

3.1. Selection of Forecast Method

For demand forecasting, there are three methods: quantitative analysis, qualitative analysis, and decision analysis. Quantitative analysis relies on a large amount of historical data and can be divided into time series forecasting, which uses time to organize data, and the causal analysis method, which uses relationships to organize data. The time series method is the most commonly used, in which the Holt–Winters model and moving average method are representative. Time series forecasting can predict future demand, but it cannot explain the reason. Causal analysis commonly uses regression analysis and simulation methods. The causal analysis method uses the causal relationship between data to look for changes and is generally used for macro prediction.
Qualitative analysis mainly relies on expert knowledge and experience to evaluate and does not involve quantitative analysis. Delphi and judgments fall into this category. Although this kind of method operates simply, the subjectivity is too strong and the effect is also not favorable. The decision analysis method combines quantitative and qualitative methods. At present, the market survey and randomness method are more commonly used, but the research is not very mature. The quantitative analysis method is commonly used to compare the above three methods.
The Holt–Winters model is a kind of time series forecasting model that avoids the deficiencies of the moving average method. It uses a cubic smoothing equation to make different data have different weights, and the predicted value is the weighted sum of the previous data sequence. Larry Weatherford et al. deemed the availability of the basic neural network more useful than traditional forecasting methods (moving averages, exponential smoothing, linear regression, etc.) by comparing the mean absolute percentage error [38].
The backpropagation (BP) neural network is a widely used nonlinear forecast method that can simulate the neural structure of the human brain and solve more complicated problems. The BP neural network can use its own nonlinear characteristics to simulate the development trend of the data, without requiring an assumption function. When the prediction accuracy is reached, the future demand can be predicted according to the learning situation. However, the BP neural network relies on the initial conditions and is prone to fall into the local optimal solution. Therefore, the single prediction method has inevitable defects, and the error accuracy may not reach the conditions for actual use. This paper intends to use the hybrid forecasting to combine weights of multiple forecast methods and improve the forecasting accuracy.

3.2. Holt–Winters Model

The Holt–Winters forecasting model is observed to outperform other techniques for the time series, having changing seasonality, mean, and growth rate [39]. It is an adaptive model that automatically recognizes changes in data patterns. For example, if the deviation is caused by internal interference, it can be considered that the new observation has the same influence as the original data, and thus gives the same weight to the data of different periods. If the deviation is caused by external interference, then new observations and the original data have different influences on the prediction results, and the new observations have a higher impact on the prediction events. In order to show that the value of the data in different periods has a different influence on the forecasting results, different weights can be given to different periods.
The demand for car rental is a nonstationary time series with seasonal and cyclical trends. The Holt–Winters model can perform very accurate forecasting of this regular time series data, especially for trends and seasonal changes. It can decompose linear time series, seasonal variations, and random variation time series and properly filter the impact of random fluctuations. The Holt–Winters model is an improvement and development of the moving average method. It does not need to store much historical data but also considers the importance of each period of data and uses all historical data.
First, smoothing equations can be obtained by iteration:
T i , t = α F i , t U i , t L + ( 1 α ) ( T i , t 1 + S i , t 1 ) S i , t = β ( T i , t T i , t 1 ) + ( 1 β ) S i , t 1 U i , t = γ F i , t T i , t + ( 1 γ ) U i , t L
where T i , t is the smoothed value, S i , t is the trend value, and F i , t is the seasonal index of car type i at t after iterations. L is the length of the seasonal period, and α , β , γ are smoothing coefficients.
Then, the predicted value at the future time point k is
F ^ i , t + k = ( T i , t 1 + k S i , t 1 ) U i , t L + k
where U i , t L + k is the seasonal index of car type i , which is one cycle ahead of t , k { 1 , 2 , , k L } .
The way to determine the three smoothing coefficients α , β , and γ is to minimize the error between the forecasted and actual values. In order to obtain more accurate and objective parameters, the traditional method is the residual square sum minimum method. Smoothing coefficients are all located in the interval (0, 1) and increase with 0.1 step length. The squares of the prediction residuals are calculated separately and summed until the smoothing coefficients corresponding to the sum of squares of the smallest residuals are found.

3.3. BP Neural Network

The neural network generally includes an input layer, an output layer, and a hidden layer. The input layer is located in the first, and no neurons are connected at the front end. The output layer and the hidden layer are all connected with neurons, and the weights have a one-to-one correspondence. The impact factors of output data include input, weight, threshold, and excitation function. Because of the large number of neurons, a large amount of information is stored, giving the neural network powerful data processing capabilities. The neural network has the advantages of a strong parallel processing ability, strong nonlinear processing ability, strong self-adaptation and learning ability, and strong associative memory and fault tolerance.
The BP neural network has a simple structure, strong plasticity, clear learning steps, and mathematical meaning. It has been proved that the BP neural network can simulate any complex nonlinear mapping by selecting three layers. The BP neural network is a feedforward type network that utilizes an algorithm of error back propagation. There are only feedforward associations between neurons, and no feedback, intralayer, or interlayer correlation. Linear sigmoid-type functions are generally used as excitation functions. Since the excitation function is measurable everywhere, for a BP network, the divided area is no longer a linear partition but an area composed of a nonlinear hyperplane, which is a relatively smooth surface. Compared with linear partitioning, this classification is more accurate and has greater fault tolerance. The learning method of the BP neural network is to strictly adopt the gradient descent method, so that the analytical formula of weight correction is also very clear.

3.3.1. Network Structure

A full connection is achieved between the upper and lower layers of the BP neural network, and there is no connection among each neuron. Studies have shown that when the output layer and input layer use a linear activation function and the hidden layer uses the sigmoid activation function, a BP neural network with a hidden layer can map all continuous functions. Therefore, when constructing a BP neural network model, only one hidden layer is generally used, as shown in Figure 3.

3.3.2. Learning Process

The learning process of the BP neural network consists of two parts: forward and reverse propagation. When the propagation is positive, the sample is delivered to the input layer and then processed by the hidden layer and passed to the output layer. If the value obtained by the output layer does not satisfy the expectation, then it enters the backpropagation link, which uses the error back to the input layer and continuously corrects the weight between layers during the transfer process. After repeated propagation, when the error is small enough to be acceptable, the learning process then stops (shown in Figure 4).
The specific learning steps are as follows:
Step 1
Program initialization. Select sigmoid as the activation function, then determine the minimum error, learning rate, and momentum coefficient.
Step 2
Calculate output. Input the initial weight and calculate the output values of the hidden layer and the output layer processing unit.
Step 3
Calculate the error value. When error is less than the given minimum error, go to Step 5; otherwise, go to Step 4.
Step 4
Backpropagation to adjust the weight between hidden layers; then reuse Step 2.
Step 5
Acquire the optimal output value and the weight of each layer and end the algorithm.

3.3.3. Sample Selection

After the model is established, the sample needs to be trained. In general, the training sample needs to meet four characteristics. (1) There is a close functional relationship between the input and object variables, and the object variable will change obviously due to the change of input variables. (2) Input variables are independent of each other. It is impossible to accurately calculate other components by using the components of input variables. (3) The data to be predicted has a certain commonality with the sample data. (4) Sample size should have a certain scale, so that the combination of all samples can reflect the mapping relationship between output variables and target variables.
The BP neural network applies the data-driven idea, that is, using a nonlinear characteristic to approximate a time series, and then using the clear logical relationship and historical data to express future values. Suppose there is a time series { X i } , where the historical data is X p , X p + 1 , , X p + q ; to forecast the future value at time p + q + r ( r > 0 ) , some kind of nonlinear function relationship between X p , X p + 1 , , X p + q and X p + q + r needs to be found. Forecasting methods generally include single-step predictions, multistep predictions, and rolling predictions. Among them, the rolling prediction can reach a certain training sample and reflects the relationship between the output sample and predicted data. In fact, the rolling prediction starts from the single-step prediction, then feeds the output value back to the import as part of the input for predicting future values (see Table 3).

3.4. Weight Determination

Hybrid forecasting is to assign several kinds of single prediction methods to different weights to form a comprehensive forecasting model. It can accurately and reasonably use the valuable information of a single forecasting model, better adapt to future changes, and reduce forecasting risk.
The key to hybrid forecasting is to properly determine the weight of various forecasting methods, and reasonable weights will improve the prediction accuracy greatly. Common weight determination methods include the arithmetic average method, variance reciprocal method, mean square reciprocal method, simple weighting method, and linear programming method. Among them, the arithmetic averaging method is suitable for treating each individual model equally if it is not known to each model. Usually, it is not optimal, and the sum of squared errors is large. The variance reciprocal method, mean square reciprocal method, and simple weighting method all have higher prediction accuracy than the arithmetic average method, but it is necessary to have a certain understanding of the prediction requirements in advance. All three methods above have one thing in common, that is, the variance is used to calculate the weight. The variance is the degree of fluctuation of the response variable above and below the mean. When the prediction result is not ideal and the error fluctuation is not large, the weight will still be large, which is unreasonable.
Therefore, this paper adopts the linear programming method, which determines the optimal weighting coefficient by taking the minimum absolute value of the combined prediction value as the objective function.

4. Case Study and Discussion

4.1. Data Collection

This paper collected the operating data of the Huilongwan site of China Auto Rental Inc. in Chongqing for 4 consecutive weeks (28 days). Each set of data includes the date of pick up, vehicle brand, car rental price, and pick-up location. According to the previous method, the model is divided into three grades according to the rental price and is set as a “presale lead time interval” every week. The survey data is sorted out to obtain three “presale lead time intervals” for each price grade model. The observable order quantity q and the presale opening status of each grade model are shown in Table 4.

4.2. Applying IMSM

According to the rental price, the car type is classified as 1 under $40 per day, 2 within $40–70 per day, and 3 above $70 per day. Due to cost and time constraints, a random sampling method was used to conduct a questionnaire survey at some rental sites of China Auto Rental Inc. in Chongqing (see Figure 5).
The sample size of this RP survey was 400, and 363 questionnaires were collected, of which 320 were valid and 220 of which included buyup or buydown. Discretize the survey data, and let “very satisfied = 10”, “satisfied = 8”, “general = 5”, “not very satisfied = 3”, and “not satisfied = 0”. The influencing factor score, utilities, and possibilities corresponding to various choice behaviors can be obtained (Table 5).
Based on a week’s data from the International Convention & Exhibition Center, the biggest site of China Auto Rental Inc. in the Nan‘an district, EM, PD, MSM, and IMSM were used to repair the car rental demand, and the unconstrained repair results were compared with the real needs. The convergence conditions were θ = 0.1 and τ = 0.6 , set in EM and PD (see Table 6).
According to the results listed in the above table, when demand repair is not carried out, the average gap between constrained demand and actual demand is large, which is 10.1%. After various demand estimation methods are adopted, the average error of the obtained demand data is small, among which the average error of IMSM is the smallest, 3.06%.

4.3. Demand Estimation

Combined with the survey data, the abovementioned various algorithms are used to estimate the unconstrained demand of multiple car types. For EM and PD, let θ = 0.1 and τ = 0.6 . In the PD algorithm, the difference of parameter τ affects the result of demand estimation and maximum iteration. The smaller τ is, the larger the iteration time is, and the higher the estimation of unconstrained demand becomes. When τ 0.35 , the estimation result of PD starts to be larger than EM. τ = 0.6 is selected here. For MSM and IMSM, the specific values of the parameters μ and σ are listed in Table 7 for different vehicle types and cycles.

4.4. Demand Forecast

4.4.1. Holt–Winters Model Forecast

When making predictions, the first step is to calibrate the model. Due to the large amount of data, only the fitting process of car type 1 will be described in detail. Considering the characteristics of the Holt–Winters model, the initial value of cubic exponential smoothing is selected from the first value I B ( 1 , j , 1 ) . Therefore, these three sets of data are eliminated, and the remaining 28 sets of data are fitted. According to the minimum sum of squared residuals, the coefficients of α , β , and γ are determined to be 0.3, 0.9, and 0.1, respectively. The two curves obtained coincide with each other, which indicates that the exponential smoothing effect is good. The forecasting model is generally considered to be available when the error is within 10% (see Figure 6). The demands of different car types can be acquired. The specific prediction results are shown in Table 8.

4.4.2. BP Neural Network Forecast

The BP neural network used in this paper is divided into three layers: the first layer is the input layer, with input variables of i , j , and t . The second layer is a hidden layer. In this paper, the same sample is used to train the models of different neuron numbers. According to the test error, it is reasonable to determine 17 hidden neurons. The hidden layer activation function uses the sigmoid function. The third layer is the output layer, with the demand as the output element, a total of one neuron, and the activation function uses a linear activation function.
After the BP neural network model is established, samples need to be trained. In general, the training sample needs to meet four characteristics: (1) there is a close functional relationship between the input and object variable, and the latter will change significantly due to the change of the former; (2) input variables are independent of each other, and other components cannot be accurately calculated using the components of the input variables; (3) the data to be predicted has a certain commonality with the sample data; (4) the sample capacity should have a certain scale, so that all the samples can reflect the mapping relationship between output variables and target variables.
In order to train and test the neural network model more reasonably, the unconstrained historical data repaired by EM is divided into 42 groups, among which seven groups of prediction samples and 35 groups of training samples are randomly selected by scrolling to select data. The training samples are used to train the neural network model, while the prediction samples are only used to test the feasibility and reliability of the forecasting model. The specific grouping situation is shown in Table 9.
Using prediction samples and training samples, the calculated values of forecasting models are checked separately. The neural network forecasting model has a high degree of fitness for the training samples, and the average error is 0.066 (see Figure 7). The model is reasonable and can be applied to later predictions. Demands of different car types can be obtained, which are shown in Table 10.

4.4.3. Hybrid Model

The combination weight coefficients are selected by a linear programming method. When the weight is 0.7 for the Holt–Winters model and 0.3 for the BP neural network, the sum of the squared errors is the smallest. Using the hybrid forecasting model, the demands of different car types can be obtained, as shown in Table 11. Hybrid forecasting can neutralize both models, making the predictions more accurate, increasing the smoothness, and forecasting accuracy, so it is presented below.

4.5. Discussion

As described previously, the demand data was estimated by four unconstrained estimation methods, and the three forecast models were used to predict the demand in the fourth presale lead time interval; then, 12 sets of demand data were obtained. In contrast to the real demand in the fourth presale lead time interval (see Table 12), the relative errors of the various results can be calculated (see Table 13).
From the real demand and the prediction results (as shown in Table 12 and Table 13), the following can be seen.
For different car types, from the perspective of the predicted effect, comparing the Holt–Winters model with the BP neural network model, the former has a greater volatility, while the latter is more stable. From the perspective of the adaptability of the unconstrained estimation model, when the unconstrained estimation data obtained by MSM is used, the prediction results of the three methods are relatively stable, and the hybrid forecasting model can obtain the best results. The relative errors of the three models are respectively: 1.45%, 8.66%, and 5.00% (see data in the box of Table 13).
For the forecasted total, the Holt–Winters model is better than the BP neural network model in predicting the effect. From the perspective of the adaptability of the unconstrained estimation model, the Holt–Winters model has the best prediction effect when using MSM to repair the data, while the three methods can all obtain better prediction results when using IMSM, among which hybrid model has the smallest error.
Considered comprehensively, when using MSM for unconstraining repair data, the relative error of the Holt–Winters model for predicting total demand is 3.06%. However, looking at the corresponding subtype prediction results, we find that the relative error of the third type is 32.5%. When IMSM is used for unconstrained repair, the relative error of the total demand using the hybrid model is 4.76%. The relative errors of the predicted results for the three models are 1.45%, 8.66%, and 5.00%. Therefore, by using the IMSM to estimate historical constraint requirements, the hybrid forecasting is optimal for future demand prediction.

5. Conclusions

To fully use the information of historical car rental data, a two-stage joint approach was proposed for predicting multitype car rental demand. The method considers the customer’s choice behavior to improve the spill model, which can estimate the unconstrained demand effectively and uses the hybrid forecasting model to predict future short-term demand.
For historical car rental demand data, the repaired data can more accurately reflect customer demand. In order to prove the effectiveness of the method, EM, PD, and MSM were compared in the unconstrained demand estimation stage, and the Holt–Winters, BP neural network, and hybrid models were used for comparison in the future demand prediction stage. For different car types, different models can form different prediction results in two stages. The comprehensive calculation results show that the proposed method is superior to other combinations in terms of stability and effectiveness of prediction.
Specifically, using the case study, for the repair of the demand data of each car type, IMSM has obvious advantages compared with the other three methods, and the average error of the example test is 3.06%. For the prediction of future demand of each car type, based on the calculation results of different unconstrained estimation models, the performances of the Holt–Winters model, the BP neural network model, and the hybrid model are quite different. The hybrid forecasting model has the best effect, and the relative error of the predicted results is within the acceptable range for different car types or overall needs. Therefore, the case study shows that the proposed method outperforms other methods. It can be regarded as a new way to help car rental companies to predict customer demand. There are many factors affecting customers’ car rental behavior, and with the application of mobile apps, the way of car reservation is changing. In the future, more influencing factors, such as electric vehicles, urban traffic restriction policies, and road traffic conditions, should be taken into account. In addition, the different distribution characteristics of car rental demand and the combined optimization of forecasting methods will also be interesting research topics.

Author Contributions

Conceptualization, Y.Y. and A.C.; methodology, Y.Y., A.C. and W.Z.; software, W.Z.; validation, W.Z. and H.T.; formal analysis, Y.Y. and A.C.; investigation, W.Z. and H.T.; writing—original draft preparation, Y.Y. and W.Z.; writing—review and editing, Y.Y. and A.C.; visualization, H.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Humanities and Social Sciences Foundation of Ministry of Education of China (Grant Nos. 17YJCZH220, 17YJA630079).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Applsci 11 04506 i001

Appendix B

Applsci 11 04506 i002

References

  1. Markets, R.A. Global Car Rental Market Size, Share & Trends Analysis, 2014–2019 & 2025: Market Size is Expected to Reach USD 125.4 Billion. Available online: https://www.prnewswire.com/news-releases/global-car-rental-market-size-share--trends-analysis-2014-2019--2025-market-size-is-expected-to-reach-usd-125-4-billion-300908404.html (accessed on 1 January 2021).
  2. McGill, J.I.; Van Ryzin, G.J. Revenue management: Research overview and prospects. Transp. Sci. 1999, 33, 233–256. [Google Scholar] [CrossRef]
  3. Pölt, S. Forecasting Is Difficult–Especially if It Refers to the Future. In Proceedings of the AGIFORS Reservations and Yield Management Study Group, Melbourne, Australia, May 1998; pp. 61–91. [Google Scholar]
  4. Weatherford, L.; Belobaba, P. Revenue impacts of fare input and demand forecast accuracy in airline yield management. J. Oper. Res. Soc. 2002, 53, 811–821. [Google Scholar] [CrossRef]
  5. Zaki, H. Forecasting for airline revenue management. J. Bus. Forecast. Methods Syst. 2000, 19, 2–6. [Google Scholar]
  6. Queenan, C.C.; Ferguson, M.; Higbie, J.; Kapoor, R. A comparison of unconstraining methods to improve revenue management systems. Product. Oper. Manag. 2007, 16, 729–746. [Google Scholar] [CrossRef] [Green Version]
  7. Sharif Azadeh, S.; Marcotte, P.; Savard, G. A taxonomy of demand uncensoring methods in revenue management. J. Revenue Pricing Manag. 2014, 13, 440–456. [Google Scholar] [CrossRef]
  8. Vulcano, G.; van Ryzin, G.; Ratliff, R. Estimating primary demand for substitutable products from sales transaction data. Oper. Res. 2012, 60, 313–334. [Google Scholar] [CrossRef] [Green Version]
  9. Weatherford, L.R. Intelligent aggressiveness: Using forecast multipliers, hybrid forecasting, fare adjustment, and unconstraining methods to increase revenue. Decis. Sci. 2017, 48, 391–419. [Google Scholar] [CrossRef]
  10. Haensel, A.; Koole, G.; Erdman, J. Estimating unconstrained customer choice set demand: A case study on airline reservation data. J. Choice Model. 2011, 4, 75–87. [Google Scholar] [CrossRef] [Green Version]
  11. Sharif Azadeh, S.; Marcotte, P.; Savard, G. A non-parametric approach to demand forecasting in revenue management. Comput. Oper. Res. 2015, 63, 23–31. [Google Scholar] [CrossRef]
  12. Wickham, R.R. Evaluation of Forecasting Techniques for Short-Term Demand of Air Transportation; Massachusetts Institute of Technology: Cambridge, MA, USA, 1995. [Google Scholar]
  13. Viswanathan, V. Demand forecasting. In Proceedings of the AGIFORS Reservations and Yield Management Study Group, London, UK, 1999. [Google Scholar]
  14. Campbell, G.; Williams, L. O&D demand forecasting: Dealing with real-world complexities. In Proceedings of the AGIFORS Reservations and Yield Management Study Group, Bangkok, Thailand, 1992. [Google Scholar]
  15. Fiig, T. Odyssey: SAS’ O&D forecasting system. In Proceedings of the AGIFORS Reservations and Yield Management Study Group, Jeju, Korea, 14–17 May 2007. [Google Scholar]
  16. Oancea, O.; Bala, P.K. Measuring and monitoring granular forecast performance. J. Revenue Pricing Manag. 2013, 12, 551–564. [Google Scholar] [CrossRef]
  17. Guo, P.; Xiao, B.; Li, J. Unconstraining methods in revenue management systems: Research overview and prospects. Adv. Oper. Res. 2012, 2012, 1–23. [Google Scholar] [CrossRef]
  18. Weatherford, L. The history of unconstraining models in revenue management. J. Revenue Pricing Manag. 2016, 15, 222–228. [Google Scholar] [CrossRef]
  19. Weatherford, L. The history of forecasting models in revenue management. J. Revenue Pricing Manag. 2016, 15, 212–221. [Google Scholar] [CrossRef]
  20. Aslani, S.; Sibdari, S.; Modarres, M. Revenue management with customers’ reference price: Are the existing methods effective? Serv. Sci. 2018, 10, 195–214. [Google Scholar] [CrossRef]
  21. Price, I.; Fowkes, J.; Hopman, D. Gaussian processes for unconstraining demand. Eur. J. Oper. Res. 2019, 275, 621–634. [Google Scholar] [CrossRef] [Green Version]
  22. Fiig, T.; Weatherford, L.R.; Wittman, M.D. Can demand forecast accuracy be linked to airline revenue? J. Revenue Pricing Manag. 2019, 18, 291–305. [Google Scholar] [CrossRef]
  23. Oliveira, B.B.; Carravilla, M.A.; Oliveira, J.F. Fleet and revenue management in car rental companies: A literature review and an integrated conceptual framework. Omega 2017, 71, 11–26. [Google Scholar] [CrossRef] [Green Version]
  24. Carroll, W.J.; Grimes, R.C. Evolutionary change in product management: Experiences in the car rental industry. Interfaces 1995, 25, 84–104. [Google Scholar] [CrossRef] [Green Version]
  25. Geraghty, M.K.; Johnson, E. Revenue management saves national car rental. Interfaces 1997, 27, 107–127. [Google Scholar] [CrossRef] [Green Version]
  26. Zhu, J. Using turndowns to estimate the latent demand in a car rental unconstrained demand forecast. J. Revenue Pricing Manag. 2006, 4, 344–353. [Google Scholar] [CrossRef]
  27. Gordon, R.F. Why It Is Difficult to Apply Revenue Management Techniques to the Car Rental Business and What Can Be Done About It. In Proceedings of the Northeast Business & Economics Association, New York, NY, USA, 5–7 November 2015. [Google Scholar]
  28. Patel, D.; Patel, A.; Patel, D. Optimization of A Car Rental Fleet to Fulfill the Demand. Int. J. Trend Res. Dev. 2018, 5, 508–512. [Google Scholar]
  29. Kourentzes, N.; Li, D.; Strauss, A.K. Unconstraining methods for revenue management systems under small demand. J. Revenue Pricing Manag. 2019, 18, 27–41. [Google Scholar] [CrossRef] [Green Version]
  30. Cooper, W.L.; Homem-De-Mello, T.; Kleywegt, A.J. Models of the spiral-down effect in revenue management. Oper. Res. 2006, 54, 968–987. [Google Scholar] [CrossRef] [Green Version]
  31. McLachlan, G.J.; Krishnan, T. The EM Algorithm and Extensions. In Wiley Series in Probability and Statistics; John Wiley & Sons: New York, NY, USA, 1997. [Google Scholar]
  32. Hopperstad, C.A. Passenger O/D simulator (PODS) update. In Proceedings of the AGIFORS Reservations and Yield Management Study, Zurich, Switzerland, 1996; pp. 365–372. [Google Scholar]
  33. Hopperstad, C.A. PODS: Modeling update. In Proceedings of the AGIFORS Reservations and Yield Management Study, Montreal, QC, Canada, 1997. [Google Scholar]
  34. Weatherford, L.R.; Pölt, S. Better unconstraining of airline demand data in revenue management systems for improved forecast accuracy and greater revenues. J. Revenue Pricing Manag. 2002, 1, 234–254. [Google Scholar] [CrossRef]
  35. Peng, G.; Baichun, X.; Jun, L. Multi-class unconstraining estimation models based on multi-distribution of demand in revenue management systems. J. Syst. Sci. Math. Sci. Chin. Ser. 2013, 33, 1164–1177. [Google Scholar]
  36. Li, M.Z.; Oum, T.H. Airline spill analysis—Beyond the normal demand. Eur. J. Oper. Res. 2000, 125, 205–215. [Google Scholar] [CrossRef]
  37. Ziarko, W. Variable precision rough set model. J. Comput. Syst. Sci. 1993, 46, 39–59. [Google Scholar] [CrossRef] [Green Version]
  38. Weatherford, L.R.; Gentry, T.W.; Wilamowski, B. Neural network forecasting for airlines: A comparative analysis. J. Revenue Pricing Manag. 2003, 1, 319–331. [Google Scholar] [CrossRef]
  39. Kays, H.; Karim, A.; Daud, M.R.C.; Varela, M.L.; Putnik, G.D.; Machado, J.M. A Collaborative Multiplicative Holt-Winters Forecasting Approach with Dynamic Fuzzy-Level Component. Appl. Sci. 2018, 8, 530. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Booking curves for given streams of arrivals.
Figure 1. Booking curves for given streams of arrivals.
Applsci 11 04506 g001
Figure 2. Estimation process for constrained observations using PD.
Figure 2. Estimation process for constrained observations using PD.
Applsci 11 04506 g002
Figure 3. BP neural network topology.
Figure 3. BP neural network topology.
Applsci 11 04506 g003
Figure 4. Flow chart of learning process.
Figure 4. Flow chart of learning process.
Applsci 11 04506 g004
Figure 5. Investigation sites of China Auto Rental Inc. in Chongqing.
Figure 5. Investigation sites of China Auto Rental Inc. in Chongqing.
Applsci 11 04506 g005
Figure 6. Fitting effect and error analysis of Holt–Winters model.
Figure 6. Fitting effect and error analysis of Holt–Winters model.
Applsci 11 04506 g006
Figure 7. Fitting effect of BP neural network.
Figure 7. Fitting effect of BP neural network.
Applsci 11 04506 g007
Table 1. Factors affecting customer car rental choice behavior.
Table 1. Factors affecting customer car rental choice behavior.
FactorsSubjective FactorsObjective FactorsRandom Factors
Company brand c 1
Car rental type c 2
Rental price c 3
Special offer c 4
Service attitude c 5
Returning location c 6
Vehicle status c 7
Procedure convenience c 8
Accident handling c 9
Table 2. Customer selection decision information.
Table 2. Customer selection decision information.
ObjectCondition AttributeDecision Attribute
c 1 c 2 c 3 c 4 c 5 c 6 c 7 c 8 c 9
u 1 321212212B
u 2 311111122G
u 3 212222211R
u 4 111112111B
u 5 122322313G
u 6 233333333G
u 7 132321111R
u 8 311111122R
u 9 321212212B
u 10 233333333G
u 11 221112112R
u 12 311111122R
u 13 323332313G
u 14 132321111R
u 15 311111122B
u 16 321212212B
u 17 111112111B
u 18 311111122G
u 19 321212212B
u 20 212222211R
u 21 132321111R
u 22 111112111B
u 23 212222211R
u 24 233333333G
u 25 111112111B
u 26 233333333G
u 27 132321111R
u 28 212222211R
u 29 311111122B
u 30 321212212B
Table 3. Process of rolling prediction.
Table 3. Process of rolling prediction.
StepInputOutput (Prediction Result)
1 X p , X p + 1 , , X p + q X p + q + 1
2 X p + 1 , X p + 2 , , X p + q + 1 X p + q + 2
r X p + q 1 , X p + q , , X p + q + r 1 X p + q + r
Table 4. Car rental order data.
Table 4. Car rental order data.
Variable Δ 1 Δ 2 Δ 3
1 2 3 4 5 6 7 1 2 3 4 5 6 7 1 2 3 4 5 6 7
I B ( 1 , j , t ) 115713394635391251071410141214
I ( 1 , j , t ) 101101010101111100011
I B ( 2 , j , t ) 5984946337518211998788
I ( 2 , j , t ) 011010110110111010110
I B ( 3 , j , t ) 314231131235324221223
I ( 3 , j , t ) 101110110111111001101
Table 5. Influencing factor scores, utilities, and possibilities.
Table 5. Influencing factor scores, utilities, and possibilities.
Car Type ChoiceCondition AttributeUtilityPossibility
c 3 c 5 c 6 c 7 c 8 c 9
1-07.146.646.746.336.426.716.746040.6156
1-26.356.185.226.355.585.266.006140.2938
1-34.344.974.865.455.424.614.829260.0905
2-05.646.356.335.485.776.135.956630.3547
2-15.896.876.245.876.155.296.149290.4301
2-35.116.056.114.935.255.115.456600.2152
3-06.926.856.836.326.546.776.766800.5558
3-15.894.465.665.896.126.035.489900.1550
3-26.366.156.295.366.026.156.113400.2892
Table 6. Repair results and errors analysis of 1 week.
Table 6. Repair results and errors analysis of 1 week.
VariableEstimation Method1234567
I B ( 1 , j , t ) 45465351434947
T U ( 1 , j , t ) 49465351474949
I ( 1 , j , t ) 0111010
EM51.76 (5.63%) 51.59 (9.77%) 52.16 (6.45%)
PD50.11 (2.27%) 49.84 (6.04%) 50.7 (3.47%)
MSM49.82 (1.67%) 49.76 (5.87%) 50.04 (2.12%)
IMSM49.56 (1.14%) 47.68 (1.45%) 50.04 (2.12%)
I B ( 2 , j , t ) 24242423202026
T U ( 2 , j , t ) 24242423222026
I ( 2 , j , t ) 1111011
EM 24.38 (10.82%)
PD 23.5 (6.82%)
MSM 23.27 (5.77%)
IMSM 20.5 (6.82%)
I B ( 3 , j , t ) 12141212101515
T U ( 3 , j , t ) 13141212131515
I ( 3 , j , t ) 0111011
EM14.88 (14.46%) 14.64 (12.62%)
PD14 (10.82%) 13.63 (14.45%)
MSM12.85 (1.15%) 13.56 (4.31%)
IMSM13.12 (0.92%) 12.23 (5.92%)
Numbers in brackets are the errors of the estimated value relative to the real demands.
Table 7. Estimation results comparison of EM, PD, MSM, and IMSM.
Table 7. Estimation results comparison of EM, PD, MSM, and IMSM.
MethodVariable Δ 1 Δ 2 Δ 3
123456712345671234567
I B ( 1 , j , t ) 115713394635391251071410141214
EM I U ( 1 , j , t ) 11127131291261051091251071815181214
PD I U ( 1 , j , t ) 111071310911685891251071613161214
MSM μ = 10 , σ = 2.85 μ = 7.4 , σ = 3.05 μ = 10.75 , σ = 2.99
S U ( 1 , j , t ) 050070605050000012100
U ( 1 , j , t ) 050070605050000012100
I U ( 1 , j , t ) 111071310910685891251071512151214
IMSM C P ( 1 , j , t ) 050070605050000012100
U ( 1 , j , t ) 050070604050000011100
I U ( 1 , j , t ) 111071310910675891251071511151214
I B ( 2 , j , t ) 5984946337518211998788
EM I U ( 2 , j , t ) 1198119116387578211129117811
PD I U ( 2 , j , t ) 9988986367568211119107810
μ = 8 , σ = 1.41 μ = 5 , σ = 2.55 μ = 8.75 , σ = 1.71
MSM S U ( 2 , j , t ) 350474606055000112101
U ( 2 , j , t ) 300404002005000101001
I U ( 2 , j , t ) 89889863575682111099789
IMSM C P ( 2 , j , t ) 300404003005000102020
U ( 2 , j , t ) 300404001005000101010
I U ( 2 , j , t ) 89889863475682111099797
I B ( 3 , j , t ) 314231131235324221223
EM I U ( 3 , j , t ) 344234135235324441243
PD I U ( 3 , j , t ) 334233134235324441243
μ = 2.6 , σ = 1.14 μ = 3 , σ = 1.1 μ = 2.5 , σ = 1.29
MSM S U ( 3 , j , t ) 370476608054000212111
U ( 3 , j , t ) 020002002000000110010
I U ( 3 , j , t ) 334233133235324331233
IMSM C P ( 3 , j , t ) 020002003000000110010
U ( 3 , j , t ) 020001002000000110010
I U ( 3 , j , t ) 334232133235234331233
Table 8. Holt–Winters forecast results based on different estimation methods.
Table 8. Holt–Winters forecast results based on different estimation methods.
MethodCar Type Δ 4 SubtotalTotal
1 2 3 4 5 6 7
EM1712101411111479175
278878181571
3361355225
PD111310108131267168
21623121291569
3162365932
MSM11225711131363155
2119971081165
3535324527
IMSM1115891014360133
210998125154
3412136219
Table 9. Prediction samples and training samples.
Table 9. Prediction samples and training samples.
No.Car TypeInput SampleOut Sample
1113.0011.499.0011.496.009.745.009.74
2112.005.0010.007.0017.1315.0417.1312.00
329.008.0010.319.0010.306.003.007.38
423.007.387.005.006.968.002.0011.00
528.002.0011.0011.069.0010.617.008.00
622.0011.0011.069.0010.617.008.0010.61
733.004.262.003.005.003.002.004.00
8111.0011.577.0013.0011.499.0011.496.00
9111.577.0013.0011.499.0011.496.009.74
1017.0013.0011.499.0011.496.009.745.00
11111.499.0011.496.009.745.009.749.00
1219.0011.496.009.745.009.749.0012.00
13111.496.009.745.009.749.0012.005.00
1416.009.745.009.749.0012.005.0010.00
1519.745.009.749.0012.005.0010.007.00
1615.009.749.0012.005.0010.007.0017.13
1719.749.0012.005.0010.007.0017.1315.04
1819.0012.005.0010.007.0017.1315.0417.13
1915.0010.007.0017.1315.0417.1312.0014.00
20210.389.008.0010.319.0010.306.003.00
2128.0010.319.0010.306.003.007.387.00
22210.319.0010.306.003.007.387.005.00
2329.0010.306.003.007.387.005.006.96
24210.306.003.007.387.005.006.968.00
2526.003.007.387.005.006.968.002.00
2627.387.005.006.968.002.0011.0011.06
2727.005.006.968.002.0011.0011.069.00
2825.006.968.002.0011.0011.069.0010.61
2926.968.002.0011.0011.069.0010.617.00
3033.003.484.002.003.003.481.003.00
3133.484.002.003.003.481.003.004.26
3234.002.003.003.481.003.004.262.00
3332.003.003.481.003.004.262.003.00
3433.003.481.003.004.262.003.005.00
3533.481.003.004.262.003.005.003.00
3631.003.004.262.003.005.003.002.00
3734.262.003.005.003.002.004.003.89
3832.003.005.003.002.004.003.893.89
3933.005.003.002.004.003.893.891.00
4035.003.002.004.003.893.891.002.00
4133.002.004.003.893.891.002.003.89
4232.004.003.893.891.002.003.893.00
Table 10. BP neural network forecast results based on different estimation methods.
Table 10. BP neural network forecast results based on different estimation methods.
MethodCar Type Δ 4 SubtotalTotal
1 2 3 4 5 6 7
EM112141098161685189
212810910131375
3534325729
PD1131310108131784184
21112897141273
3532335627
MSM112131298131582174
21111695131368
3532234524
IMSM11112897111674163
21012685121467
3442133522
Table 11. Hybrid forecasting results based on different estimation methods.
Table 11. Hybrid forecasting results based on different estimation methods.
MethodCar Type Δ 4 SubtotalTotal
1 2 3 4 5 6 7
EM1101310119141582181
21011988151273
3442335526
PD112910108131577176
21386109121371
3342345728
MSM1129989131474165
21110787111266
3533234525
IMSM1111089812122154
21011787101063
3432134421
Table 12. Real demand in the fourth presale lead time interval.
Table 12. Real demand in the fourth presale lead time interval.
Car Type SubtotalTotal
1234567
11181198101269147
29778791158
3322234420
Table 13. Relative errors of forecast demand for each car type and all car types.
Table 13. Relative errors of forecast demand for each car type and all car types.
Forecast ModelCar TypeUnconstrained Estimation Method
EMPDMSMIMSM
Holt–Winters112.31%−3.62%−10.10%−12.10
220.69%17.24%8.62%7.47%
36.00%47.45%32.50%−6.67%
BP Neural Network123.19%21.74%18.84%7.27%
229.31%25.86%17.24%15.52%
336.00%35.00%20.00%10.00%
Hybrid Model118.84%11.59%7.25%1.45%
225.86%22.41%13.79%8.66%
324.00%40.00%25.00%5.00%
Holt–Winters
BP Neural Network
Hybrid Model
All types14.96%11.56%3.06%9.50%
28.57%25.17%18.37%10.89%
23.12%19.72%12.24%4.76%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Yang, Y.; Ceder, A.; Zhang, W.; Tang, H. Unconstrained Estimation of Multitype Car Rental Demand. Appl. Sci. 2021, 11, 4506. https://doi.org/10.3390/app11104506

AMA Style

Yang Y, Ceder A, Zhang W, Tang H. Unconstrained Estimation of Multitype Car Rental Demand. Applied Sciences. 2021; 11(10):4506. https://doi.org/10.3390/app11104506

Chicago/Turabian Style

Yang, Yazao, Avishai (Avi) Ceder, Weiyong Zhang, and Haodong Tang. 2021. "Unconstrained Estimation of Multitype Car Rental Demand" Applied Sciences 11, no. 10: 4506. https://doi.org/10.3390/app11104506

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop