1. Introduction
Mobile grocery stores, also known as grocery trucks (GT) or pop-up grocery stores, have emerged as a new trend in the retail industry. GTs typically stock a wide variety of fresh fruits and vegetables, as well as other grocery items. They are usually equipped with refrigeration units to keep the food fresh and have a payment system in place for customers to buy the products.
These mobile stores bring convenience and accessibility to customers in under-served or rural areas, or in areas where traditional brick-and-mortar grocery stores are not present [
1,
2]. The trend of grocery trucks is expected to continue to grow in the coming years as more and more people look for ways to improve access to healthy food in their communities. In the U.S., the mobile food vendors market is valued at 1.16 billion dollars in 2021 and is predicted to grow at a rate of 6.4% annually from 2022 to 2030, driven by the increasing trend of culinary arts and the preference of young people for different dining experiences over the traditional dining in restaurants [
3].
Another application of GTs is to provide food to people in emergency situations, such as natural disasters or power outages [
4,
5], as well as address food insecurity by increasing access to healthy food options. GTs are valuable resources for alleviating food insecurity, and these trucks are often operated by non-profit organizations, local governments, and community groups that want to address food insecurity and promote healthy eating [
6].
Logistics for mobile grocery stores refer to the process of planning, coordinating, and controlling the movement and storage of goods, services and information from the point of origin to the point of consumption. This includes transportation, inventory management, warehousing, and distribution of products. In the case of mobile grocery stores, logistics also includes the planning and coordination of the routes and schedule of the mobile store, as well as the management of the supply chain and inventory. This includes sourcing products from suppliers, managing inventory levels and restocking the mobile store as needed, and coordinating delivery schedules [
7,
8]. Effective logistics management is crucial for the success of mobile grocery stores, as it can impact delivery times, cost, and overall customer satisfaction. By optimizing logistics, mobile grocery store owners and operators can improve their business by reducing costs and increasing efficiency, ultimately resulting in a better customer service [
9,
10].
The goal of this research is to address the challenges in the logistics management of mobile grocery stores by proposing a mixed-integer linear formulation for a Dynamic Modified Stochastic p-Median Problem (DMS-p-MP) in a supply chain. The aim is to enhance the robustness of the solution by considering the uncertainties and variabilities of the parameters in this problem. Additionally, this study aims to address the lack of consideration of the dynamic aspects in the existing literature of supply chain network design by incorporating the time-varying nature of the problem. The proposed mixed-integer linear programming (MILP) model will not only provide a robust solution to the mobile grocery store problem but also offer a framework that can be applied to other similar supply chain network design problems. In this regard, we first provide a comprehensive literature review of the proposed location-allocation models in the literature in
Section 2 and then provide a mathematical formulation for our model considering the uncertainty of demand in
Section 3.
Section 4 and
Section 5 show the application of our proposed model in the real-world mobile grocery location problem and provide managerial insights, respectively.
2. Literature Review
Supply chain management is the coordination and management of activities involved in the production and delivery of goods and services from the raw material stage to the end customer [
11,
12]. It involves the planning, design, execution, monitoring, and control of all processes involved in the flow of goods and services from the initial sourcing of raw materials to the delivery of the finished product to the end customer [
13]. The principles of supply chain management can be applied in various industries, including healthcare [
14,
15], retail, manufacturing, and transportation, to optimize the flow of goods, information, and resources, thereby improving efficiency and reducing costs [
16]. Location allocation is a critical aspect of supply chain management, particularly in distribution and service industries. The objective of location allocation is to determine the optimal number and placement of facilities to serve customers while minimizing costs. In this section, we discuss some of the most well-known location-allocation problems and characteristics of different models in the literature.
The maximal covering location problem (MCLP) is highly regarded as one of the most useful facility location models from both a theoretical and practical standpoint [
17,
18]. The goal of MCLP is to identify a set of facilities that maximizes the total weight of covered customers [
19], who are considered covered if they are located within a specified distance from their nearest facility [
20]. This problem was first introduced by [
21], in which the search for an optimal solution was limited to nodes. The study by [
22] later expanded the search to a dominant set of points and solved the problem through linear programming if the solution was an integer and through branch and bound if it was not.
The p-center problem is a well-studied optimization problem in the field of operations research and location allocation. It involves finding the optimal location of p facilities to serve a set of demand points in such a way as to minimize the maximum distance between a demand point and its nearest facility. A comprehensive survey of the p-center problem and its variants was conducted by [
23]. The authors provide an overview of the different formulations and solution methods for the p-center problem and discuss extensions to the problem such as the p-median and p-center with capacities. In recent years, there have been several advances in the solution of the p-center problem using approximation algorithms and metaheuristics. A paper by [
24] presents a memetic algorithm for the p-center problem and demonstrates its effectiveness through a set of computational experiments.
The p-median problem is another well-known location-allocation problem that involves determining the optimal number and location of facilities to minimize the total access cost [
25]. It is a popular choice for various applications, including the allocation of mobile grocery stores. This problem is particularly well-suited for this application due to its focus on minimizing the access cost, which is crucial in determining the most cost-effective locations for the stores. By optimizing the number and location of stores, the p-median problem can help ensure that the stores are located in the areas where they can provide the most accessible and convenient service to customers while minimizing the access costs. This can result in increased efficiency and cost savings, making it a good choice for the allocation of mobile grocery stores.
The p-median problem is formulated as an integer programming model and can be solved using various algorithms, such as the primal-dual method or branch-and-bound algorithm. The work of Berge [
26] initially explored the challenge of finding the minimum coverage in location problems. Later, Miehle et al. [
27] introduced the well-known p-median problem, in which the objective is to select the optimal p facilities to open with the goal of minimizing the total transportation cost. Subsequently, Hakimi’s node optimality theorem [
28,
29] was developed to show that, in a continuous graph, the optimal solution for absolute median problems always lies on the graph’s vertices. This implies that many network problems can be transformed into their discrete versions without loss of optimality.
In the quest to tackle increasingly complex and realistic optimization problems, mixed-integer linear programming (MILP) emerged as a widely used method in the location optimization domain [
30,
31]. Early works such as [
32,
33] utilized MILP to solve various location problems including the uncapacitated facility location problem, the discrete p-median problem, and the covering-location problem. The deterministic model assumes that all parameters are known with certainty, but real-life scenarios always involve unpredictable elements. To address this issue, researchers have been exploring the combination of location models with stochastic programming methods. For example, Refs. [
34,
35] incorporate stochastic variables, such as customer demand, the cost of serving a customer from a specific location, and the cost of opening a facility, into the uncapacitated facility location and p-median problems. Ref. [
36] considers the impact of stochastic customer demand on the capacitated facility location problem. Ref. [
37] studies the uncertainty surrounding the number of facilities to open. Ref. [
38] proposes a method to increase the number of facilities based on customer demand, while Ref. [
39] considers the opposite scenario of closing existing facilities. Ref. [
40] explores the impact of demand uncertainty on covering-type location problems. Zaferanieh et al. discuss a strategy to address traffic congestion caused by the high demand of clients in a bi-level p-facility network [
41].
Machine learning has demonstrated its potential for solving a range of challenges across various industries by leveraging the power of large datasets to develop optimization and predictive models [
42,
43]. Wang et al. [
44] introduce a novel solution to the uncapacitated p-median problem that utilizes reinforcement learning. The proposed method, which uses Multi-Talking-Heads Graph Attention Networks, outperforms traditional classical heuristics and meta-heuristics [
45] in terms of both solution quality and computational efficiency.
The incorporation of time into location problems is a common characteristic in practical applications. These problems are classified as either single-period or static, where decisions are based on fixed parameters, or multi-period [
46] or dynamic, where decisions vary over time [
47]. Early works in this field include [
48,
49], which deal with the location and relocation of a single facility over a period of time. The concept was then expanded to cover multiple facilities, as seen in [
50]. The integration of the Weber problem with the multi-period facility location problem was explored in [
51], which aimed to determine the best facility to open at each time period. Other studies, such as [
52,
53,
54,
55], focus on dynamic uncapacitated facility location problems, location problems on networks, network p-median problems, and network center problems, respectively. The impact of facility opening and closing costs was studied in [
51,
56,
57,
58]. Additionally, dynamic location problems under uncertain environments were addressed in [
59,
60,
61].
Despite the numerous studies on location allocation models, there remains a gap in the literature when it comes to considering both robustness and dynamic allocation. While robust optimization has been used to address uncertainty in the demand, cost, and other parameters, the majority of these models are static and do not take into consideration the dynamic aspect of the location problem. On the other hand, dynamic location models have been proposed to account for the changing nature of the location problem over time, but they do not address the uncertainty present in real-world scenarios. In light of these gaps, this study aims to fill the void by proposing a MILP model that incorporates both robustness and dynamic allocation. This model aims to provide a more comprehensive solution for real-world location problems that are subject to both uncertainties and changes over time.
4. Numerical Experiment
4.1. Case Description
In this section, we discuss a case in which a company wants to place some mobile grocery stores on the campus of the University of Waterloo. These self-service mobile stores can serve all students and staff.
Figure 1 shows a sample picture of these stores. Specifically, the company wants to propose a plan to specify the optimal locations of stores needed on campus dynamically (every day) based on the demand variation on campus. They update their plan every month, i.e., they decide on all days at the beginning of each month. In this research, we aim to find the optimal locations of stores over the time horizon (one month). Based on
Section 3,
,
, and
.
Figure 2 shows the University of Waterloo campus map. As per the legend associated with this figure, all buildings on campus are categorized into five groups: Service and Administrative Buildings, Academic Buildings, Residence Buildings, Research Park Buildings, and others. Considering the parking lots, the campus can be effectively divided into six distinct categories. The University of Waterloo asked the company to follow some rules regarding each group. Specifically, there are some limitations on the number of stores needed for each group for each day. More details and the list of buildings included in each group are provided in the following sections.
Each building has a specific demand on each day. Obviously, it is not (financially) feasible to place one store in each building. The company has only p stores, and in each period (day), they want to place all of them on campus. We consider a demand cost for students and staff in a building that does not have a store, and they need to go to other buildings. The goal is to place the stores to satisfy the demand of all buildings while minimizing the costs. There are two more costs in our problem: opening cost and closing cost. Although the stores are mobile, we need to consider an opening (closing) cost if we want to open (close) a store at the beginning of each day. In fact, we are capturing all transportation costs needed to change a store’s location. In our model, we assume that the cost of opening or closing a store is not related to the distance between different locations.
The demand of each building may change from day to day due to different reasons such as weekends, events, etc. Then, it is an excellent motivation for day-to-day decision-making following demand variation. In this research, we assume that our demand prediction is accurate. Then, we can finalize all decisions for each day of the month, once at the beginning of the month. This variability highlights the need for a more dynamic approach to decision-making, and robust optimization can provide a solution to this challenge.
We can model this problem as a DMS-p-MP. Generally, our problem has two modifications compared to the simple multi-period p-median problem. First, we are considering opening and closing costs over the time horizon. Second, there are some extra constraints raised from University of Waterloo policies.
4.2. Data Collection
We divide all of the University of Waterloo (UW)’s facilities according to their functionality into six different segments: Academic Buildings, Parking Spots, Student Residence Buildings, Research Park Buildings, Athletic Buildings, and University Plaza, respectively. The workspace is based on the UW’s official map, which includes 91 facilities, and we also make an assumption that the population in each facility within a segment is all equivalent. Segments indicated in
Table 2 are as follows,
We use Python’s OpenCV toolbox by [
76] to determine facilities’ specific locations by fixing pixels’ coordinates in the workspace. Students and staff allow going through buildings, and it is likely for us to use Euclidean distance measurement
to find the distance between the two pairs in the data set as follows in
Figure 3.
From the UW’s website, we obtained the rough number of students and staff for each faculty, the rough number of students and staff for each university college, and the capacities of each residence. A general assumption is that 10% of university students will use one of two gyms (CIF and PAC).
Table 3,
Table 4,
Table 5,
Table 6,
Table 7 and
Table 8 below are the population of all classifications for each segment.
It was given in the previous section that the time horizon
(approximately equivalent to a month), but it can also be separated into four periods (weeks). Based on the functionality of each segment, with time varying from Monday to Sunday, we may empirically define the facility in the different segments and the different days in utilization rate
. For instance, the utilization rates for the academic buildings are 100 out of 100, but there are 30 out of 100 during the weekend. The complete estimated utilization rates for different functional buildings for each day over a week are as follows in
Table 9:
Thus, the demand
of each building in segment
k on day
t, where
, in a week, is able to be defined as
As for having a unified standard, we calculate the lower bound of opening facilities in constraint (
8) by taking a floor of 70% of the number of facilities in segment
k over the total facilities in the workspace:
For example,
Similarly, the upper bound of opening facilities in constraint (
7) by taking a ceiling of 130% of the number of facilities in segment
k over the total facilities in the workspace,
The following
Table 10 is the maximum (minimum) number of Robomarts that are allowed in segment
k,
4.3. Computational Experiments
In this section, we use all parameters discussed in the
Section 4.2 Data Collection to solve our problem. All numerical experiments have been run on an Apple M1 processor, limited to 16 GB of RAM. Gurobi has access to 8 physical cores and 8 logical processors, using up to 8 threads. This model is MILP and has
M (=239,512) variables and
N (=239,694) constraints (excluding sign constraints). We use Gurobi version 9.5.1 by [
77] to solve this optimization model. Since default settings in Gurobi generally work well, we are keeping all settings as default. Specifically, we use Gurobi’s API embedded in Python 3.11.2.
MILP models are generally solved using a linear-programming based branch-and-bound algorithm. The Gurobi provides advanced implementations of the latest MILP algorithms including deterministic parallel, nontraditional search, heuristics, solution improvement, cutting planes, and symmetry breaking.
Based on
Section 5, the demand is weekly periodic. Then, we expect the model to make the same decision over the weeks.
Table 11 shows what buildings are open during the planning horizon. We show the days that we change our decisions. For instance, the buildings that are open between day 1 and day 6 are the same.
Table 11 shows that we only open new buildings and close the current buildings over the weekend. We again change our decision on weekdays. It is expected because the utilization rates of some of our segments are significantly different during the weekend.
Specifically, we close buildings 12, 55, and 83 (that, based on
Table 3, are Conrad Grebel university college, Parking lots, Research building 375, respectively) and open buildings 63, 77, and 81 (that are Parking P, Student Village, University of Waterloo Place, respectively). The interesting point is that all of the buildings 63, 77, and 81 are around students’ residences. Students are mostly in the residence area instead of the academic campus during the weekends. Then, it is worth closing some stores and opening new ones in students’ residences.
4.4. Sensitivity Analysis
In this section, we will change the parameters to see how variability can affect our results.
4.4.1. Time Horizon
First, we analyze the impact of the length of time horizon on our results.
Table 12 shows the results when we increase (decrease) the time horizon. For instance, as we increase the time horizon, it will take more time to find the optimal solution. In addition, our optimal objective value would increase since we are adding more positive terms to our cost.
Moreover, we see that the opened and closed buildings remain the same as we increase the time horizon. The reason is that our demand is periodic over the weeks. By increasing the number of weeks, the optimal decision to open and close some stores would be the same.
Figure 4 shows the results, in which red line represents the objective value and the blue line represents CPU run time.
4.4.2. Number of Facilities to Be Located (P)
Now, we change parameter P and see how it affects our results. Variability of P is important since it would help the company to decide the number of stores they want to buy and create on the campus.
Note that we could also consider a fixed cost of buying each store in our model (i.e., we could add the
of each facility to the objective function). However, since in our optimization model
P is fixed and given, we do not need to consider it.
Figure 5 shows the results when we change
P. First, it seems that the running time is not directly dependent on
P. However,
P is determining the complexity of the problem. In addition, the objective value is obviously decreasing as we increase
P since we did not consider the price of each store.
In addition,
Table 13 shows how our decision changes in different
P. There is one interesting point in our decisions. Compare
with
. In
, we do not use the same opened building we used in
This shows that, as we want to add one building, we may no longer need another building.
4.4.3. Cost Coefficients
Followed by two previous sections, we want to discuss the effects of changing the opening and closing costs together on our results.
Table 14 shows the results when we increase (decrease) the cost coefficients. Considering the case in which costs are both zero, the model tries to open (and close) any building in each period. In other words, it does not care how many building it wants to open or close each day. Another noteworthy point is that running time increases as we increase the costs. It shows that the trade-off between keeping current buildings and opening (closing) other buildings is becoming important and challenging to the model.
Figure 6 shows that the initially opened buildings remain the same and start to dynamically be changed as we increase the opening/closing cost. Moreover, the objective value increases as we increase the opening cost. In addition, in the case of high opening costs, we prefer not to open new buildings (and close the current open buildings) since it would be costly.
5. Discussion
In this paper, we propose a mixed-integer linear formulation for the Dynamic Modified Stochastic p-Median Problem in a Competitive Supply Chain Network Design. The presented model takes into account the robust optimization and time horizon as a novel approach, which enables the decision-maker to consider uncertainty and short-term changes in the supply chain network design. The robust optimization approach used in this study allows for the consideration of different scenarios and uncertainty in demand and supply, which is crucial in real-world applications. Additionally, the time horizon approach allows for the dynamic nature of the problem to be captured, which is important in today’s fast-paced business environment.
The proposed model was tested using computational experiments, and the results demonstrate the effectiveness of the proposed approach in handling the dynamic and stochastic nature of the problem. The results also provide valuable insights for practitioners and researchers in the field of supply chain network design. The proposed model can be extended and applied to other similar problems in the field, such as facility location, transportation and logistics, and inventory management.
One of the main contributions of this study is the integration of robust optimization and time horizon in the mixed-integer linear formulation for the Dynamic Modified Stochastic p-Median Problem. The robust optimization approach allows for the consideration of different scenarios and uncertainty, while the time horizon approach allows for the dynamic nature of the problem to be captured. This integration provides a more realistic and practical solution to the problem, which can be useful for practitioners and researchers in the field.
Another important contribution of this study is the application of the proposed model to a competitive supply chain network design problem. This application is relevant and valuable as it provides insights into how the proposed model can be used in a real-world context. The results of the computational experiments demonstrate the effectiveness of the proposed model in handling the dynamic and stochastic nature of the problem and provide valuable insights for practitioners and researchers in the field. For instance, we can change over the problem definition to solve any other location problems. Covering location problems are valuable to investigate, such as trying to find the optimal number of Robomarts [
75] that can serve all facilities in the workspace if the single Robomart can only serve facilities within a limited number of miles.
The results of the computational study demonstrate the effectiveness and robustness of the proposed formulation in solving the DMS-P-MP. The results show that the number of opened and closed buildings remains constant as the time horizon advances, even though the demand patterns change over time. This highlights the ability of the proposed formulation to adapt to changing demand patterns and ensure that the supply chain network is both cost-efficient and responsive to changes in demand. Moreover, the robustness of the solution is demonstrated by its ability to provide effective solutions even in the presence of uncertainty in demand patterns. The proposed formulation can be applied to real-world problems, providing decision-makers with an effective and robust tool to optimize their supply chain network design in a dynamic and uncertain environment. The ability of the formulation to take into account the periodic nature of demand and adapt to changing demand patterns makes it a valuable tool for decision-makers who are facing challenges in designing a cost-efficient and responsive supply chain network.
While the cost of opening or closing a store is currently considered as a constant parameter in the model, it is recognized that this cost could vary dynamically based on various factors such as the distance between different locations. This presents an exciting opportunity for further research, where the dynamic nature of opening and closing costs could be explored and incorporated into the model.
One limitation of this study is that the Robomart can only be located at specific facilities (nodes) in the proposed model. In reality, however, the Robomart can also be located somewhere on the route between two nodes (edges). This limitation may affect the validity and applicability of the proposed model in real-world scenarios. To address this limitation, it is possible to consider analogous absolute p-median problems to simulate reality. This approach would involve the inclusion of edge-based locations for the Robomart in the model, which would provide a more realistic representation of the problem. However, this would require additional mathematical development and computational resources, and would be a subject for future research.
Another potential area of future research is to make the problem an Adaptive Robust Optimization (ARO) problem [
64]. In this approach, the number of trucks
p would be determined in the first stage and other variables in the second stage. This would enable a more flexible and dynamic approach to supply chain network design, as the number of trucks can be adjusted in response to changes in demand and supply.
In summary, the proposed model in DMS-p-MP takes into account the robust optimization and time horizon as a novel approach, which enables the decision-maker to consider uncertainty and short-term changes in the supply chain network design. The results of the computational experiments demonstrate the effectiveness of the proposed model in handling the dynamic and stochastic nature of the problem and provide valuable insights for practitioners and researchers in the field. The proposed model can be extended and applied to other similar problems in the field of supply chain network design. The integration of robust optimization and time horizon in the proposed model provides a more realistic and practical solution to the problem, which can be useful for practitioners and researchers in the field.