1. Introduction
Video streaming is an increasingly significant social and entertainment activity for users who travel at high speed. With the rapid development of network technology, users are increasingly demanding higher video quality. However, in the high-speed network environments, such as those encountered on high-speed railways, user experience may suffer from frequent base station handovers, severe channel quality fluctuations, and limited wireless resources along the high-speed rail. As the speed of user movement increases, the duration of connection to a base station decreases. This frequent base station switching results in rapid and frequent changes in wireless channel quality, making it challenging to predict channel quality in high-speed networks. This unpredictability adversely affects the performance of rate selection algorithms, which rely on user channel quality. Accurately predicting channel quality changes in high-speed networks is a significant challenge. Additionally, the retransmissions caused by base station switching increase the video transmission latency, severely impacting user experience. Edge computing technology can alleviate this issue by caching data in advance at the edge or by providing real-time media data format conversion through its robust computing capabilities, thereby reducing transmission delay. However, in high-speed networks, choosing the location for caching and the media data format conversion is also challenging. In high-speed network environments, the distribution of wireless resources is uneven both temporally and spatially. When multiple users access the wireless network at high speeds, the wireless resource allocation algorithm must consider time, space, user viewing environment, and the varying channel qualities of different users. Therefore, adequately allocating wireless resources for a large number of users in high-speed mobile networks is another significant challenge. In a word, improving user video experience in a high-speed network is a challenging problem.
In this context, the term “high-speed network” refers to the scenario where users access wireless networks while moving at high speeds. Significantly, when the user’s movement speed exceeds 250 km/h, such as during high-speed train travel, this rapid mobility significantly impacts the network experience. Conversely, when the user’s speed is below 250 km/h, it is termed as a “low-speed network”. This speed threshold is derived from our experimental tests conducted along high-speed rail lines and is related to the distribution of base stations along these routes.
Currently, extensive research has been conducted on streaming media transmission technology. Variable bitrate technology adapts to the jittery network bandwidth by sending videos with different bitrates [
1]. One study [
2] incorporates the subjective Quality of Experience (QoE) of users into the bitrate selection algorithm, choosing an appropriate video bitrate based on user preferences. Another study [
3] proposes a dynamic block quality-aware adaptive bitrate algorithm that caters to diverse QoE requirements by selecting higher quality for dynamic blocks while not excessively reducing the quality of static blocks. Another research [
4] suggests an intelligent variable video segment division strategy that makes decisions based on network and user viewing information data, striking a balance between accuracy and cost of variable-length division. It also introduces a data-driven I-frame adaptive bitrate switching algorithm to enhance data transmission efficiency. However, these studies primarily focus on low-speed mobile scenarios, and their proposed algorithms may not perform as well in high-speed mobile scenarios. In high-speed mobile scenarios, base station handovers occur more frequently, the packet loss rate is higher, and predicting user channel quality becomes more challenging. Therefore, the current study proposes an optimization scheme for single-user video streaming in a high-speed mobile network [
5]. However, the previous work of this study does not consider the scenario of multiple-user video streaming, and its optimization objectives also differ. The main objective of this paper is to enhance the overall video experience for all users.
However, in high-speed mobile scenarios, multiple users requesting media data simultaneously face the following challenges. First, the base station has limited wireless resources, and allocating suitable wireless resources for each user is difficult. Second, users frequently switch base stations, and predicting the channel quality of multiple users is challenging. Third, in Mobile Edge Computing (MEC), the data caching method and the user’s decision after a cache failure are also problematic.
This paper proposes an environment-aware wireless resource allocation algorithm based on edge computing to address the above challenges. The algorithm considers the operational state of high-speed rail, the video streaming environment of users, the limited wireless resources of the base station, and other constraints. This paper mathematically models the wireless resource allocation problem of the base station in the high-speed mobile network environment as a nonlinear, NP-hard problem. To address this challenge, the paper proposes a user channel quality prediction model, updates the QoE model for multiple users video streaming, and introduces two algorithms to solve the NP-hard problem. The simulation experiments validate the effectiveness of the proposed algorithms.
The contributions of this paper can be summarized as follows:
A wireless resource allocation method based on spatio-temporal attributes is proposed. This method involves selecting suitable base stations and allocating appropriate wireless resources for each user, based on time and environmental parameters related to the user’s video-watching experience.
A mathematical formulation of the problem is provided, and it is demonstrated that the optimization problem addressed in this paper is an NP-hard problem.
The effectiveness of the proposed method is validated through simulation experiments.
The paper is organized as follows.
Section 2 reviews the related work on the research topic both at home and abroad.
Section 3 introduces the proposed system architecture and system model.
Section 4 presents the solution to the problem.
Section 5 provides the simulation results.
Section 6 analyzes the results and discusses the implications.
Section 7 discusses the results of the experiments, and
Section 8 provides a summary of the entire paper.
2. Related Works
In recent years, the continuous development and expansion of high-speed rail have increased the demand for network services onboard high-speed rail. As people also expect higher video quality, providing better media services for users in high-speed mobile networks is a challenge. Currently, the main methods of video transmission optimization in a high-speed network involve optimizing protocols such as the Transmission Control Protocol (TCP), User Datagram Protocol (UDP), and Hypertext Transfer Protocol (HTTP).
In high-speed mobile network environments, frequent base station switching causes a lot of packet losses. The TCP transmission protocol may experience excessive congestion control when working, as the TCP congestion control algorithm cannot determine the cause of losses of user data packets. This reduces the user’s available bandwidth. Therefore, many researchers have proposed some TCP optimization schemes, detailed as follows. Binary increase congestion control (BIC): BIC is a congestion control algorithm used with the TCP protocol. It aims to provide improved performance in high-speed networks [
6]. Explicit congestion notification (ECN): ECN is a network congestion management mechanism used to explicitly notify the congestion status of packets in a network [
7]. Multipath TCP (MPTCP): MPTCP is an extension of the TCP protocol that allows a single connection to transmit data across multiple network paths [
8]. High-speed TCP (HSTCP): HSTCP is a variant of the TCP protocol designed specifically for high-speed network environments. It optimizes TCP’s congestion control algorithm to accommodate the specific requirements of high-speed networks, offering improved performance. These schemes can dynamically adjust network bandwidth and delay and improve the performance of the TCP protocol in high-speed mobile network environments.
The UDP protocol can improve transmission efficiency in high-speed mobile networks. However, UDP is unreliable and does not guarantee data transmission reliability. Hence, researchers have proposed UDP optimization techniques, such as the Real-time Transport Protocol (RTP)/RTP Control Protocol (RTCP) and UDP Lite, to enhance UDP’s reliability and stability. Additionally, the Stream Control Transmission Protocol (SCTP) [
9], which combines features of both UDP and TCP, offers another approach to achieve reliable and efficient data transmission.
The Dynamic Adaptive Streaming over HTTP (DASH) protocol is a popular variable bitrate protocol that can adapt to network bandwidth jitter and enhance the stability and quality of video transmission. As it is based on the HTTP protocol, scholars have optimized HTTP to better support DASH, for example, through HTTP-FLV (HTTP-Flash Video) and WebSocket [
10]. These optimizations have greatly improved video transmission efficiency.
Researchers have conducted extensive research to improve the quality of high-speed rail streaming. However, in high-speed vehicular networks, network congestion and reliability remain challenging due to factors such as user behavior prioritizing individual performance and link failures. In [
11], authors propose an Enhanced Congestion Game with Link Failure (E-CGF) scheme to cope with these challenges and achieve optimal network selection. E-CGF employs a hidden Markov model to estimate the probability of link failure. In [
12], the authors propose a user-assisted base station (BS) caching and cooperative prefetching scheme for high-speed rail (HSR) communication, where adjacent BSs exchange information periodically, such as coverage area and communication rate, to facilitate content caching and prefetching. Additionally, users can cache varying content received from BSs, augmenting the caching capabilities. The scheme formulates a content caching and prefetching optimization problem to minimize the overall transmission delay. The authors in [
13] propose a switching algorithm for the high-speed rail wireless communication environment, based on the combination of train running direction and speed. This algorithm analyzes the measurement, filtering, and control parameters involved in the Time Division-Long Term Evolution (TD-LTE) switching process.
Adaptive bitrate selection algorithms and video data caching strategies are applied by some scholars to improve video transmission technology. In [
14], the authors explore the challenges of scalable video with varying content popularity and viewing demands, suggesting various cache content types and transmission schemes. The authors in [
15] introduce a Multi-Rate cache (MRC) scheme for video offloading in Device-To-Device (D2D) networks, which in communication technology refer to wireless networks that enable devices to directly exchange data independently of central communication towers or servers, a key advancement in decentralized network communications. MRC adopts a tactical approach in caching systems, storing and retrieving data at varying rates based on its access frequency and importance. This strategy is essential in environments that require efficient data handling, such as high-speed mobile networks. The goal of MRC is to rapidly improve data accessibility and use storage capacities wisely, leading to enhanced overall system performance. MRC enables smart management of cache resources, ensuring quick access to frequently demanded or high-priority data, while efficiently handling less-used data. The study in [
16] introduces a collaborative caching architecture for scalable video coding, tailored for use in drones and client devices. This architecture allows for efficient content distribution at high transmission rates and personalized video quality in hotspot regions.
Wireless resource allocation in the base station directly affects the user’s network bandwidth and is a current research hotspot. This research in [
17] focuses on studying network energy efficiency in downlink scenarios, incorporating user cooperation and quality of service guarantees. This study jointly considers relay selection, power allocation, and network selection to maximize the energy efficiency of mobile users while maintaining high Quality of Service (QoS). Another study [
18] combines resource allocation with power control and jointly optimizes resource block allocation and power control. The authors propose a multi-agent deep reinforcement learning algorithm to improve the efficiency of resource block allocation. Additionally, the research in [
19] studies a multi-band wireless network, where orthogonal and non-orthogonal multiple-access techniques coexist. They investigate the joint optimization of user association, transmit power allocation, subchannel assignment, and multiple-access technique selection to maximize the downlink sum rate under the minimum user rate requirement and power constraint. Furthermore, [
20] proposes a time-varying demand resource allocation method for QoE-oriented wireless communication networks. This can predict the time-varying demand and make the network operate under the constraint of random blocking probability. This also considers queuing delay requirements as a practical design for QoE. Utilizing a time-varying queuing model and an approximation method based on the Continuous-Time Markov Chain (CTMC), the authors formulate the technical design as a convex stochastic optimization and propose a dynamic capacity allocation method based on it.
Wireless resource allocation is an important research area in high-speed mobile scenarios. It involves allocating a certain number of Resource Blocks (RBs) to users. An RB is the smallest data transmission unit in the 5G NR (5G New Radio) system, consisting of a group of adjacent subcarriers that are capable of transmitting one symbol of data. The RB allocation directly affects the communication quality and network bandwidth of users on high-speed trains. Traditional RB allocation algorithms allocate RBs based on user channel quality, but this method is not adaptive to the network environment in high-speed mobile scenarios.
Edge computing technology has been widely used to enhance streaming media transmission, notably facilitating the reduction of latency in video transmission. This domain has garnered considerable attention and in-depth investigation from numerous scholars in the field. In [
21], the researchers delve into the realm of serverless edge computing technologies, proposing a dual deep Q-network based solution adept at facilitating scheduling decisions amidst dynamic system alterations. This solution harnesses the dual deep Q-network architecture, thereby augmenting decision-making efficacy and adaptability within environments subject to volatility. In [
22], the authors conduct a study on a collective deep reinforcement learning approach tailored for intelligent sharing in the Internet landscape, grounded in edge computing. They put forth an avant-garde collective deep reinforcement learning algorithm, dedicated to optimizing resource deployment and enhancing user experiences in edge computing milieus. Distinguished by its collaborative learning paradigm, this algorithm markedly elevates the efficiency and effectiveness of learning in distributed computing contexts. The study in [
23] introduces a resource allocation and management mechanism for 5G networks utilizing Mobile Edge Computing (MEC) and straightforward mathematical approaches to reduce model complexity. This mechanism focuses on allocating resources within mobile edge computing to fulfill user request requirements. In [
24], the authors investigate offloading decisions, collaborative decisions, and the allocation of computing and communication resources in the context of Cooperative Mobile Edge Computing (C-MEC). They propose a novel strategy where users’ latency-sensitive tasks are processed locally and offloaded to collaborative devices or MEC servers. The primary goal is to minimize the total energy consumption of all mobile users under latency constraints. The aforementioned studies explored wireless resource allocation methods in edge computing, but they were conducted in scenarios of low mobility or stationarity. In high-speed mobility scenarios, wireless resource allocation faces new challenges. As different users request various media data and frequent base station handovers occur due to high-speed movement, it becomes crucial to consider how base stations, varying over time and space, can allocate appropriate wireless resources to different users. Addressing these issues is also one of the contributions of this paper.
Many scholars have researched VANETs. Comprising highly mobile and self-organizing nodes, VANETs enable wireless communication among these nodes, facilitating the transmission of various information, including media data. The study in [
25] explored VANET’s routing protocols, proposing a Trust-based Geographical Routing Protocol for VANETs (TGRV) that limits the participation of malicious vehicles in routing. The research in [
26] introduced a secure access control protocol for VANETs, which utilizes a pseudonym mechanism to provide conditional privacy, allowing legitimate vehicles to remain anonymous while tracking malicious ones. The work in [
27] addressed issues of location privacy and reliability in VANET routing protocols, proposing DARVAN, a fully decentralized, anonymous, and reliable routing for VANETs that maximizes privacy and reliability using distributed databases and collective consensus. Furthermore, authors in [
28] proposed an Intelligent Real-time Multimedia Traffic Shaping based on Reinforcement Multi-Distributed Learning (RMDRL) to control the flow and rate of traffic sent to 5G-VANETs. While VANETs can transmit media data and perform well in some scenarios, they are not suitable for media data transmission in high-speed train scenarios or high-speed networks.
The above methods investigated various optimization schemes for video transmission in high-speed mobile scenarios, but some problems remain unresolved. For example, the network parameters in high-speed mobile scenarios are highly uncertain, and the network bandwidth, channel quality, and transmission delay vary significantly. Hence, this paper proposes an environment-aware wireless resource algorithm based on edge computing, designed to enhance the video experience for all users in high-speed mobile scenarios.
3. Optimization Model
This section provides a detailed introduction to a streaming transmission architecture specifically designed for high-speed railway scenarios, as shown in
Figure 1. In this architecture, high-speed railway stations not only serve as communication nodes but also establish an edge computing module responsible for executing the optimization algorithm proposed in this paper. The high-speed railway stations collect data, including the Channel Quality Indicator (CQI) reported periodically by mobile users, and transmit these data to the edge computing module. The module executes an optimization algorithm dedicated to determining the optimal caching locations for video data along the high-speed railway line, taking into account factors such as the user’s moving speed, the distribution of high-speed railway stations, and the available Resource Blocks (RBs) at each station. According to the results of the algorithm, video data can be dynamically cached across different high-speed railway stations along the line. Furthermore, the algorithm allocates suitable RBs to all users connected to the high-speed railway stations. Users select the most appropriate video bitrate based on the allocated RBs and the actual CQI, optimizing their viewing experience. The method proposed in this paper is broadly applicable and suitable for various transmission protocols, such as Dynamic Adaptive Streaming over HTTP (DASH), Vehicle-to-Everything (V2X), and HTTP Live Streaming (HLS).
3.1. User Channel Quality Prediction Method
The media server stores the video data encoded into different quality levels. Let
denote video segments with different quality levels, where
represents the resolution of the video segment
for each
. Let
, where
, denote the
jth user. During the high-speed rail movement, the user
needs to connect (switch) to
N base stations. Let
, where
, denote the
nth base station. Let
denote the number of radio resources that the user
allocates at time
t and
denote the network bandwidth of the user at time
t. Let
denote the edge buffer when the user
connects to base station
, and
(bit/s) denote the media bitrate requested by the edge buffer.
Let
denote the bandwidth calculation function,
represent the network bandwidth predicted at time
t, and
indicate the level of the video segment requested by the user. Let
U denote the environmental parameters of the user watching the video, such as the user’s moving speed.
An RB represents the smallest wireless resource allocation unit. In the time domain, it encompasses a time slot denoted as , which represents the number of time slots within 1 ms, amounting to 2. Additionally, it comprises seven orthogonal frequency-division multiplexing (OFDM) symbols, i.e., , and includes 12 consecutive subcarriers in the frequency domain, indicated as . The base station is responsible for determining the number of to be allocated. The modulation and coding scheme (MCS) represents the data encoding method, which correlates with the channel quality indicator (CQI) value provided by the user. The symbol denotes the maximum bit rate achievable within the CQI index under 5G network conditions.
Equation (
3) defines the prior value of CQI.
and
denote the predicted and detected value of CQI at time
, respectively. Let A and B denote different weight parameters. Let
denote the deviation between prediction and actual value, and
denote the prior error covariance matrix, as Equation (
4) shows.
The weight equation can be calculated by Equation (
5)
where
R denotes the average value of measurement noise and H denotes the scaling factor. Equation (
6) represents the calculation of the posterior estimate
based on the prior
.
3.2. User-Subjective Video Experience Model
The network state in high-speed mobile scenarios is complex and unstable, due to the high-speed train movement and rapid change at the base station. Users may experience video buffering and image blur when watching videos, which degrades the user’s experience [
29].
Figure 2 shows some QoE factors used in this paper. The excessive speed causes wireless channel jitter, which seriously reduces the user’s viewing experience in high-speed mobile scenarios. To establish a QoE model for users in these scenarios, we consider various factors, such as video objective quality, video smoothness, extra overhead from video cache misses, and video quality jitter. These factors affect the overall QoE of users differently, and each user’s subjective experience has a different influence weight. Thus, the model has trade-off parameters that the user can define.
User QoE depends mainly on video objective quality, which is determined by the resolution and bitrate of the video. Higher-quality videos offer users a better subjective experience. The function represents the quality of the video and its mapping relationship with the QoE of the user. It can be expressed by peak signal-to-noise ratio (PSNR) or structural similarity index (SSIM). PSNR is a measurement used to assess the quality of an image or video by comparing it to a reference (original) image. It quantifies the level of distortion or noise introduced during the compression or transmission of media. SSIM is a metric used to measure the structural similarity between two images, where one is typically the reference (original) image and the other a distorted or processed version of it. It assesses not only changes in pixel values but also considers structural information, such as textures, edges, and patterns. This paper considers various factors, such as the video quality and the user’s viewing environment. It proposes a QoE model that suits this transmission architecture.
The objective quality of the video, which can be measured using PSNR or SSIM, is the most important factor for the user viewing experience. Equation (
7) states the relationship between the user’s objective quality and the video supervisor’s assessment of quality.
denotes the natural logarithm, that is, the logarithm to the base e.
Rebuffering is one of the main factors that affect the subjective experience of users when watching videos. This situation occurs more frequently in high-speed mobile scenarios. This paper models the impact of rebuffering on the subjective quality of users as an exponential function, as Equation (
8) shows.
According to the IQX hypothesis, the rebuffering number
N and the stalling duration
L are input parameters mapped to
. The parameter
N denotes the number of rebuffering occurrences, and the parameter
L denotes the average duration of each rebuffering.
Let denote the mapping relationship between video quality jitter and user . Let q denote the jitter amplitude and s denote the number of jitters in the video. Let denote the value of video level switching between and . Let denote the impact of jitter amplitude on .
The parameter
represents the video transcoding cost. The cost is 0 when the user obtains the video from the server. Otherwise, if the video is transcoded on the computing module, the cost is calculated as described in Equation (
10).
Let
be the playback time of the video segment and
be the total data amount of the transcoded video segment. The parameter
c denotes the cost of transcoding 1 bit of data. In this paper, we aim to maximize the
of all users, which can be computed by Equation (
11).
The parameters
,
, and
are the weight parameters for video quality, smoothness, and jitter, respectively. The parameter
is a binary trade-off parameter of cost. In this paper, we normalize the value of
.
3.3. Wireless Resource Allocation Model
We consider a high-speed mobile scenario with n users passing through N base stations. In the 5G NR network environment, each user reports their channel quality to the current base station in time slot
t.
Figure 3 shows the system sequence diagram proposed in this paper. The base station allocates wireless resources to the user and sends the video data as a carrier through the wireless channel. The streaming server provides video streaming service to the user. We propose a channel prediction model to forecast the future channel quality of users. Based on the predicted value, the number of available RBs at the base station, and the user’s video viewing environment (e.g., moving speed), we select the suitable bit rate video data to cache in different base station caches. Let S be the remaining available RB number in the base station and 1 RB be the minimum allocation unit of wireless resources. One RB provides a carrier for user video transmission in time slot t. The base station allocates RBs in time slot t using the predicted value of wireless channel quality and user viewing environment parameters. Our goal is to maximize the total QoE of all users by allocating an optimal number of RBs for each user.
Assuming the user’s wireless channel quality is predicted at time t, the CQI prediction value is then approximately mapped to determine the optimal MCS value for the current user channel. For a given MCS value, the corresponding modulation method and coding rate are determined according to the international standard, in order to determine the required channel quality and bit error rate. Then, the achievable data rate is determined using the corresponding bit error rate and modulation method. The base station runs the algorithm proposed in this paper to allocate wireless resources for each user. The user can calculate the current available bandwidth according to their own wireless channel quality and the number of RBs allocated by the base station. The base station then selects a video segment with a bit rate less than or equal to the available bandwidth for caching. The data volume of the nth video segment is represented as the media bit rate multiplied by the segment duration. Assuming that the available RBs at time t is , the number of RBs allocated by the base station must be less than or equal to .
Given the user’s wireless channel quality predicted at time t, we map the CQI prediction value to the optimal MCS value for the current user channel. For a given MCS value, we determine the corresponding modulation method and coding rate according to the international standard, as well as the required channel quality and bit error rate. We then determine the achievable data rate using the bit error rate and modulation method. The base station allocates wireless resources to each user using the algorithm proposed in this paper. The user calculates the current available bandwidth based on their wireless channel quality and the number of RBs allocated by the base station. The base station selects a video segment with a bit rate less than or equal to for caching. The data volume of the nth video segment is the media bit rate times the segment duration. Let be the available RBs at time t. The base station allocates no more than RBs.
Let
be the available bandwidth at time
t. The selected video segment has a bitrate no greater than
.
In a high-mobility scenario, the base station performs frequent handovers, leading to significant fluctuations in the user’s communication channel quality. When a user requests a video, the current base station searches the edge cache. If the cache contains the requested media data, it is immediately delivered to the user. Otherwise, there are two options. One is to request media data from the server, which incurs a retransmission delay of
.
Let
be the user’s moving distance during the retransmission time and
V be the current speed of the high-speed rail. We compare the distance between the user and the two adjacent base stations and determine the base station that will receive the video data based on the above data. Alternatively, the video data can be transcoded in the edge computing module of the base station, but this will consume the base station’s computing resources and incur costs.
The function indicates whether the segment of the video needs to be transcoded. If , the segment is transcoded. If , the media server retransmits the segment.
The function
denotes the transcoding time of the
nth segment of the video. The function
denotes the retransmission time of segment
n.
We set
to be the server CPU’s processing capacity (bits/second). Given the above constraints, we formulate the optimization problem as:
3.4. Complexity Analysis
Our problem is to select the optimal bit rate for each user and allocate wireless resources in the high-speed rail scenario to maximize user QoE. We first show that the problem is NP-hard.
Proof. Consider a special case of Equation (
20) with fixed parameter values in the high-speed rail scenario. Let
,
,
,
,
,
(assuming the same CQI value for each user), and bandwidth be constant. The base station determines the RB allocation according to Equation (
19).
where
is the number of RBs allocated by the base station to user
r.
To show the equivalence between Equation (
20) and the exact cover problem, which is a well-known NP-hard problem, consider an arbitrary family
G of
p sets, each with a cardinality of 3. Represent all sets in
G by binary sequences of length
. For example, 101001 and 011010 represent the sets
and
, respectively. These sequences correspond to integers in the base-
system:
where
is the set corresponding to the (r)-th digit in the binary sequence. Note that
and
for all
.
Construct integers
, and
S such that
has a subset sum of
S if and only if a subfamily of
G, denoted by
, forms an exact cover. The value of
S is the sequence
(of length
):
We will prove the equivalence by showing the sufficiency and the necessity.
Sufficiency: Suppose that there is a non-empty set such that . In the base- arithmetic, each addend in the sum of S is either 0 or 1, and less than . Therefore, there is no “carry” in this addition. This implies that for each , there is at most one such that . Hence, the sets for are pairwise disjoint. Moreover, since , we have that . Therefore, is an exact cover of (Z).
Necessity: Suppose that is an exact cover of (Z). Then, for each , there is exactly one such that . Let . Then, (Y) is a non-empty subset of , and we have that .
This shows that Equation (
19) defines an NP-hard problem. The global optimal solution is hard to find. Heuristic algorithms usually provide approximate solutions. We use a heuristic algorithm to solve this problem. □
6. Results
This section presents the simulation results of the proposed algorithm and compares them with those of other algorithms, using the following parameters.
QoE: The performance of different algorithms is measured by the QoE of users in high-speed mobile state, which is defined and normalized in this paper.
SSIM: The structural similarity index (SSIM) is an indicator of image quality. It measures the structural similarity between images before and after transmission. The SSIM value ranges from 0 to 1, with higher values indicating higher similarity [
35].
CDF: The cumulative distribution function (CDF), or simply the distribution function, is the integral of the probability density function (PDF) and fully characterizes the probability distribution of a real random variable X. This paper analyzes the CDF of user QoE.
Figure 6,
Figure 7 and
Figure 8 display the aggregate Quality of Experience (QoE) of all users in a high-speed train carriage utilizing the EA-WRAA, MDP-A, QBA, and ADMM/SA algorithms for transmitting three different videos, respectively.
Figure 6 shows the QoE for the EA-WRAA, MDP-A, QBA, and ADMM/SA algorithms transmitting three videos. The EA-WRAA algorithm proposed in this paper surpasses the others in QoE and stability, as demonstrated in
Figure 6. For instance, during the interval between 5 and 10 s, there is a significant drop in the user’s channel quality, potentially due to the train entering a tunnel or an arch bridge. Despite this, the EA-WRAA algorithm maintains a more stable QoE. The network state of the train during the ‘blue sky’ segment is more stable than during the ‘50 mobcal’ segment. As clearly observed in
Figure 6a, when watching the ‘old town cross’ video, the EA-WRAA algorithm provides the highest QoE. Notably, during periods of significant network quality fluctuation (as indicated between seconds 10 and 30 and between seconds 50 and 60), the QoE of the EA-WRAA algorithm does not significantly decrease, illustrating its robustness and adaptability. In stark contrast, the QoE for MDP-A and QBA algorithms noticeably drops during these intervals, indicating their less sensitive or insufficient adaptive response to network fluctuations.
Figure 7 presents the Cumulative Distribution Function (CDF) of user QoE for the same video under different algorithms. The EA-WRAA algorithm, as proposed in this paper, achieves a smoother and higher QoE for a greater number of users compared to the other three algorithms, which show poorer results. In
Figure 7c, the CDF distribution for the ‘blue sky’ video content demonstrates the statistical superiority of the EA-WRAA algorithm. The CDF curve of the EA-WRAA algorithm rises rapidly and approaches a CDF value closer to 1, indicating that almost all users enjoy a high-quality video experience. In contrast, the CDF curve for comparative algorithms like MDP-A starts to show a gradient increase around a QoE of 0.6, suggesting that a significant proportion of users are subjected to a lower QoE.
Figure 8 displays the SSIM values for three videos before and after transmission using four algorithms: EA-WRAA, MDP-A, QBA, and ADMM/SA. The chart indicates that the ‘old town cross’ video segment exhibits a lower SSIM value in comparison to the other two segments, attributable to its higher complexity and increased susceptibility to noise during transmission.
Figure 8b showcases the SSIM performance of the EA-WRAA algorithm for the ‘50 mobcal ter’ video segment. It is observed that the SSIM value for the EA-WRAA algorithm maintains a consistently high level throughout the transmission process. This is particularly notable within the 30–40 s interval, where the SSIM values of the other algorithms exhibit significant fluctuations. Consequently, the stability of the EA-WRAA algorithm becomes quite pronounced. This highlights the EA-WRAA algorithm’s superior capability in preserving video details, ensuring that video quality is robustly maintained, even in a high-speed mobile network environment.
Overall, the EA-WRAA algorithm exhibits advantages in QoE by adeptly adapting to network fluctuations and stabilizing the user experience. In SSIM, it showcases its capability by maintaining high image quality, and in CDF, it highlights its performance by providing a consistent high QoE experience to a broader user group. These advantages collectively make the EA-WRAA algorithm particularly suitable for high-speed mobile network scenarios, effectively addressing network quality variations due to high mobility and environmental changes.
QoE (Quality of Experience): The EA-WRAA algorithm maintains a higher QoE during network condition fluctuations (as illustrated in the 5–10 s interval in
Figure 6), such as when the train enters a tunnel. This demonstrates the algorithm’s remarkable stability and adaptability in response to network fluctuations. During the ‘blue sky’ segment, the EA-WRAA algorithm provides a more stable QoE, demonstrating that the algorithm can flexibly adjust to different network states to optimize the overall user experience. Furthermore, the EA-WRAA algorithm achieves a higher QoE in the CDF distribution shown in
Figure 7, indicating a significant improvement in user satisfaction compared to other algorithms.
SSIM (Structural Similarity Index): Image Quality Maintenance: The EA-WRAA algorithm maintains a high SSIM value even in segments of high video complexity, such as ‘old town cross’, reducing the quality loss during transmission. In the SSIM comparison across the three video segments, the EA-WRAA algorithm displays a higher image quality, reflecting its superior encoding and transmission strategy.
CDF (Cumulative Distribution Function): The CDF curve of the EA-WRAA algorithm rises quickly and approaches a value of 1, meaning that most users enjoy a high level of QoE, providing a more consistent and broad high-quality experience compared to other algorithms. The CDF analysis shows that the EA-WRAA algorithm provides a more stable and higher QoE level across the entire user group, demonstrating its statistical advantage and overall improvement in user experience.
7. Discussion
Compared to three other algorithms, the EA-WRAA algorithm delivers higher and more consistent user QoE in high-speed mobile scenarios. This algorithm comprehensively considers factors such as objective video quality, subjective user perception, network state, and switching costs, and dynamically adjusts video resolution and bitrate to adapt to high-speed mobile network fluctuations. User QoE is significantly affected by video complexity and noise susceptibility. For example, the ’old city cross’ video segment has a lower SSIM value than the other two segments, indicating poorer user QoE. This video has higher spatial and temporal complexity, with various colors, textures, motions, and details, which are prone to noise interference during transmission. This results in reduced structural similarity and increased distortion of the video, culminating in a diminished viewing experience for users.
The EA-WRAA algorithm is tailored to the unique features of high-speed mobile scenarios, such as frequent base station handovers, rapid rail velocities, and various environmental factors, rendering it more suitable for these network environments. Network conditions in high-speed mobile scenarios vary with user location, velocity, and direction, leading to instability in network parameters such as bandwidth, latency, and packet loss. Merely adjusting video parameters based on the current network state can result in fluctuations in video quality and a reduction in user QoE. The EA-WRAA algorithm enhances video quality and user QoE by proactively adjusting videos based on predicted user trajectories and network states.
8. Conclusions
This study introduces a novel environment-aware wireless resource allocation algorithm, specifically designed to enhance the user video experience in high-speed mobile network scenarios. The algorithm comprehensively takes into account the network fluctuations, user mobility speed, and environmental factors prevalent in high-speed mobile environments. It dynamically adjusts video resolution and bitrate to adapt to network changes. Experimental results indicate that the EA-WRAA algorithm outperforms other algorithms in terms of overall User QoE, SSIM, and CDF. Compared to traditional video transmission optimization methods, the EA-WRAA algorithm shows greater proficiency in adapting to the distinct characteristics of high-speed mobile networks, thereby providing a more stable and higher quality video viewing experience. In high-speed mobile network environments, frequent base station transitions, the velocity of high-speed railways, and other environmental factors significantly impact network parameters such as bandwidth, latency, and packet loss rate. The EA-WRAA algorithm proactively adjusts video parameters based on user trajectory predictions and current network status, with the goal of optimizing video quality and stability. In the experiments, the EA-WRAA algorithm’s performance was compared with three other algorithms (MDP-A, QBA, and ADMM/SA), demonstrating notable superiority in maintaining high SSIM values and providing a broader high-QoE experience.
Overall, the EA-WRAA algorithm demonstrates significant advantages in enhancing video transmission quality and user experience in high-speed mobile network environments. Future research will delve into various video encoding and transmission schemes, such as semantic transmission, to further augment video quality and stability.