Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (64)

Search Parameters:
Keywords = HJB equation

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
21 pages, 2842 KB  
Article
Robust Optimal Reinsurance and Investment Problem Under Markov Switching via Actor–Critic Reinforcement Learning
by Fang Jin, Kangyong Cheng, Xiaoliang Xie and Shubo Chen
Mathematics 2025, 13(21), 3502; https://doi.org/10.3390/math13213502 - 2 Nov 2025
Viewed by 242
Abstract
This paper investigates a robust optimal reinsurance and investment problem for an insurance company operating in a Markov-modulated financial market. The insurer’s surplus process is modeled by a diffusion process with jumps, which is correlated with financial risky assets through a common shock [...] Read more.
This paper investigates a robust optimal reinsurance and investment problem for an insurance company operating in a Markov-modulated financial market. The insurer’s surplus process is modeled by a diffusion process with jumps, which is correlated with financial risky assets through a common shock structure. The economic regime switches according to a continuous-time Markov chain. To address model uncertainty concerning both diffusion and jump components, we formulate the problem within a robust optimal control framework. By applying the Girsanov theorem for semimartingales, we derive the dynamics of the wealth process under an equivalent martingale measure. We then establish the associated Hamilton–Jacobi–Bellman (HJB) equation, which constitutes a coupled system of nonlinear second-order integro-differential equations. An explicit form of the relative entropy penalty function is provided to quantify the cost of deviating from the reference model. The theoretical results furnish a foundation for numerical solutions using actor–critic reinforcement learning algorithms. Full article
Show Figures

Figure 1

26 pages, 1442 KB  
Article
A Tripartite Differential Game Approach to Understanding Intelligent Transformation in the Wastewater Treatment Industry
by Renmin Liao, Linbin Wang and Feng Deng
Systems 2025, 13(11), 960; https://doi.org/10.3390/systems13110960 - 28 Oct 2025
Viewed by 248
Abstract
The intelligent transformation of the wastewater treatment industry, as a core component of the modern environmental governance system, is of decisive significance for achieving sustainable development goals. This study focuses on the issue of multi-stakeholder collaborative governance in the intelligent transformation of the [...] Read more.
The intelligent transformation of the wastewater treatment industry, as a core component of the modern environmental governance system, is of decisive significance for achieving sustainable development goals. This study focuses on the issue of multi-stakeholder collaborative governance in the intelligent transformation of the wastewater treatment industry, with differential game theory as the core framework. A tripartite game model involving the government, wastewater treatment enterprises, and digital twin platforms is developed to depict the dynamic interrelations and mutual influences of strategy choices, thereby capturing the coordination mechanisms among government regulation, enterprise technology adoption, and platform support in the transformation process. Based on the dynamic optimization properties of differential games, the Hamilton–Jacobi–Bellman (HJB) equation is employed to derive the long-term equilibrium strategies of the three parties, presenting the evolutionary paths under Nash non-cooperative games, Stackelberg games, and tripartite cooperative games. Furthermore, the Sobol global sensitivity analysis is applied to identify key parameters influencing system performance, while the response surface method (RSM) with central composite design (CCD) is used to quantify parameter interaction effects. The findings are as follows: (1) compared with Nash non-cooperative and Stackelberg games, the tripartite cooperative strategy based on the differential game model achieves global optimization of system performance, demonstrating the efficiency-enhancing effect of dynamic collaboration; (2) the most sensitive parameters are β, α, μ3, and η3, with β having the highest sensitivity index (STi = 0.459), indicating its dominant role in system performance; (3) significant synergistic enhancement effects are observed among αβ, αμ3, and βμ3, corresponding, respectively, to the “technology stability–benefit conversion” gain effect, the “technology decay–platform compensation” dynamic balance mechanism, and the “benefit conversion–platform empowerment” performance threshold rule. Full article
Show Figures

Figure 1

18 pages, 868 KB  
Article
Stochastic Production Planning in Manufacturing Systems
by Dragos-Patru Covei
Axioms 2025, 14(10), 766; https://doi.org/10.3390/axioms14100766 - 16 Oct 2025
Viewed by 289
Abstract
We study stochastic production planning in capacity-constrained manufacturing systems, where feasible operating states are restricted to a convex safe-operating region. The objective is to minimize the total cost that combines a quadratic production effort with an inventory holding cost, while automatically halting production [...] Read more.
We study stochastic production planning in capacity-constrained manufacturing systems, where feasible operating states are restricted to a convex safe-operating region. The objective is to minimize the total cost that combines a quadratic production effort with an inventory holding cost, while automatically halting production when the state leaves the safe region. We derive the associated Hamilton–Jacobi–Bellman (HJB) equation, establish the existence and uniqueness of the value function under broad conditions, and prove a concavity property of the transformed value function that yields a robust gradient-based optimal feedback policy. From an operations perspective, the stopping mechanism encodes hard capacity and safety limits, ensuring bounded risk and finite expected costs. We complement the analysis with numerical methods based on finite differences and illustrate how the resulting policies inform real-time decisions through two application-inspired examples: a single-product case calibrated with typical process-industry parameters and a two-dimensional example motivated by semiconductor fabrication, where interacting production variables must satisfy joint safety constraints. The results bridge rigorous stochastic control with practical production planning and provide actionable guidance for operating under uncertainty and capacity limits. Full article
Show Figures

Figure 1

28 pages, 6227 KB  
Article
Image Restoration via the Integration of Optimal Control Techniques and the Hamilton–Jacobi–Bellman Equation
by Dragos-Patru Covei
Mathematics 2025, 13(19), 3137; https://doi.org/10.3390/math13193137 - 1 Oct 2025
Cited by 1 | Viewed by 362
Abstract
In this paper, we propose a novel image restoration framework that integrates optimal control techniques with the Hamilton–Jacobi–Bellman (HJB) equation. Motivated by models from production planning, our method restores degraded images by balancing an intervention cost against a state-dependent penalty that quantifies the [...] Read more.
In this paper, we propose a novel image restoration framework that integrates optimal control techniques with the Hamilton–Jacobi–Bellman (HJB) equation. Motivated by models from production planning, our method restores degraded images by balancing an intervention cost against a state-dependent penalty that quantifies the loss of critical image information. Under the assumption of radial symmetry, the HJB equation is reduced to an ordinary differential equation and solved via a shooting method, from which the optimal feedback control is derived. Numerical experiments, supported by extensive parameter tuning and quality metrics such as PSNR and SSIM, demonstrate that the proposed framework achieves significant improvement in image quality. The results not only validate the theoretical model but also suggest promising directions for future research in adaptive and hybrid image restoration techniques. Full article
Show Figures

Figure 1

36 pages, 2144 KB  
Article
Dynamic Portfolio Optimization Using Information from a Crisis Indicator
by Victor Gonzalo, Markus Wahl and Rudi Zagst
Mathematics 2025, 13(16), 2664; https://doi.org/10.3390/math13162664 - 19 Aug 2025
Viewed by 782
Abstract
Investors face the challenge of how to incorporate economic and financial forecasts into their investment strategy, especially in times of financial crisis. To model this situation, we consider a financial market consisting of a risk-free asset with a constant interest rate as well [...] Read more.
Investors face the challenge of how to incorporate economic and financial forecasts into their investment strategy, especially in times of financial crisis. To model this situation, we consider a financial market consisting of a risk-free asset with a constant interest rate as well as a risky asset whose drift and volatility is influenced by a stochastic process indicating the probability of potential market downturns. We use a dynamic portfolio optimization approach in continuous time to maximize the expected utility of terminal wealth and solve the corresponding HJB equations for the general class of HARA utility functions. The resulting optimal strategy can be obtained in closed form. It corresponds to a CPPI strategy with a stochastic multiplier that depends on the information from the crisis indicator. In addition to the theoretical results, a performance analysis of the derived strategy is implemented. The specified model is fitted using historic market data and the performance is compared to the optimal portfolio strategy obtained in a Black–Scholes framework without crisis information. The new strategy clearly dominates the BS-based CPPI strategy with respect to the Sharpe Ratio and Adjusted Sharpe Ratio. Full article
(This article belongs to the Special Issue Latest Advances in Mathematical Economics)
Show Figures

Figure 1

19 pages, 4228 KB  
Article
Data-Driven Optimal Bipartite Containment Tracking for Multi-UAV Systems with Compound Uncertainties
by Bowen Chen, Mengji Shi, Zhiqiang Li and Kaiyu Qin
Drones 2025, 9(8), 573; https://doi.org/10.3390/drones9080573 - 13 Aug 2025
Viewed by 405
Abstract
With the increasing deployment of Unmanned Aerial Vehicle (UAV) swarms in uncertain and dynamically changing environments, optimal cooperative control has become essential for ensuring robust and efficient system coordination. To this end, this paper designs a data-driven optimal bipartite containment tracking control scheme [...] Read more.
With the increasing deployment of Unmanned Aerial Vehicle (UAV) swarms in uncertain and dynamically changing environments, optimal cooperative control has become essential for ensuring robust and efficient system coordination. To this end, this paper designs a data-driven optimal bipartite containment tracking control scheme for multi-UAV systems under compound uncertainties. A novel Dynamic Iteration Regulation Strategy (DIRS) is proposed, which enables real-time adjustment of the learning iteration step according to the task-specific demands. Compared with conventional fixed-step data-driven algorithms, the DIRS provides greater flexibility and computational efficiency, allowing for better trade-offs between the performance and cost. First, the optimal bipartite containment tracking control problem is formulated, and the associated coupled Hamilton–Jacobi–Bellman (HJB) equations are established. Then, a data-driven iterative policy learning algorithm equipped with the DIRS is developed to solve the optimal control law online. The stability and convergence of the proposed control scheme are rigorously analyzed. Furthermore, the control law is approximated via the neural network framework without requiring full knowledge of the model. Finally, numerical simulations are provided to demonstrate the effectiveness and robustness of the proposed DIRS-based optimal containment tracking scheme for multi-UAV systems, which can reduce the number of iterations by 88.27% compared to that for the conventional algorithm. Full article
Show Figures

Figure 1

23 pages, 811 KB  
Article
Backstepping-Based Finite-Horizon Optimization for Pitching Attitude Control of Aircraft
by Ang Li, Yaohua Shen and Bin Du
Aerospace 2025, 12(8), 653; https://doi.org/10.3390/aerospace12080653 - 23 Jul 2025
Viewed by 305
Abstract
In this paper, the problem of pitching attitude finite-horizon optimization for aircraft is posed with system uncertainties, external disturbances, and input constraints. First, a neural network (NN) and a nonlinear disturbance observer (NDO) are employed to estimate the value of system uncertainties and [...] Read more.
In this paper, the problem of pitching attitude finite-horizon optimization for aircraft is posed with system uncertainties, external disturbances, and input constraints. First, a neural network (NN) and a nonlinear disturbance observer (NDO) are employed to estimate the value of system uncertainties and external disturbances. Taking input constraints into account, an auxiliary system is designed to compensate for the constrained input. Subsequently, the backstepping control containing NN and NDO is used to ensure the stability of systems and suppress the adverse effects caused by the system uncertainties and external disturbances. In order to avoid the derivation operation in the process of backstepping, a dynamic surface control (DSC) technique is utilized. Simultaneously, the estimations of the NN and NDO are applied to derive the backstepping control law. For the purpose of achieving finite-horizon optimization for pitching attitude control, an adaptive method termed adaptive dynamic programming (ADP) with a single NN-termed critic is applied to obtain the optimal control. Time-varying feature functions are applied to construct the critic NN in order to approximate the value function in the Hamilton–Jacobi–Bellman (HJB) equation. Furthermore, a supplementary term is added to the weight update law to minimize the terminal constraint. Lyapunov stability theory is used to prove that the signals in the control system are uniformly ultimately bounded (UUB). Finally, simulation results illustrate the effectiveness of the proposed finite-horizon optimal attitude control method. Full article
(This article belongs to the Section Aeronautics)
Show Figures

Figure 1

18 pages, 4415 KB  
Article
Ultra-Dense Uplink UAV Lossy Communications: Trajectory Optimization Based on Mean Field Game
by Yibo Ma and Shen Qian
Electronics 2025, 14(11), 2219; https://doi.org/10.3390/electronics14112219 - 29 May 2025
Viewed by 633
Abstract
This paper investigates a multiple unmanned aerial vehicle (UAV) enabled network for supporting emergency communication services, where each drone acts as a base station (also called the drone small cell (DSC)). The novelty of this paper is that a mean field game (MFG)-based [...] Read more.
This paper investigates a multiple unmanned aerial vehicle (UAV) enabled network for supporting emergency communication services, where each drone acts as a base station (also called the drone small cell (DSC)). The novelty of this paper is that a mean field game (MFG)-based strategy is conceived for jointly controlling the three-dimensional (3D) locations of these drones to guarantee the distortion requirement of lossy communications, while considering the inter-cell interference and the flight energy consumption of drones. More explicitly, we derive the Hamilton–Jacobi–Bellman (HJB) and Fokker–Planck–Kolmogorov (FPK) equations, and propose an algorithm where both the Lax–Friedrichs scheme and the Lagrange relaxation are invoked for solving the HJB and FPK equations with 3D control vectors and state vectors. The numerical results show that the proposed algorithm can achieve a higher access rate with a similar flight energy consumption. Full article
Show Figures

Figure 1

13 pages, 247 KB  
Article
Stochastic Optimal Control of Averaged SDDE with Semi-Markov Switching and with Application in Economics
by Mariya Svishchuk and Anatoliy V. Swishchuk
Mathematics 2025, 13(9), 1440; https://doi.org/10.3390/math13091440 - 28 Apr 2025
Viewed by 710
Abstract
This paper is devoted to the study of stochastic optimal control of averaged stochastic differential delay equations (SDDEs) with semi-Markov switchings and their applications in economics. By using the Dynkin formula and solution of the Dirichlet–Poisson problem, the Hamilton–Jacobi–Bellman (HJB) equation and the [...] Read more.
This paper is devoted to the study of stochastic optimal control of averaged stochastic differential delay equations (SDDEs) with semi-Markov switchings and their applications in economics. By using the Dynkin formula and solution of the Dirichlet–Poisson problem, the Hamilton–Jacobi–Bellman (HJB) equation and the inverse HJB equation are derived. Applications are given to a new Ramsey stochastic models in economics, namely the averaged Ramsey diffusion model with semi-Markov switchings. A numerical example is presented as well. Full article
(This article belongs to the Special Issue Stochastic Models with Applications, 2nd Edition)
27 pages, 3808 KB  
Article
Dynamic Modeling of Limit Order Book and Market Maker Strategy Optimization Based on Markov Queue Theory
by Fei Xie, Yang Liu, Changlong Hu and Shenbao Liang
Mathematics 2025, 13(5), 778; https://doi.org/10.3390/math13050778 - 26 Feb 2025
Viewed by 6510
Abstract
In recent years, high-frequency trading has become increasingly popular in financial markets, making the dynamic modeling of the limit book and the optimization of market maker strategies become key topics. However, existing studies often lacked detailed descriptions of order books and failed to [...] Read more.
In recent years, high-frequency trading has become increasingly popular in financial markets, making the dynamic modeling of the limit book and the optimization of market maker strategies become key topics. However, existing studies often lacked detailed descriptions of order books and failed to fully characterize the optimal decisions of market makers in complex market environments, especially in China’s A-share market. Based on Markov queue theory, this paper proposes the dynamic model of the limit order and the optimal strategy of the market maker. The model uses a state transition probability matrix to refine the market diffusion state, order generation, and trading process and incorporates indicators such as optimal quote deviation and restricted order trading probability. Then, the optimal control model is constructed and the reference strategy is derived using the Hamilton–Jacobi–Bellman (HJB) equation. Then, the key parameters are estimated using the high-frequency data of Ping An Bank for a single trading day. In the empirical aspect, the six-month high-frequency trading data of 114 representative stocks in different market states such as the bull market and bear market in China’s A-share market were selected for strategy verification. The results showed that the proposed strategy had robust returns and stable profits in the bull market and that frequent capture of market fluctuations in the bear market can earn relatively high returns while maintaining 50% of the order coverage rate and 66% of the stable order winning rate. Our study used Markov queuing theory to describe the state and price dynamics of the limit order book in detail and used optimization methods to construct and solve the optimal market maker strategy. The empirical aspect broadens the empirical scope of market maker strategies in the Chinese market and studies the stability and effectiveness of market makers in different market states. Full article
(This article belongs to the Section E: Applied Mathematics)
Show Figures

Figure 1

15 pages, 646 KB  
Article
An Optimal Investment Decision Problem Under the HARA Utility Framework
by Aiyin Wang, Xiao Ji, Lu Zhang, Guodong Li and Wenjie Li
Symmetry 2025, 17(2), 311; https://doi.org/10.3390/sym17020311 - 19 Feb 2025
Viewed by 797
Abstract
This paper is dedicated to studying the optimal investment proportions of three types of assets with symmetry, namely, risky assets, risk-free assets, and wealth management products, when the stochastic expenditure process follows a jump-diffusion model. The stochastic expenditure process is treated as an [...] Read more.
This paper is dedicated to studying the optimal investment proportions of three types of assets with symmetry, namely, risky assets, risk-free assets, and wealth management products, when the stochastic expenditure process follows a jump-diffusion model. The stochastic expenditure process is treated as an exogenous cash flow and is assumed to follow a stochastic differential process with jumps. Under the Cox–Ingersoll–Ross interest rate term structure, it is presumed that the prices of multiple risky assets evolve according to a multi-dimensional geometric Brownian motion. By employing stochastic control theory, the Hamilton–Jacobi–Bellman (HJB) equation for the household portfolio problem is formulated. Considering various risk-preference functions, particularly the Hyperbolic Absolute Risk Aversion (HARA) function, and given the algebraic form of the objective function through the terminal-value maximization condition, an explicit solution for the optimal investment strategy is derived. The findings indicate that when household investment behavior is characterized by random expenditures and symmetry, as the risk-free interest rate rises, the optimal proportion of investment in wealth-management products also increases, whereas the proportion of investment in risky assets continually declines. As the expected future expenditure increases, households will decrease their acquisition of risky assets, and the proportion of risky-asset purchases is sensitive to changes in the expectation of unexpected expenditures. Full article
(This article belongs to the Section Mathematics)
Show Figures

Figure 1

22 pages, 5681 KB  
Article
The Research on Strategic Choices of Food Supply Chain Considering Information Symmetry and Cost Sharing
by Jianhua Wang and Ji Xu
Symmetry 2025, 17(1), 142; https://doi.org/10.3390/sym17010142 - 18 Jan 2025
Viewed by 1062
Abstract
In the digital economy era, information symmetry, transparency, and traceability in food supply chains have increasingly garnered consumer attention. To motivate supply chain members to engage in product traceability, this paper examines the competitive and cooperative dynamics among participants in the food supply [...] Read more.
In the digital economy era, information symmetry, transparency, and traceability in food supply chains have increasingly garnered consumer attention. To motivate supply chain members to engage in product traceability, this paper examines the competitive and cooperative dynamics among participants in the food supply chain over continuous time. By developing a differential game model involving manufacturers and retailers with three decision-making modes, we solve the model using the Hamilton–Jacobi–Bellman (HJB) equation and perform a simulation analysis to assess the impact of different modes on overall supply chain profits. Additionally, we analyze how various parameters affect the profits of manufacturers and retailers. The key findings of this study indicate that centralized decision-making enhances the overall benefits of the food supply chain. Among the three decision-making models, the cost-sharing model proves to be the optimal approach, as it leads to a Pareto improvement in the profits of both manufacturers and retailers. These conclusions provide valuable insights for supply chain members seeking to optimize product traceability and enhance supply chain efficiency, as well as for government authorities involved in traceable supply chain governance. Full article
(This article belongs to the Section Mathematics)
Show Figures

Figure 1

14 pages, 429 KB  
Article
Optimal Consumption and Investment with Income Adjustment and Borrowing Constraints
by Geonwoo Kim and Junkee Jeon
Mathematics 2024, 12(22), 3536; https://doi.org/10.3390/math12223536 - 12 Nov 2024
Viewed by 1163
Abstract
In this paper, we address the utility maximization problem of an infinitely lived agent who has the option to increase their income. The agent can increase their income at any time, but doing so incurs a wealth cost proportional to the amount of [...] Read more.
In this paper, we address the utility maximization problem of an infinitely lived agent who has the option to increase their income. The agent can increase their income at any time, but doing so incurs a wealth cost proportional to the amount of the increase. To prevent the agent from infinitely increasing their income and borrowing against future income, we additionally consider a non-negative wealth constraint that prohibits borrowing based on future income. This utility maximization problem is a mixture of stochastic control, where the agent chooses consumption and investment, and singular control, where the agent chooses a non-decreasing income process. To solve this non-trivial and challenging problem, we derive the Hamilton–Jacobi–Bellman (HJB) equation with a gradient constraint using the dynamic programming principle (DPP). Then, using the guess-and-verify method and a linearization technique, we obtain a closed-form solution to the HJB equation and, based on this, find the optimal strategy. Full article
(This article belongs to the Special Issue Financial Mathematics and Applications)
Show Figures

Figure 1

22 pages, 645 KB  
Article
A Preconditioned Policy–Krylov Subspace Method for Fractional Partial Integro-Differential HJB Equations in Finance
by Xu Chen, Xin-Xin Gong, Youfa Sun and Siu-Long Lei
Fractal Fract. 2024, 8(6), 316; https://doi.org/10.3390/fractalfract8060316 - 27 May 2024
Cited by 1 | Viewed by 1540
Abstract
To better simulate the prices of underlying assets and improve the accuracy of pricing financial derivatives, an increasing number of new models are being proposed. Among them, the Lévy process with jumps has received increasing attention because of its capacity to model sudden [...] Read more.
To better simulate the prices of underlying assets and improve the accuracy of pricing financial derivatives, an increasing number of new models are being proposed. Among them, the Lévy process with jumps has received increasing attention because of its capacity to model sudden movements in asset prices. This paper explores the Hamilton–Jacobi–Bellman (HJB) equation with a fractional derivative and an integro-differential operator, which arise in the valuation of American options and stock loans based on the Lévy-α-stable process with jumps model. We design a fast solution strategy that includes the policy iteration method, Krylov subspace method, and banded preconditioner, aiming to solve this equation rapidly. To solve the resulting HJB equation, a finite difference method including an upwind scheme, shifted Grünwald approximation, and trapezoidal method is developed with stability and convergence analysis. Then, an algorithmic framework involving the policy iteration method and the Krylov subspace method is employed. To improve the performance of the above solver, a banded preconditioner is proposed with condition number analysis. Finally, two examples, sugar option pricing and stock loan valuation, are provided to illustrate the effectiveness of the considered model and the efficiency of the proposed preconditioned policy–Krylov subspace method. Full article
(This article belongs to the Topic Advances in Nonlinear Dynamics: Methods and Applications)
Show Figures

Figure 1

18 pages, 413 KB  
Article
Near-Optimal Tracking Control of Partially Unknown Discrete-Time Nonlinear Systems Based on Radial Basis Function Neural Network
by Jiashun Huang, Dengguo Xu, Yahui Li and Yan Ma
Mathematics 2024, 12(8), 1146; https://doi.org/10.3390/math12081146 - 10 Apr 2024
Cited by 7 | Viewed by 1363
Abstract
This paper proposes an optimal tracking control scheme through adaptive dynamic programming (ADP) for a class of partially unknown discrete-time (DT) nonlinear systems based on a radial basis function neural network (RBF-NN). In order to acquire the unknown system dynamics, we use two [...] Read more.
This paper proposes an optimal tracking control scheme through adaptive dynamic programming (ADP) for a class of partially unknown discrete-time (DT) nonlinear systems based on a radial basis function neural network (RBF-NN). In order to acquire the unknown system dynamics, we use two RBF-NNs; the first one is used to construct the identifier, and the other one is used to directly approximate the steady-state control input, where a novel adaptive law is proposed to update neural network weights. The optimal feedback control and the cost function are derived via feedforward neural network approximation, and a means of regulating the tracking error is proposed. The critic network and the actor network were trained online to obtain the solution of the associated Hamilton–Jacobi–Bellman (HJB) equation within the ADP framework. Simulations were carried out to verify the effectiveness of the optimal tracking control technique using the neural networks. Full article
(This article belongs to the Special Issue Advances in Nonlinear Analysis and Control)
Show Figures

Figure 1

Back to TopTop