1. Introduction
The Fifth Plenary Session of the 19th Central Committee of the Communist Party of China pointed out that we must adhere to the core position of innovation in the overall modernization construction of our country, deeply implement the strategy of rejuvenating the country through science and education, the strategy of innovation driven development, improve the national innovation system, enhance the technological innovation capability of enterprises, and improve the system and mechanism of scientific and technological innovation [
1]. China should take the creation of an innovative country as the starting point, promote scientific and technological innovation from the aspects of institutional mechanisms, resource elements, etc., and rely on scientific and technological innovation to provide adjustments for the high-quality development of the Chinese economy. Scientific and technological innovation will become an inherent force to promote sustainable social and economic development [
2].
The 14th Five Year Plan for National Economic and Social Development of the People’s Republic of China and the 2035 Long Range Objectives Outline propose to accelerate the green transformation of development mode, adhere to ecological priority and green development, promote total resource management, scientific allocation, comprehensive conservation, and circular utilization, and work together to promote high-quality economic development and high-level ecological environment protection. As knowledge and technology-intensive industries, high-tech industries are important carriers for promoting high-quality economic development and leading technological innovation. Due to factors such as resource endowment, knowledge spillover, and government policies, the development of high-tech industries will form industrial agglomeration phenomena at a certain stage, specifically manifested as the agglomeration of high-tech enterprises, talents, technology, and other resources. The positive and negative externalities caused by agglomeration effects significantly affect changes in other factors, such as corporate innovation behavior, capital investment, and human resources [
3].
Due to their knowledge- and technology-intensive attributes, high-tech industries are an important force in China’s economic development. Developing high-tech industries is one of the important ways to accelerate the development of China’s modern industrial system and consolidate and strengthen the foundation of the real economy. The high-tech industry is an emerging industry based on high-tech, supported by innovation, and led by the development and production of high-tech products. The Beijing-Tianjin-Hebei region is rich in technological innovation resources and has a strong industrial foundation. It is one of the most dynamic and promising regions in China and also one of the three regions with great potential for economic growth. Beijing, located in the Beijing-Tianjin-Hebei region, serves as the political center, cultural center, international communication center, and science and technology innovation center of China, possessing abundant scientific and technological innovation resources and high-tech industries.
Therefore, exploring the relationship between the added value of the high-tech industry and its influencing factors in depth is crucial for optimizing investment allocation and maximizing its value. Most scholars use multiple linear regression [
4], a combination of correlation testing and linear regression [
5], and social network cohesion subgroup algorithms when studying the impact on the added value of high-tech industries. Feng Qing and Shi Xiongtian applied static and dynamic efficiency measurement models to study the overall and regional changes in innovation efficiency of China’s high-tech industries from 2013 to 2021 and analyzed the factors affecting innovation efficiency of high-tech industries [
6]. Yang Dandan and Luo Feng used the Trade Competitive Advantage (TC) index to evaluate the international competitiveness of high-tech industries in the Guangdong Province and analyzed the influencing factors of the international competitiveness of high-tech industries in the Guangdong Province by constructing a vector autoregression model based on the diamond model theory [
7]. Tang Bo, Chen Deguan, and others used POI data as a basis to explore the spatial evolution and influencing factors of high-tech industries in Foshan, Guangdong Province, using kernel density analysis, spatial autocorrelation, multiple linear regression, and other methods [
8]. Zhang Xiao and Ge Yuhui took high-tech industries in 28 provinces, municipalities, and autonomous regions in China as research objects, constructed a DEA-BCC model and a Malmquist index to conduct dynamic and static analysis of innovation efficiency, and established a Tobit panel model to deeply analyze the influencing factors of innovation efficiency [
9]. Zeng Wujia et al. conducted a study on the innovation efficiency and influencing factors of 103 national high-tech industrial development zones in China using panel data and a three-stage DEA Tobit analysis method [
10]. Li Jian et al. used the Super SBM model that considers unexpected output to measure the green innovation efficiency of high-tech industries in panel data from 30 provinces in China. Based on this, ArcGIS visualization tools were used to analyze their spatiotemporal differentiation characteristics, and the GMM dynamic panel model was used to test the influencing factors of green innovation efficiency [
11]. Zhang et al. used a two-stage DEA Tobit model to empirically analyze the five factors that affect the two-stage efficiency of industry university research collaborative innovation in different industries of high-tech industries [
12].
A review of the literature on the added value of the high-tech industry and its influencing factors reveals that scholars have rarely applied relevant models from grey system theory for analysis. Grey system theory, pioneered by Professor Deng Julong, is a novel approach to addressing uncertainty in small-data and information-deficient scenarios. It focuses on systems where some information is known while other parts remain unknown, extracting valuable insights from partial data to enable accurate description and analysis [
13]. Therefore, this study employs the ‘grey Lotka–Volterra model’ and the ‘grey differential dynamic multivariate model’, which are designed for analyzing multifactor relationships, to examine the interaction between the added value of Beijing’s high-tech industry and its influencing factors, as well as its development trends over the next five years. By leveraging limited short-term data, these models provide more precise predictive analysis.
The Lotka–Volterra model was originally used to analyze the interrelationships between biological populations. Scholars have introduced it into the grey system model to broaden the research field of the model, forming the grey Lotka–Volterra model [
14]. In terms of the application of the grey Lotka–Volterra model, Guo et al. used the grey Lotka–Volterra model to analyze the relationship between water resources, energy, industry, and technological innovation in the Beijing-Tianjin-Hebei region and proposed suggestions based on the analysis results [
15]. Mao et al. analyzed the relationship between China’s third-party payment system and online banking using the grey Lotka–Volterra model [
16]. Yuan Zhihua et al. applied the grey Lotka–Volterra model to the study of microbial synergy in biological uranium leaching, providing a theoretical basis for improving the efficiency of biological uranium leaching [
17]. Han Ruiwen et al. applied the Lotka–Volterra model to study and analyze the stability of data among large, medium, and small enterprises [
18]. Anindya S. Chakrabarti applied the Lotka–Volterra model equation to simulate the evolution of technological frontiers and demonstrated that the diffusion of technology has had a ripple effect in partner economies [
19]. Xia et al. introduced the Lotka–Volterra model to explore the internal structure of technological innovation in China’s high-tech industry from six indicators: independent innovation, technology introduction, research and development, technological innovation, external technology acquisition, and domestic technology procurement [
20]. Zhang et al. studied the competition and cooperation between technological innovation, resource consumption, environmental quality, and industrial development quality in the Shaanxi Province by constructing a four-dimensional grey Lotka–Volterra model and discussed the equilibrium point and stability of the four-dimensional grey Lotka–Volterra model [
21].
The fractional order cumulative grey Lotka–Volterra model (FGLV) is a combination of the grey Lotka–Volterra model and the fractional order cumulative generation operator. Compared with the original grey Lotka–Volterra model, the fractional order cumulative grey Lotka–Volterra model is more in line with the new information first principle and is more suitable for solving practical problems.
Recent research on fractional-order grey models has significantly advanced predictive accuracy in various domains. Duman and Kongar (2025) introduced the Hausdorff fractional NGBM (r,1), enhancing flexibility and computational efficiency by eliminating the Gamma function, and demonstrated superior predictive accuracy in e-waste forecasting [
22]. Li and Xie (2024) proposed a reduced-order discrete grey forecasting model, refining its internal mechanisms and simplifying modeling steps while ensuring high prediction accuracy, particularly in battery capacity degradation [
23]. Wu et al. (2024) developed a conformable fractional-order grey Bernoulli model optimized for energy consumption forecasting in Chongqing, outperforming traditional models [
24]. These advancements align with broader applications of fractional-order models, such as smoothed functional algorithms for Hammerstein models, reaction curve-based chemical process modeling, and fractional calculus in separation processes, reinforcing their efficacy in small-data prediction. The fractional-order Hammerstein model for identifying continuous time proposed by Mok, R., and Ahmad, M. A. [
25] and the general recognition method of fractional-order process based on process response curve proposed by Gude, J. J., and García Bringas, P. effectively demonstrate that the application of fractional-order accumulation operator has unique advantages in the prediction of low-data problems [
26].
The grey differential dynamic multivariate model is a prediction model in the grey multivariate model, among which the grey multivariate GM(1,
N) model is proposed to address the problem that the traditional GM(1,1) model cannot achieve accurate prediction of various influencing factors in real life. Xie Naiming and Liu Sifeng found that the GM(1,
N) model has low accuracy in long-term development prediction and thus proposed a discrete grey multivariate model [
27]. Wang et al. proposed a multivariate grey GM(1,
N) power model and its derivative models to improve the modeling performance of feature sequences in multivariate nonlinear systems [
28]. The Grey Differential Dynamic Model [
29] was initially proposed by Academician Xia Jun using Professor Deng Julong’s Grey System Theory as a hydrological system model, with the general form being DHGM(
n + 1,
l + 1). The grey differential dynamic multivariate model was proposed by Duan Huiming et al. based on the grey differential dynamic prediction model, considering the many factors that affect the main sequence, and extended it to apply multiple variables, greatly enriching the types of grey multivariate prediction models [
30]. Under this context, fractional grey forecasting model will be applied to optimize investment allocation for maximum value addition in Beijing’s high-tech industries.
This study makes the following key contributions:
Innovative Application of Grey System Models—This research applies the fractional-order cumulative grey Lotka–Volterra model to analyze the historical influence of four related systems on the added value of Beijing’s high-tech industry.
Enhanced Predictive Accuracy—By employing the grey differential dynamic multivariate model, this study provides a more precise forecast of future relationships between the high-tech industry and its influencing factors.
Strategic Investment Optimization—Based on the analytical results, this study offers strategic recommendations for optimizing investment allocation to maximize the future value of Beijing’s high-tech industry.
These contributions provide valuable insights for policymakers and investors in fostering the sustainable growth of high-tech industries.
4. Prediction Model
4.1. Grey Differential Dynamic Multivariate Model
The grey differential dynamic model was initially proposed by Xia Jun using Professor Deng Julong’s grey system idea [
29] as a hydrological system model [
32], and then improved by Duan Huiming et al. [
30] to a grey differential dynamic model that studies multiple variables. The main modeling process is as follows:
Step 1: Assuming
is a system feature
sequence and a sequence of related factors. Using a first-order accumulation generation operator to accumulate the original sequence, further weakening the randomness, and obtaining a stable first-order accumulation generation sequence:
Among them, the first-order accumulation generation operator is denoted as 1-AGO, and the accumulation formula is
The differential equation form of the grey differential dynamic model DHGM(2,M) obtained from the sequence generated by first-order accumulation is
Among them .
Step 2: Next,
compare the adjacent mean values
Among them
, the grey differential dynamic multivariate prediction model can be referred to as
Step 3: The parameters can be obtained by the least squares method. Let
be a parameter column, satisfying
,
Step 4: Based on the known
time response function of the obtained model,
The (20) integral in the formula is discretized using the trapezoidal formula to obtain the basic prediction formula of the model
Step 5: Restore the time response equation of the DHGM(2, M) model, and the predicted value obtained is
Step 6: Use the mean relative error (MAPE) and relative error (APE) values to test the prediction accuracy of the model. The calculation formulas are as follows:
When the simulation error and prediction error of the model are both below 5%, it indicates that the prediction performance of the model is good. To achieve the best prediction results, the parameter intervals are continuously and iteratively adjusted through the Particle Swarm Optimization (PSO) algorithm, searching for the optimal parameter solution to minimize the interval width of predicted values.
The specific modeling steps and parameter optimization process of the model are as follows:
Step 1: Data preprocessing and sequence generation
- (1)
Original sequence division: Divide the raw data into a training set and a testing set (e.g., training from 2016 to 2020, testing in 2021).
Define the relationship between the independent variable (Y) and the dependent variable (X).
- (2)
Accumulated generation sequence (1-AGO): Perform first-order accumulation generation (1-AGO) on the original sequence to eliminate data volatility and enhance sequence regularity through accumulation.
Step 2: Model parameter estimation
- (1)
Constructing differential equations.
- (2)
Using MAPE as the loss function, adjust parameters through PSO optimization method to minimize MAPE.
Step 3: Fitted values and error calculation
Generate fitting sequence: Substitute estimated parameters into the model and obtain a fitting value sequence through iterative reduction and restoration (1-AGO)
Then, calculate the error point by point, and calculate the average error of each point to obtain MAPE,
Step 4: Parameter iteration optimization, adjust parameter combinations through cross validation or grid search, observe the trend of MAPE changes, and select the parameter that minimizes MAPE.
4.2. Model Solving and Data Analysis
This article predicts the added value of high-tech industries based on the human capital system, taking the solution between total labor productivity and the added value of high-tech industries in the human capital system as an example. The relevant data are shown in
Table 16.
Based on the raw data in
Table 16, establish the DHGM(2,M) model. The specific process is as follows:
- (1)
The original data of added value of high-tech industries in Beijing from 2016 to 2021 are
- (2)
The original data of labor productivity for all employees in Beijing from 2016 to 2021 are
Then, the first-order accumulated sequence is obtained as
utilize
Then, the time response sequence can be obtained as
According to the time response equation, the first-order cumulative prediction sequence can be obtained as
Then, perform a first-order accumulation inverse operation to obtain the restored predicted value
Using data from 2016 to 2020 as the training set and 2021 as the validation set, obtain the training error of the model, MAPE = 0.003%, MAE = 0.28, RMSE = 0.28. The validation error of the model, MAPE = 0.016%, MAE = 1.7, MRSE = 1.7.
To visually highlight the advantages of the grey dynamic multivariate differential model in predicting high-tech industry added value, comparative predictions were conducted using multiple grey and non-grey models on the given dataset. The outcomes are presented below.
As evidenced by the comparative results in
Table 17, the DHGM(2,M) model exhibited superior performance, achieving a remarkably low prediction MAPE of 0.016%. By leveraging second-order derivatives to capture growth acceleration, this model aligns more closely with the exponential growth dynamics inherent to high-tech industries. Other grey models have slightly larger errors than the DHGM(2,M) model, while non-grey models have larger errors than most grey models. However, non grey models have certain problems in predicting short data sequences.
ARIMA relies entirely on the autocorrelation of historical data for modeling and does not incorporate external variables. When data are influenced by multiple factors, it fails to capture external shocks (e.g., policy changes, technological breakthroughs). Multivariate regression and machine learning models face limitations due to insufficient data—only six years of observations are inadequate to support the complex parameter estimation required for multivariate linear regression (MLR) or neural networks (e.g., BP, LSTM), leaving these models vulnerable to noise interference. Support Vector Machines (SVRs) and neural networks demand large datasets to learn nonlinear relationships. Under small-sample conditions, they cannot be adequately trained, resulting in poor generalization ability and an inability to adapt to the complex growth patterns of high-tech industries. Therefore, the DHGM(2,M) model was selected to forecast the added value of high-tech industries.
Similarly, the predicted value added of high-tech industries under alternative systems is illustrated in the table. The human capital system selected the labor productivity of all employees, a closely related factor to the added value of high-tech industries, as the influencing factor. The funding system selected R&D internal expenditure and government funds as the influencing factors, and the technology system selected the number of patent applications as the influencing factor. The trade system selected the import quantity of high-tech products as the influencing factor and used a grey differential dynamic multivariate prediction model for prediction. The influencing factor sequence used the GM(1,1) model to predict the data from 2022 to 2026, and then used the DHGM(2,M) model to predict the added value of high-tech industries in the next 5 years.
The total labor productivity of high-tech industries is defined as the value added of high-tech industrial enterprises divided by the average number of employees in the same period, reflecting the productivity of the enterprise, and thus the level of economic development and productivity of a region.
Government investment funds include government subsidies, equity investments, and other fiscal appropriations, representing the government’s emphasis on a certain industry and encouraging future development directions.
The import volume of high-tech products is indicative of demand for such goods, as well as the development speed and degree of openness of the high-tech industry.
As illustrated in
Table 18, the added value of high-tech industries in Beijing is influenced by multiple systems. The grey differential dynamic multivariate prediction model has been demonstrated to achieve relatively accurate predictions of the added value of high-tech industries. It can be expected that the added value of high-tech industries will continue to grow in the future under the influence of different systems.