1. Introduction
Heat conduction problems are prevalent across various engineering applications, including mechanical engineering, architectural engineering, and the thermal management of electronic devices. The inverse problem of temperature fields [
1,
2], which involves deducing the distribution of heat sources and boundary conditions from known temperature data, is a critical technique for addressing these challenges. Traditional numerical simulation methods, particularly those based on finite difference methods (FDMs) [
3] and finite element methods (FEMs) [
4], often rely on comprehensive sensor networks and detailed boundary conditions. However, in practical scenarios, sensor deployment frequently suffers from sparsity, making it difficult to accurately capture the complete temperature field information, thereby complicating the solution of the temperature field inverse problem.
Classical methods for solving thermal inverse problems primarily encompass analytical methods [
5], regularization techniques [
6], optimization strategies [
7], Bayesian approaches [
8], and iterative inversion methods [
9]. Analytical methods yield precise solutions through mathematical derivation; in the fields of fluid flow and heat transfer, Fourier transforms, Green’s functions, and Laplace inversion [
10] are widely utilized. These methods typically apply to linear problems with simple models and regular boundary conditions [
11,
12]. For instance, in steady-state conduction problems where thermal conductivity varies linearly with temperature, inverse analysis can be employed for parameter estimation. Although analytical solutions can offer accurate results with clear physical interpretations, they are mainly constrained by stringent requirements regarding the linearity of the problem and geometric regularity, making them inadequate for complex nonlinear problems and irregular geometries.
Ill-posedness is an inherent challenge of inverse problems, where even minor data noise can lead to significant deviations in solutions. Regularization methods aim to stabilize solutions by introducing additional constraints, thereby enhancing robustness. Tikhonov regularization is one of the most established techniques, which smooths solutions by adding a penalty term to the least-squares objective function, achieving a balance between the data fitting error and the “smoothness” or “simplicity” of the solution [
13]. In the context of heat transfer, regularization methods are extensively employed to identify unknown parameters within mathematical models. For instance, when dealing with ill-posed problems, various regularization techniques can be selected with appropriately adjusted regularization parameters [
14]. However, the selection of these parameters often depends on empirical judgment, and inappropriate choices may lead to erroneous results.
Optimization methods convert inverse problems into optimization tasks, estimating unknown parameters by minimizing the residuals between observed data and model predictions. Prominent approaches include nonlinear least squares, constrained optimization, and variational methods [
15]. These methods are versatile and theoretically sound, finding broad applications in parameter estimation, imaging, and system identification. For example, in intricate heat transfer scenarios, they can optimize thermal performance designs [
16]. Nevertheless, these methods are susceptible to local optima and may struggle with multimodal problems, while computational efficiency is restricted by the problem’s scale and complexity [
17]. Bayesian methods, rooted in probability theory, treat unknown parameters as random variables, updating posterior probability distributions by synthesizing prior knowledge with observational data to quantify uncertainty. Techniques such as Markov Chain Monte Carlo (MCMC) and variational Bayes are commonly employed [
18]. This approach excels in managing complex models and quantifying uncertainties, as exemplified by predicting the energy consumption of HVAC systems using Bayesian networks [
19]. The primary challenges arise from the computational demands of high-dimensional parameter spaces and the intricacies involved in model construction. Iterative inversion methods approach the true solution through successive iterations, with prevalent algorithms including Landweber iteration, the Conjugate Gradient Method (CGM), and Gauss–Newton methods [
20]. These techniques are straightforward to implement, capable of addressing nonlinear problems, and are widely used in image reconstruction, bioelectrical impedance tomography (BIT), and seismic imaging. For example, when solving ill-posed linear inverse problems, a recently proposed second-order dynamic method (SODM) integrates Tikhonov regularization with second-order asymptotic regularization and employs a dual-parameter strategy, demonstrating superiority over conventional methods under noisy conditions [
21]. However, it is essential to note that this approach has slower convergence rates, sensitivity to initial values, and demands meticulous parameter tuning [
22].
With the advancement of machine learning and data-driven methodologies, data-based field reconstruction techniques have increasingly emerged as effective tools for addressing related challenges. These approaches effectively reconstruct accurate temperature field distributions by leveraging limited sensor data, integrated with dimensionality reduction techniques and model simplification strategies. Specifically, the Proper Orthogonal Decomposition (POD) method [
23] facilitates dimensionality reduction in temperature field data, extracting principal feature modes and thereby simplifying complex heat transfer problems. The Gappy POD method [
24] is an extension of POD designed to handle incomplete data. In heat transfer scenarios, complete temperature field data may not be obtainable due to sensor malfunctions or data transmission errors. Gappy POD allows for the reconstruction of the full temperature field from available partial data. For example, research conducted by Wang et al. [
25] involved deploying a limited number of sensors, combined with Computational Fluid Dynamics (CFD) simulations and the Gappy POD methodology, to reconstruct the temperature field distribution across an entire indoor space. Moreover, Xiao et al. [
26] addressed the severe missing data issues in meteorological observations by generating a POD basis using the WRF model and subsequently employing Gappy POD to fill in the gaps, successfully reconstructing the temperature field in the Tibetan region. The POD-RBF (Proper Orthogonal Decomposition–Radial Basis Function) method effectively reconstructs temperature fields with a small number of measurement points. This technique utilizes POD to extract key characteristics of the temperature field and employs RBF interpolation for complete temperature reconstruction, thus striking a balance between computational efficiency and accuracy. Research by Wang et al. [
27] demonstrated the use of the POD-RBF method to rapidly reconstruct the temperature field of printed circuit boards (PCBs) based on minimal sensor data, facilitating monitoring and control of thermal behavior in devices. The Multilayer Perceptron (MLP) represents a type of artificial neural network capable of learning nonlinear relationships between inputs and outputs. The POD-MLP method integrates POD dimensionality reduction with MLP neural networks, aiding in the estimation of physical system states, including thermal and flow conditions. Qi et al. [
28] leveraged the POD-MLP technique for real-time monitoring and reconstruction of the surface temperature field of batteries. The accuracy of data-driven field reconstruction methods significantly depends on the sparse sensor layout. A strategically designed sensor arrangement can not only enhance the precision of temperature field reconstruction but also effectively reduce data acquisition costs and computational burdens. However, many existing reconstruction methods overlook the influence of sparse sensor configurations, often resorting to uniform or random placement techniques, which can lead to considerable variability in reconstruction accuracy. Willcox et al. [
29] optimized the sparse sensor placement for the Gappy POD method using matrix stability criteria, specifically condition numbers, achieving commendable results; however, this approach demands iterative and repeated matrix computations, resulting in lower time efficiency. It also necessitates integration with optimization methods to accelerate the selection process in complex scenarios. Lauzon et al. [
30] introduced the S-OPT method, which quickly determines optimal sensor configurations by scanning orthogonal basis matrices to maximize an S-metric. Yet, this method proves sensitive to the dimensionality of the basis, requiring higher-order modes for accurate results in complex fields such as turbulence. Yuan et al. [
31] proposed a method utilizing the Pearson correlation coefficients of matrices to gauge the interrelationships between sensors and employed a threshold-based strategy to swiftly identify optimal sensor placement locations, known as the Correlation Coefficient Filtering Method (CCFM). This method enables sparse sensor layout optimization under low-modal and confined regional conditions.
Building upon the clustering-based dimensionality reduction method guided by pod structures proposed (C-POD) by Yuan et al. [
31], this paper introduces the Gappy C-POD approach, incorporating principles from inverse problem solving, and develops a multi-scale sample database alongside various sensor layout optimization strategies to facilitate its implementation. This study systematically compares the reconstruction pathways, accuracy, and robustness of four data-driven reconstruction methods—Gappy POD, Gappy C-POD, POD-RBF, and POD-MLP—across different modal truncation numbers, database sizes, and selection strategies, focusing on steady-state and transient heat conduction problems with multiple internal heat sources in two dimensions. The findings aim to provide theoretical foundations and methodological guidance for the engineering modeling and optimization design of temperature field inverse problems.