1. Introduction
According to the latest China Mobile Source Environmental Management Annual Report (2024) [
1] issued by the Ministry of Ecology and Environment, the total pollutant emissions from mobile sources reached 19.246 million metric tons in 2023, with carbon monoxide (CO: 7.249 million tons), nitrogen oxides (NOx: 9.265 million tons), and particulate matter (PM: 0.268 million tons) constituting major components. Notably, vehicular emissions accounted for 72.2% of mobile source emissions (13.896 million tons), where diesel vehicles dominated critical pollutant profiles—contributing 87.8% of total automotive NOx emissions and over 99% of PM emissions. These emissions not only exacerbate climate change, but also pose serious public health risks—the World Health Organization (WHO) classifies diesel exhaust as a Group 1 carcinogen. Therefore, ambient air pollution from mobile sources has emerged as a critical challenge in China’s environmental governance. Reducing diesel-related pollutants is crucial for achieving green transportation and sustainable development goals.
Controlling pollution from diesel trucks is a key focus for the sustainable management of vehicle emissions. Although modern diesel vehicles are equipped with advanced emission control technologies at the manufacturing stage, sustaining their environmental performance requires continuous technical oversight during operation. The implementation of an Inspection and Maintenance (I/M) system is widely recognized as the most effective approach for regulating emissions from in-use vehicles [
2,
3]. Unlike typical malfunctions, diesel vehicle emission faults that exceed acceptable limits during routine emission tests often do not manifest noticeable anomalies in performance, fuel consumption rates, urea consumption rates, or other operational indicators. Specifically, these vehicles do not emit visible blue or black smoke from the exhaust, nor do they display any diagnostic fault codes. However, these faults are revealed when vehicles are subjected to testing under loaded deceleration conditions, where excessive exhaust emissions or insufficient power output that fails to meet regulatory standards become evident. Diagnosing such faults is particularly challenging, representing a critical aspect of the maintenance and regulation of vehicles that exceed emission limits [
4].
As the complexity of industrial systems continues to increase, fault diagnosis technology has become essential for ensuring equipment reliability, optimizing maintenance strategies, and mitigating production risks. This field has evolved from early methods grounded in physical models to contemporary data-driven intelligent diagnostic systems, gradually establishing a technological framework characterized by interdisciplinary integration and demonstrating substantial application value across various industrial scenarios [
5]. In manufacturing, fault diagnosis technology is extensively utilized for the precise identification of bearing faults. For example, Cheng, Y et al. [
6] developed an IESCFFOgram method that leverages candidate fault frequency optimization diagrams to effectively reveal hidden fault information within SCoh. This method adaptively identifies information-rich spectral frequency bands based on the determined candidate fault frequencies for bearing fault diagnosis. Additionally, Cheng, Y et al. [
7] introduced a candidate blind deconvolution method grounded in fault frequency (CFF) to enhance the fault features in bearings, wherein new indicators developed using CFF substitute the cyclic frequency in ICS2, thereby serving as a standard for guiding the deconvolution filter. The fault diagnosis of critical components in aircraft engines and spacecraft is vital for safety. Zhao, Y.P et al. [
8] introduced the following two cross-domain aircraft engine fault diagnosis methods: the One-Stage Transfer Learning ELM (OSTL-ELM) and the Two-Stage Transfer Learning ELM (TSTL-ELM). These approaches are based on Extreme Learning Machine (ELM), where network weights are computed through direct calculations rather than iterative processes, requiring only a limited amount of target domain data to achieve a high diagnostic accuracy. Research into aircraft engine fault detection is essential for ensuring safe and reliable operation. Furthermore, Wang, H.F et al. [
9] proposed a dynamic threshold method for aircraft engine fault detection utilizing Isolation Forest (iota Forest), which develops a fault detection model based solely on normal aircraft engine data for training. This approach addresses the challenge of limited access to comprehensive fault data for training, a common constraint in the field of aircraft engine fault detection. Wang, Z.Y et al. [
10] proposed a comprehensive fault diagnosis framework for rotating machinery that utilizes phase entropy. This framework integrates phase entropy as a feature extraction technique in conjunction with Twin Support Vector Machines (TSVMs) for classification purposes. The efficacy of the framework is demonstrated through its application to aircraft engine bearings, train drive systems, and aviation bearings.
Existing research on fault diagnosis for diesel vehicle emission control has predominantly focused on enhancing the diagnostic capabilities of on-board diagnostics (OBD) systems. OBD systems play a crucial role in tracking the emission performance of vehicles in service, acting as a valuable tool for managing and overseeing vehicle emissions [
11]. The inception of OBD can be traced back to its adoption in California, USA, for regulating automotive emissions. This initial system is now recognized as the OBD I version. Its capabilities were somewhat limited due to the absence of standardized protocols and monitoring limitations, primarily focusing on identifying faults associated with emissions [
12]. To address these limitations, the OBD II system was introduced in the United States [
13], which broadened the scope of component monitoring and established uniform standards for both hardware and software. Both the OBD I and OBD II systems facilitate wireless data transmission, utilizing this information to alert vehicle owners about the condition of their vehicles. However, it remains possible for owners to operate their vehicles despite the potential for high emissions. The OBD III system, also known as the remote OBD system, integrates OBD technology with wireless communication technology to transmit the operational data, emission status, and fault information of vehicles to a central database managed by regulatory authorities, enabling the real-time monitoring of in-service vehicles [
14]. The OBD II system continues to be a primary instrument for vehicle emission surveillance and fault diagnosis [
15], with numerous scholars conducting research on online vehicle diagnostics centered around the OBD II system. For instance, Zhao Xi [
16] developed a simulation model of an integrated post-treatment system using the GT-Power platform. This model employs parameters such as temperature differential, pressure differential, and NOx conversion efficiency to diagnose the following degradation modes: damage to the Selective Catalytic Reduction (SCR) carrier, the aging of SCR catalysts, and blockage of the SCR urea nozzle, utilizing information entropy fusion and neural network techniques. Similarly, Chen Renxiang et al. [
17] collected data from on-board sensors, including the engine speed and pressure differential from the Diesel Particulate Filter (DPF), yielding five data points. Through data fusion, they categorized the sample data into the following three types: pressure differential, temperature differential, and a combination of both, applying deep learning methodologies to facilitate DPF fault diagnosis. Zhu et al. [
18] further highlighted that variations in OBD monitoring conditions significantly affect the measurement of NOx conversion efficiency. They quantitatively analyzed NH
3 storage, NOx emissions, urea injection quantities, and OBD diagnostic conditions under World Harmonized Transient Cycle (WHTC) conditions, offering effective strategies for diagnosing degraded SCR systems. However, OBD diagnostic information primarily relies on signals from electronic control system sensors rather than direct measurements of particulate matter (PM) and NOx emissions. Due to the computational and storage limitations of Engine Control Unit (ECU) chips, diagnostic models are often relatively simplistic [
19]. Furthermore, the complexity of vehicle operating conditions tends to result in lenient thresholds for certain diagnostic parameters, thereby limiting the ability to identify only critically excessive emissions faults [
20]. Consequently, many existing OBD systems in National V diesel trucks fail to effectively diagnose issues such as injector blockage, cylinder carbon buildup, and the performance degradation of post-treatment systems [
21,
22].
As modern technology evolves, the complexity of diagnostic equipment and system structures continues to increase, with functionalities being upgraded and a growing diversity of application environments. Consequently, traditional on-board diagnostics (OBD) online fault diagnosis techniques are far from meeting the diagnostic needs of modern systems [
15]. With the advancement and application of signal acquisition and processing technologies, optimal control, and modern control theories, researchers have been progressively integrating the outcomes of artificial intelligence research with traditional OBD data stream analysis techniques for comprehensive application in the field of fault diagnosis [
23]. Fault diagnosis technology is evolving from its traditional, simplistic, and singular form towards an intelligent, efficient, and integrated direction in OBD data analysis [
24]. Currently, there are numerous successful application cases. Zhang, D.F et al. [
25] conducted research on the fault diagnosis of marine diesel engines. Utilizing the simulation software GT-Suite, a marine diesel engine diagnostic framework based on a self-adaptive genetic algorithm was constructed. An improved genetic algorithm was employed to optimize the weights and biases of Elman neural networks, thereby achieving the efficient and accurate classification of diesel engine faults, effectively diagnosing malfunctions occurring during the operation of marine diesel engines. Cho, I et al. [
26] experimentally characterized the particulate number (PN) emissions and on-board diagnostic (OBD) signals of diesel fuel vehicles resulting from three types of Diesel Particulate Filter (DPF) faults (cracking, melting, and hollowing). By utilizing OBD to read parameters such as air flow rate and boost pressure and combining these with PM emission values measured by an opacity meter, the study aimed to diagnose the following three types of damage modes of the DPF system: cracking, melting, and destruction. When employed for fault diagnosis, the application of on-board diagnostics (OBD) is constrained by its inherent monitoring strategy, which often fails to detect many deteriorating faults that exceed the diagnostic capabilities of OBD tools alone. Consequently, the integration of additional parameters and advanced diagnostic techniques is essential to achieve accurate and comprehensive fault diagnosis. Zawisa M [
27] proposed a novel diagnostic approach for mechanical damage in diesel engines by leveraging vibration signals, which are undetectable by conventional OBD diagnostic tools. This method specifically targets the issue of damage severity exceeding the threshold values defined within the OBD system. By incorporating engine vibration acoustic signals—an additional diagnostic parameter not available in OBD tools—a diagnostic system based on vibration acoustic signals was established. This system effectively supplements OBD in diagnosing mechanical damage to the engine, thereby enhancing the overall diagnostic capability. Furthermore, Hernández J C M et al. [
28] introduced an advanced diagnostic method for vibration signals based on multiscale permutation entropy and variability correlation analysis. This method integrates multiple sensor signals, including vibration signals and engine speed, to diagnose bearing faults in diesel engines. The K-Nearest Neighbor (KNN) classifier was employed for signal classification, ensuring robust and accurate fault detection. The effectiveness of this diagnostic method was rigorously validated using a comprehensive database of vibration signals from single-cylinder engine bearings under various operating conditions, demonstrating its potential for practical application in engine fault diagnosis.
In the face of the challenges that machine learning encounters in diagnostics [
29], the current common practice leverages the professional experience of domain experts and a theoretical framework to establish a theoretical foundation, upon which corresponding information systems are constructed [
30,
31]. Gharib, H et al. [
32] introduced a newly developed expert system that analyzes the various patterns and degrees of response of each parameter to changes in engine structural parameters, selecting diagnostic parameters aimed at diagnosing the technical condition and performance of marine diesel engines, which holds significant practical value. Lv, X. [
33] explored and studied a gasoline engine exhaust emission prediction system based on OBD data streams. By establishing an emission assessment system and utilizing vehicle status parameters to forecast the levels of pollutant emissions, it provides a reference for diagnosing vehicle faults. Additionally, this research offers rational advice for vehicle owners and service entities in vehicle maintenance and usage, and can serve as an effective supplement to the implementation of an Inspection and Maintenance (I/M) system. Kilagiz, Y et al. [
34] introduced a fuzzy expert system specifically developed to identify potential failures in fuel systems, faults in ignition systems, and malfunctions in intake and exhaust valves, while also offering recommended solutions for these issues. The system employs measurements of CO, HC, CO
2, O
2, and the air–fuel ratio (lambda) to provide decisions and recommendations grounded in a rule base consistent with expert opinions. Vinsonneau, J.A.F et al. [
35] applied a fault detection strategy to a Jaguar automotive engine, utilizing real data at multiple engine speeds to obtain an improved model of the subsystem, which comprised an intake system, manifold dynamics, and engine pumping. Additionally, the modeling of air leakage was considered, and three practical fault scenarios were evaluated.
In the absence of on-board diagnostics (OBD) fault codes, repair technicians often depend on data stream analysis for diagnostic purposes [
36]. However, maintenance stations (M stations) frequently lack the requisite equipment for conducting loaded deceleration testing, which complicates the precise control of testing conditions and hinders the replication of excessive emissions events. Moreover, data streams deliver processed information from the OBD system; therefore, the information obtained does not completely reflect the actual dynamic conditions of the vehicle. Consequently, data stream analysis is insufficient for diagnosing excessive emissions [
37].
The above analysis indicates that OBD (on-board diagnostics) systems monitor vehicle performance by evaluating the output signals generated by the ECU (Engine Control Unit) across different operating conditions. When the variation in these signals surpasses a predetermined threshold, the OBD system detects a fault within the vehicle’s systems. Although OBD has been demonstrated as an effective and established approach for diagnosing faults in critical engine components, including high-pressure common rail systems, SCR (Selective Catalytic Reduction) systems, and the intake and exhaust systems in diesel engines, it possesses significant limitations. Specifically, OBD may be ineffective in detecting faults when multiple engine components experience simultaneous degradation that does not reach the threshold required to activate an OBD alert. Furthermore, the monitoring scope of OBD systems is limited and does not encompass all components and systems within an engine. In light of these limitations, it is essential to leverage multi-source information and incorporate additional diagnostic parameters.
In tackling the issue of excessive emissions from diesel trucks, current research predominantly depends on a single data source, such as OBD systems or simulation models, and lacks the comprehensive integration of information from I-station loading deceleration tests, OBD data, and manual measurements conducted at M stations. Moreover, the inability to effectively integrate multidimensional information from various scenarios has led to an incomplete coverage of diagnostic parameters, complicating efforts to identify latent faults. Furthermore, fuzzy inference models that integrate multi-source information have not been extensively investigated. Existing diagnostic models primarily employ traditional threshold determinations or singular algorithms, which fail to adequately represent the complex fuzzy relationships between fault phenomena and their underlying causes. Additionally, previous studies have often concentrated on simulations or limited sample experiments, lacking validation against large-scale empirical maintenance data. Furthermore, no diagnostic tools have been developed for direct application within M stations, thereby limiting their practical utility. Therefore, this study adopts a tri-source information approach that integrates loaded deceleration emission testing conducted at inspection stations (I stations), data from the OBD system, and manual measurements collected at maintenance stations (M stations). By utilizing fuzzy diagnostic theory and techniques, we propose a fault diagnosis methodology for excessive emissions in operational National V diesel trucks (total mass ≥ 45,000 kg). This methodology significantly enhances diagnostic efficiency, thereby bolstering the industry’s capacity to repair and manage diesel trucks that exceed emission standards. Ultimately, the aim is to facilitate the effective implementation of Inspection and Maintenance (I/M) systems for in-use vehicles.
4. Results
4.1. Input Parameter Fuzzification
Given that the system input parameters consist of specific numerical values while the diagnostic parameter values in fuzzy diagnosis are represented as fuzzy sets with ambiguous boundaries, it is essential to establish appropriate membership functions to convey the degree of correspondence for these diagnostic parameters. In light of the evaluation criteria derived from the “In-Use Vehicle Inspection Report”, the NOx emission value, PM emission value, and power ratio are subject to the stipulations of the China GB 3847-2018 standard, which mandates that the NOx concentration must not exceed 1500 ppm, the opacity (light absorption coefficient) must remain below 1.2 m
−1, and the measured wheel-side power-to-rated power ratio must not fall below 0.4. Consequently, if the NOx emission level exceeds 1500 ppm, the PM emission level exceeds 1.2 m
−1, and the power ratio falls below 0.4, a high fuzzy membership degree is anticipated. Accordingly, the fuzzy set for the NOx and PM emission levels in this system is defined as {very high, high, relatively high, medium, relatively low, low, very low}, with associated fuzzy membership degrees of {1, 0.9, 0.7, 0.5, 0.3, 0.1, 0}. The fuzzy set for the power ratio is defined as {very high, high, medium, low, very low}, with corresponding membership degrees of {1, 0.7, 0.5, 0.3, 0}. The fuzzy sets for the intake volume ratio, exhaust back pressure, and common rail pressure ratio are derived from Equations (10)–(12) in
Section 3.3. Our study indicates that the affiliation function for each feature parameter is a segmented function, and its expression is detailed in
Appendix A, as shown in
Figure 7.
4.2. Fuzzy Relational Matrix
The precise establishment of the fuzzy relationship matrix is critical, as the degree of correspondence between the diagnostic parameter values and fault categories significantly influences the accuracy of the diagnostic outcomes. It is imperative to reference a substantial number of fault case examples pertaining to the target vehicle models, alongside the consolidated insights of maintenance experts.
Let the expert weight be
and the statistical weight be
. The diagnostic matrix can then be established as shown in Equation (13), as follows:
.
Based on the analysis and organization of actual diagnostic and maintenance data, combined with expert reviews from the maintenance industry, the fuzzy diagnostic matrix R is obtained. The system’s reasoning model is then formulated as shown in Equation (14).
In this example, the measured values of the diagnostic parameters are as follows:
The concentration of nitrogen oxides (NOx) at the 80% maximum power speed point is 367 ppm.
The highest of the smoke opacity values at the 100% and 80% maximum power speed points is 1.47 m−1.
The ratio of the measured wheel-side power to the rated engine power at the 100% maximum power speed point is 0.43.
The ratio of the actual intake air flow (or pressure) to the lower limit value of the intake air flow (or pressure) of a well-performing diesel engine of the same model under the same operating conditions is 0.97.
The exhaust back pressure is 6.8 bar.
The ratio of the actual common rail pressure to the lower limit value of the common rail pressure of a well-performing diesel engine of the same model under the same operating conditions is 0.51.
Based on the membership function , the vector is derived as [0, 1, 0.91, 0, 0, 0.97]. After normalization, the vector becomes [0, 0.3742, 0.3160, 0, 0, 0.3368]. Then, according to , the calculation result is = [0.2119, 0.4091, 0.1374, 0, 0.0887, 0.1630, 0].
4.3. Verification of Diagnostic Input Parameter Thresholds and Valid Values
Due to the fact that the NOx and PM emission values, as well as the ratio of measured wheel power to rated power, are derived from actual measurement results obtained through the load deceleration emission testing method and subsequently transmitted to the M station via the in-use motor vehicle emission inspection information system, errors in data collection during the inspection process, data transmission errors, or even incorrect input by users of the diagnostic system could potentially affect the accuracy of fault parameter values. Therefore, before conducting fuzzy diagnostics, it is essential to verify the system’s input data to ensure the validity of the information.
Parameters such as NOx and PM must fall within the thresholds specified by GB-3847. Data exceeding these thresholds are considered erroneous. The ratio of measured wheel power to rated power, taking into account the transmission losses of the dynamometer and normal mechanical transmission, should not exceed 0.8. The specific thresholds for each parameter are detailed in
Table 6.
- 2.
Verification of the Correlation of Input Parameters
Considering the mechanisms of particulate and nitrogen oxide (NOx) formation, achieving a good emission performance requires enhancing diesel spray quality and matching it with an appropriate gas flow to create a reasonable mixture velocity and spatial distribution of diesel and air. This facilitates well-organized premixed and diffusive combustion, optimizing the rate of temperature rise, peak value, and spatial distribution within the cylinder. Under these technical conditions, the engine burns sufficiently, providing a higher power output and maintaining good emission control levels. Consequently, if a vehicle’s emission test shows good PM and NOx emission values, the ratio of the engine’s measured wheel power to its rated power will also be relatively high.
Statistical analysis indicates that when the NOx emission level is below 400 ppm and the PM emission level is below 0.2 m−1, 99.5% of the samples demonstrate a power ratio greater than 0.52. Consequently, adhering to the 3σ principle, we establish the threshold for the power ratio at 0.5. Specifically, if the NOx emission level is below 400 ppm and the PM emission level is below 0.2 m−1, the power ratio should exceed 0.5; otherwise, an error in the input parameters is indicated.
For instance, if the system inputs are a NOx emission value of 150 ppm, PM emission value of 0.01 m−1, and power ratio of 0.37, the system will determine that the correlation verification between the parameters is non-compliant.