Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (19)

Search Parameters:
Keywords = neural network sampler

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
25 pages, 4052 KB  
Article
Leveraging Neural Networks Trained with Scaled Conjugate Gradient for Enhanced VANET Performance in High-Mobility Environments
by Etienne Alain Feukeu
Network 2026, 6(2), 36; https://doi.org/10.3390/network6020036 - 27 May 2026
Viewed by 122
Abstract
Vehicular Ad Hoc Networks (VANETs) face significant challenges in high-mobility environments, where dynamic channel conditions, particularly Doppler Shift (DS), degrade communication reliability and increase latency, thereby undermining safety-critical applications. To address these limitations, this paper proposes a neural network (NN)-based link adaptation strategy [...] Read more.
Vehicular Ad Hoc Networks (VANETs) face significant challenges in high-mobility environments, where dynamic channel conditions, particularly Doppler Shift (DS), degrade communication reliability and increase latency, thereby undermining safety-critical applications. To address these limitations, this paper proposes a neural network (NN)-based link adaptation strategy trained using the Scaled Conjugate Gradient (SCG) algorithm. SCG is selected as a second-order approximation optimizer that leverages curvature information to produce well-conditioned weight updates particularly suited to the small, physics-constrained training dataset. The SCG-optimized model dynamically adjusts transmission parameters to mitigate DS effects, improving real-time adaptability by explicitly incorporating Doppler Shift as a key input feature. Simulation results demonstrate that the proposed approach outperforms both the conventional Auto Rate Fallback (ARF) method and the SampleRate baseline. Specifically, the SCG-based strategy achieves an overall throughput improvement of +34.6% relative to ARF (1.77 Mbps vs. 1.32 Mbps) across all tested conditions, with condition-specific gains of +16.1% at 5 Hz Doppler (0.9 km/h), +21.7% at 750 Hz (137.3 km/h), and +35.2% at 1500 Hz (274.6 km/h), while consistently reducing transmission duration. A formal ablation study confirms that the Doppler Shift feature alone contributes +67% to +78% throughput gain at high mobility (DS > 900 Hz) compared to an SNR-only model. The main contributions of this work are threefold: (i) the explicit integration of Doppler Shift as a first-class input feature for link adaptation; (ii) the application of SCG optimization for fast, stable training of a lightweight feedforward neural network on a compact, physics-constrained dataset; and (iii) the formal ablation study that isolates and quantifies the Doppler feature’s contribution, establishing that the performance gain is attributable to feature engineering rather than the neural network architecture alone. This approach offers a scalable, real-time solution for Doppler-resilient VANET link adaptation. Full article
Show Figures

Figure 1

31 pages, 2165 KB  
Article
Class Imbalance in IoMT Datasets: Evaluating Balancing Strategies for Learning-Based Attack Detection
by Eren Gencturk, Beste Ustubioglu, Guzin Ulutas and Iraklis Symeonidis
Appl. Sci. 2026, 16(10), 4921; https://doi.org/10.3390/app16104921 - 15 May 2026
Viewed by 465
Abstract
Internet of Medical Things (IoMT) devices are inherently vulnerable to cyberattacks, typically due to their limited processing power and memory capacity. Their widespread use in healthcare poses a significant security risk, threatening patient data privacy and the continuity of services. This study examines [...] Read more.
Internet of Medical Things (IoMT) devices are inherently vulnerable to cyberattacks, typically due to their limited processing power and memory capacity. Their widespread use in healthcare poses a significant security risk, threatening patient data privacy and the continuity of services. This study examines the effects of data imbalance correction and balancing strategies on the performance of machine and deep learning models using openly available IoMT datasets. In this context, four different balancing methods—RandomUnderSampler, SMOTE, Borderline-SMOTE, and ADASYN—were applied to three open-access IoMT datasets: ECU-IoHT, WUSTL, and CICIoMT2024. Performance analyses were conducted using five machine learning algorithms (AdaBoost, Logistic Regression, Random Forest, XGBoost, and K-Nearest Neighbor (KNN)) and two deep learning algorithms (Convolutional Neural Networks (CNN) and Deep Neural Networks (DNN)). In the highly imbalanced binary setting of the CICIoMT2024 dataset, the combination of RandomUnderSampler and SMOTE under the balanced-training/original-testing scenario produced the strongest improvement in the binary CICIoMT2024 setting, increasing the F1-Score from the unbalanced baseline to 99.87% for Random Forest and 99.86% for XGBoost across repeated runs. However, the benefit of balancing was not universal. In datasets with stronger class separability, such as ECU-IoHT, and in several multi-class settings, the effect of balancing was limited or, in some cases, inferior to the unbalanced baseline. These findings indicate that balancing is most effective under specific conditions, particularly in highly imbalanced binary tasks, and should be validated using class-sensitive metrics rather than overall performance alone. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

28 pages, 6379 KB  
Article
Research on Lithium-Ion Battery Diaphragm Defect Detection Based on Transfer Learning-Integrated Modeling
by Lihua Ye, Xu Zhao, Zhou He, Zixing Zhang, Qinglong Zhao and Aiping Shi
Electronics 2025, 14(9), 1699; https://doi.org/10.3390/electronics14091699 - 22 Apr 2025
Cited by 1 | Viewed by 1347
Abstract
Ensuring the security and reliability of lithium-ion batteries necessitates the development of a robust methodology for detecting defects in battery separators during production. This study initially uses data augmentation techniques in the data processing phase, followed by the utilization of the weighted random [...] Read more.
Ensuring the security and reliability of lithium-ion batteries necessitates the development of a robust methodology for detecting defects in battery separators during production. This study initially uses data augmentation techniques in the data processing phase, followed by the utilization of the weighted random sampler method for sampling. Additionally, the dataset is partitioned using the Stratified K-Fold cross-validation method to tackle imbalanced sample data. Subsequently, an ensemble of object detection algorithms involving Faster Region Convolutional Neural Network and RetinaNet is developed. The ensemble method employs a voting mechanism to ascertain the most accurate predictions and utilizes the Adaptive Delta optimization algorithm with adaptive learning rates. This algorithm adjusts the learning rate based on parameter change rates, eliminating the requirement for setting an initial learning rate to ensure result convergence. Finally, a model fine-tuning technique using pre-training transfer learning is applied to improve the detection performance of the ensemble model. Experimental results show that the improved methodology demonstrates a 16.26% increase in recall, a 7.05% improvement in precision, an 11.83% rise in balanced F Score, and a 0.23 increase in the area under the Receiver Operating Characteristic curve. The study results indicate that the proposed method is an effective and accurate approach to detecting defects in lithium-ion battery separators. Full article
(This article belongs to the Special Issue 2D/3D Industrial Visual Inspection and Intelligent Image Processing)
Show Figures

Figure 1

20 pages, 8936 KB  
Article
A Minority Sample Enhanced Sampler for Crop Classification in Unmanned Aerial Vehicle Remote Sensing Images with Class Imbalance
by Jiapei Cheng, Liang Huang, Bohui Tang, Qiang Wu, Meiqi Wang and Zixuan Zhang
Agriculture 2025, 15(4), 388; https://doi.org/10.3390/agriculture15040388 - 12 Feb 2025
Cited by 4 | Viewed by 1420
Abstract
Deep learning techniques have become the mainstream approach for fine-grained crop classification in unmanned aerial vehicle (UAV) remote sensing imagery. However, a significant challenge lies in the long-tailed distribution of crop samples. This imbalance causes neural networks to focus disproportionately on majority class [...] Read more.
Deep learning techniques have become the mainstream approach for fine-grained crop classification in unmanned aerial vehicle (UAV) remote sensing imagery. However, a significant challenge lies in the long-tailed distribution of crop samples. This imbalance causes neural networks to focus disproportionately on majority class features during training, leading to biased decision boundaries and weakening model performance. We designed a minority sample enhanced sampling (MES) method with the goal of addressing the performance limitations that are caused by class imbalance in many crop classification models. The main principle of MES is to relate the re-sampling probability of each class to the sample pixel frequency, thereby achieving intensive re-sampling of minority classes and balancing the training sample distribution. Meanwhile, during re-sampling, data augmentation is performed on the sampled images to improve the generalization. MES is simple to implement, is highly adaptable, and can serve as a general-purpose sampler for semantic segmentation tasks, functioning as a plug-and-play component within network models. To validate the applicability of MES, experiments were conducted on four classic semantic segmentation networks. The results showed that MES achieved mIoU improvements of +1.54%, +4.14%, +2.44%, and +7.08% on the Dali dataset and +2.36%, +0.86%, +4.26%, and +2.75% on the Barley Remote Sensing Dataset compared with the respective benchmark models. Additionally, our hyperparameter sensitivity analysis confirmed the stability and reliability of the method. MES mitigates the impact of class imbalance on network performance, which facilitates the practical application of deep learning in fine-grained crop classification. Full article
(This article belongs to the Special Issue Applications of Remote Sensing in Agricultural Soil and Crop Mapping)
Show Figures

Figure 1

20 pages, 5331 KB  
Article
Visual Servoing for Aerial Vegetation Sampling Systems
by Zahra Samadikhoshkho and Michael G. Lipsett
Drones 2024, 8(11), 605; https://doi.org/10.3390/drones8110605 - 22 Oct 2024
Cited by 7 | Viewed by 2296
Abstract
This research describes a vision-based control strategy that employs deep learning for an aerial manipulation system developed for vegetation sampling in remote, dangerous environments. Vegetation sampling in such places presents considerable technical challenges such as equipment failures and exposure to hazardous elements. Controlling [...] Read more.
This research describes a vision-based control strategy that employs deep learning for an aerial manipulation system developed for vegetation sampling in remote, dangerous environments. Vegetation sampling in such places presents considerable technical challenges such as equipment failures and exposure to hazardous elements. Controlling aerial manipulation in unstructured areas such as forests remains a significant challenge because of uncertainty, complex dynamics, and the possibility of collisions. To overcome these issues, we offer a new image-based visual servoing (IBVS) method that uses knowledge distillation to provide robust, accurate, and adaptive control of the aerial vegetation sampler. A convolutional neural network (CNN) from a previous study is used to detect the grasp point, giving critical feedback for the visual servoing process. The suggested method improves the precision of visual servoing for sampling by using a learning-based approach to grip point selection and camera calibration error handling. Simulation results indicate the system can track and sample tree branches with minimum error, demonstrating that it has the potential to improve the safety and efficiency of aerial vegetation sampling. Full article
Show Figures

Figure 1

11 pages, 488 KB  
Article
A Deep Learning Approach to Investigating Clandestine Laboratories Using a GC-QEPAS Sensor
by Giorgio Felizzato, Nicola Liberatore, Sandro Mengali, Roberto Viola, Vittorio Moriggia and Francesco Saverio Romolo
Chemosensors 2024, 12(8), 152; https://doi.org/10.3390/chemosensors12080152 - 5 Aug 2024
Cited by 11 | Viewed by 3617
Abstract
Illicit drug production in clandestine laboratories involves the use of large quantities of different chemicals that can be obtained for legitimate purposes. The identification of these chemicals, including reagents, catalyzers and solvents, is crucial for forensic investigations. From a legal point of view, [...] Read more.
Illicit drug production in clandestine laboratories involves the use of large quantities of different chemicals that can be obtained for legitimate purposes. The identification of these chemicals, including reagents, catalyzers and solvents, is crucial for forensic investigations. From a legal point of view, a drug precursor is a material that is specific and critical to the production of a finished chemical and that constitutes a significant portion of the final molecular structure of the drug. In this study, a gas chromatography quartz-enhanced photoacoustic spectroscopy (GC-QEPAS) sensor—in conjunction with a deep learning model—was evaluated for its effectiveness in the detection and identification of interesting compounds for the production of amphetamine, methamphetamine, 3,4-methylenedioxymethamphetamine (MDMA), phenylcyclohexyl piperidine (PCP), and cocaine. The GC-QEPAS sensor includes a gas sampler, a fast GC for separation, and a QEPAS detector, which excites molecules exiting the GC column using a quantum cascade laser to provide the infra-red (IR) spectrum. The on-site capability of the GC-QEPAS system offers significant advantages over the current instruments employed in this field, including rapid analysis, which is crucial in field operations. This allows law enforcement to quickly identify specimens of interest on site. The system’s performance was validated by taking into account the limit of detection, repeatability, and within-laboratory reproducibility. The results showed excellent repeatability and reproducibility for both the GC and QEPAS modules. The deep learning model, a multilayer perceptron neural network, was trained using IR spectra and retention times, achieving very high classification accuracy in the testing conditions. This study demonstrated the efficacy of the GC-QEPAS sensor combined with a deep learning model for the reliable identification of drug precursors, providing a robust tool for law enforcement during criminal investigations in clandestine laboratories. Full article
(This article belongs to the Special Issue Chemical Sensing and Analytical Methods for Forensic Applications)
Show Figures

Figure 1

15 pages, 570 KB  
Article
LMD-DARTS: Low-Memory, Densely Connected, Differentiable Architecture Search
by Zhongnian Li, Yixin Xu, Peng Ying, Hu Chen, Renke Sun and Xinzheng Xu
Electronics 2024, 13(14), 2743; https://doi.org/10.3390/electronics13142743 - 12 Jul 2024
Cited by 2 | Viewed by 2269
Abstract
Neural network architecture search (NAS) technology is pivotal for designing lightweight convolutional neural networks (CNNs), facilitating the automatic search for network structures without requiring extensive prior knowledge. However, NAS is resource-intensive, consuming significant computational power and time due to the evaluation of numerous [...] Read more.
Neural network architecture search (NAS) technology is pivotal for designing lightweight convolutional neural networks (CNNs), facilitating the automatic search for network structures without requiring extensive prior knowledge. However, NAS is resource-intensive, consuming significant computational power and time due to the evaluation of numerous candidate architectures. To address the issues of high memory usage and slow search speed in traditional NAS algorithms, we propose the Low-Memory, Densely Connected, Differentiable Architecture Search (LMD-DARTS) algorithm. To expedite the updating speed of the optional operation weights during the search process, LMD-DARTS introduces a continuous strategy based on weight redistribution. Furthermore, to mitigate the influence of low-weight operations on classification results and reduce the number of searches, LMD-DARTS employs a dynamic sampler to prune underperforming operations during the search process, thereby lowering memory consumption and simplifying the complexity of individual searches. Additionally, to sparsify the dense connection matrix and mitigate redundant connections while maintaining optimal network performance, we introduce an adaptive downsampling search algorithm. Our experimental results show that the proposed LMD-DARTS achieves a remarkable 20% reduction in search time, along with a significant decrease in memory utilization within NAS process. Notably, the lightweight CNNs derived through this algorithm exhibit commendable classification accuracy, underscoring their effectiveness and efficiency for practical applications. Full article
Show Figures

Figure 1

19 pages, 4290 KB  
Article
Recurrent Neural Network with Finite Time Sampling for Dynamics Identification in Rehabilitation Robots
by Ahmed Alotaibi and Hajid Alsubaie
Mathematics 2023, 11(17), 3731; https://doi.org/10.3390/math11173731 - 30 Aug 2023
Cited by 1 | Viewed by 1733
Abstract
Rehabilitation robots can establish a direct connection between the user’s nerve signals and the robot’s actuators by integrating with the human nervous system. However, uncertainties in these systems limit their performance and accuracy. To address this challenge, the current study introduces an algorithm [...] Read more.
Rehabilitation robots can establish a direct connection between the user’s nerve signals and the robot’s actuators by integrating with the human nervous system. However, uncertainties in these systems limit their performance and accuracy. To address this challenge, the current study introduces an algorithm that effectively identifies and predicts unfamiliar dynamics in lower-limb rehabilitation robots. To accomplish this, the current study initially presents the dynamic model of a knee rehabilitation robot. Then, a finite time sampler is developed and the algorithm is proposed. In the proposed algorithm, the electromyographic signals are input into the rehabilitation robot. Via the use of a guaranteed stable sampler, samples from the unknown dynamics are extracted. By training the recurrent neural network with the acquired samples, the algorithm effectively learns and captures the underlying patterns of the unknown dynamics. The proposed recurrent neural network is enhanced with a self-attention mechanism, which plays a vital role in devising effective strategies for practical applications. Numerical simulation demonstrates the algorithm’s effectiveness, highlighting its excellent performance in identifying the system’s unknown dynamics. Full article
(This article belongs to the Special Issue Control Problem of Nonlinear Systems with Applications, 2nd Edition)
Show Figures

Figure 1

27 pages, 1590 KB  
Article
The Seven-League Scheme: Deep Learning for Large Time Step Monte Carlo Simulations of Stochastic Differential Equations
by Shuaiqiang Liu, Lech A. Grzelak and Cornelis W. Oosterlee
Risks 2022, 10(3), 47; https://doi.org/10.3390/risks10030047 - 23 Feb 2022
Cited by 7 | Viewed by 4707
Abstract
We propose an accurate data-driven numerical scheme to solve stochastic differential equations (SDEs), by taking large time steps. The SDE discretization is built up by means of the polynomial chaos expansion method, on the basis of accurately determined stochastic collocation (SC) points. By [...] Read more.
We propose an accurate data-driven numerical scheme to solve stochastic differential equations (SDEs), by taking large time steps. The SDE discretization is built up by means of the polynomial chaos expansion method, on the basis of accurately determined stochastic collocation (SC) points. By employing an artificial neural network to learn these SC points, we can perform Monte Carlo simulations with large time steps. Basic error analysis indicates that this data-driven scheme results in accurate SDE solutions in the sense of strong convergence, provided the learning methodology is robust and accurate. With a method variant called the compression–decompression collocation and interpolation technique, we can drastically reduce the number of neural network functions that have to be learned, so that computational speed is enhanced. As a proof of concept, 1D numerical experiments confirm a high-quality strong convergence error when using large time steps, and the novel scheme outperforms some classical numerical SDE discretizations. Some applications, here in financial option valuation, are also presented. Full article
(This article belongs to the Special Issue Risks: Feature Papers 2022)
Show Figures

Figure 1

22 pages, 5395 KB  
Article
Building Extraction from Remote Sensing Images with Sparse Token Transformers
by Keyan Chen, Zhengxia Zou and Zhenwei Shi
Remote Sens. 2021, 13(21), 4441; https://doi.org/10.3390/rs13214441 - 5 Nov 2021
Cited by 179 | Viewed by 11744
Abstract
Deep learning methods have achieved considerable progress in remote sensing image building extraction. Most building extraction methods are based on Convolutional Neural Networks (CNN). Recently, vision transformers have provided a better perspective for modeling long-range context in images, but usually suffer from high [...] Read more.
Deep learning methods have achieved considerable progress in remote sensing image building extraction. Most building extraction methods are based on Convolutional Neural Networks (CNN). Recently, vision transformers have provided a better perspective for modeling long-range context in images, but usually suffer from high computational complexity and memory usage. In this paper, we explored the potential of using transformers for efficient building extraction. We design an efficient dual-pathway transformer structure that learns the long-term dependency of tokens in both their spatial and channel dimensions and achieves state-of-the-art accuracy on benchmark building extraction datasets. Since single buildings in remote sensing images usually only occupy a very small part of the image pixels, we represent buildings as a set of “sparse” feature vectors in their feature space by introducing a new module called “sparse token sampler”. With such a design, the computational complexity in transformers can be greatly reduced over an order of magnitude. We refer to our method as Sparse Token Transformers (STT). Experiments conducted on the Wuhan University Aerial Building Dataset (WHU) and the Inria Aerial Image Labeling Dataset (INRIA) suggest the effectiveness and efficiency of our method. Compared with some widely used segmentation methods and some state-of-the-art building extraction methods, STT has achieved the best performance with low time cost. Full article
Show Figures

Graphical abstract

23 pages, 1894 KB  
Article
Conditional Variational Autoencoder for Learned Image Reconstruction
by Chen Zhang, Riccardo Barbano and Bangti Jin
Computation 2021, 9(11), 114; https://doi.org/10.3390/computation9110114 - 28 Oct 2021
Cited by 27 | Viewed by 9416
Abstract
Learned image reconstruction techniques using deep neural networks have recently gained popularity and have delivered promising empirical results. However, most approaches focus on one single recovery for each observation, and thus neglect information uncertainty. In this work, we develop a novel computational framework [...] Read more.
Learned image reconstruction techniques using deep neural networks have recently gained popularity and have delivered promising empirical results. However, most approaches focus on one single recovery for each observation, and thus neglect information uncertainty. In this work, we develop a novel computational framework that approximates the posterior distribution of the unknown image at each query observation. The proposed framework is very flexible: it handles implicit noise models and priors, it incorporates the data formation process (i.e., the forward operator), and the learned reconstructive properties are transferable between different datasets. Once the network is trained using the conditional variational autoencoder loss, it provides a computationally efficient sampler for the approximate posterior distribution via feed-forward propagation, and the summarizing statistics of the generated samples are used for both point-estimation and uncertainty quantification. We illustrate the proposed framework with extensive numerical experiments on positron emission tomography (with both moderate and low-count levels) showing that the framework generates high-quality samples when compared with state-of-the-art methods. Full article
(This article belongs to the Special Issue Inverse Problems with Partial Data)
Show Figures

Figure 1

13 pages, 25446 KB  
Article
Fully Learnable Model for Task-Driven Image Compressed Sensing
by Bowen Zheng, Jianping Zhang, Guiling Sun and Xiangnan Ren
Sensors 2021, 21(14), 4662; https://doi.org/10.3390/s21144662 - 7 Jul 2021
Cited by 5 | Viewed by 3130
Abstract
This study primarily investigates image sensing at low sampling rates with convolutional neural networks (CNN) for specific applications. To improve the image acquisition efficiency in energy-limited systems, this study, inspired by compressed sensing, proposes a fully learnable model for task-driven image-compressed sensing (FLCS). [...] Read more.
This study primarily investigates image sensing at low sampling rates with convolutional neural networks (CNN) for specific applications. To improve the image acquisition efficiency in energy-limited systems, this study, inspired by compressed sensing, proposes a fully learnable model for task-driven image-compressed sensing (FLCS). The FLCS, based on Deep Convolution Generative Adversarial Networks (DCGAN) and Variational Auto-encoder (VAE), divides the image-compressed sensing model into three learnable parts, i.e., the Sampler, the Solver and the Rebuilder. To be specific, a measurement matrix suitable for a type of image is obtained by training the Sampler. The Solver calculates the image’s low-dimensional representation with the measurements. The Rebuilder learns a mapping from the low-dimensional latent space to the image space. All the mentioned could be trained jointly or individually for a range of application scenarios. The pre-trained FLCS reconstructs images with few iterations for task-driven compressed sensing. As indicated from the experimental results, compared with existing approaches, the proposed method could significantly improve the reconstructed images’ quality while decreasing the running time. This study is of great significance for the application of image-compressed sensing at low sampling rates. Full article
(This article belongs to the Special Issue Neural Networks and Deep Learning in Image Sensing)
Show Figures

Figure 1

10 pages, 1891 KB  
Article
Deep Learning Methods for Improving Pollen Monitoring
by Elżbieta Kubera, Agnieszka Kubik-Komar, Krystyna Piotrowska-Weryszko and Magdalena Skrzypiec
Sensors 2021, 21(10), 3526; https://doi.org/10.3390/s21103526 - 19 May 2021
Cited by 22 | Viewed by 5931
Abstract
The risk of pollen-induced allergies can be determined and predicted based on data derived from pollen monitoring. Hirst-type samplers are sensors that allow airborne pollen grains to be detected and their number to be determined. Airborne pollen grains are deposited on adhesive-coated tape, [...] Read more.
The risk of pollen-induced allergies can be determined and predicted based on data derived from pollen monitoring. Hirst-type samplers are sensors that allow airborne pollen grains to be detected and their number to be determined. Airborne pollen grains are deposited on adhesive-coated tape, and slides are then prepared, which require further analysis by specialized personnel. Deep learning can be used to recognize pollen taxa based on microscopic images. This paper presents a method for recognizing a taxon based on microscopic images of pollen grains, allowing the pollen monitoring process to be automated. In this research, a deep CNN (convolutional neural network) model was built from scratch. Publicly available deep neural network models, pre-trained on image data (not including microscopic pictures), were also used. The results show that even a simple deep learning model produces quite good results when the classification of pollen grain taxa is performed directly from the images. The best deep learning model achieved 97.88% accuracy in the difficult task of recognizing three types of pollen grains (birch, alder, and hazel) with similar structures. The derived models can be used to build a system to support pollen monitoring experts in their work. Full article
(This article belongs to the Special Issue Analytics and Applications of Audio and Image Sensing Techniques)
Show Figures

Figure 1

18 pages, 1871 KB  
Article
A Neural Network MCMC Sampler That Maximizes Proposal Entropy
by Zengyi Li, Yubei Chen and Friedrich T. Sommer
Entropy 2021, 23(3), 269; https://doi.org/10.3390/e23030269 - 25 Feb 2021
Cited by 7 | Viewed by 4858
Abstract
Markov Chain Monte Carlo (MCMC) methods sample from unnormalized probability distributions and offer guarantees of exact sampling. However, in the continuous case, unfavorable geometry of the target distribution can greatly limit the efficiency of MCMC methods. Augmenting samplers with neural networks can potentially [...] Read more.
Markov Chain Monte Carlo (MCMC) methods sample from unnormalized probability distributions and offer guarantees of exact sampling. However, in the continuous case, unfavorable geometry of the target distribution can greatly limit the efficiency of MCMC methods. Augmenting samplers with neural networks can potentially improve their efficiency. Previous neural network-based samplers were trained with objectives that either did not explicitly encourage exploration, or contained a term that encouraged exploration but only for well structured distributions. Here we propose to maximize proposal entropy for adapting the proposal to distributions of any shape. To optimize proposal entropy directly, we devised a neural network MCMC sampler that has a flexible and tractable proposal distribution. Specifically, our network architecture utilizes the gradient of the target distribution for generating proposals. Our model achieved significantly higher efficiency than previous neural network MCMC techniques in a variety of sampling tasks, sometimes by more than an order magnitude. Further, the sampler was demonstrated through the training of a convergent energy-based model of natural images. The adaptive sampler achieved unbiased sampling with significantly higher proposal entropy than a Langevin dynamics sample. The trained sampler also achieved better sample quality. Full article
Show Figures

Figure 1

17 pages, 9463 KB  
Article
Visual Tracking Using Wang–Landau Reinforcement Sampler
by Dokyeong Kwon and Junseok Kwon
Appl. Sci. 2020, 10(21), 7780; https://doi.org/10.3390/app10217780 - 3 Nov 2020
Viewed by 2283
Abstract
In this study, we present a novel tracking system, in which the tracking accuracy can be considerably enhanced by state prediction. Accordingly, we present a new Q-learning-based reinforcement method, augmented by Wang–Landau sampling. In the proposed method, reinforcement learning is used to predict [...] Read more.
In this study, we present a novel tracking system, in which the tracking accuracy can be considerably enhanced by state prediction. Accordingly, we present a new Q-learning-based reinforcement method, augmented by Wang–Landau sampling. In the proposed method, reinforcement learning is used to predict a target configuration for the subsequent frame, while Wang–Landau sampler balances the exploitation and exploration degrees of the prediction. Our method can adapt to control the randomness of policy, using statistics on the number of visits in a particular state. Thus, our method considerably enhances conventional Q-learning algorithm performance, which also enhances visual tracking performance. Numerical results demonstrate that our method substantially outperforms other state-of-the-art visual trackers and runs in realtime because our method contains no complicated deep neural network architectures. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

Back to TopTop