You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.

Search for Articles:

Title / Keyword

Author / Affiliation / Email

Journal

Article Type

Advanced Search

Section

Special Issue

Volume

Issue

Number

Page

Logical OperatorOperator

Search Text

Search Type

Journal Description

AI

AI is an international, peer-reviewed, open access journal on artificial intelligence (AI), including broad aspects of cognition and reasoning, perception and planning, machine learning, intelligent robotics, and applications of AI, published monthly online by MDPI.

Open Access— free for readers, with article processing charges (APC) paid by authors or their institutions.
High Visibility: indexed within ESCI (Web of Science), Scopus, EBSCO, and other databases.
Journal Rank: JCR - Q1 (Computer Science, Interdisciplinary Applications) / CiteScore - Q2 (Artificial Intelligence)
Rapid Publication: manuscripts are peer-reviewed and a first decision is provided to authors approximately 20.7 days after submission; acceptance to publication is undertaken in 3.9 days (median values for papers published in this journal in the first half of 2025).
Recognition of Reviewers: APC discount vouchers, optional signed peer review, and reviewer names published annually in the journal.

Impact Factor: 5.0 (2024); 5-Year Impact Factor: 4.6 (2024)

Imprint Information Journal Flyer Open Access ISSN: 2673-2688

Latest Articles

23 pages, 28832 KiB

Open AccessArticle

Micro-Expression-Based Facial Analysis for Automated Pain Recognition in Dairy Cattle: An Early-Stage Evaluation

by Shuqiang Zhang, Kashfia Sailunaz and Suresh Neethirajan

AI 2025, 6(9), 199; https://doi.org/10.3390/ai6090199 - 22 Aug 2025

Timely, objective pain recognition in dairy cattle is essential for welfare assurance, productivity, and ethical husbandry yet remains elusive because evolutionary pressure renders bovine distress signals brief and inconspicuous. Without verbal self-reporting, cows suppress overt cues, so automated vision is indispensable for on-farm [...] Read more.

Timely, objective pain recognition in dairy cattle is essential for welfare assurance, productivity, and ethical husbandry yet remains elusive because evolutionary pressure renders bovine distress signals brief and inconspicuous. Without verbal self-reporting, cows suppress overt cues, so automated vision is indispensable for on-farm triage. Although earlier systems tracked whole-body posture or static grimace scales, frame-level detection of facial micro-expressions has not been explored fully in livestock. We translate micro-expression analytics from automotive driver monitoring to the barn, linking modern computer vision with veterinary ethology. Our two-stage pipeline first detects faces and 30 landmarks using a custom You Only Look Once (YOLO) version 8-Pose network, achieving a 96.9% mean average precision (

m A P

) at an Intersection over the Union (IoU) threshold of 0.50 for detection and 83.8% Object Keypoint Similarity (OKS) for keypoint placement. Cropped eye, ear, and muzzle patches are encoded using a pretrained MobileNetV2, generating 3840-dimensional descriptors that capture millisecond muscle twitches. Sequences of five consecutive frames are fed into a 128-unit Long Short-Term Memory (LSTM) classifier that outputs pain probabilities. On a held-out validation set of 1700 frames, the system records 99.65% accuracy and an F1-score of 0.997, with only three false positives and three false negatives. Tested on 14 unseen barn videos, it attains 64.3% clip-level accuracy (i.e., overall accuracy for the whole video clip) and 83% precision for the pain class, using a hybrid aggregation rule that combines a 30% mean probability threshold with micro-burst counting to temper false alarms. As an early exploration from our proof-of-concept study on a subset of our custom dairy farm datasets, these results show that micro-expression mining can deliver scalable, non-invasive pain surveillance across variations in illumination, camera angle, background, and individual morphology. Future work will explore attention-based temporal pooling, curriculum learning for variable window lengths, domain-adaptive fine-tuning, and multimodal fusion with accelerometry on the complete datasets to elevate the performance toward clinical deployment. Full article

24 pages, 26970 KiB

Open AccessArticle

Using a High-Precision YOLO Surveillance System for Gun Detection to Prevent Mass Shootings

by Jonathan Hsueh and Chao-Tung Yang

AI 2025, 6(9), 198; https://doi.org/10.3390/ai6090198 - 22 Aug 2025

Mass shootings are forms of loosely defined violent crimes typically involving four or more casualties by firearm and have become increasingly more frequent, and organized and speedy responses from police are necessary to mitigate harm and neutralize the perpetrator. Recent, widely publicized police [...] Read more.

Mass shootings are forms of loosely defined violent crimes typically involving four or more casualties by firearm and have become increasingly more frequent, and organized and speedy responses from police are necessary to mitigate harm and neutralize the perpetrator. Recent, widely publicized police responses to mass shooting events have been criticized by the media, government, and public. With the advancements in artificial intelligence, specifically single-shot detection (SSD) models, computer programs can detect harmful weapons within efficient time frames. We utilized YOLO (You Only Look Once), an SSD with a Convolutional Neural Network, and used versions 5, 7, 8, 9, 10, and 11 to develop our detection system. For our data, we used a Roboflow dataset that contained almost 17,000 images of real-life handgun scenarios, designed to skew towards positive instances. We trained each model on our dataset and exchanged different hyperparameters, conducting a randomized trial. Finally, we evaluated the performance based on precision metrics. Using a Python-based design, we tested our model’s capabilities for surveillance functions. Our experimental results showed that our best-performing model was YOLOv10s, with an mAP-50 (mean average precision 50) of 98.2% on our dataset. Our model showed potential in edge computing settings. Full article

22 pages, 5943 KiB

Open AccessArticle

LiteCOD: Lightweight Camouflaged Object Detection via Holistic Understanding of Local-Global Features and Multi-Scale Fusion

by Abbas Khan, Hayat Ullah and Arslan Munir

AI 2025, 6(9), 197; https://doi.org/10.3390/ai6090197 - 22 Aug 2025

Camouflaged object detection (COD) represents one of the most challenging tasks in computer vision, requiring sophisticated approaches to accurately extract objects that seamlessly blend within visually similar backgrounds. While contemporary techniques demonstrate promising detection performance, they predominantly suffer from computational complexity and resource [...] Read more.

Camouflaged object detection (COD) represents one of the most challenging tasks in computer vision, requiring sophisticated approaches to accurately extract objects that seamlessly blend within visually similar backgrounds. While contemporary techniques demonstrate promising detection performance, they predominantly suffer from computational complexity and resource requirements that severely limit their deployment in real-time applications, particularly on mobile devices and edge computing platforms. To address these limitations, we propose LiteCOD, an efficient lightweight framework that integrates local and global perceptions through holistic feature fusion and specially designed efficient attention mechanisms. Our approach achieves superior detection accuracy while maintaining computational efficiency essential for practical deployment, with enhanced feature propagation and minimal computational overhead. Extensive experiments validate LiteCOD’s effectiveness, demonstrating that it surpasses existing lightweight methods with average improvements of 7.55% in the F-measure and 8.08% overall performance gain across three benchmark datasets. Our results indicate that our framework consistently outperforms 20 state-of-the-art methods across quantitative metrics, computational efficiency, and overall performance while achieving real-time inference capabilities with a significantly reduced parameter count of 5.15M parameters. LiteCOD establishes a practical solution bridging the gap between detection accuracy and deployment feasibility in resource-constrained environments. Full article

► Show Figures

Figure 1

23 pages, 2723 KiB

Open AccessArticle

Dairy DigiD: An Edge-Cloud Framework for Real-Time Cattle Biometrics and Health Classification

by Shubhangi Mahato and Suresh Neethirajan

AI 2025, 6(9), 196; https://doi.org/10.3390/ai6090196 - 22 Aug 2025

Digital livestock farming faces a critical deployment challenge: bridging the gap between cutting-edge AI algorithms and practical implementation in resource-constrained agricultural environments. While deep learning models demonstrate exceptional accuracy in laboratory settings, their translation to operational farm systems remains limited by computational constraints, [...] Read more.

Digital livestock farming faces a critical deployment challenge: bridging the gap between cutting-edge AI algorithms and practical implementation in resource-constrained agricultural environments. While deep learning models demonstrate exceptional accuracy in laboratory settings, their translation to operational farm systems remains limited by computational constraints, connectivity issues, and user accessibility barriers. Dairy DigiD addresses these challenges through a novel edge-cloud AI framework integrating YOLOv11 object detection with DenseNet121 physiological classification for cattle monitoring. The system employs YOLOv11-nano architecture optimized through INT8 quantization (achieving 73% model compression with <1% accuracy degradation) and TensorRT acceleration, enabling 24 FPS real-time inference on NVIDIA Jetson edge devices while maintaining 94.2% classification accuracy. Our key innovation lies in intelligent confidence-based offloading: routine detections execute locally at the edge, while ambiguous cases trigger cloud processing for enhanced accuracy. An entropy-based active learning pipeline using Roboflow reduces the annotation overhead by 65% while preserving 97% of the model performance. The Gradio interface democratizes system access, reducing technician training requirements by 84%. Comprehensive validation across ten commercial dairy farms in Atlantic Canada demonstrates robust performance under diverse environmental conditions (seasonal, lighting, weather variations). The framework achieves mAP@50 of 0.947 with balanced precision-recall across four physiological classes, while consuming 18% less energy than baseline implementations through attention-based optimization. Rather than proposing novel algorithms, this work contributes a systems-level integration methodology that transforms research-grade AI into deployable agricultural solutions. Our open-source framework provides a replicable blueprint for precision livestock farming adoption, addressing practical barriers that have historically limited AI deployment in agricultural settings. Full article

► Show Figures

Figure 1

30 pages, 3417 KiB

Open AccessArticle

A Lightweight Deep Learning Model for Automatic Modulation Classification Using Dual-Path Deep Residual Shrinkage Network

by Prakash Suman and Yanzhen Qu

AI 2025, 6(8), 195; https://doi.org/10.3390/ai6080195 - 21 Aug 2025

Efficient spectrum utilization is critical for meeting the growing data demands of modern wireless communication networks. Automatic Modulation Classification (AMC) plays a key role in enhancing spectrum efficiency by accurately identifying modulation schemes in received signals—an essential capability for dynamic spectrum allocation and [...] Read more.

Efficient spectrum utilization is critical for meeting the growing data demands of modern wireless communication networks. Automatic Modulation Classification (AMC) plays a key role in enhancing spectrum efficiency by accurately identifying modulation schemes in received signals—an essential capability for dynamic spectrum allocation and interference mitigation, particularly in cognitive radio (CR) systems. With the increasing deployment of smart edge devices, such as IoT nodes with limited computational and memory resources, there is a pressing need for lightweight AMC models that balance low complexity with high classification accuracy. In this study, we propose a low-complexity, lightweight deep learning (DL) AMC model optimized for resource-constrained edge devices. We introduce a dual-path deep residual shrinkage network (DP-DRSN) with garrote thresholding for effective signal denoising, and we designed a compact hybrid CNN-LSTM architecture comprising only 27,072 training parameters. The proposed model achieved average classification accuracies of 61.20%, 63.78%, and 62.13% on the RML2016.10a, RML2016.10b, and RML2018.01a datasets, respectively, demonstrating a strong balance between model efficiency and classification performance. These results highlight the model’s potential for enabling accurate and efficient AMC on edge devices with limited resources, despite not surpassing state-of-the-art accuracy owing to its deliberate emphasis on computational efficiency. Full article

(This article belongs to the Section AI Systems: Theory and Applications)

► Show Figures

Figure 1

29 pages, 1051 KiB

Open AccessArticle

Urdu Toxicity Detection: A Multi-Stage and Multi-Label Classification Approach

by Ayesha Rashid, Sajid Mahmood, Usman Inayat and Muhammad Fahad Zia

AI 2025, 6(8), 194; https://doi.org/10.3390/ai6080194 - 21 Aug 2025

Social media empowers freedom of expression but is often misused for abuse and hate. The detection of such content is crucial, especially in under-resourced languages like Urdu. To address this challenge, this paper designed a comprehensive multilabel dataset, the Urdu toxicity corpus (UTC). [...] Read more.

Social media empowers freedom of expression but is often misused for abuse and hate. The detection of such content is crucial, especially in under-resourced languages like Urdu. To address this challenge, this paper designed a comprehensive multilabel dataset, the Urdu toxicity corpus (UTC). Second, the Urdu toxicity detection model is developed, which detects toxic content from an Urdu dataset presented in Nastaliq Font. The proposed framework initially processed the gathered data and then applied feature engineering using term frequency-inverse document frequency, bag-of-words, and N-gram techniques. Subsequently, the synthetic minority over-sampling technique is used to address the data imbalance problem, and manual data annotation is performed to ensure label accuracy. Four machine learning models, namely logistic regression, support vector machine, random forest, and gradient boosting, are applied to preprocessed data. The results indicate that the RF outperformed all evaluation metrics. Deep learning algorithms, including long short-term memory (LSTM), Bidirectional LSTM, and gated recurrent unit, have also been applied to UTC for classification purposes. Random forest outperforms the other models, achieving a precision, recall, F1-score, and accuracy of 0.97, 0.99, 0.98, and 0.99, respectively. The proposed model demonstrates a strong potential to detect rude, offensive, abusive, and hate speech content from user comments in Urdu Nastaliq. Full article

(This article belongs to the Special Issue AI-Driven Innovations: Emerging Trends, Security, and Industrial Solutions)

► Show Figures

Figure 1

25 pages, 2127 KiB

Open AccessPerspective

Making AI Tutors Empathetic and Conscious: A Needs-Driven Pathway to Synthetic Machine Consciousness

by Earl Woodruff

AI 2025, 6(8), 193; https://doi.org/10.3390/ai6080193 - 19 Aug 2025

As large language model (LLM) tutors evolve from scripted helpers into adaptive educational partners, their capacity for self-regulation, ethical decision-making, and internal monitoring will become increasingly critical. This paper introduces the Needs-Driven Consciousness Framework (NDCF) as a novel, integrative architecture that combines Dennett’s [...] Read more.

As large language model (LLM) tutors evolve from scripted helpers into adaptive educational partners, their capacity for self-regulation, ethical decision-making, and internal monitoring will become increasingly critical. This paper introduces the Needs-Driven Consciousness Framework (NDCF) as a novel, integrative architecture that combines Dennett’s multiple drafts model, Damasio’s somatic marker hypothesis, and Tulving’s tripartite memory system into a unified motivational design for synthetic consciousness. The NDCF defines three core regulators, specifically Survive (system stability and safety), Thrive (autonomy, competence, relatedness), and Excel (creativity, ethical reasoning, long-term purpose). In addition, there is a proposed supervisory Protect layer that detects value drift and overrides unsafe behaviours. The core regulators compute internal need satisfaction states and urgency gradients, feeding into a softmax-based control system for context-sensitive action selection. The framework proposes measurable internal signals (e.g., utility gradients, conflict intensity Ω), behavioural signatures (e.g., metacognitive prompts, pedagogical shifts), and three falsifiable predictions for educational AI testbeds. By embedding these layered needs directly into AI governance, the NDCF offers (i) a psychologically and biologically grounded model of emergent machine consciousness, (ii) a practical approach to building empathetic, self-regulating AI tutors, and (iii) a testable platform for comparing competing consciousness theories through implementation. Ultimately, the NDCF provides a path toward the development of AI tutors that are capable of transparent reasoning, dynamic adaptation, and meaningful human-like relationships, while maintaining safety, ethical coherence, and long-term alignment with human well-being. Full article

► Show Figures

Figure 1

24 pages, 7632 KiB

Open AccessArticle

Air Battlefield Time Series Data Augmentation Model Based on a Lightweight Denoising Diffusion Probabilistic Model

by Bo Cao, Qinghua Xing, Longyue Li, Junjie Shi and Weijie Lin

AI 2025, 6(8), 192; https://doi.org/10.3390/ai6080192 - 18 Aug 2025

The uncertainty and confrontational nature of war itself pose significant challenges to the collection and storage of aerial battlefield temporal data. To address the issue of insufficient training of intelligent models caused by the scarcity of air battlefield situation data, this paper designs [...] Read more.

The uncertainty and confrontational nature of war itself pose significant challenges to the collection and storage of aerial battlefield temporal data. To address the issue of insufficient training of intelligent models caused by the scarcity of air battlefield situation data, this paper designs an air battlefield time series data augmentation model based on a lightweight denoising diffusion probabilistic model (LDMKD-DA). Considering the advantages of a denoising diffusion probabilistic model (DDPM) in processing images, this paper transforms 1D time series data into image data. 1D univariate time series data, such as High-resolution Range Profile dataset, are transformed by Gramian angular fields and Markov transition fields. Multivariate time series data, such as the air target intention dataset, are transformed by matrix expansion. Then, the data augmentation model is constructed based on the denoising diffusion probabilistic model. Considering the need for miniaturization and intelligence in future combat platforms, the depthwise separable convolution is introduced to lighten the DDPM, and, at the same time, the improved knowledge distillation method is introduced to accelerate the sampling process. The experimental results show that LDMKD-DA is capable of generating synthetic data similar to real data with high quality while significantly reducing FLOPs and params, while having significant advantages in univariate and multivariate time series data amplification. Full article

(This article belongs to the Topic Theoretical Foundations and Applications of Deep Learning Techniques)

► Show Figures

Figure 1

23 pages, 3836 KiB

Open AccessArticle

RUDA-2025: Depression Severity Detection Using Pre-Trained Transformers on Social Media Data

by Muhammad Ahmad, Pierpaolo Basile, Fida Ullah, Ildar Batyrshin and Grigori Sidorov

AI 2025, 6(8), 191; https://doi.org/10.3390/ai6080191 - 18 Aug 2025

Depression is a serious mental health disorder affecting cognition, emotions, and behavior. It impacts over 300 million people globally, with mental health care costs exceeding $1 trillion annually. Traditional diagnostic methods are often expensive, time-consuming, stigmatizing, and difficult to access. This study leverages [...] Read more.

Depression is a serious mental health disorder affecting cognition, emotions, and behavior. It impacts over 300 million people globally, with mental health care costs exceeding $1 trillion annually. Traditional diagnostic methods are often expensive, time-consuming, stigmatizing, and difficult to access. This study leverages NLP techniques to identify depressive cues in social media posts, focusing on both standard Urdu and code-mixed Roman Urdu, which are often overlooked in existing research. To the best of our knowledge, a script-conversion and combination-based approach for Roman Urdu and Nastaliq Urdu has not been explored earlier. To address this gap, our study makes four key contributions. First, we created a manually annotated dataset named Ruda-2025, containing posts in code-mixed Roman Urdu and Nastaliq Urdu for both binary and multiclass classification. The binary classes are depression” and not depression, with the depression class further divided into fine-grained categories: Mild, Moderate, and Severe depression alongside not depression. Second, we applied first-time two novel techniques to the RUDA-2025 dataset: (1) script-conversion approach that translates between code-mixed Roman Urdu and Standard Urdu and (2) combination-based approach that merges both scripts to make a single dataset to address linguistic challenges in depression assessment. Finally, we employed 60 different experiments using a combination of traditional machine learning and deep learning techniques to find the best-fit model for the detection of mental disorder. Based on our analysis, our proposed model (mBERT) using custom attention mechanism outperformed baseline (XGB) in combination-based, code-mixed Roman and Nastaliq Urdu script conversions. Full article

► Show Figures

Figure 1

29 pages, 500 KiB

Open AccessReview

The Impact of Artificial Intelligence on Modern Society

by Pedro Ramos Brandao

AI 2025, 6(8), 190; https://doi.org/10.3390/ai6080190 - 17 Aug 2025

In recent years, artificial intelligence (AI) has emerged as a transformative force across various sectors of modern society, reshaping economic landscapes, social interactions, and ethical considerations. This paper explores the multifaceted impact of AI, analyzing its implications for employment, privacy, and decision-making processes. [...] Read more.

In recent years, artificial intelligence (AI) has emerged as a transformative force across various sectors of modern society, reshaping economic landscapes, social interactions, and ethical considerations. This paper explores the multifaceted impact of AI, analyzing its implications for employment, privacy, and decision-making processes. By synthesizing recent research and case studies, we investigate the dual nature of AI as both a catalyst for innovation and a source of potential disruption. The findings highlight the necessity for proactive governance and ethical frameworks to mitigate risks associated with AI deployment while maximizing its benefits. Ultimately, this paper aims to provide a comprehensive understanding of how AI is redefining human experiences and societal norms, encouraging further discourse on the sustainable integration of these technologies in everyday life. Full article

(This article belongs to the Section AI Systems: Theory and Applications)

► Show Figures

Figure 1

43 pages, 1528 KiB

Open AccessArticle

Adaptive Sign Language Recognition for Deaf Users: Integrating Markov Chains with Niching Genetic Algorithm

by Muslem Al-Saidi, Áron Ballagi, Oday Ali Hassen and Saad M. Darwish

AI 2025, 6(8), 189; https://doi.org/10.3390/ai6080189 - 15 Aug 2025

Sign language recognition (SLR) plays a crucial role in bridging the communication gap between deaf individuals and the hearing population. However, achieving subject-independent SLR remains a significant challenge due to variations in signing styles, hand shapes, and movement patterns among users. Traditional Markov [...] Read more.

Sign language recognition (SLR) plays a crucial role in bridging the communication gap between deaf individuals and the hearing population. However, achieving subject-independent SLR remains a significant challenge due to variations in signing styles, hand shapes, and movement patterns among users. Traditional Markov Chain-based models struggle with generalizing across different signers, often leading to reduced recognition accuracy and increased uncertainty. These limitations arise from the inability of conventional models to effectively capture diverse gesture dynamics while maintaining robustness to inter-user variability. To address these challenges, this study proposes an adaptive SLR framework that integrates Markov Chains with a Niching Genetic Algorithm (NGA). The NGA optimizes the transition probabilities and structural parameters of the Markov Chain model, enabling it to learn diverse signing patterns while avoiding premature convergence to suboptimal solutions. In the proposed SLR framework, GA is employed to determine the optimal transition probabilities for the Markov Chain components operating across multiple signing contexts. To enhance the diversity of the initial population and improve the model’s adaptability to signer variations, a niche model is integrated using a Context-Based Clearing (CBC) technique. This approach mitigates premature convergence by promoting genetic diversity, ensuring that the population maintains a wide range of potential solutions. By minimizing gene association within chromosomes, the CBC technique enhances the model’s ability to learn diverse gesture transitions and movement dynamics across different users. This optimization process enables the Markov Chain to better generalize subject-independent sign language recognition, leading to improved classification accuracy, robustness against signer variability, and reduced misclassification rates. Experimental evaluations demonstrate a significant improvement in recognition performance, reduced error rates, and enhanced generalization across unseen signers, validating the effectiveness of the proposed approach. Full article

(This article belongs to the Topic Advances in Robot Vision Perception and Control Technology)

► Show Figures

Figure 1

22 pages, 3187 KiB

Open AccessArticle

Automated Clinical Trial Data Analysis and Report Generation by Integrating Retrieval-Augmented Generation (RAG) and Large Language Model (LLM) Technologies

by Sheng-Ming Kuo, Shao-Kuo Tai, Hung-Yu Lin and Rung-Ching Chen

AI 2025, 6(8), 188; https://doi.org/10.3390/ai6080188 - 15 Aug 2025

Retrieval-Augmented Generation (RAG) combined with Large Language Models (LLMs) introduces a new paradigm for clinical-trial data analysis that is both real-time and knowledge-traceable. This study targets a multi-site, real-world data environment. It builds a hierarchical RAG pipeline spanning an electronic health record (EHR), [...] Read more.

Retrieval-Augmented Generation (RAG) combined with Large Language Models (LLMs) introduces a new paradigm for clinical-trial data analysis that is both real-time and knowledge-traceable. This study targets a multi-site, real-world data environment. It builds a hierarchical RAG pipeline spanning an electronic health record (EHR), National Health Insurance (NHI) billing codes, and image-vector indices. The LLM is optimized through lightweight LoRA/QLoRA fine-tuning and reinforcement-learning-based alignment. The system first retrieves key textual and imaging evidence from heterogeneous data repositories and then fuses these artifacts into the contextual window for clinical report generation. Experimental results show marked improvements over traditional manual statistics and prompt-only models in retrieval accuracy, textual coherence, and response latency while reducing human error and workload. In evaluation, the proposed multimodal RAG-LLM workflow achieved statistically significant gains in three core metrics—recall, factual consistency, and expert ratings—and substantially shortened overall report-generation time, demonstrating clear efficiency advantages versus conventional manual processes. However, LLMs alone often face challenges such as limited real-world grounding, hallucination risks, and restricted context windows. Similarly, RAG systems, while improving factual consistency, depend heavily on retrieval quality and may yield incoherent synthesis if evidence is misaligned. These limitations underline the complementary nature of integrating RAG and LLM architectures in a clinical reporting context. Quantitatively, the proposed system achieved a Composite Quality Index (CQI) of 78.3, outperforming strong baselines such as Med-PaLM 2 (72.6) and PMC-LLaMA (74.3), and reducing the report drafting time by over 75% (p < 0.01). These findings confirm the practical feasibility of the framework to support fully automated clinical reporting. Full article

► Show Figures

Figure 1

19 pages, 7937 KiB

Open AccessArticle

Feature-Level Insights into the Progesterone–Estradiol Ratio in Postmenopausal Women Using Explainable Machine Learning

by Ajna Hamidovic, John Davis and Mark R Burge

AI 2025, 6(8), 187; https://doi.org/10.3390/ai6080187 - 15 Aug 2025

The protective role of progesterone against estradiol-driven proliferation is essential for preserving endometrial homeostasis. However, the factors that influence the progesterone–estradiol (P4:E2) ratio remain poorly characterized. This study aimed to model this ratio using a machine learning approach to identify key hormonal, anthropometric, [...] Read more.

The protective role of progesterone against estradiol-driven proliferation is essential for preserving endometrial homeostasis. However, the factors that influence the progesterone–estradiol (P4:E2) ratio remain poorly characterized. This study aimed to model this ratio using a machine learning approach to identify key hormonal, anthropometric, demographic, dietary, metabolic, and inflammatory predictors. In addition, it aimed to assess estradiol and progesterone as individual outcomes to clarify whether shared or divergent mechanisms underlie variation in each hormone. NHANES data were used to identify postmenopausal women (n = 1902). An XGBoost model was developed to predict the log-transformed P4:E2 ratio using a 70/30 stratified train–test split. SHAP (SHapley Additive exPlanations) values were computed to interpret feature contributions. The final XGBoost model for the log-transformed P4:E2 ratio achieved an RMSE of 0.746, an MAE of 0.574, and an R² of 0.298 on the test set. SHAP analysis identified FSH (0.213), waist circumference (0.181), and CRP (0.133) as the most influential contributors, followed by total cholesterol (0.085) and LH (0.066). FSH and waist circumference emerged as key predictors of estradiol, while total cholesterol and LH were the most influential for progesterone. By leveraging SHAP-based feature importance to rank predictors of the P4:E2 ratio, this study provides interpretable, data-driven insights into the reproductive hormonal dynamics of postmenopausal women. Full article

► Show Figures

Figure 1

31 pages, 18843 KiB

Open AccessArticle

Liquid Adaptive AI: A Theoretical Framework for Continuously Self-Improving Artificial Intelligence

by Thomas R. Caulfield, Naeyma N. Islam and Rohit Chitale

AI 2025, 6(8), 186; https://doi.org/10.3390/ai6080186 - 14 Aug 2025

We present Liquid Adaptive AI as a theoretical framework and mathematical basis for artificial intelligence systems capable of continuous structural adaptation and autonomous capability development. This work explores the conceptual boundaries of adaptive AI by formalizing three interconnected mechanisms: (1) entropy-guided hyperdimensional knowledge [...] Read more.

We present Liquid Adaptive AI as a theoretical framework and mathematical basis for artificial intelligence systems capable of continuous structural adaptation and autonomous capability development. This work explores the conceptual boundaries of adaptive AI by formalizing three interconnected mechanisms: (1) entropy-guided hyperdimensional knowledge graphs that could autonomously restructure based on information-theoretic criteria; (2) a self-development engine using hierarchical Bayesian optimization for runtime architecture modification; and (3) a federated multi-agent framework with emergent specialization through distributed reinforcement learning. We address fundamental limitations in current AI systems through mathematically formalized processes of dynamic parameter adjustment, structural self-modification, and cross-domain knowledge synthesis, while immediate implementation faces substantial computational challenges requiring infrastructure on the scale of current large language model training facilities, we provide architectural specifications, theoretical convergence bounds, and evaluation criteria as a foundation for future research. This theoretical exploration establishes mathematical foundations for a potential new paradigm in artificial intelligence that would transition from episodic training to persistent autonomous development, offering a long-term research direction for the field. A comprehensive ^{Supplementary Materials} document provides detailed technical analysis, computational requirements, and an incremental development roadmap spanning approximately a decade. Full article

(This article belongs to the Topic The Future of Artificial Intelligence: Trends, Challenges, and Developments)

► Show Figures

Figure 1

20 pages, 1527 KiB

Open AccessArticle

Trends in Patent Applications for Technologies in the Automotive Industry: Applications of Deep Learning and Machine Learning

by ChoongChae Woo and Junbum Park

AI 2025, 6(8), 185; https://doi.org/10.3390/ai6080185 - 13 Aug 2025

This study investigates global innovation trends in machine learning (ML) and deep learning (DL) technologies within the automotive sector through a patent analysis of 5314 applications filed between 2005 and 2022 across the five major patent offices (IP5). Using Cooperative Patent Classification (CPC) [...] Read more.

This study investigates global innovation trends in machine learning (ML) and deep learning (DL) technologies within the automotive sector through a patent analysis of 5314 applications filed between 2005 and 2022 across the five major patent offices (IP5). Using Cooperative Patent Classification (CPC) codes and keyword analysis, we identify seven sub-technology domains and examine both geographical and corporate patenting strategies. Our findings show that the United States dominates in overall filings, while Japan demonstrates a notably high share of triadic patents, which reflects a strong global-reach strategy. Patent activity is heavily concentrated in vehicle control and infrastructure traffic control, with emerging growth observed in battery management and occupant analytics. In contrast, security-related technologies remain underrepresented, indicating a potential blind spot in current innovation efforts. Corporate strategies diverge markedly; for example, some firms, such as Toyota and Bosch, pursue balanced tri-regional protection, whereas others, including Ford and GM, focus on dual-market coverage in the United States and China. These patterns illustrate how market priorities, regulatory environments, and technological objectives influence patenting behavior. By mapping the technological and strategic landscape of ML/DL innovation in the automotive industry, this study provides actionable insights for industry practitioners seeking to optimize intellectual property portfolios and for policymakers aiming to address gaps such as automotive cybersecurity in future R&D agendas. Full article

► Show Figures

Figure 1

22 pages, 3599 KiB

Open AccessArticle

Exploring Artificial Personality Grouping Through Decision Making in Feature Spaces

by Yuan Zhou and Siamak Khatibi

AI 2025, 6(8), 184; https://doi.org/10.3390/ai6080184 - 11 Aug 2025

Human personality (HP) is seen as an individual’s consistent patterns of feeling, thinking, and behaving by today’s psychological studies, in which HPs are characterized in terms of traits—in particular, as relatively enduring characteristics that influence human behavior across many situations. In this sense, [...] Read more.

Human personality (HP) is seen as an individual’s consistent patterns of feeling, thinking, and behaving by today’s psychological studies, in which HPs are characterized in terms of traits—in particular, as relatively enduring characteristics that influence human behavior across many situations. In this sense, more generally, artificial personality (AP) is studied in computer science to develop AI agents who should behave more like humans. However, in this paper, we suggest another approach by which the APs of individual agents are distinguishable based on their behavioral characteristics in achieving tasks and not necessarily in their human-like performance. As an initial step toward AP, we propose an approach to extract human decision-making characteristics as a generative resource for encoding the variability in agent personality. Using an application example, we demonstrate the feasibility of grouping APs, divided into several steps consisting of (1) defining a feature space to measure the commonality of decision making between individual and a group of people; (2) grouping APs by using multidimensional orthogonal features in the feature space to guarantee inter-individual differences between APs in achieving for the same task; and (3) evaluating the consistency of grouping APs by performing a cluster-stability analysis. Finally, our thoughts for the future implementation of APs are discussed and presented. Full article

► Show Figures

Figure 1

28 pages, 4548 KiB

Open AccessArticle

A Deep Reinforcement Learning Framework for Strategic Indian NIFTY 50 Index Trading

by Raj Gaurav Mishra, Dharmendra Sharma, Mahipal Gadhavi, Sangeeta Pant and Anuj Kumar

AI 2025, 6(8), 183; https://doi.org/10.3390/ai6080183 - 11 Aug 2025

This paper presents a comprehensive deep reinforcement learning (DRL) framework for developing strategic trading models tailored to the Indian NIFTY 50 index, leveraging the temporal and nonlinear nature of financial markets. Three advanced DRL architectures deep Q-network (DQN), double deep Q-network (DDQN), and [...] Read more.

This paper presents a comprehensive deep reinforcement learning (DRL) framework for developing strategic trading models tailored to the Indian NIFTY 50 index, leveraging the temporal and nonlinear nature of financial markets. Three advanced DRL architectures deep Q-network (DQN), double deep Q-network (DDQN), and dueling double deep Q-network (Dueling DDQN) were implemented and empirically evaluated. Using a decade-long dataset of 15-min interval OHLC data enriched with technical indicators such as the exponential moving average (EMA), pivot points, and multiple supertrend configurations, the models were trained using prioritized experience replay, epsilon-greedy exploration strategies, and softmax sampling mechanisms. A test set comprising one year of unseen data (May 2024–April 2025) was used to assess generalization performance across key financial metrics, including Sharpe ratio, profit factor, win rate, and trade frequency. Each architecture was analyzed in three progressively sophisticated variants incorporating enhancements in reward shaping, exploration–exploitation balancing, and penalty-based trade constraints. DDQN V3 achieved a Sharpe ratio of 0.7394, a 73.33% win rate, and a 16.58 profit factor across 15 trades, indicating strong volatility-adjusted suitability for real-world deployment. In contrast, the Dueling DDQN V3 achieved a high Sharpe ratio of 1.2278 and a 100% win rate but with only three trades, indicating an excessive conservatism. The DQN V1 model served as a strong baseline, outperforming passive strategies but exhibiting limitations due to Q-value overestimation. The novelty of this work lies in its systematic exploration of DRL variants integrated with enhanced exploration mechanisms and reward–penalty structures, rigorously applied to high-frequency trading on the NIFTY 50 index within an emerging market context. Our findings underscore the critical importance of architectural refinements, dynamic exploration strategies, and trade regularization in stabilizing learning and enhancing profitability in DRL-based intelligent trading systems. Full article

(This article belongs to the Special Issue AI in Finance: Leveraging AI to Transform Financial Services)

► Show Figures

Figure 1

30 pages, 2591 KiB

Open AccessArticle

Prompt Optimization with Two Gradients for Classification in Large Language Models

by Anthony Jethro Lieander, Hui Wang and Karen Rafferty

AI 2025, 6(8), 182; https://doi.org/10.3390/ai6080182 - 8 Aug 2025

Large language models (LLMs) generally perform well in common tasks, yet are often susceptible to errors in sophisticated natural language processing (NLP) on classification applications. Prompt engineering has emerged as a strategy to enhance their performance. Despite the effort required for manual prompt [...] Read more.

Large language models (LLMs) generally perform well in common tasks, yet are often susceptible to errors in sophisticated natural language processing (NLP) on classification applications. Prompt engineering has emerged as a strategy to enhance their performance. Despite the effort required for manual prompt optimization, recent advancements highlight the need for automation to reduce human involvement. We introduced the PO2G (prompt optimization with two gradients) framework to improve the efficiency of optimizing prompts for classification tasks. PO2G demonstrates improvement in efficiency, reaching almost 89% accuracy after just three iterations, whereas ProTeGi requires six iterations to achieve a comparable level. We evaluated PO2G and ProTeGi on a benchmark of nine NLP tasks, three tasks from the original ProTeGi study, and six non-domain-specific tasks. We also evaluated both frameworks on seven legal-domain classification tasks. These results provide broader insights into the efficiency and effectiveness of prompt optimization frameworks for classification across diverse NLP scenarios. Full article

(This article belongs to the Special Issue Large Language Models and Retrieval-Augmented Generation in Natural Language Processing, Human–Robot Interaction and Quantum Computing)

► Show Figures

Figure 1

26 pages, 3980 KiB

Open AccessArticle

Optimization and Performance Comparison of AOD-Net and DehazeFormer Dehazing Algorithms

by Futing Liu, Jingtao Wang and Yun Pan

AI 2025, 6(8), 181; https://doi.org/10.3390/ai6080181 - 7 Aug 2025

Image dehazing is an effective approach for enhancing the quality of images captured under foggy or hazy conditions. Although existing methods have achieved certain success in dehazing performance, many rely on deep network architectures, leading to high model complexity and computational costs. To [...] Read more.

Image dehazing is an effective approach for enhancing the quality of images captured under foggy or hazy conditions. Although existing methods have achieved certain success in dehazing performance, many rely on deep network architectures, leading to high model complexity and computational costs. To address this issue, this study aims to compare and optimize existing algorithms to improve dehazing performance. For this purpose, we innovatively propose a multi-scale feature-coordinated composite loss mechanism, integrating perceptual loss, Mean Squared Error, and L1 regularization to optimize two dehazing methods: AOD-Net and DehazeFormer. Extensive experiments demonstrate significant performance improvements under the multi-objective loss mechanism. For AOD-Net, the PSNR increased by 22.40% (+4.17 dB), SSIM by 3.62% (+0.0318), VSNR by 43% (+1.54 dB), and LPIPS decreased by 56.30% (−0.1161). Similarly, DehazeFormer showed notable enhancements: the PSNR improved by 11.43% (+2.45 dB), SSIM by 0.8% (+0.008), VSNR by 2.6% (+0.23 dB), and LPIPS decreased by 5.5% (−0.0104). These results fully validate the effectiveness of the composite loss mechanism in enhancing the feature representation capability of the models. Full article

► Show Figures

Figure 1

18 pages, 4529 KiB

Open AccessArticle

LGSIK-Poser: Skeleton-Aware Full-Body Motion Reconstruction from Sparse Inputs

by Linhai Li, Jiayi Lin and Wenhui Zhang

AI 2025, 6(8), 180; https://doi.org/10.3390/ai6080180 - 7 Aug 2025

Accurate full-body motion reconstruction from sparse sensors is crucial for VR/AR applications but remains challenging due to the under-constrained nature of limited observations and the computational constraints of mobile platforms. This paper presents LGSIK-Poser, a unified and lightweight framework that supports real-time motion [...] Read more.

Accurate full-body motion reconstruction from sparse sensors is crucial for VR/AR applications but remains challenging due to the under-constrained nature of limited observations and the computational constraints of mobile platforms. This paper presents LGSIK-Poser, a unified and lightweight framework that supports real-time motion reconstruction from heterogeneous sensor configurations, including head-mounted displays, handheld controllers, and up to three optional inertial measurement units, without requiring reconfiguration across scenarios. The model integrates temporally grouped LSTM modeling, anatomically structured graph-based reasoning, and region-specific inverse kinematics refinement to enhance end-effector accuracy and structural consistency. Personalized body shape is estimated using user-specific anthropometric priors within the SMPL model, a widely adopted parametric representation of human shape and pose. Experiments on the AMASS benchmark demonstrate that LGSIK-Poser achieves state-of-the-art accuracy with up to 48% improvement in hand localization, while reducing model size by 60% and latency by 22% compared to HMD-Poser. The system runs at 63.65 FPS with only 3.74 M parameters, highlighting its suitability for real-time immersive applications. Full article

(This article belongs to the Topic Innovations in AI and Signal Processing for Advanced Sensing, Radar, RFID, and Communication Systems)

► Show Figures

Figure 1

More Articles...

Submit to AI Review for AI

Journal Menu

Journal Browser

► Journal Browser

Highly Accessed Articles

View More...

Latest Books

More Books and Reprints...

E-Mail Alert

News

11 August 2025
Meet Us at the 18^th European Congress and Exhibition on Advanced Materials and Processes—FEMS EUROMAT 2025, 14–18 September 2025, Granada, Spain

31 July 2025
MDPI INSIGHTS: The CEO's Letter #25 - 8,000 Staff Worldwide, Korea Visit, 100,000 Preprints, Malaysia Roundtable, Canada Consortium Deal

23 July 2025
Join Us at the 2^nd Physical & Mathematical Sciences Summit—Quantum Artificial Intelligence, 22–24 August 2025, Haikou, China

More News & Announcements...

Topics

Propose a Topic

Topic in Algorithms, Applied Sciences, Electronics, MAKE, AI, Software

Applications of NLP, AI, and ML in Software Engineering Topic Editors: Affan Yasin, Javed Ali Khan, Lijie Wen
Deadline: 31 August 2025

Topic in AI, Data, Economies, Mathematics, Risks

Advanced Techniques and Modeling in Business and Economics Topic Editors: José Manuel Santos-Jaén, Ana León-Gomez, María del Carmen Valls Martínez
Deadline: 30 September 2025

Topic in AI, Energies, Entropy, Sustainability

Game Theory and Artificial Intelligence Methods in Sustainable and Renewable Energy Power Systems Topic Editors: Lefeng Cheng, Pei Zhang, Anbo Meng
Deadline: 31 October 2025

Topic in AI, Algorithms, Diagnostics, Emergency Care and Medicine

Trends of Artificial Intelligence in Emergency and Critical Care Medicine Topic Editors: Zhongheng Zhang, Yucai Hong, Wei Shao
Deadline: 30 November 2025

More Topics

Conferences

Propose a Conference Collaboration

21–22 September 2026 The 1st International Online Conference on Forecasting

More Conferences...

Special Issues

Propose a Special Issue

Special Issue in AI

AI in Finance: Leveraging AI to Transform Financial Services Guest Editor: Xianrong (Shawn) Zheng
Deadline: 31 August 2025

Special Issue in AI

AI and the Evolution of Work: Redefining Project Management across Disciplines Guest Editor: Jose Berengueres
Deadline: 30 September 2025

Special Issue in AI

Artificial Intelligence for Network Management Guest Editors: Stephen Ojo, Agbotiname Lucky Imoize, Lateef Adesola Akinyemi
Deadline: 30 September 2025

Special Issue in AI

Development and Design of Autonomous Robot Guest Editors: Tayab Din Memon, Kamran Shaukat, Sufyan Ali Memon
Deadline: 24 October 2025

More Special Issues

Back to TopTop