Skip Content
You are currently on the new version of our website. Access the old version .

Machine Learning and Knowledge Extraction, Volume 7, Issue 3

2025 September - 50 articles

Cover Story: The rise of LLMs in user-facing settings—such as chatbots—makes it critical to ensure that these systems are safeguarded against prompt attacks. If not properly protected, such attacks could lead to data breaches, malware transmission, or reputational damage. Even more concerning, publicly available and computationally lightweight generative models can be leveraged to mass-produce prompt attacks. This paper assesses the generative capabilities of such models, evaluating two LSTM-based GAN architectures—SeqGAN and RelGAN—alongside a small language model (SLM). By systematically analyzing their effectiveness against current LLM defense systems and revealing distinct attack patterns, this work offers actionable steps to strengthen future LLM defense systems. View this paper
  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list .
  • You may sign up for email alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.

Articles (50)

  • Article
  • Open Access
1,244 Views
18 Pages

Saliency-Guided Local Semantic Mixing for Long-Tailed Image Classification

  • Jiahui Lv,
  • Jun Lei,
  • Jun Zhang,
  • Chao Chen and
  • Shuohao Li

In real-world visual recognition tasks, long-tailed distributions pose a widespread challenge, with extreme class imbalance severely limiting the representational learning capability of deep models. In practice, due to this imbalance, deep models oft...

  • Article
  • Open Access
1 Citations
1,375 Views
24 Pages

Background: Liquidity crises pose significant risks to financial stability, and missing data in predictive models increase the uncertainty in decision-making. This study aims to develop a robust Bayesian Model Averaging (BMA) framework using decision...

  • Systematic Review
  • Open Access
7 Citations
15,487 Views
38 Pages

Background: Customer churn significantly impacts business revenues. Machine Learning (ML) and Deep Learning (DL) methods are increasingly adopted to predict churn, yet a systematic synthesis of recent advancements is lacking. Objectives: This systema...

  • Article
  • Open Access
1 Citations
1,364 Views
32 Pages

In the technology-assisted review (TAR) area, most research has focused on ranking effectiveness and active learning strategies within individual topics, often assuming unconstrained review effort. However, real-world applications such as legal disco...

  • Article
  • Open Access
2 Citations
2,622 Views
34 Pages

Leveraging LLMs for Automated Extraction and Structuring of Educational Concepts and Relationships

  • Tianyuan Yang,
  • Baofeng Ren,
  • Chenghao Gu,
  • Tianjia He,
  • Boxuan Ma and
  • Shin’ichi Konomi

Students must navigate large catalogs of courses and make appropriate enrollment decisions in many online learning environments. In this context, identifying key concepts and their relationships is essential for understanding course content and infor...

  • Article
  • Open Access
993 Views
21 Pages

Exploiting the Feature Space Structures of KNN and OPF Algorithms for Identification of Incipient Faults in Power Transformers

  • André Gifalli,
  • Marco Akio Ikeshoji,
  • Danilo Sinkiti Gastaldello,
  • Victor Hideki Saito Yamaguchi,
  • Welson Bassi,
  • Talita Mazon,
  • Floriano Torres Neto,
  • Pedro da Costa Junior and
  • André Nunes de Souza

Power transformers represent critical assets within the electrical power system, and their unexpected failures may result in substantial financial losses for both utilities and consumers. Dissolved Gas Analysis (DGA) is a well-established diagnostic...

  • Article
  • Open Access
2,312 Views
38 Pages

CRISP-NET: Integration of the CRISP-DM Model with Network Analysis

  • Héctor Alejandro Acuña-Cid,
  • Eduardo Ahumada-Tello,
  • Óscar Omar Ovalle-Osuna,
  • Richard Evans,
  • Julia Elena Hernández-Ríos and
  • Miriam Alondra Zambrano-Soto

To carry out data analysis, it is necessary to implement a model that guides the process in an orderly and sequential manner, with the aim of maintaining control over software development and its documentation. One of the most widely used tools in th...

  • Article
  • Open Access
1,428 Views
27 Pages

Learnable Petri Net Neural Network Using Max-Plus Algebra

  • Mohammed Sharafath Abdul Hameed,
  • Sofiene Lassoued and
  • Andreas Schwung

Interpretable decision-making algorithms are important when used in the context of production optimization. While concepts like Petri nets are inherently interpretable, they are not straightforwardly learnable. This paper presents a novel approach to...

  • Article
  • Open Access
1 Citations
1,103 Views
17 Pages

Dynamic Graph Analysis: A Hybrid Structural–Spatial Approach for Brain Shape Correspondence

  • Jonnatan Arias-García,
  • Hernán Felipe García,
  • Andrés Escobar-Mejía,
  • David Cárdenas-Peña and
  • Álvaro A. Orozco

Accurate correspondence of complex neuroanatomical surfaces under non-rigid deformations remains a formidable challenge in computational neuroimaging, owing to inter-subject topological variability, partial occlusions, and non-isometric distortions....

  • Article
  • Open Access
2,065 Views
14 Pages

MCTS-Based Policy Improvement for Reinforcement Learning

  • György Csippán,
  • István Péter,
  • Bálint Kővári and
  • Tamás Bécsi

Curriculum Learning (CL) is a potent field in Machine Learning that provides several excellent techniques for enhancing the performance of the training process given the same data points, regardless of the training method used. In this research, we p...

  • Review
  • Open Access
2 Citations
10,303 Views
26 Pages

Automated test case generation aims to improve software testing by reducing the manual effort required to create test cases. Recent advancements in large language models (LLMs), with their ability to understand natural language and generate code, hav...

  • Article
  • Open Access
983 Views
35 Pages

Bioinspired computing methods, such as Artificial Neural Networks (ANNs), play a significant role in machine learning. This is particularly evident in smart manufacturing, where ANNs and their derivatives, like deep learning, are widely used for patt...

  • Article
  • Open Access
1 Citations
1,571 Views
25 Pages

Deep neural networks (DNNs) are highly effective across many domains but are sensitive to noisy or corrupted training data. Existing noise mitigation strategies often rely on strong assumptions about noise distributions or require costly retraining,...

  • Article
  • Open Access
1,504 Views
37 Pages

Knee osteoarthritis (KOA) is a most prevalent chronic muscoloskeletal disorder causing pain and functional impairment. Accurate predictions of KOA evolution are important for early interventions and preventive treatment planning. In this paper, we pr...

  • Article
  • Open Access
2,156 Views
34 Pages

Geometric Reasoning in the Embedding Space

  • David Mojžíšek,
  • Jan Hůla,
  • Jiří Janeček,
  • David Herel and
  • Mikoláš Janota

While neural networks can solve complex geometric problems, as demonstrated by systems like AlphaGeometry, we have limited understanding of how they internally represent and reason about spatial relationships. In this work, we investigate how neural...

  • Article
  • Open Access
2 Citations
3,483 Views
17 Pages

A Novel Prediction Model for Multimodal Medical Data Based on Graph Neural Networks

  • Lifeng Zhang,
  • Teng Li,
  • Hongyan Cui,
  • Quan Zhang,
  • Zijie Jiang,
  • Jiadong Li,
  • Roy E. Welsch and
  • Zhongwei Jia

Multimodal medical data provides a wide and real basis for disease diagnosis. Computer-aided diagnosis (CAD) powered by artificial intelligence (AI) is becoming increasingly prominent in disease diagnosis. CAD for multimodal medical data requires add...

  • Article
  • Open Access
1 Citations
1,917 Views
28 Pages

Machine Learning Prediction Models of Beneficial and Toxicological Effects of Zinc Oxide Nanoparticles in Rat Feed

  • Leonid Legashev,
  • Ivan Khokhlov,
  • Irina Bolodurina,
  • Alexander Shukhman and
  • Svetlana Kolesnik

Nanoparticles have found widespread application across diverse fields, including agriculture and animal husbandry. However, a persistent challenge in laboratory-based studies involving nanoparticle exposure is the limited availability of experimental...

  • Article
  • Open Access
3,315 Views
30 Pages

This paper introduces a novel framework for Distributionally Robust Bayesian Optimization (DRBO) with continuous context that integrates optimal transport theory and entropic regularization. We propose the sampling from the Wasserstein Barycenter Bay...

  • Article
  • Open Access
2 Citations
2,573 Views
29 Pages

Accurate and high-resolution spatio-temporal prediction of PM2.5 concentrations remains a significant challenge for air pollution early warning and prevention. Advanced artificial intelligence (AI) technologies, however, offer promising solutions to...

  • Article
  • Open Access
2 Citations
2,921 Views
22 Pages

Recent advancements in generative AI have fostered the development of highly adept Large Language Models (LLMs) that integrate diverse data types to empower decision-making. Among these, multimodal retrieval-augmented generation (RAG) applications ar...

  • Article
  • Open Access
1 Citations
1,790 Views
33 Pages

AML4S: An AutoML Pipeline for Data Streams

  • Eleftherios Kalaitzidis,
  • Themistoklis Diamantopoulos,
  • Athanasios Michailoudis and
  • Andreas L. Symeonidis

The data landscape has changed, as more and more information is produced in the form of continuous data streams instead of stationary datasets. In this context, several online machine learning techniques have been proposed with the aim of automatical...

  • Article
  • Open Access
1,474 Views
27 Pages

Full Domain Analysis in Fluid Dynamics

  • Alexander Hagg,
  • Adam Gaier,
  • Dominik Wilde,
  • Alexander Asteroth,
  • Holger Foysi and
  • Dirk Reith

Novel techniques in evolutionary optimization, simulation, and machine learning enable a broad analysis of domains like fluid dynamics, in which computation is expensive and flow behavior is complex. This paper introduces the concept of full domain a...

  • Article
  • Open Access
9,838 Views
45 Pages

Machine Learning Applied to Professional Football: Performance Improvement and Results Prediction

  • Diego Moya,
  • Christian Tipantuña,
  • Génesis Villa,
  • Xavier Calderón-Hinojosa,
  • Belén Rivadeneira and
  • Robin Álvarez

This paper examines the integration of machine learning (ML) techniques in professional football, focusing on two key areas: (i) player and team performance, and (ii) match outcome prediction. Using a systematic methodology, this study reviews 172 pa...

  • Article
  • Open Access
2 Citations
2,450 Views
24 Pages

A Comparison Between Unimodal and Multimodal Segmentation Models for Deep Brain Structures from T1- and T2-Weighted MRI

  • Nicola Altini,
  • Erica Lasaracina,
  • Francesca Galeone,
  • Michela Prunella,
  • Vladimiro Suglia,
  • Leonarda Carnimeo,
  • Vito Triggiani,
  • Daniele Ranieri,
  • Gioacchino Brunetti and
  • Vitoantonio Bevilacqua

Accurate segmentation of deep brain structures is critical for preoperative planning in such neurosurgical procedures as Deep Brain Stimulation (DBS). Previous research has showcased successful pipelines for segmentation from T1-weighted (T1w) Magnet...

  • Article
  • Open Access
1,151 Views
15 Pages

Interactions between human behavior, legal regulations, and monitoring technology in road traffic systems provide an everyday example of complex biosocial–technical systems. In this paper, a study is reported that investigated the potential for...

  • Article
  • Open Access
1,182 Views
30 Pages

According to a recent experimental phenomenology–information processing theory, the sensory strength, or vividness, of visual mental images self-reported by human observers reflects the intensive variation in subjective time duration during the...

  • Article
  • Open Access
1,191 Views
25 Pages

Vision-based navigation is a common solution for the critical challenge of GPS-denied Unmanned Aerial Vehicle (UAV) operation, but a research gap remains in the autonomous discovery of robust landmarks from aerial survey imagery needed for such syste...

  • Article
  • Open Access
970 Views
20 Pages

Predicting undergraduate student success is critical for informing timely interventions and improving outcomes in higher education. This study leverages over a decade of historical data from Louisiana State University (LSU) to forecast graduation out...

  • Article
  • Open Access
3 Citations
2,787 Views
40 Pages

In this study, we introduce an enhanced hybrid Autoencoder–Dense–Transformer Neural Network (AE-DTNN) model for developing an effective intrusion detection system (IDS) aimed at improving the performance and robustness of threat detection...

  • Article
  • Open Access
1 Citations
3,759 Views
18 Pages

Machine Learning-Based Vulnerability Detection in Rust Code Using LLVM IR and Transformer Model

  • Young Lee,
  • Syeda Jannatul Boshra,
  • Jeong Yang,
  • Zechun Cao and
  • Gongbo Liang

Rust’s growing popularity in high-integrity systems requires automated vulnerability detection in order to maintain its strong safety guarantees. Although Rust’s ownership model and compile-time checks prevent many errors, sometimes unexp...

  • Article
  • Open Access
2 Citations
8,780 Views
24 Pages

Evaluating Prompt Injection Attacks with LSTM-Based Generative Adversarial Networks: A Lightweight Alternative to Large Language Models

  • Sharaf Rashid,
  • Edson Bollis,
  • Lucas Pellicer,
  • Darian Rabbani,
  • Rafael Palacios,
  • Aneesh Gupta and
  • Amar Gupta

Generative Adversarial Networks (GANs) using Long Short-Term Memory (LSTM) provide a computationally cheaper approach for text generation compared to large language models (LLMs). The low hardware barrier of training GANs poses a threat because it me...

  • Article
  • Open Access
1 Citations
1,231 Views
22 Pages

Machine learning modeling is a valuable tool for gap-filling or prediction, and its performance is typically evaluated using standard metrics. To enable more precise assessments for time-series data, this study emphasizes the importance of considerin...

  • Review
  • Open Access
1 Citations
4,686 Views
65 Pages

Quantum computing, with its foundational principles of superposition and entanglement, has the potential to provide significant quantum advantages, addressing challenges that classical computing may struggle to overcome. As data generation continues...

  • Article
  • Open Access
3,860 Views
24 Pages

Large language models (LLMs) often tend to hallucinate, especially in domain-specific tasks and tasks that require reasoning. Previously, we introduced SubGraph Retrieval Augmented Generation (SG-RAG) as a novel Graph RAG method for multi-hop questio...

  • Article
  • Open Access
2,057 Views
19 Pages

MERA: Medical Electronic Records Assistant

  • Ahmed Ibrahim,
  • Abdullah Khalili,
  • Maryam Arabi,
  • Aamenah Sattar,
  • Abdullah Hosseini and
  • Ahmed Serag

The increasing complexity and scale of electronic health records (EHRs) demand advanced tools for efficient data retrieval, summarization, and comparative analysis in clinical practice. MERA (Medical Electronic Records Assistant) is a Retrieval-Augme...

  • Article
  • Open Access
1,184 Views
23 Pages

VisRep (Visualisation Report) is an AI-powered system for capturing and structuring the early stages of the visualisation design process. It addresses a critical gap in predesign: the lack of tools that can naturally record, organise, and transform r...

  • Systematic Review
  • Open Access
3,064 Views
26 Pages

Harnessing Language Models for Studying the Ancient Greek Language: A Systematic Review

  • Diamanto Tzanoulinou,
  • Loukas Triantafyllopoulos and
  • Vassilios S. Verykios

Applying language models (LMs) and generative artificial intelligence (GenAI) to the study of Ancient Greek offers promising opportunities. However, it faces substantial challenges due to the language’s morphological complexity and lack of anno...

  • Article
  • Open Access
1,328 Views
26 Pages

Random Forests are powerful machine learning models widely applied in classification and regression tasks due to their robust predictive performance. Nevertheless, traditional Random Forests face computational challenges during tree construction, par...

  • Article
  • Open Access
1 Citations
1,848 Views
23 Pages

In earthquake-prone areas such as Tokyo, accurate estimation of bearing stratum depth is crucial for foundation design, liquefaction assessment, and urban disaster mitigation. However, traditional methods such as the standard penetration test (SPT),...

  • Article
  • Open Access
1,754 Views
22 Pages

We present a pipeline for synthetic simplification of text in French that combines large language models with structured semantic guidance. Our approach enhances data generation by integrating contextual knowledge from Wikipedia and Vikidia articles...

  • Article
  • Open Access
2,056 Views
24 Pages

Fault detection and remaining useful life (RUL) prediction are critical tasks in self-healing network (SHN) environments and industrial cyber–physical systems. These domains demand intelligent systems capable of handling dynamic, high-dimension...

  • Article
  • Open Access
1,047 Views
16 Pages

Generalising Stock Detection in Retail Cabinets with Minimal Data Using a DenseNet and Vision Transformer Ensemble

  • Babak Rahi,
  • Deniz Sagmanli,
  • Felix Oppong,
  • Direnc Pekaslan and
  • Isaac Triguero

Generalising deep-learning models to perform well on unseen data domains with minimal retraining remains a significant challenge in computer vision. Even when the target task—such as quantifying the number of elements in an image—stays th...

  • Article
  • Open Access
1 Citations
1,964 Views
24 Pages

This study investigates the application of the Dueling Double Deep Q-Network (3DQN) algorithm to optimize traffic signal control at a major urban intersection in Danang City, Vietnam. The objective is to enhance signal timing efficiency in response t...

  • Article
  • Open Access
1 Citations
1,413 Views
22 Pages

Helmet Detection in Underground Coal Mines via Dynamic Background Perception with Limited Valid Samples

  • Guangfu Wang,
  • Dazhi Sun,
  • Hao Li,
  • Jian Cheng,
  • Pengpeng Yan and
  • Heping Li

The underground coal mine environment is complex and dynamic, making the application of visual algorithms for object detection a crucial component of underground safety management as well as a key factor in ensuring the safe operation of workers. We...

  • Article
  • Open Access
1 Citations
2,177 Views
19 Pages

Predicting company bankruptcy is a critical task in financial risk assessment. This study introduces a novel approach using Convolutional Neural Networks (CNNs) and Generative Adversarial Networks (GANs) to enhance bankruptcy prediction accuracy. By...

  • Review
  • Open Access
2,377 Views
22 Pages

A Review of Methods for Unobtrusive Measurement of Work-Related Well-Being

  • Zoja Anžur,
  • Klara Žinkovič,
  • Junoš Lukan,
  • Pietro Barbiero,
  • Gašper Slapničar,
  • Mohan Li,
  • Martin Gjoreski,
  • Maike E. Debus,
  • Sebastijan Trojer and
  • Marc Langheinrich
  • + 1 author

Work-related well-being is an important research topic, as it is linked to various aspects of individuals’ lives, including job performance. To measure it effectively, unobtrusive sensors are desirable to minimize the burden on employees. Becau...

  • Article
  • Open Access
5 Citations
6,982 Views
13 Pages

The integration of artificial intelligence (AI) and advanced deep learning techniques is reshaping intelligent financial forecasting and decision-support systems. This study presents a comprehensive comparative analysis of advanced deep learning mode...

  • Article
  • Open Access
1 Citations
1,843 Views
13 Pages

High-dimensional experimental spaces and resource constraints challenge modern science. We introduce a hybrid machine-learning (ML) framework that combines Ordinary Least Squares (OLS) for global surface estimation, Gaussian Process (GP) regression f...

  • Article
  • Open Access
1 Citations
1,976 Views
24 Pages

Stitching History into Semantics: LLM-Supported Knowledge Graph Engineering for 19th-Century Greek Bookbinding

  • Dimitrios Doumanas,
  • Efthalia Ntalouka,
  • Costas Vassilakis,
  • Manolis Wallace and
  • Konstantinos Kotis

Preserving cultural heritage can be efficiently supported by structured and semantic representation of historical artifacts. Bookbinding, a critical aspect of book history, provides valuable insights into past craftsmanship, material use, and conserv...

  • Article
  • Open Access
2,179 Views
26 Pages

Machine Learning (ML) is increasingly applied across various domains, addressing tasks such as predictive analytics, anomaly detection, and decision-making. Many of these applications share similar underlying tasks, offering potential for systematic...

Get Alerted

Add your email address to receive forthcoming issues of this journal.

XFacebookLinkedIn
Mach. Learn. Knowl. Extr. - ISSN 2504-4990