Due to scheduled maintenance work on our servers, there may be short service disruptions on this website between 11:00 and 12:00 CEST on March 28th.

Submit to Symmetry Review for Symmetry Propose a Special Issue

Journal Menu

Journal Browser

► Journal Browser

Machine Learning and Data Analysis II

Special Issue Editors
Special Issue Information
Keywords
Benefits of Publishing in a Special Issue
Related Special Issues
Published Papers

A special issue of Symmetry (ISSN 2073-8994). This special issue belongs to the section "Computer".

Deadline for manuscript submissions: closed (31 March 2025) | Viewed by 20405

Share This Special Issue

Special Issue Editor

Dr. Marcin Michalak

E-Mail Website
Guest Editor

Department of Computer Networks and Systems, Silesian University of Technology, 44-100 Gliwice, Poland
Interests: image processing; data mining; machine learning; pattern recognition; rough set theory; biclustering
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Due to the great success of our Special Issue "Machine Learning and Data Analysis" we decided to set up a second volume.

There is no need to convince anyone about the huge influence of theoretical models of machine learning or data analysis techniques on our present way of living. They influence many science disciplines including industry, medicine, transport, and many others. We may observe how different approaches are mixed to become a new and complete model: classifiers for image analysis as well as image pattern recognition algorithms for classification; neural networks for clustering, classification or time series prediction; feature selection and extraction algorithms for preprocessing step of many of above mentioned applications.

The topics of the Special Issue include but are not limited to the following:

Supervised learning;
Unsupervised learning;
Time series analysis;
Descriptive analysis;
Biclustering;
Genetic algorithms;
ML & DM applications;
Artificial neural networks;
Deep learning;
Decision support systems;
Anomaly detection;
Image analysis;
Pattern recognition.

Welcome to read the publications in "Machine Learning and Data Analysis" at https://www.mdpi.com/journal/symmetry/special_issues/Machine_Learning_Data_Analysis.

Dr. Marcin Michalak
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 250 words) can be sent to the Editorial Office for assessment.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Symmetry is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

machine learning
data analysis
process modelling
time series prediction

Benefits of Publishing in a Special Issue

Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Related Special Issues

Machine Learning and Data Analysis in Symmetry (20 articles)
Machine Learning and Data Analysis III in Symmetry (3 articles)

Published Papers (9 papers)

Download All Papers

Order results

Result details

Show export options Show export options

Select all

Export citation of selected articles as:

Editorial

Jump to: Research

4 pages, 159 KB

Open AccessEditorial

Special Issue: Machine Learning and Data Analysis II

by Marcin Michalak

Symmetry 2025, 17(8), 1199; https://doi.org/10.3390/sym17081199 - 28 Jul 2025

Viewed by 599

Abstract

In an age defined by massive data generation, machine learning has emerged as a transformative tool to uncover meaningful patterns and drive intelligent decision-making [...] Full article

(This article belongs to the Special Issue Machine Learning and Data Analysis II)

Research

Jump to: Editorial

22 pages, 2528 KB

Open AccessArticle

Multi-View Utility-Based Clustering: A Mutually Supervised Perspective

by Zhibin Jiang, Jie Zhou and Shitong Wang

Symmetry 2025, 17(6), 924; https://doi.org/10.3390/sym17060924 - 11 Jun 2025

Viewed by 956

Abstract

Information in multiple views is typically symmetrical and distinctive. Multi-view clustering generally attempts to address two key issues: (1) how to mine/leverage additional useful information between different views for current view clustering, and (2) how to combine multiple clustering results from multiple views by using compatible and complementary information hidden in multiple views. In order to achieve this, we propose a multi-view clustering method, namely the multi-view utility-based clustering method (MUC), from the novel perspective of utility-based mutual supervision between different views. The proposed method, MUC, has notable merits: (1) It moves multi-view clustering from the level of feature and/or sample side to partition side information. That is, in order to handle the first issue, MUC considers how to use utility-based partition-level side information from all the other views for current view clustering. Such partition-level side information is consistent with human thinking; it is high-caliber and instructional for clustering. (2) The utility-based partition-level side information provided by other views complements current view information in a mutually supervised way. Then, we conduct multi-view clustering via a mutual supervision mode to circumvent the second issue. As a result, using an alternating optimization strategy, the objective function of MUC can be solved in a K-means-like way. Moreover, we leverage multi-view weight learning based on maximum entropy to integrate multi-view clustering results and further improve performance. The extensive experimental results on various multi-view datasets indicate that the proposed method is better than—or at least comparable to—the existing commonly used single- and multi-view clustering methods, in terms of both clustering performance and running speed. Full article

(This article belongs to the Special Issue Machine Learning and Data Analysis II)

► Show Figures

Figure 1

27 pages, 4410 KB

Open AccessArticle

QoS-Effective and Resilient Service Deployment and Traffic Management in MEC-Based Crowd Sensing

by Zhengzhe Xiang, Fuli Ying, Hao Yan, Zengwei Zheng, Yufei Zhang and Yueshen Xu

Symmetry 2025, 17(5), 718; https://doi.org/10.3390/sym17050718 - 8 May 2025

Viewed by 1446

Abstract

The rapid development of Multi-access Edge Computing (MEC) technology is transforming traditional platform services and enabling realization of the mMTC vision of 5G networks. MEC allows computational tasks to be offloaded to devices at the network edge, enabling real-time crowd sensing. However, due to the limited resources in MEC environments, it is essential to propose efficient service deployment and traffic management strategies that balance Quality of Service (QoS) with costs. This paper addresses the challenge by modeling the QoS-effective joint service deployment and traffic management problem (QST) as a nonlinear integer optimization problem. We propose a customized genetic algorithm called GA4QST, which aims to minimize cost-performance ratios. In the experimental section, GA4QST is compared with baseline algorithms in terms of efficiency and effectiveness. Although GA4QST exhibits slightly increased complexity compared to the original genetic algorithm, it performs exceptionally well in balancing benefits and average time costs. GA4QST demonstrates strong capabilities in finding optimal solutions, consistently outperforming baseline algorithms. This further confirms the effectiveness and potential applicability of GA4QST in real-world scenarios. Finally, we also explore the impact on optimization outcomes of system configurations such as service diversity, data volume, processing power, and network characteristics. The results indicate that GA4QST has broad applicability and represents a feasible solution for practical MEC applications. Full article

(This article belongs to the Special Issue Machine Learning and Data Analysis II)

► Show Figures

Figure 1

17 pages, 875 KB

Open AccessArticle

Public Opinion Evolution Based on the Two-Dimensional Theory of Emotion and Top2Vec-RoBERTa

by Shaowen Wang, Qingyang Liu, Yanrong Hu and Hongjiu Liu

Symmetry 2025, 17(2), 190; https://doi.org/10.3390/sym17020190 - 26 Jan 2025

Cited by 2 | Viewed by 2638

Abstract

This paper applies the concept of symmetry to the design of a research methodology for public opinion evolution, emphasizing that both the construction and analysis processes of the method embody symmetrical principles. In today’s information age, dominated by social media, online platforms have become crucial venues for information dissemination. While the free flow of information promotes public participation, it also introduces certain challenges. Therefore, analyzing the evolution of public opinion and extracting public sentiment holds significant practical value for managing online public sentiment. This study takes the Zibo barbecue incident as a case study, utilizing the two-dimensional theory of emotion and Top2Vec for thematic analysis of public opinion comments. By combining sentiment dictionary methods with the RoBERTa model, we conduct a sentiment polarity analysis of public opinion comments. The results show that the RoBERTa model achieved an accuracy of 98.46% on the test set. The proposed method effectively uncovers public sentiment biases and the influencing factors on public emotions during the evolution of public opinion events, providing a more comprehensive understanding of the emotional dynamics throughout the development of public sentiment. This deeper insight aids in addressing issues related to public opinion more effectively. Full article

(This article belongs to the Special Issue Machine Learning and Data Analysis II)

► Show Figures

Figure 1

23 pages, 5188 KB

Open AccessFeature PaperEditor’s ChoiceArticle

Comparison of Affine and Rational Quadratic Spline Coupling and Autoregressive Flows through Robust Statistical Tests

by Andrea Coccaro, Marco Letizia, Humberto Reyes-González and Riccardo Torre

Symmetry 2024, 16(8), 942; https://doi.org/10.3390/sym16080942 - 23 Jul 2024

Cited by 15 | Viewed by 4062

Abstract

Normalizing flows have emerged as a powerful brand of generative models, as they not only allow for efficient sampling of complicated target distributions but also deliver density estimation by construction. We propose here an in-depth comparison of coupling and autoregressive flows, both based on symmetric (affine) and non-symmetric (rational quadratic spline) bijectors, considering four different architectures: real-valued non-Volume preserving (RealNVP), masked autoregressive flow (MAF), coupling rational quadratic spline (C-RQS), and autoregressive rational quadratic spline (A-RQS). We focus on a set of multimodal target distributions of increasing dimensionality ranging from 4 to 400. The performances were compared by means of different test statistics for two-sample tests, built from known distance measures: the sliced Wasserstein distance, the dimension-averaged one-dimensional Kolmogorov–Smirnov test, and the Frobenius norm of the difference between correlation matrices. Furthermore, we included estimations of the variance of both the metrics and the trained models. Our results indicate that the A-RQS algorithm stands out both in terms of accuracy and training speed. Nonetheless, all the algorithms are generally able, without too much fine-tuning, to learn complicated distributions with limited training data and in a reasonable time of the order of hours on a Tesla A40 GPU. The only exception is the C-RQS, which takes significantly longer to train, does not always provide good accuracy, and becomes unstable for large dimensionalities. All algorithms were implemented using TensorFlow2 and TensorFlow Probability and have been made available on GitHub. Full article

(This article belongs to the Special Issue Machine Learning and Data Analysis II)

► Show Figures

Figure 1

21 pages, 3192 KB

Open AccessArticle

Cutting-Edge Machine Learning Techniques for Accurate Prediction of Agglomeration Size in Water–Alumina Nanofluids

by Behzad Vaferi, Mohsen Dehbashi and Ali Hosin Alibak

Symmetry 2024, 16(7), 804; https://doi.org/10.3390/sym16070804 - 27 Jun 2024

Cited by 4 | Viewed by 2001

Abstract

Nanoparticle agglomeration is one of the most problematic phenomena during nanofluid synthesis by a two-step procedure. Understanding and accurately estimating agglomeration size is crucial, as it significantly affects nanofluids’ properties, behavior, and successful applications. To the best of our knowledge, the literature has not yet applied machine learning methods to estimate alumina agglomeration size in water-based nanofluids. So, this research employs a range of machine learning models—Random Forest, Adaptive Boosting, Extra Trees, Categorical Boosting, and Multilayer Perceptron Neural Networks—to predict alumina agglomeration sizes in water-based nanofluids. To this end, a comprehensive experimental database, including 345 alumina agglomeration sizes in water-based nanofluids, compiled from 29 various sources from the literature, is utilized to train these models and monitor their generalization ability in the testing stage. The models estimate agglomeration size based on multiple factors: alumina concentration, ultrasonic time, power, frequency, temperature, surfactant type and concentration, and pH levels. The relevancy test based on the Pearson method clarifies that Al₂O₃ agglomeration size in water primarily depends on ultrasonic frequency, ultrasonic power, alumina concentration in water, and surfactant concentration. Comparative analyses based on numerical and graphical techniques reveal that the Categorical Boosting model surpasses others in accurately simulating this complex phenomenon. It effectively captures the intricate relationships between key features and alumina agglomeration size, achieving an average absolute relative deviation of 6.75%, a relative absolute error of 12.83%, and a correlation coefficient of 0.9762. Furthermore, applying the leverage method to the experimental data helps identify two problematic measurements within the database. These results validate the effectiveness of the Categorical Boosting model and contribute to the broader goal of enhancing our understanding and control of nanofluid properties, thereby aiding in improving their practical applications. Full article

(This article belongs to the Special Issue Machine Learning and Data Analysis II)

► Show Figures

Figure 1

21 pages, 5171 KB

Open AccessArticle

Discharge Capacity Estimation for Li-Ion Batteries: A Comparative Study

by Saadin Oyucu, Sezer Dümen, İremnur Duru, Ahmet Aksöz and Emre Biçer

Symmetry 2024, 16(4), 436; https://doi.org/10.3390/sym16040436 - 5 Apr 2024

Cited by 17 | Viewed by 3212

Abstract

Li-ion batteries are integral to various applications, ranging from electric vehicles to mobile devices, because of their high energy density and user friendliness. The assessment of the Li-ion state of heath stands as a crucial research domain, aiming to innovate safer and more effective battery management systems that can predict and promptly report any operational discrepancies. To achieve this, an array of machine learning (ML) and artificial intelligence (AI) methodologies have been employed to analyze data from Li-ion batteries, facilitating the estimation of critical parameters like state of charge (SoC) and state of health (SoH). The continuous enhancement of ML and AI algorithm efficiency remains a pivotal focus of scholarly inquiry. Our study distinguishes itself by separately evaluating traditional machine learning frameworks and advanced deep learning paradigms to determine their respective efficacy in predictive modeling. We dissected the performances of an assortment of models, spanning from conventional ML techniques to sophisticated, hybrid deep learning constructs. Our investigation provides a granular analysis of each model’s utility, promoting an informed and strategic integration of ML and AI in Li-ion battery state of health prognostics. Specifically, a utilization of machine learning algorithms such as Random Forests (RFs) and eXtreme Gradient Boosting (XGBoost), alongside regression models like Elastic Net and foundational neural network approaches including Multilayer Perceptron (MLP) were studied. Furthermore, our research investigated the enhancement of time series analysis using intricate models like Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) and their outcomes with those of hybrid models, including a RNN-long short-term memory (LSTM), CNN-LSTM, CNN-Gated Recurrent Unit (GRU) and RNN-GRU. Comparative evaluations reveal that the RNN-LSTM configuration achieved a Mean Squared Error (MSE) of 0.043, R-Squared of 0.758, Root Mean Square Error (RMSE) of 0.208, and Mean Absolute Error (MAE) of 0.124, whereas the CNN-LSTM framework reported an MSE of 0.039, R-Squared of 0.782, RMSE of 0.197, and MAE of 0.122, underscoring the potential of deep learning-based hybrid models in advancing the accuracy of battery state of health assessments. Full article

(This article belongs to the Special Issue Machine Learning and Data Analysis II)

► Show Figures

Figure 1

21 pages, 4286 KB

Open AccessArticle

Velocity Estimations in Blood Microflows via Machine Learning Symmetries

by Gerardo Alfonso Perez and Jaime Virgilio Colchero Paetz

Symmetry 2024, 16(4), 428; https://doi.org/10.3390/sym16040428 - 4 Apr 2024

Cited by 2 | Viewed by 1540

Abstract

Improving velocity forecasts of blood microflows could be useful in biomedical applications. We focus on estimating the velocity of the blood in capillaries. Modeling blood microflow in capillaries is a complex process. In this paper, we use artificial intelligence techniques for this modeling: [...] Read more.

R^{2}

of 0.8992 comparing the forecast with the actual velocity. A key part of ANN model creation is selecting the appropriate parameters for the ANN, such as the number of neurons, the number of layers and the type of training algorithm used. A grid approach with 327,600 simulations was used. It is shown that there are substantial, statistically significant differences when different types of ANN structures are used. It is also shown that the proposed model is robust regarding the initial random initialization of weights in the ANN. Additionally, the sensitivity of the selected models to additional noise was also tested. Full article

(This article belongs to the Special Issue Machine Learning and Data Analysis II)

► Show Figures

Figure 1

19 pages, 532 KB

Open AccessEditor’s ChoiceArticle

Shifting Pattern Biclustering and Boolean Reasoning Symmetry

by Marcin Michalak and Jesús S. Aguilar-Ruiz

Symmetry 2023, 15(11), 1977; https://doi.org/10.3390/sym15111977 - 26 Oct 2023

Cited by 2 | Viewed by 2514

Abstract

There are several goals of the two-dimensional data analysis: one may be interested in searching for groups of similar objects (clustering), another one may be focused on searching for some dependencies between a specified one and other variables (classification, regression, associate rules induction), and finally, some may be interested in serching for well-defined patterns in the data called biclusters. It was already proved that there exists a mathematically proven symmetry between some patterns in the matrix and implicants of data-defined Boolean function. This paper provides the new look for a specific pattern search—the pattern named the

δ

-shifting pattern. The shifting pattern is interesting, as it accounts for constant fluctuations in data, i.e., it captures situations in which all the values in the pattern move up or down for one dimension, maintaining the range amplitude for all the dimensions. Such a behavior is very common in real data, e.g., in the analysis of gene expression data. In such a domain, a subset of genes might go up or down for a subset of patients or experimental conditions, identifying functionally coherent categories. A

δ

-shifting pattern meets the necessity of shifting pattern induction together with the bias of the real values acquisition where the original shifts may be disturbed with some outer conditions. Experiments with a real dataset show the potential of our approach at finding biclusters with

δ

-shifting patterns, providing excellent performance. It was possible to find the

12 \times 9

pattern in the

112 \times 9

input data with

M S R = 0.00653

. The experiments also revealed that

δ

-shifting patterns are quite difficult to be found by some well-known methods of biclustering, as these are not designed to focus on shifting patterns—results comparable due to

M S R

had much more variability (in terms of

δ

) than patterns found with Boolean reasoning. Full article

(This article belongs to the Special Issue Machine Learning and Data Analysis II)

► Show Figures

Journal Menu

Journal Browser

Machine Learning and Data Analysis II

Share This Special Issue

Special Issue Editor

Special Issue Information

Keywords

Benefits of Publishing in a Special Issue

Related Special Issues

Published Papers (9 papers)

Editorial

Research

Further Information

Guidelines

MDPI Initiatives

Follow MDPI