applsci-logo

Journal Browser

Journal Browser

Applications of Data Science and Artificial Intelligence

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Computing and Artificial Intelligence".

Deadline for manuscript submissions: 20 December 2024 | Viewed by 13867

Special Issue Editors


E-Mail Website
Guest Editor
ISEG – Lisbon School of Economics and Management, Universidade de Lisboa, 1200-781 Lisboa, Portugal
Interests: data science; data science and management; machine learning in finance; gamification; information systems
Special Issues, Collections and Topics in MDPI journals

E-Mail Website
Guest Editor
NOVA IMS Information Management School, Universidade Nova de Lisboa Campus de Campolide, 1070-312 Lisboa, Portugal
Interests: data science; artificial intelligence; information systems; e-learning; digital transformation; gamification; e-commerce
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Significant advances in artificial intelligence (AI) have led to new challenges and opportunities in the field. Data science is a rapidly growing area of study and professional discipline. It is thus critical to investigate this new reality from a social and corporate standpoint. Abundant information about data science and AI and how they may be used to solve economic and societal problems exists. However, in order to realize the widespread use of data science and AI in business and everyday life, their efficacy must be objectively assessed. This Special Issue aims to gather contributions from academics investigating a variety of subjects and viewpoints, including AI-related management, social sciences, and engineering. Given the present level of AI and data science, three forms are of particular interest: machine learning, natural language processing, and robotics. Submissions considering other relevant topics will also be considered.

Dr. Carlos J. Costa
Dr. Manuela Aparicio
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • data science applications
  • AI applications
  • machine learning applications
  • NLP applications
  • AI trends
  • data science trends

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • e-Book format: Special Issues with more than 10 articles can be published as dedicated e-books, ensuring wide and rapid dissemination.

Further information on MDPI's Special Issue polices can be found here.

Published Papers (8 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Editorial

Jump to: Research

3 pages, 196 KiB  
Editorial
Applications of Data Science and Artificial Intelligence
by Carlos J. Costa and Manuela Aparicio
Appl. Sci. 2023, 13(15), 9015; https://doi.org/10.3390/app13159015 - 7 Aug 2023
Cited by 4 | Viewed by 2512
Abstract
A series of waves have marked the history of artificial intelligence (AI) [...] Full article
(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)

Research

Jump to: Editorial

19 pages, 3351 KiB  
Article
Automatizing Automatic Controller Design Process: Designing Robust Automatic Controller under High-Amplitude Disturbances Using Particle Swarm Optimized Neural Network Controller
by Celal Onur Gökçe
Appl. Sci. 2024, 14(17), 7859; https://doi.org/10.3390/app14177859 - 4 Sep 2024
Viewed by 461
Abstract
In this study, a novel approach of designing automatic control systems with the help of AI tools is proposed. Given plant dynamics, expected references, and expected disturbances, the design of an optimal neural network-based controller is performed automatically. Several common reference types are [...] Read more.
In this study, a novel approach of designing automatic control systems with the help of AI tools is proposed. Given plant dynamics, expected references, and expected disturbances, the design of an optimal neural network-based controller is performed automatically. Several common reference types are studied including step, square, sine, sawtooth, and trapezoid functions. Expected reference–disturbance pairs are used to train the system for finding optimal neural network controller parameters. A separate test set is used to test the system for unexpected reference–disturbance pairs to show the generalization performance of the proposed system. Parameters of a real DC motor are used to test the proposed approach. The real DC motor’s parameters are estimated using a particle swarm optimization (PSO) algorithm. Initially, a proportional–integral (PI) controller is designed using a PSO algorithm to find the simple controller’s parameters optimally and automatically. Starting with the neural network equivalent of the optimal PI controller, the optimal neural network controller is designed using a PSO algorithm for training again. Simulations are conducted with estimated parameters for a diverse set of training and test patterns. The results are compared with the optimal PI controller’s performance and reported in the corresponding section. Encouraging results are obtained, suggesting further research in the proposed direction. For low-disturbance scenarios, even simple controllers can have acceptable performance, but the real quality of a proposed controller should be shown under high-amplitude and difficult disturbances, which is the case in this study. The proposed controller shows higher performance, especially under high disturbances, with an 8.6% reduction in error rate on average compared with the optimal PI controller, and under high-amplitude disturbances, the performance difference is of more than 2.5 folds. Full article
(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)
Show Figures

Figure 1

14 pages, 766 KiB  
Article
ULYSSES: Automated FreqUentLY ASked QueStions for KnowlEdge GraphS
by Giannis Vassiliou, Georgia Eirini Trouli, Georgia Troullinou, Nikolaos Spyridakis, George Bitzarakis, Fotini Droumalia, Antonis Karagiannakis, Georgia Skouteli, Nikolaos Oikonomou, Dimitra Deka, Emmanouil Makaronas, Georgios Pronoitis, Konstantinos Alexandris, Stamatios Kostopoulos, Yiannis Kazantzakis, Nikolaos Vlassis, Eleftheria Sfinarolaki, Vardis Daskalakis, Iakovos Giannakos, Argyro Stamatoukou, Nikolaos Papadakis and Haridimos Kondylakisadd Show full author list remove Hide full author list
Appl. Sci. 2024, 14(17), 7640; https://doi.org/10.3390/app14177640 - 29 Aug 2024
Viewed by 725
Abstract
The exponential growth of Knowledge Graphs necessitates effective and efficient methods for their exploration and understanding. Frequently Asked Questions (FAQ) is a service that typically presents a list of questions and answers related to a specific topic, and which is intended to help [...] Read more.
The exponential growth of Knowledge Graphs necessitates effective and efficient methods for their exploration and understanding. Frequently Asked Questions (FAQ) is a service that typically presents a list of questions and answers related to a specific topic, and which is intended to help people understand that topic. Although FAQ has already shown its value on large websites and is widely used, to the best of our knowledge it has not yet been exploited for Knowledge Graphs. In this paper, we present ULYSSES, the first system for automatically constructing FAQ lists for large Knowledge Graphs. Our method consists of three key steps. First, we select the most frequent queries by exploiting the available query logs. Next, we answer the selected queries, using the original graph. Finally, we construct textual descriptions of both the queries and the corresponding answers, exploring state-of-the-art transformer models, i.e., ChatGPT 3.5 and Gemini 1.5 Pro. We evaluate the results of each model, using a human-constructed FAQ list, contributing a unique dataset to the domain and showing the benefits of our approach. Full article
(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)
Show Figures

Figure 1

16 pages, 1570 KiB  
Article
A Classification Method for Incomplete Mixed Data Using Imputation and Feature Selection
by Gengsong Li, Qibin Zheng, Yi Liu, Xiang Li, Wei Qin and Xingchun Diao
Appl. Sci. 2024, 14(14), 5993; https://doi.org/10.3390/app14145993 - 9 Jul 2024
Viewed by 932
Abstract
Data missing is a ubiquitous problem in real-world systems that adversely affects the performance of machine learning algorithms. Although many useful imputation methods are available to address this issue, they often fail to consider the information provided by both features and labels. As [...] Read more.
Data missing is a ubiquitous problem in real-world systems that adversely affects the performance of machine learning algorithms. Although many useful imputation methods are available to address this issue, they often fail to consider the information provided by both features and labels. As a result, the performance of these methods might be constrained. Furthermore, feature selection as a data quality improvement technique has been widely used and has demonstrated its efficiency. To overcome the limitation of imputation methods, we propose a novel algorithm that combines data imputation and feature selection to tackle classification problems for mixed data. Based on the mean and standard deviation of quantitative features and the selecting probabilities of unique values of categorical features, our algorithm constructs different imputation models for quantitative and categorical features. Particle swarm optimization is used to optimize the parameters of the imputation models and select feature subsets simultaneously. Additionally, we introduce a legacy learning mechanism to enhance the optimization capability of our method. To evaluate the performance of the proposed method, seven algorithms and twelve datasets are used for comparison. The results show that our algorithm outperforms other algorithms in terms of accuracy and F1 score and has reasonable time overhead. Full article
(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)
Show Figures

Figure 1

24 pages, 3487 KiB  
Article
A New Hybrid Approach for Product Management in E-Commerce
by Hacire Oya Yüregir, Metin Özşahin and Serap Akcan Yetgin
Appl. Sci. 2024, 14(13), 5735; https://doi.org/10.3390/app14135735 - 1 Jul 2024
Viewed by 786
Abstract
Nowadays, due to the developments in technology and the effects of the pandemic, people have largely switched to e-commerce instead of traditional face-to-face commerce. In this sector, the product variety reaches tens of thousands, which has made it difficult to manage and to [...] Read more.
Nowadays, due to the developments in technology and the effects of the pandemic, people have largely switched to e-commerce instead of traditional face-to-face commerce. In this sector, the product variety reaches tens of thousands, which has made it difficult to manage and to make quick decisions on inventory, promotion, pricing, and logistics. Therefore, it is thought that obtaining accurate and fast forecasting for the future will provide significant benefits to such companies in every respect. This study was built on the proposal of creating a cluster-based–genetic algorithm hybrid forecasting model including genetic algorithm (GA), cluster analysis, and some forecasting models as a new approach. In this study, unlike the literature, an attempt was made to create a more successful forecasting model for many products at the same time inside of single product forecasting. The proposed CBGA model success was compared separately to both the single prediction method successes and only genetic algorithm-based hybrid model successes by using real values from a popular B2C company. As a result, it has been observed that the forecasting success of the model proposed in this study is more successful than the forecasting made using single models or only the genetic algorithm. Full article
(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)
Show Figures

Figure 1

16 pages, 15347 KiB  
Article
Transforming Customer Digital Footprints into Decision Enablers in Hospitality
by Achini Adikari, Su Nguyen, Rashmika Nawaratne, Daswin De Silva and Damminda Alahakoon
Appl. Sci. 2024, 14(7), 3114; https://doi.org/10.3390/app14073114 - 8 Apr 2024
Viewed by 924
Abstract
The proliferation of online hotel review platforms has prompted decision-makers in the hospitality sector to acknowledge the significance of extracting valuable information from this vast source. While contemporary research has primarily focused on extracting sentiment and discussion topics from online reviews, the transformative [...] Read more.
The proliferation of online hotel review platforms has prompted decision-makers in the hospitality sector to acknowledge the significance of extracting valuable information from this vast source. While contemporary research has primarily focused on extracting sentiment and discussion topics from online reviews, the transformative potential of such insights remains largely untapped. In this paper, we propose an approach that leverages Natural Language Processing (NLP) techniques to convert unstructured textual reviews into a quantifiable and structured representation of emotions and hotel aspects. Building upon this derived representation, we conducted a segmentation analysis to gauge distinct emotion and concern-based profiles of customers, as well as profiles of hotels with similar customer emotions using a self-organizing unsupervised algorithm. We demonstrated the practicality of our approach using 22,450 online reviews collected from 44 hotels. The insights garnered from emotion analysis and review segmentation facilitate the development of targeted customer management strategies and informed decision-making. Full article
(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)
Show Figures

Figure 1

40 pages, 5710 KiB  
Article
The Impacts of Open Data and eXplainable AI on Real Estate Price Predictions in Smart Cities
by Fátima Trindade Neves, Manuela Aparicio and Miguel de Castro Neto
Appl. Sci. 2024, 14(5), 2209; https://doi.org/10.3390/app14052209 - 6 Mar 2024
Cited by 1 | Viewed by 4545
Abstract
In the rapidly evolving landscape of urban development, where smart cities increasingly rely on artificial intelligence (AI) solutions to address complex challenges, using AI to accurately predict real estate prices becomes a multifaceted and crucial task integral to urban planning and economic development. [...] Read more.
In the rapidly evolving landscape of urban development, where smart cities increasingly rely on artificial intelligence (AI) solutions to address complex challenges, using AI to accurately predict real estate prices becomes a multifaceted and crucial task integral to urban planning and economic development. This paper delves into this endeavor, highlighting the transformative impact of specifically chosen contextual open data and recent advances in eXplainable AI (XAI) to improve the accuracy and transparency of real estate price predictions within smart cities. Focusing on Lisbon’s dynamic housing market from 2018 to 2021, we integrate diverse open data sources into an eXtreme Gradient Boosting (XGBoost) machine learning model optimized with the Optuna hyperparameter framework to enhance its predictive precision. Our initial model achieved a Mean Absolute Error (MAE) of EUR 51,733.88, which was significantly reduced by 8.24% upon incorporating open data features. This substantial improvement underscores open data’s potential to boost real estate price predictions. Additionally, we employed SHapley Additive exPlanations (SHAP) to address the transparency of our model. This approach clarifies the influence of each predictor on price estimates and fosters enhanced accountability and trust in AI-driven real estate analytics. The findings of this study emphasize the role of XAI and the value of open data in enhancing the transparency and efficacy of AI-driven urban development, explicitly demonstrating how they contribute to more accurate and insightful real estate analytics, thereby informing and improving policy decisions for the sustainable development of smart cities. Full article
(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)
Show Figures

Figure 1

16 pages, 541 KiB  
Article
Research on Ensemble Learning-Based Feature Selection Method for Time-Series Prediction
by Da Huang, Zhaoguo Liu and Dan Wu
Appl. Sci. 2024, 14(1), 40; https://doi.org/10.3390/app14010040 - 20 Dec 2023
Cited by 1 | Viewed by 1672
Abstract
Feature selection has perennially stood as a pivotal concern in the realm of time-series forecasting due to its direct influence on the efficacy of predictive models. Conventional approaches to feature selection predominantly rely on domain knowledge and experiential insights and are, therefore, susceptible [...] Read more.
Feature selection has perennially stood as a pivotal concern in the realm of time-series forecasting due to its direct influence on the efficacy of predictive models. Conventional approaches to feature selection predominantly rely on domain knowledge and experiential insights and are, therefore, susceptible to individual subjectivity and the resultant inconsistencies in the outcomes. Particularly in domains such as financial markets, and within datasets comprising time-series information, an abundance of features adds complexity, necessitating adept handling of high-dimensional data. The computational expenses associated with traditional methodologies in managing such data dimensions, coupled with vulnerability to the curse of dimensionality, further compound the challenges at hand. In response to these challenges, this paper advocates for an innovative approach—a feature selection method grounded in ensemble learning. The paper explicitly delineates the formal integration of ensemble learning into feature selection, guided by the overarching principle of “good but different”. To operationalize this concept, five feature selection methods that are well suited to ensemble learning were identified, and their respective weights were determined through K-fold cross-validation when applied to specific datasets. This ensemble method amalgamates the outcomes of diverse feature selection techniques into a numeric composite, thereby mitigating potential biases inherent in traditional methods and elevating the precision and comprehensiveness of feature selection. Consequently, this method improves the performance of time-series prediction models. Full article
(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)
Show Figures

Figure 1

Back to TopTop