Topic Editors

Dr. Jiahui Yu
Binjiang Institute, Zhejiang University, Hangzhou 310053, China
Department of Computer Science, University of Liverpool, Liverpool L69 3BX, UK
Institute of Cyberspace Security, Zhejiang University of Technology, Hangzhou 310014, China
School of Computing, University of Portsmouth, Portsmouth, UK
School of Computing, University of Portsmouth, Portsmouth PO1 3HE, UK
Binjiang Institute, Zhejiang University, Hangzhou 310053, China

Theoretical and Applied Problems in Human-Computer Intelligent Systems

Abstract submission deadline
closed (31 March 2025)
Manuscript submission deadline
closed (30 June 2025)
Viewed by
53488

Topic Information

Dear Colleagues,

The advancement in human–computer interaction technology has been a driving force in developing robotics, intelligent systems, and medical applications. The ability to understand human–computer interaction through visual and text analyses, as well as ensuring security, is critical for the success of intelligent systems. To achieve this, it is necessary to extract valuable insights from visual and text information and to implement robust security measures to protect against potential attacks. Despite these advancements, many challenges remain to be addressed in this field. Recently, deep learning, AI defense and attack, real-time learning, robot control, and other cutting-edge technologies have been explored and applied to intelligent systems. The increasing demand and complex real-world problems have encouraged the growth of academic research in human–computer interaction. This Topic brings together experts, engineers, and researchers worldwide to present their latest findings and advancements in human–computer interaction. The focus is on innovative theories related to intelligent systems and machine learning, emphasizing applications involving human–machine interaction through visual and text information.

Dr. Jiahui Yu
Prof. Dr. Charlie Yang
Prof. Dr. Zhenyu Wen
Dr. Dalin Zhou
Dr. Dongxu Gao
Dr. Changting Lin
Topic Editors

Keywords

  • visual-based interaction
  • data analysis
  • robustness and security
  • machine learning
  • computer-aided medical analysis
  • robotic control and design
  • machine learning in signal transmission

Participating Journals

Journal Name Impact Factor CiteScore Launched Year First Decision (median) APC
Applied Sciences
applsci
2.5 5.5 2011 19.8 Days CHF 2400
Electronics
electronics
2.6 6.1 2012 16.8 Days CHF 2400
Future Internet
futureinternet
3.6 8.3 2009 17 Days CHF 1600
Machines
machines
2.5 4.7 2013 16.9 Days CHF 2400
Systems
systems
3.1 4.1 2013 18.8 Days CHF 2400
Technologies
technologies
3.6 8.5 2013 21.8 Days CHF 1600
Biomimetics
biomimetics
3.9 4.2 2016 17.1 Days CHF 2200

Preprints.org is a multidisciplinary platform offering a preprint service designed to facilitate the early sharing of your research. It supports and empowers your research journey from the very beginning.

MDPI Topics is collaborating with Preprints.org and has established a direct connection between MDPI journals and the platform. Authors are encouraged to take advantage of this opportunity by posting their preprints at Preprints.org prior to publication:

  1. Share your research immediately: disseminate your ideas prior to publication and establish priority for your work.
  2. Safeguard your intellectual contribution: Protect your ideas with a time-stamped preprint that serves as proof of your research timeline.
  3. Boost visibility and impact: Increase the reach and influence of your research by making it accessible to a global audience.
  4. Gain early feedback: Receive valuable input and insights from peers before submitting to a journal.
  5. Ensure broad indexing: Web of Science (Preprint Citation Index), Google Scholar, Crossref, SHARE, PrePubMed, Scilit and Europe PMC.

Published Papers (16 papers)

Order results
Result details
Journals
Select all
Export citation of selected articles as:
26 pages, 10083 KB  
Article
Product Image Generation Method Based on Morphological Optimization and Image Style Transfer
by Aimin Zhou, Xinle Wang, Yujin Huang, Weitang Wang, Shutao Zhang and Jinyan Ouyang
Appl. Sci. 2025, 15(13), 7330; https://doi.org/10.3390/app15137330 - 30 Jun 2025
Viewed by 517
Abstract
In order to improve the controllability and esthetics of product image generation, from the perspective of design, this study proposes a product image generation method based on morphological optimization, esthetic evaluation, and style transfer. Firstly, based on computational esthetics and principles of visual [...] Read more.
In order to improve the controllability and esthetics of product image generation, from the perspective of design, this study proposes a product image generation method based on morphological optimization, esthetic evaluation, and style transfer. Firstly, based on computational esthetics and principles of visual perception, an esthetic comprehensive evaluation model is constructed and used as the fitness function. The genetic algorithm is employed to build a product morphological optimization design system, obtaining product form schemes with higher esthetic quality. Then, an automobile front-end image dataset is constructed, and a generative adversarial network model is trained. Using the aforementioned product form scheme as the content image and selecting automobile front-end images from the market as the target style image, the content features and style features are extracted by the encoder and input into the generator to generate style-transferred images. The discriminator is utilized for judgment, and through iterative optimization, product image schemes that meet the target style are obtained. Experimental results demonstrate that the model generates product images with good effects, showcasing the feasibility of the method and providing robust technical support for intelligent product image design. Full article
Show Figures

Figure 1

18 pages, 4002 KB  
Article
The Spatio-Temporal Equalization Sliding-Window Distribution Distance Maximization Based on Unsupervised Learning for Online Event-Related Potential-Based Brain–Computer Interfaces
by Haoye Wang, Jing Jin, Xinjie He, Shurui Li and Andrzej Cichocki
Machines 2025, 13(4), 282; https://doi.org/10.3390/machines13040282 - 29 Mar 2025
Viewed by 872
Abstract
Brain–computer interfaces (BCIs) provide a direct communication pathway between the central nervous system and external environments, enabling human–machine interaction control. Among them, event-related potential (ERP)-based BCIs are among the most accurate and reliable BCI systems. However, current mainstream classification algorithms struggle to eliminate [...] Read more.
Brain–computer interfaces (BCIs) provide a direct communication pathway between the central nervous system and external environments, enabling human–machine interaction control. Among them, event-related potential (ERP)-based BCIs are among the most accurate and reliable BCI systems. However, current mainstream classification algorithms struggle to eliminate calibration requirements and rely heavily on costly labeled data, limiting the practical usability of ERP-based BCIs. To address this, the development of unsupervised algorithms is critical for advancing real-world BCI applications. In this study, we propose the spatio-temporal equalization sliding-window distribution distance maximization (STE-sDDM) algorithm, which introduces spatio-temporal equalization (STE) to unsupervised ERP classification for the first time and integrates it with a novel unsupervised classification method, sliding-window distribution distance maximization (sDDM). STE estimates and removes colored noise interference in background noise to enhance the signal-to-noise ratio of inputs for sDDM. Meanwhile, sDDM leverages an enhanced inter-class divergence metric based on the ergodic hypothesis theory, utilizing sliding windows to emphasize temporally discriminative features, thereby improving unsupervised classification accuracy. The experimental results demonstrate that the integration of STE and sDDM significantly enhances ERP feature separability, outperforming state-of-the-art unsupervised online classification algorithms in spelling accuracy and the information transfer rate (ITR), facilitating more accurate and faster plug-and-play real-time control for BCI systems. Additionally, static spatio-temporal equalizer architectures were found to outperform dynamic architectures when combined with this framework. Full article
Show Figures

Figure 1

31 pages, 8901 KB  
Article
CORES: Context-Aware Emotion-Driven Recommendation System-Based LLM to Improve Virtual Shopping Experiences
by Abderrahim Lakehal, Adel Alti and Boubakeur Annane
Future Internet 2025, 17(2), 94; https://doi.org/10.3390/fi17020094 - 18 Feb 2025
Viewed by 2865
Abstract
In today’s business landscape, artificial intelligence (AI) plays a pivotal role in shopping processes and customization. As the demand for customization grows, virtual reality (VR) emerges as an innovative solution to improve users’ perception and decision making in virtual shopping experiences (VSEs). Despite [...] Read more.
In today’s business landscape, artificial intelligence (AI) plays a pivotal role in shopping processes and customization. As the demand for customization grows, virtual reality (VR) emerges as an innovative solution to improve users’ perception and decision making in virtual shopping experiences (VSEs). Despite its potential, limited research has explored the integration of contextual information and emotions in VR to deliver effective product recommendations. This paper presents CORES (context-aware emotion-driven recommendation system), a novel approach designed to enrich users’ experiences and to support decision making in VR. CORES combines advanced large language models (LLMs) and embedding-based context-aware recommendation strategies to provide customized products. Therefore, emotions are collected from social platforms, and relevant contextual information is matched to enable effective recommendation. Additionally, CORES leverages transformers and retrieval-augmented generation (RAG) capabilities to explain recommended items, facilitate VR visualization, and generate insights using various prompt templates. CORES is applied to a VR shop of different items. An empirical study validates the efficiency and accuracy of this approach, achieving a significant average accuracy of 97% and an acceptable response time of 0.3267s in dynamic shopping scenarios. Full article
Show Figures

Graphical abstract

28 pages, 9930 KB  
Article
Evolving the Whale Optimization Algorithm: The Development and Analysis of MISWOA
by Chunfang Li, Yuqi Yao, Mingyi Jiang, Xinming Zhang, Linsen Song, Yiwen Zhang, Baoyan Zhao, Jingru Liu, Zhenglei Yu, Xinyang Du and Shouxin Ruan
Biomimetics 2024, 9(10), 639; https://doi.org/10.3390/biomimetics9100639 - 18 Oct 2024
Cited by 4 | Viewed by 2449
Abstract
This paper introduces an enhanced Whale Optimization Algorithm, named the Multi-Swarm Improved Spiral Whale Optimization Algorithm (MISWOA), designed to address the shortcomings of the traditional Whale Optimization Algorithm (WOA) in terms of global search capability and convergence velocity. The MISWOA combines an adaptive [...] Read more.
This paper introduces an enhanced Whale Optimization Algorithm, named the Multi-Swarm Improved Spiral Whale Optimization Algorithm (MISWOA), designed to address the shortcomings of the traditional Whale Optimization Algorithm (WOA) in terms of global search capability and convergence velocity. The MISWOA combines an adaptive nonlinear convergence factor with a variable gain compensation mechanism, adaptive weights, and an advanced spiral convergence strategy, resulting in a significant enhancement in the algorithm’s global search capability, convergence velocity, and precision. Moreover, MISWOA incorporates a multi-population mechanism, further bolstering the algorithm’s efficiency and robustness. Ultimately, an extensive validation of MISWOA through “simulation + experimentation” approaches has been conducted, demonstrating that MISWOA surpasses other algorithms and the Whale Optimization Algorithm (WOA) and its variants in terms of convergence accuracy and algorithmic efficiency. This validates the effectiveness of the improvement method and the exceptional performance of MISWOA, while also highlighting its substantial potential for application in practical engineering scenarios. This study not only presents an improved optimization algorithm but also constructs a systematic framework for analysis and research, offering novel insights for the comprehension and refinement of swarm intelligence algorithms. Full article
Show Figures

Figure 1

22 pages, 12633 KB  
Article
MediaPipe Frame and Convolutional Neural Networks-Based Fingerspelling Detection in Mexican Sign Language
by Tzeico J. Sánchez-Vicinaiz, Enrique Camacho-Pérez, Alejandro A. Castillo-Atoche, Mayra Cruz-Fernandez, José R. García-Martínez and Juvenal Rodríguez-Reséndiz
Technologies 2024, 12(8), 124; https://doi.org/10.3390/technologies12080124 - 1 Aug 2024
Cited by 8 | Viewed by 4486
Abstract
This research proposes implementing a system to recognize the static signs of the Mexican Sign Language (MSL) dactylological alphabet using the MediaPipe frame and Convolutional Neural Network (CNN) models to correctly interpret the letters that represent the manual signals coming from a camera. [...] Read more.
This research proposes implementing a system to recognize the static signs of the Mexican Sign Language (MSL) dactylological alphabet using the MediaPipe frame and Convolutional Neural Network (CNN) models to correctly interpret the letters that represent the manual signals coming from a camera. The development of these types of studies allows the implementation of technological advances in artificial intelligence and computer vision in teaching Mexican Sign Language (MSL). The best CNN model achieved an accuracy of 83.63% over the sets of 336 test images. In addition, considering samples of each letter, the following results are obtained: an accuracy of 84.57%, a sensitivity of 83.33%, and a specificity of 99.17%. The advantage of this system is that it could be implemented on low-consumption equipment, carrying out the classification in real-time, contributing to the accessibility of its use. Full article
Show Figures

Figure 1

23 pages, 3648 KB  
Article
Probabilistic Confusion Matrix: A Novel Method for Machine Learning Algorithm Generalized Performance Analysis
by Ioannis Markoulidakis and Georgios Markoulidakis
Technologies 2024, 12(7), 113; https://doi.org/10.3390/technologies12070113 - 13 Jul 2024
Cited by 21 | Viewed by 6746
Abstract
The paper addresses the issue of classification machine learning algorithm performance based on a novel probabilistic confusion matrix concept. The paper develops a theoretical framework which associates the proposed confusion matrix and the resulting performance metrics with the regular confusion matrix. The theoretical [...] Read more.
The paper addresses the issue of classification machine learning algorithm performance based on a novel probabilistic confusion matrix concept. The paper develops a theoretical framework which associates the proposed confusion matrix and the resulting performance metrics with the regular confusion matrix. The theoretical results are verified based on a wide variety of real-world classification problems and state-of-the-art machine learning algorithms. Based on the properties of the probabilistic confusion matrix, the paper then highlights the benefits of using the proposed concept both during the training phase and the application phase of a classification machine learning algorithm. Full article
Show Figures

Figure 1

17 pages, 9779 KB  
Article
Optimizing Speech Emotion Recognition with Machine Learning Based Advanced Audio Cue Analysis
by Nuwan Pallewela, Damminda Alahakoon, Achini Adikari, John E. Pierce and Miranda L. Rose
Technologies 2024, 12(7), 111; https://doi.org/10.3390/technologies12070111 - 11 Jul 2024
Cited by 3 | Viewed by 4567
Abstract
In today’s fast-paced and interconnected world, where human–computer interaction is an integral component of daily life, the ability to recognize and understand human emotions has emerged as a crucial facet of technological advancement. However, human emotion, a complex interplay of physiological, psychological, and [...] Read more.
In today’s fast-paced and interconnected world, where human–computer interaction is an integral component of daily life, the ability to recognize and understand human emotions has emerged as a crucial facet of technological advancement. However, human emotion, a complex interplay of physiological, psychological, and social factors, poses a formidable challenge even for other humans to comprehend accurately. With the emergence of voice assistants and other speech-based applications, it has become essential to improve audio-based emotion expression. However, there is a lack of specificity and agreement in current emotion annotation practice, as evidenced by conflicting labels in many human-annotated emotional datasets for the same speech segments. Previous studies have had to filter out these conflicts and, therefore, a large portion of the collected data has been considered unusable. In this study, we aimed to improve the accuracy of computational prediction of uncertain emotion labels by utilizing high-confidence emotion labelled speech segments from the IEMOCAP emotion dataset. We implemented an audio-based emotion recognition model using bag of audio word encoding (BoAW) to obtain a representation of audio aspects of emotion in speech with state-of-the-art recurrent neural network models. Our approach improved the state-of-the-art audio-based emotion recognition with a 61.09% accuracy rate, an improvement of 1.02% over the BiDialogueRNN model and 1.72% over the EmoCaps multi-modal emotion recognition models. In comparison to human annotation, our approach achieved similar results in identifying positive and negative emotions. Furthermore, it has proven effective in accurately recognizing the sentiment of uncertain emotion segments that were previously considered unusable in other studies. Improvements in audio emotion recognition could have implications in voice-based assistants, healthcare, and other industrial applications that benefit from automated communication. Full article
Show Figures

Figure 1

32 pages, 7283 KB  
Technical Note
Research on the Training and Application Methods of a Lightweight Agricultural Domain-Specific Large Language Model Supporting Mandarin Chinese and Uyghur
by Kun Pan, Xiaogang Zhang and Liping Chen
Appl. Sci. 2024, 14(13), 5764; https://doi.org/10.3390/app14135764 - 1 Jul 2024
Cited by 4 | Viewed by 2944
Abstract
In the field of Natural Language Processing (NLP), the lack of support for minority languages, especially Uyghur, the scarcity of Uyghur language corpora in the agricultural domain, and the lightweight nature of large language models remain prominent issues. This study proposes a method [...] Read more.
In the field of Natural Language Processing (NLP), the lack of support for minority languages, especially Uyghur, the scarcity of Uyghur language corpora in the agricultural domain, and the lightweight nature of large language models remain prominent issues. This study proposes a method for constructing a bilingual (Uyghur and Chinese) lightweight specialized large language model for the agricultural domain. By utilizing a mixed training approach of Uyghur and Chinese, we extracted Chinese corpus text from agricultural-themed books in PDF format using OCR (Optical Character Recognition) technology, converted the Chinese text corpus into a Uyghur corpus using a rapid translation API, and constructed a bilingual mixed vocabulary. We applied the parameterized Transformer model algorithm to train the model for the agricultural domain in both Chinese and Uyghur. Furthermore, we introduced a context detection and fail-safe mechanism for the generated text. The constructed model possesses the ability to support bilingual reasoning in Uyghur and Chinese in the agricultural domain, with higher accuracy and a smaller size that requires less hardware. It (our work) addresses issues such as the scarcity of Uyghur corpora in the agricultural domain, mixed word segmentation and word vector modeling in Uyghur for widespread agricultural languages, model lightweighting and deployment, and the fragmentation of non-relevant texts during knowledge extraction from small-scale corpora. The lightweight design of the model reduces hardware requirements, facilitating deployment in resource-constrained environments. This advancement promotes agricultural intelligence, aids in the development of specific applications and minority languages (such as agriculture and Uyghur), and contributes to rural revitalization. Full article
Show Figures

Figure 1

24 pages, 5092 KB  
Article
A Computational Framework for Enhancing Industrial Operations and Electric Network Management: A Case Study
by André F. V. Pedroso, Francisco J. G. Silva, Raul D. S. G. Campilho, Rita C. M. Sales-Contini, Arnaldo G. Pinto and Renato R. Moreira
Technologies 2024, 12(6), 91; https://doi.org/10.3390/technologies12060091 - 19 Jun 2024
Viewed by 2282
Abstract
Automotive industries require constant technological development and the capacity to adapt to market needs. Hence, component suppliers must be able to adapt to persistent trend changes and technical improvements, acting in response to customers’ expectations and developing their manufacturing methods to be as [...] Read more.
Automotive industries require constant technological development and the capacity to adapt to market needs. Hence, component suppliers must be able to adapt to persistent trend changes and technical improvements, acting in response to customers’ expectations and developing their manufacturing methods to be as flexible as possible. Concepts such as layout flexibility, management of industrial facilities, and building information modeling (BIM) are becoming ever more addressed within the automotive industry in order to envision and select the necessary information exchanges. Given this question and based on the gap in the literature regarding this subject, this work proposes a solution, developing a novel tool that allows the monitoring and assignment of newer/relocated equipment to the switchboards within a given industrial plant. The solution intends to increase the flexibility of production lines through the assessment, analysis, improvement, and reorganization of the electrical load distribution to develop projects accurately implying layout changes. The tool is validated with an automotive manufacturer. With the implementation of this open-source tool, a detailed electrical flow management system is accomplished, and it has proven successful and essential in raising levels of organizational flexibility. This has guaranteed the company’s competitiveness with effective integrated administration methods and tools, such as a much easier study upon inserting new/relocated equipment without production line breaks. Full article
Show Figures

Figure 1

14 pages, 7066 KB  
Article
Improved Particle Filter in Machine Learning-Based BLE Fingerprinting Method to Reduce Indoor Location Estimation Errors
by Jingshi Qian, Jiahe Li, Nobuyoshi Komuro, Won-Suk Kim and Younghwan Yoo
Future Internet 2024, 16(6), 211; https://doi.org/10.3390/fi16060211 - 15 Jun 2024
Viewed by 2201
Abstract
Indoor position fingerprint-based location estimation methods have been widely used by applications on smartphones. In these localization estimation methods, it is very popular to use the RSSI (Received Signal Strength Indication) of signals to represent the position fingerprint. This paper proposes the design [...] Read more.
Indoor position fingerprint-based location estimation methods have been widely used by applications on smartphones. In these localization estimation methods, it is very popular to use the RSSI (Received Signal Strength Indication) of signals to represent the position fingerprint. This paper proposes the design of a particle filter for reducing the estimation error of the machine learning-based indoor BLE location fingerprinting method. Unlike the general particle filter, taking into account the distance, the proposed system designs improved likelihood functions, considering the coordinates based on fingerprint points using mean and variance of RSSI values, combining the particle filter with the k-NN (k-Nearest Neighbor) algorithm to realize the reduction in indoor positioning error. The initial position is estimated by the position fingerprinting method based on the machine learning method. By comparing the fingerprint method based on k-NN with general particle filter processing, and the fingerprint estimation method based on only k-NN or SVM (Support Vector Machine), experiment results showed that the proposed method has a smaller minimum error and a better average error than the conventional method. Full article
Show Figures

Figure 1

19 pages, 16794 KB  
Article
A Benchmark for UAV-View Natural Language-Guided Tracking
by Hengyou Li, Xinyan Liu and Guorong Li
Electronics 2024, 13(9), 1706; https://doi.org/10.3390/electronics13091706 - 28 Apr 2024
Cited by 1 | Viewed by 3124
Abstract
We propose a new benchmark, UAVNLT (Unmanned Aerial Vehicle Natural Language Tracking), for the UAV-view natural language-guided tracking task. UAVNLT consists of videos taken from UAV cameras from four cities for vehicles on city roads. For each video, vehicles’ bounding boxes, trajectories, and [...] Read more.
We propose a new benchmark, UAVNLT (Unmanned Aerial Vehicle Natural Language Tracking), for the UAV-view natural language-guided tracking task. UAVNLT consists of videos taken from UAV cameras from four cities for vehicles on city roads. For each video, vehicles’ bounding boxes, trajectories, and natural language are carefully annotated. Compared to the existing data sets, which are only annotated with bounding boxes, the natural language sentences in our data set can be more suitable for many application fields where humans take part in the system for that language, being not only more friendly for human–computer interaction but also capable of overcoming the appearance features’ low uniqueness for tracking. We tested several existing methods on our new benchmarks and found that the performance of the existing methods was not satisfactory. To pave the way for future work, we propose a baseline method suitable for this task, achieving state-of-the-art performance. We believe our new data set and proposed baseline method will be helpful in many fields, such as smart city, smart transportation, vehicle management, etc. Full article
Show Figures

Figure 1

20 pages, 1541 KB  
Article
Developing a Performance Evaluation Framework Structural Model for Educational Metaverse
by Elena Tsappi, Ioannis Deliyannis and George Nathaniel Papageorgiou
Technologies 2024, 12(4), 53; https://doi.org/10.3390/technologies12040053 - 16 Apr 2024
Cited by 9 | Viewed by 3775
Abstract
In response to the transformative impact of digital technology on education, this study introduces a novel performance management framework for virtual learning environments suitable for the metaverse era. Based on the Structural Equation Modeling (SEM) approach, this paper proposes a comprehensive evaluative model, [...] Read more.
In response to the transformative impact of digital technology on education, this study introduces a novel performance management framework for virtual learning environments suitable for the metaverse era. Based on the Structural Equation Modeling (SEM) approach, this paper proposes a comprehensive evaluative model, anchored on the integration of the Theory of Planned Behavior (TPB), the Unified Theory of Acceptance and Use of Technology (UTAUT), and the Community of Inquiry Framework (CoI). The model synthesizes five Key Performance Indicators (KPIs)—content delivery, student engagement, metaverse tool utilization, student performance, and adaptability—to intricately assess academic avatar performances in virtual educational settings. This theoretical approach marks a significant stride in understanding and enhancing avatar efficacy in the metaverse environment. It enriches the discourse on performance management in digital education and sets a foundation for future empirical studies. As virtual online environments gain prominence in education and training, this research study establishes the basic principles and highlights the key points for further empirical research in the new era of the metaverse educational environment. Full article
Show Figures

Figure 1

32 pages, 24494 KB  
Article
Interdisciplinary Dynamics in COVID-19 Research: Examining the Role of Computer Science and Collaboration Patterns
by Yunfan Li, Shiyong Liu, An Zeng, Jun Wu, Jiayu Zhang, Weiwei Zhang and Sheng Li
Systems 2024, 12(4), 113; https://doi.org/10.3390/systems12040113 - 28 Mar 2024
Cited by 2 | Viewed by 3750
Abstract
In academia, it is rare for an event or issue to foster the extensive participation of multiple disciplines. Research related to COVID-19 has undeniably yielded a wealth of valuable insights and impetus for the progress of interdisciplinary research, encompassing concepts, methodologies, intellectual approaches, [...] Read more.
In academia, it is rare for an event or issue to foster the extensive participation of multiple disciplines. Research related to COVID-19 has undeniably yielded a wealth of valuable insights and impetus for the progress of interdisciplinary research, encompassing concepts, methodologies, intellectual approaches, theories, frameworks, data integration and analysis, and pertinent considerations. In the academic community, there is a widespread expectation that as science and technology continue to progress, the convergence of medicine with various other fields will gain momentum. Fields like computer science are anticipated to see expanded applications in domains such as medicine, vaccine research, disease diagnosis, and more. This study aims to examine interdisciplinary approaches in health-related research, particularly in the context of COVID-19. The goal is to analyze and comprehend the involvement and collaboration patterns of various disciplines in pandemic research, with a specific emphasis on the role and integration level of computer science. This study analyzed 240,509 COVID-19 related articles published from December 2019 to September 2022 using methods such as chord diagrams, modularity analysis, and eigenvector centrality analysis in Social Networking Analysis (SNA). The findings revealed an emerging trend of integration trend between Humanities & Social Sciences and Natural Sciences. Expectations that computer science would prominently feature in pandemic research during this technology-driven era haven’t materialized. While it maintains links with engineering, it hasn’t formed strong connections with medicine. This indicates a gap between computer science and core medical research in large-scale health crises, where COVID-19 research remains centered on medicine with varying interdisciplinary collaboration, and high-tech disciplines like computer science have not achieved their expected influence in these studies. Full article
Show Figures

Figure 1

20 pages, 4632 KB  
Article
Predicting Maps Using In-Vehicle Cameras for Data-Driven Intelligent Transport
by Zhiguo Ma, Yutong Zhang and Meng Han
Electronics 2023, 12(24), 5017; https://doi.org/10.3390/electronics12245017 - 15 Dec 2023
Cited by 1 | Viewed by 2178
Abstract
Bird’s eye view (BEV) semantic maps have evolved into a crucial element of urban intelligent traffic management and monitoring, offering invaluable visual and significant data representations for informed intelligent city decision making. Nevertheless, current methodologies continue underutilizing the temporal information embedded within dynamic [...] Read more.
Bird’s eye view (BEV) semantic maps have evolved into a crucial element of urban intelligent traffic management and monitoring, offering invaluable visual and significant data representations for informed intelligent city decision making. Nevertheless, current methodologies continue underutilizing the temporal information embedded within dynamic frames throughout the BEV feature transformation process. This limitation results in decreased accuracy when mapping high-speed moving objects, particularly in capturing their shape and dynamic trajectory. A framework is proposed for cross-view semantic segmentation to address this challenge, leveraging simulated environments as a starting point before applying it to real-life urban imaginative transportation scenarios. The view converter module is thoughtfully designed to collate information from multiple initial view observations captured from various angles and modes. This module outputs a top-down view semantic graph characterized by its object space layout to preserve beneficial temporal information in BEV transformation. The NuScenes dataset is used to evaluate model effectiveness. A novel application is also devised that harnesses transformer networks to map images and video sequences into top-down or comprehensive bird’s-eye views. By combining physics-based and constraint-based formulations and conducting ablation studies, the approach has been substantiated, highlighting the significance of context above and below a given point in generating these maps. This innovative method has been thoroughly validated on the NuScenes dataset. Notably, it has yielded state-of-the-art instantaneous mapping results, with particular benefits observed for smaller dynamic category displays. The experimental findings include comparing axial attention with the state-of-the-art (SOTA) model, demonstrating the performance enhancement associated with temporal awareness. Full article
Show Figures

Figure 1

30 pages, 17684 KB  
Article
Designing for Intergenerational Communication among Older Adults: A Systematic Inquiry in Old Residential Communities of China’s Yangtze River Delta
by Cun Li and Ming Cao
Systems 2023, 11(11), 528; https://doi.org/10.3390/systems11110528 - 29 Oct 2023
Cited by 1 | Viewed by 4824
Abstract
Presently, a substantial majority of older individuals in urban regions of China prefer to inhabit older residential communities over newer counterparts. Within these aging communities, the intricate matter of intergenerational communication among older adults presents a complex and multifaceted issue that warrants comprehensive [...] Read more.
Presently, a substantial majority of older individuals in urban regions of China prefer to inhabit older residential communities over newer counterparts. Within these aging communities, the intricate matter of intergenerational communication among older adults presents a complex and multifaceted issue that warrants comprehensive investigation from a systematic perspective. This paper first employs the observational method to study multiple old residential communities in a city in the Yangtze River Delta region of China. The POEMS framework and the AEIOU framework are applied, focusing on the analysis of individuals and the interaction between individuals and objects, respectively. Semistructured interviews are then conducted with three groups of people, emphasizing community participation by older adults, intergenerational interaction from the perspective of older adults, and intergenerational interaction from the perspective of young people. Finally, the paper categorizes the types and characteristics of individuals in the old communities, identifying the intersections between these groups. The current social situation of older adults and young people is summarized, including behavioral and psychological characteristics and social interaction challenges. Based on these findings, ten system design directions to enhance intergenerational interaction in old communities are proposed, and three of these system design directions are further developed. Full article
Show Figures

Figure 1

20 pages, 1017 KB  
Article
RAdam-DA-NLSTM: A Nested LSTM-Based Time Series Prediction Method for Human–Computer Intelligent Systems
by Banteng Liu, Wei Chen, Zhangquan Wang, Seyedamin Pouriyeh and Meng Han
Electronics 2023, 12(14), 3084; https://doi.org/10.3390/electronics12143084 - 16 Jul 2023
Cited by 8 | Viewed by 2182
Abstract
At present, time series prediction methods are widely applied for Human–Computer Intelligent Systems in various fields such as Finance, Meteorology, and Medicine. To enhance the accuracy and stability of the prediction model, this paper proposes a time series prediction method called RAdam-Dual stage [...] Read more.
At present, time series prediction methods are widely applied for Human–Computer Intelligent Systems in various fields such as Finance, Meteorology, and Medicine. To enhance the accuracy and stability of the prediction model, this paper proposes a time series prediction method called RAdam-Dual stage Attention mechanism-Nested Long Short-Term Memory (RAdam-DA-NLSTM). First, we design a Nested LSTM (NLSTM), which adopts a new internal LSTM unit structure as the memory cell of LSTM to guide memory forgetting and memory selection. Then, we design a self-encoder network based on the Dual stage Attention mechanism (DA-NLSTM), which uses the NLSTM encoder based on the input attention mechanism, and uses the NLSTM decoder based on the time attention mechanism. Additionally, we adopt the RAdam optimizer to solve the objective function, which dynamically selects Adam and SGD optimizers according to the variance dispersion and constructs the rectifier term to fully express the adaptive momentum. Finally, we use multiple datasets, such as PM2.5 data set, stock data set, traffic data set, and biological signals, to analyze and test this method, and the experimental results show that RAdam-DA-NLSTM has higher prediction accuracy and stability compared with other traditional methods. Full article
Show Figures

Figure 1

Back to TopTop