MDPI - Publisher of Open Access Journals

22 pages, 2008 KB

Open AccessArticle

Charting the Development of Robot-Assisted Social–Emotional Learning: Mapping Its Intellectual Foundations, Thematic Foci, and Evolution

by Wenjia Cui, Kejun Zhang, Zaipeng Zhang, Haoran Cui, Cixian Lv, Taghreed Ali Alsudais and Xinghua Wang

Behav. Sci. 2026, 16(5), 746; https://doi.org/10.3390/bs16050746 - 11 May 2026

Viewed by 298

Abstract

Social and emotional learning (SEL) has become increasingly central to educational policy and lifelong development, while advances in robotics have opened new possibilities for supporting socio-emotional competencies through human–robot interaction. Despite the rapid growth of robot-assisted SEL research, this field remains fragmented, with [...] Read more.

Social and emotional learning (SEL) has become increasingly central to educational policy and lifelong development, while advances in robotics have opened new possibilities for supporting socio-emotional competencies through human–robot interaction. Despite the rapid growth of robot-assisted SEL research, this field remains fragmented, with limited understanding of its intellectual structure, thematic foci, and evolution. To address this gap, this study conducted a scientometric analysis of 241 publications indexed in Web of Science using bibliometric methods. Results indicate a steady growth trajectory, with research concentrated in a small number of core countries driving international collaboration. Influential publications and co-citation patterns reveal a strong foundation in autism-related interventions and child-centered social skill development. Thematic mapping shows that motor themes are dominated by soft skills, autism, and interaction design, while emotion recognition and affective computing form technically mature but specialized streams. Foundational concepts such as human–robot interaction and artificial intelligence remain central yet theoretically evolving. Emerging links between robotics, STEM, and project-based learning suggest ongoing pedagogical expansion. This study maps the intellectual and thematic structure of robot-assisted SEL, showing how robots are emerging as mediational agents in hybrid learning systems while revealing uneven integration and misalignments between technological capabilities and pedagogical foundations. Full article

(This article belongs to the Topic Personality, Emotions, and Emotional Intelligence Assessments: New Applications and Instruments)

► Show Figures

Figure 1

11 pages, 1329 KB

Open AccessProceeding Paper

Neuromorphic AI-Based e-Skin for Emotion-Sensitive Humanoid Robots

by Shubham Gupta and Suhaib Ahmed

Eng. Proc. 2026, 124(1), 114; https://doi.org/10.3390/engproc2026124114 - 7 May 2026

Viewed by 548

Abstract

Humanoid robots operating in proximity to humans require the ability to perceive and interpret emotional cues conveyed through touch to achieve safe, natural, and socially intelligent interaction. Conventional tactile sensing systems primarily focus on force or pressure detection and cannot infer affective intent, [...] Read more.

Humanoid robots operating in proximity to humans require the ability to perceive and interpret emotional cues conveyed through touch to achieve safe, natural, and socially intelligent interaction. Conventional tactile sensing systems primarily focus on force or pressure detection and cannot infer affective intent, while frame-based deep learning models often suffer from high latency and energy consumption when deployed on embedded platforms. To address these limitations, this paper presents a neuromorphic AI-based multimodal electronic skin (e-skin) framework for emotion-sensitive touch perception in humanoid robots. The proposed system integrates pressure, temperature, and electrostatic sensing with a bio-inspired signal conditioning pipeline and a Spiking Neural Network (SNN) for event-driven, low-power processing. A custom multimodal tactile dataset was collected using the proposed e-skin prototype to model four emotional touch interactions: stress, neutral, comfort, and affection. Experimental results demonstrate that the proposed approach achieves a high emotion classification accuracy of up to 92%, with an average accuracy of 88.75% across all classes. The neuromorphic SNN significantly reduces inference latency to approximately 8 ms, compared to 38 ms for a conventional CNN-based model, while maintaining energy-efficient operation suitable for edge deployment. The results validate the effectiveness of combining multimodal tactile sensing with neuromorphic processing to enable real-time, emotion-aware human–robot interaction. Full article

(This article belongs to the Proceedings of The 6th International Electronic Conference on Applied Sciences)

► Show Figures

Figure 1

32 pages, 7928 KB

Open AccessArticle

eXCube2: Explainable Brain-Inspired Spiking Neural Network Framework for Emotion Recognition from Audio, Visual and Multimodal Audio–Visual Data

by N. K. Kasabov, A. Yang, Z. Wang, I. Abouhassan, A. Kassabova and T. Lappas

Biomimetics 2026, 11(3), 208; https://doi.org/10.3390/biomimetics11030208 - 14 Mar 2026

Viewed by 779

Abstract

This paper introduces a biomimetic framework and novel brain-inspired AI (BIAI) models based on spiking neural networks (SNNs) for emotional state recognition from audio (speech), visual (face), and integrated multimodal audio–visual data. The developed framework, named eXCube2, uses a three-dimensional SNN architecture NeuCube [...] Read more.

This paper introduces a biomimetic framework and novel brain-inspired AI (BIAI) models based on spiking neural networks (SNNs) for emotional state recognition from audio (speech), visual (face), and integrated multimodal audio–visual data. The developed framework, named eXCube2, uses a three-dimensional SNN architecture NeuCube that is spatially structured according to a human brain template. The BIAI models developed in eXCube2 are trainable on spatio- and spectro-temporal data using brain-inspired learning rules. Such models are explainable in terms of revealing patterns in data and are adaptable to new data. The eXCube2 models are implemented as software systems and tested on speech and video data of subjects expressing emotional states. The use of a brain template for the SNN structure enables brain-inspired tonotopic and stereo mapping of audio inputs, topographic mapping of visual data, and the combined use of both modalities. This novel approach brings AI-based emotional state recognition closer to human perception, provides a better explainability and adaptability than existing AI systems. It also results in a higher or competitive accuracy, even though this was not the main goal here. This is demonstrated through experiments on benchmark datasets, achieving classification accuracy above 80% on single-modality data and 88.9% when multimodal audio–visual data are used, and a “don’t know” output is introduced. The paper further discusses possible applications of the proposed eXCube2 framework to other audio, visual, and audio–visual data for solving challenging problems, such as recognizing emotional states of people from different origins; brain state diagnosis (e.g., Parkinson’s disease, Alzheimer’s disease, ADHD, dementia); measuring response to treatment over time; evaluating satisfaction responses from online clients; cognitive robotics; human–robot interaction; chatbots; and interactive computer games. The SNN-based implementation of BIAI also enables the use of neuromorphic chips and platforms, leading to reduced power consumption, smaller device size, higher performance accuracy, and improved adaptability and explainability. This research shows a step toward building brain-inspired AI systems. Full article

► Show Figures

Figure 1

22 pages, 1747 KB

Open AccessReview

Talking Head Generation Through Generative Models and Cross-Modal Synthesis Techniques

by Hira Nisar, Salman Masood, Zaki Malik and Adnan Abid

J. Imaging 2026, 12(3), 119; https://doi.org/10.3390/jimaging12030119 - 10 Mar 2026

Viewed by 1229

Abstract

Talking Head Generation (THG) is a rapidly advancing field at the intersection of computer vision, deep learning, and speech synthesis, enabling the creation of animated human-like heads that can produce speech and express emotions with high visual realism. The core objective of THG [...] Read more.

Talking Head Generation (THG) is a rapidly advancing field at the intersection of computer vision, deep learning, and speech synthesis, enabling the creation of animated human-like heads that can produce speech and express emotions with high visual realism. The core objective of THG systems is to synthesize coherent and natural audio–visual outputs by modeling the intricate relationship between speech signals, facial dynamics, and emotional cues. These systems find widespread applications in virtual assistants, interactive avatars, video dubbing for multilingual content, educational technologies, and immersive virtual and augmented reality environments. Moreover, the development of THG has significant implications for accessibility technologies, cultural preservation, and remote healthcare interfaces. This survey paper presents a comprehensive and systematic overview of the technological landscape of Talking Head Generation. We begin by outlining the foundational methodologies that underpin the synthesis process, including generative adversarial networks (GANs), motion-aware recurrent architectures, and attention-based models. A taxonomy is introduced to organize the diverse approaches based on the nature of input modalities and generation goals. We further examine the contributions of various domains such as computer vision, speech processing, and human–robot interaction, each of which plays a critical role in advancing the capabilities of THG systems. The paper also provides a detailed review of datasets used for training and evaluating THG models, highlighting their coverage, structure, and relevance. In parallel, we analyze widely adopted evaluation metrics, categorized by their focus on image quality, motion accuracy, synchronization, and semantic fidelity. Operating parameters such as latency, frame rate, resolution, and real-time capability are also discussed to assess deployment feasibility. Special emphasis is placed on the integration of generative artificial intelligence (GenAI), which has significantly enhanced the adaptability and realism of talking head systems through more powerful and generalizable learning frameworks. Full article

(This article belongs to the Special Issue AI-Driven Multimodal Image and Video Processing: Advances and Applications)

► Show Figures

Figure 1

35 pages, 1070 KB

Open AccessEditor’s ChoiceArticle

Adaptive Deep Learning Framework for Emotion Recognition in Social Robots: Toward Inclusive Human–Robot Interaction for Users with Special Needs

by Eryka Probierz and Adam Gałuszka

Electronics 2026, 15(5), 924; https://doi.org/10.3390/electronics15050924 - 25 Feb 2026

Viewed by 761

Abstract

Emotion recognition is a key capability of social robots operating in real-world human-centered environments, especially when interacting with users with special needs. Such users may express emotions in atypical, subtle, or strongly context-dependent ways. These characteristics pose significant challenges for conventional emotion recognition [...] Read more.

Emotion recognition is a key capability of social robots operating in real-world human-centered environments, especially when interacting with users with special needs. Such users may express emotions in atypical, subtle, or strongly context-dependent ways. These characteristics pose significant challenges for conventional emotion recognition systems. This paper proposes an adaptive deep learning framework for emotion recognition in social robots. The framework is designed to support inclusive and accessible human–robot interaction. It combines region-based convolutional neural networks with adaptive learning mechanisms. These mechanisms explicitly model individual variability, contextual information, and interaction dynamics. Multiple deep architectures are evaluated to assess robustness across diverse emotional expressions, including those influenced by cognitive, sensory, or developmental differences. Rather than relying on fixed emotion models, the proposed approach emphasizes adaptability. The system dynamically adjusts its perception strategies to user-specific expressive patterns. Experimental validation is conducted using context-aware emotion datasets. Performance is evaluated in terms of detection accuracy, robustness to variability, and generalization across emotion categories. The results show that adaptive mechanisms improve recognition performance in scenarios characterized by non-standard or low-intensity expressions, compared to static baseline models. This study highlights the importance of flexible, context-sensitive perception for inclusive social robotics. It also discusses design implications for deploying emotion-aware robots in assistive, educational, and therapeutic settings. Overall, the proposed framework represents a step toward socially intelligent robots capable of engaging more effectively with users with special needs. Full article

(This article belongs to the Special Issue Research on Deep Learning and Human-Robot Collaboration)

► Show Figures

Figure 1

34 pages, 8501 KB

Open AccessArticle

A Multimodal Adaptive Framework for Social Interaction with the MiRo-E Robot

by Yufeng Yang, Pei Shan Yap, Sobanawartiny Wijeakumar, Aly Magassouba and Nikhil Deshpande

Sensors 2026, 26(4), 1209; https://doi.org/10.3390/s26041209 - 12 Feb 2026

Viewed by 853

Abstract

Adaptivity is a key component of social human–robot interaction (HRI) towards achieving more natural and human-like interactions. Current interactive systems tend to rely on preset and repetitive verbal communication and isolated nonverbal interactions, which results in unappealing engagement. This study proposes an integrated [...] Read more.

Adaptivity is a key component of social human–robot interaction (HRI) towards achieving more natural and human-like interactions. Current interactive systems tend to rely on preset and repetitive verbal communication and isolated nonverbal interactions, which results in unappealing engagement. This study proposes an integrated framework that combines a coordinated nonverbal interaction system based on real-time emotion expression with a fine-tuned large language model-based verbal communication system, resulting in more engaging and context-aware interaction. The design utilises the MiRo-E as the zoomorphic social interaction platform, with the aim of enhancing the consistency across verbal and nonverbal modalities and improving user engagement through adaptive and emotionally aligned responses. To evaluate the effectiveness of the approach, a user study was conducted with tasks designed to assess user engagement, task performance, and the perceived naturalness of interaction. Task performance metrics and subjective questionnaire responses indicate that the framework significantly enhances user experience, improving task completion rates, engagement, and perceived naturalness. Full article

(This article belongs to the Special Issue Smart Sensing System for Intelligent Human–Computer Interaction)

► Show Figures

Figure 1

17 pages, 15849 KB

Open AccessArticle

A Study on the Appearance and Behavioral Patterns of Robots for Fostering Attachment in Users

by Younseal Eum, Cheonyu Park, Gihun Kang, Yeonghun Chun and Jeakweon Han

Appl. Sci. 2026, 16(3), 1290; https://doi.org/10.3390/app16031290 - 27 Jan 2026

Viewed by 680

Abstract

As the importance of emotional interaction between humans and robots continues to gain attention, numerous studies have been conducted to identify the characteristics and effects of emotional HRI (Human–Robot Interaction) elements applied to robots. However, no study has yet combined various HRI elements [...] Read more.

As the importance of emotional interaction between humans and robots continues to gain attention, numerous studies have been conducted to identify the characteristics and effects of emotional HRI (Human–Robot Interaction) elements applied to robots. However, no study has yet combined various HRI elements into a single robot and conducted large-scale user experiments to determine which HRI element users prefer the most. This study selected four characteristics that facilitate attachment and emotional bonding between humans and animals: grooming, emotional transfer, imprinting, and cooperative hunting (play). These four characteristics were incorporated into the design and behavioral patterns of the robot EDIE as HRI elements. To allow users to effectively experience these elements, a 30 min runtime robot performance content featuring EDIE as the main character was developed. This large-scale experiment in the form of a performance enabled participants to engage with all four HRI elements and then respond to a survey identifying their most preferred element. Over two experiments involving a total of 3760 participants, this study examined trends in user preferences regarding the robot’s characteristics. By identifying the most effective HRI elements for fostering user attachment to robots, the findings aim to contribute to the harmonious coexistence of humans and robots. Full article

(This article belongs to the Special Issue Novel Approaches and Applications in Human–Robot Interactions)

► Show Figures

Figure 1

19 pages, 2708 KB

Open AccessArticle

A TPU-Based 3D Printed Robotic Hand: Design and Its Impact on Human–Robot Interaction

by Younglim Choi, Minho Lee, Seongmin Yea, Seunghwan Kim and Hyunseok Kim

Electronics 2026, 15(2), 262; https://doi.org/10.3390/electronics15020262 - 7 Jan 2026

Viewed by 1467

Abstract

This study outlines the design and evaluation of a biomimetic robotic hand tailored for Human–Robot Interaction (HRI), focusing on improvements in tactile fidelity driven by material choice. Thermoplastic polyurethane (TPU) was selected over polylactic acid (PLA) based on its reported elastomeric characteristics and [...] Read more.

This study outlines the design and evaluation of a biomimetic robotic hand tailored for Human–Robot Interaction (HRI), focusing on improvements in tactile fidelity driven by material choice. Thermoplastic polyurethane (TPU) was selected over polylactic acid (PLA) based on its reported elastomeric characteristics and mechanical compliance described in prior literature. Rather than directly matching human skin properties, TPU was perceived as providing a softer and more comfortable tactile interaction compared to rigid PLA. The robotic hand was anatomically reconstructed from an open-source model and integrated with AX-12A and MG90S actuators to simplify wiring and enhance motion precision. A custom PCB, built around an ATmega2560 microcontroller, enables real-time communication with ROS-based upper-level control systems. Angular displacement analysis of repeated gesture motions confirmed the high repeatability and consistency of the system. A repeated-measures user study involving 47 participants was conducted to compare the PLA- and TPU-based prototypes during interactive tasks such as handshakes and gesture commands. The TPU hand received significantly higher ratings in tactile realism, grip satisfaction, and perceived responsiveness (p < 0.05). Qualitative feedback further supported its superior emotional acceptance and comfort. These findings indicate that incorporating TPU in robotic hand design not only enhances mechanical performance but also plays a vital role in promoting emotionally engaging and natural human–robot interactions, making it a promising approach for affective HRI applications. Full article

(This article belongs to the Special Issue Enabling Intelligent Human–Robot Interaction: Design, Operation, and Learning)

► Show Figures

Figure 1

13 pages, 1196 KB

Open AccessArticle

Socially Assistive Robot Hyodol for Depressive Symptoms of Community-Dwelling Older Adults in Medically Underserved Areas: A Preliminary Study

by Han Wool Jung, Yujin Kim, Hyojung Kim, Min-kyeong Kim, Hyejung Lee, Jin Young Park, Woo Jung Kim and Jaesub Park

J. Clin. Med. 2026, 15(1), 217; https://doi.org/10.3390/jcm15010217 - 27 Dec 2025

Viewed by 1153

Abstract

Background/Objectives: Socially assistive robots effectively support elderly care when they incorporate personalization, person-centered principles, rich interactions, and careful role setting with psychosocial alignment. Hyodol, a socially assistive robot designed for elderly people, embodies a grandchild’s persona, emulating the grandparent–grandchild relationship. Based [...] Read more.

Background/Objectives: Socially assistive robots effectively support elderly care when they incorporate personalization, person-centered principles, rich interactions, and careful role setting with psychosocial alignment. Hyodol, a socially assistive robot designed for elderly people, embodies a grandchild’s persona, emulating the grandparent–grandchild relationship. Based on the behavioral activation principles and a human-centered approach, this robot continuously supports users’ emotional well-being, health management, and daily routines. Methods: The current study evaluated Hyodol’s impact on depressive symptoms and other quality of life factors among older adults living in medically underserved areas. A total of 278 participants were assessed for depressive symptoms, loneliness, medication adherence, and user acceptance. Results: After six months of use, participants showed significant reductions in overall depressive symptoms, with a 45% decrease in the proportion of individuals at high risk of depression. Significant improvements were also observed in loneliness and medication adherence. Participants reported high levels of user acceptance and satisfaction, exceeding 70% of the total score. Participants who engaged more frequently in free chat with Hyodol showed greater improvements in depressive symptoms. Conclusions: These results highlight Hyodol’s potential as a promising tool for enhancing mental healthcare and overall well-being in this population. This at-home mental-healthcare framework can complement primary care and, if its effects are confirmed in controlled trials, could contribute to reducing healthcare burden and preventing the onset and escalation of depressive symptoms. Full article

(This article belongs to the Special Issue Innovations in the Treatment for Depression and Anxiety)

► Show Figures

Figure 1

18 pages, 1678 KB

Open AccessArticle

Body Knowledge and Emotion Recognition in Preschool Children: A Comparative Study of Human Versus Robot Tutors

by Alice Araguas, Arnaud Blanchard, Sébastien Derégnaucourt, Adrien Chopin and Bahia Guellai

Behav. Sci. 2026, 16(1), 29; https://doi.org/10.3390/bs16010029 - 23 Dec 2025

Viewed by 860

Abstract

Social robots are increasingly integrated into early childhood education, yet limited research exists examining preschoolers’ learning from robotic versus human demonstrators across embodied tasks. This study investigated whether children (aged between 3 and 6) demonstrate comparable performance when learning body-centered tasks from a [...] Read more.

Social robots are increasingly integrated into early childhood education, yet limited research exists examining preschoolers’ learning from robotic versus human demonstrators across embodied tasks. This study investigated whether children (aged between 3 and 6) demonstrate comparable performance when learning body-centered tasks from a humanoid robot compared to a human demonstrator. Sixty-two typically developing children were randomly assigned to a robot or a human condition. Participants completed three tasks: body part comprehension and production, body movement imitation, and emotion recognition from body postures. Performance was measured using standardized protocols. No significant main effects of demonstrator type emerged across most tasks. However, age significantly predicted performance across all measures, with systematic improvements between 3 and 6. A significant age × demonstrator interaction was observed for sequential motor imitation, with stronger age effects for the human demonstrator condition. Preschool children demonstrate comparable performance when interacting with a humanoid robot versus a human in body-centered tasks, though motor imitation shows differential developmental trajectories. These findings suggest appropriately designed social robots may serve as supplementary pedagogical tools for embodied learning in early childhood education under specific conditions. The primacy of developmental effects highlights the importance of age-appropriate design in both traditional and technology-enhanced educational contexts. Full article

► Show Figures

Figure 1

22 pages, 11862 KB

Open AccessArticle

Do We View Robots as We Do Ourselves? Examining Robotic Face Processing Using EEG

by Xaviera Pérez-Arenas, Álvaro A. Rivera-Rei, David Huepe and Vicente Soto

Brain Sci. 2026, 16(1), 9; https://doi.org/10.3390/brainsci16010009 - 22 Dec 2025

Viewed by 935

Abstract

Background/Objectives: The ability to perceive and process emotional faces quickly and efficiently is essential for human social interactions. In recent years, humans have started to interact more regularly with robotic faces in the form of virtual or real-world robots. Neurophysiological research regarding how [...] Read more.

Background/Objectives: The ability to perceive and process emotional faces quickly and efficiently is essential for human social interactions. In recent years, humans have started to interact more regularly with robotic faces in the form of virtual or real-world robots. Neurophysiological research regarding how the brain decodes robotic faces relative to human ones is scarce and, as such, warrants further research to explore these mechanisms and their social implications. Methods: This study uses event-related potentials (ERPs) to examine the neural correlates during an emotional face categorization task involving human and robotic stimuli. We examined differences in brain activity elicited by viewing robotic and human faces expressing both happy and neutral emotions. ERP waveforms’ amplitudes for the P100, N170, P300, and P600 components were calculated and compared. Furthermore, mass univariate analysis of ERP waveforms was carried out to explore effects not limited to brain regions previously reported in the literature. Results: Results showed robotic faces evoked increased waveform amplitudes at early components (P100 and N170) as well as at the later P300 component. Further, only mid-latency and late cortical components (P300 and P600) showed amplitude differences resulting from emotional valences, aligning with dual-stage models of face processing. Conclusions: These results advance our understanding of face processing during human–robot interaction and contribute to our understanding of brain mechanisms underlying interactions when viewing social robots, setting new considerations for their use in brain health settings and broader cognitive impact. Full article

(This article belongs to the Section Cognitive, Social and Affective Neuroscience)

► Show Figures

Figure 1

17 pages, 3220 KB

Open AccessArticle

ArecaNet: Robust Facial Emotion Recognition via Assembled Residual Enhanced Cross-Attention Networks for Emotion-Aware Human–Computer Interaction

by Jaemyung Kim and Gyuho Choi

Sensors 2025, 25(23), 7375; https://doi.org/10.3390/s25237375 - 4 Dec 2025

Viewed by 841

Abstract

Recently, the convergence of advanced sensor technologies and innovations in artificial intelligence and robotics has highlighted facial emotion recognition (FER) as an essential component of human–computer interaction (HCI). Traditional FER studies based on handcrafted features and shallow machine learning have shown a limited [...] Read more.

Recently, the convergence of advanced sensor technologies and innovations in artificial intelligence and robotics has highlighted facial emotion recognition (FER) as an essential component of human–computer interaction (HCI). Traditional FER studies based on handcrafted features and shallow machine learning have shown a limited performance, while convolutional neural networks (CNNs) have improved nonlinear emotion pattern analysis but have been constrained by local feature extraction. Vision transformers (ViTs) have addressed this by leveraging global correlations, yet both CNN- and ViT-based single networks often suffer from overfitting, single-network dependency, and information loss in ensemble operations. To overcome these limitations, we propose ArecaNet, an assembled residual enhanced cross-attention network that integrates multiple feature streams without information loss. The framework comprises (i) channel and spatial feature extraction via SCSESResNet, (ii) landmark feature extraction from specialized sub-networks, (iii) iterative fusion through residual enhanced cross-attention, (iv) final emotion classification from the fused representation. Our research introduces a novel approach by integrating pre-trained sub-networks specialized in facial recognition with an attention mechanism and our uniquely designed main network, which is optimized for size reduction and efficient feature extraction. The extracted features are fused through an iterative residual enhanced cross-attention mechanism, which minimizes information loss and preserves complementary representations across networks. This strategy overcomes the limitations of conventional ensemble methods, enabling seamless feature integration and robust recognition. The experimental results show that the proposed ArecaNet achieved accuracies of 97.0% and 97.8% using the public databases, FER-2013 and RAF-DB, which were 4.5% better than the existing state-of-the-art method, PAtt-Lite, for FER-2013 and 2.75% for RAF-DB, and achieved a new state-of-the-art accuracy for each database. Full article

(This article belongs to the Special Issue Sensor-Based Behavioral Biometrics)

► Show Figures

Figure 1

20 pages, 31486 KB

Open AccessArticle

Design and Implementation of a Companion Robot with LLM-Based Hierarchical Emotion Motion Generation

by Yoongu Lim, Jaeuk Cho, Duk-Yeon Lee, Dongwoon Choi and Dong-Wook Lee

Appl. Sci. 2025, 15(23), 12759; https://doi.org/10.3390/app152312759 - 2 Dec 2025

Cited by 1 | Viewed by 1748

Abstract

Recently, human–robot interaction (HRI) with social robots has attracted significant attention. Among them, companion robots, which exhibit pet-like behaviors and interact with people primarily through non-verbal means, particularly require the generation of appropriate gestures. This paper presents the design and implementation of a [...] Read more.

Recently, human–robot interaction (HRI) with social robots has attracted significant attention. Among them, companion robots, which exhibit pet-like behaviors and interact with people primarily through non-verbal means, particularly require the generation of appropriate gestures. This paper presents the design and implementation of a companion cat robot, named PEPE, with a large language model (LLM)-based hierarchical emotional motion generation method. To design the cat-like companion robot, an analysis of feline emotional behaviors was conducted to identify the body parts and motions essential for effective emotional expression. Based on this analysis, the required degrees of freedom (DoFs) and structural configuration for PEPE were derived. To generate expressive gestures efficiently and reliably, a hierarchical LLM-based emotional motion generation method was proposed. The process defines the robot’s structural features, establishes a gesture generation code format, and incorporates emotion-based guidelines grounded in feline behavioral analysis to mitigate LLM hallucination and ensure physical feasibility. The proposed method was implemented on the physical robot, and eight emotional gestures were generated—Happy, Angry, Sad, Fearful, Joyful, Excited, Positive Feedback, and Negative Feedback. A user study with 15 participants was conducted to validate the system. The high-arousal gestures—Angry, Joyful, and Excited—were rated significantly above the neutral clarity threshold (p < 0.05), demonstrating clear user recognition. Meanwhile, low-arousal gestures exhibited neutral-level perceptions consistent with their subtle motion profiles. These results confirm that the proposed LLM-based framework effectively generates expressive, physically executable gestures for a companion robot. Full article

► Show Figures

Figure 1

31 pages, 3310 KB

Open AccessArticle

Companion Robots Supporting the Emotional Needs of the Elderly: Research Trends and Future Directions

by Hui Zeng, Yuxin Sheng and Jinwei Zhu

Information 2025, 16(11), 948; https://doi.org/10.3390/info16110948 - 3 Nov 2025

Cited by 3 | Viewed by 6277

Abstract

The accelerating global population aging has brought increasing attention to the loneliness and emotional needs experienced by older adults due to shrinking social networks and the loss of relatives and friends, which significantly impair their quality of life and psychological well-being. In this [...] Read more.

The accelerating global population aging has brought increasing attention to the loneliness and emotional needs experienced by older adults due to shrinking social networks and the loss of relatives and friends, which significantly impair their quality of life and psychological well-being. In this context, companion robots powered by artificial intelligence are increasingly regarded as a scalable and sustainable form of emotional intervention that can address older people’s affective and social requirements. This study systematically reviews research trends in this field, analyzing the structure of emotional needs among older users and their acceptance mechanisms toward robot functionalities. First, a keyword co-occurrence analysis was conducted using VOSviewer on relevant literature published between 2000 and 2025 from the Web of Science database, revealing focal research topics and emerging trends. Subsequently, questionnaire surveys and in-depth interviews were carried out to identify emotional needs and functional preferences among elderly users. Findings indicate that the field is characterized by increasing interdisciplinary integration, with affective computing and naturalistic interaction becoming central concerns. Empirical results reveal significant differences in need structures across age groups: the oldest-old prioritize safety monitoring and daily assistance, whereas the young-old emphasize social interaction and developmental activities. Regarding emotional interaction, older adults generally prefer natural and non-intrusive expressive styles and exhibit reserved attitudes toward highly anthropomorphic designs. Key factors influencing acceptance include practicality, ease of use, privacy protection, and emotional warmth. The study concludes that effective companion robot design should be grounded in a nuanced understanding of the heterogeneous needs of the aging population, integrating functionality, interaction, and emotional value. Future development should emphasize adaptive and customizable capabilities, adopt natural yet restrained interaction strategies, and strengthen real-world cross-cultural and long-term evaluations. Full article

► Show Figures

Graphical abstract

24 pages, 1312 KB

Open AccessArticle

Differences in Human Response When Interacting in Real and Virtual (VR) Human–Robot Scenarios

by Jonas Birkle and Verena Wagner-Hartl

Automation 2025, 6(4), 58; https://doi.org/10.3390/automation6040058 - 15 Oct 2025

Viewed by 1288

Abstract

The utilization of robots has become an integral aspect of industrial operations. In this particular context, the study of the interaction of humans and robots aims to integrate their relevant capabilities with the intention of attaining maximum efficiency. Moreover, in the private sector, [...] Read more.

The utilization of robots has become an integral aspect of industrial operations. In this particular context, the study of the interaction of humans and robots aims to integrate their relevant capabilities with the intention of attaining maximum efficiency. Moreover, in the private sector, interaction with robots is already common in many places. Acceptance, trust, and perceived emotions vary widely depending on specific contexts. This highlights the necessity for adequate training to mitigate fears and enhance trust and acceptance. Currently, no such training is available. Virtual realities have frequently proven to be helpful platforms for the implementation of training. This study aims to evaluate the suitability of virtual realities for training in this specific application area. For this purpose, simple object handovers were performed in three different scenarios (reality, virtual reality, and hybrid reality). Subjective evaluations of the participants were extended by psychophysiological (ECG and EDA) and performance measures. In most cases, the results show no significant differences between the scenarios, indicating that personal perception during interaction is transferable to a virtual reality. This demonstrates the general suitability of virtual realities in this context. Full article

(This article belongs to the Section Robotics and Autonomous Systems)

► Show Figures

Figure 1

Search Results (178)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (178)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI