Next Article in Journal
Towards a New Business Model for Streaming Platforms Using Blockchain Technology
Previous Article in Journal
Enhancing Efficiency and Security in Unbalanced PSI-CA Protocols through Cloud Computing and Homomorphic Encryption in Mobile Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Visual Data and Pattern Analysis for Smart Education: A Robust DRL-Based Early Warning System for Student Performance Prediction

Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3086, Australia
*
Authors to whom correspondence should be addressed.
Future Internet 2024, 16(6), 206; https://doi.org/10.3390/fi16060206
Submission received: 27 April 2024 / Revised: 28 May 2024 / Accepted: 6 June 2024 / Published: 11 June 2024

Abstract

:
Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) have revolutionised e-learning by creating personalised, adaptive, and secure environments. However, challenges such as privacy, bias, and data limitations persist. E-FedCloud aims to address these issues by providing more agile, personalised, and secure e-learning experiences. This study introduces E-FedCloud, an AI-assisted, adaptive e-learning system that automates personalised recommendations and tracking, thereby enhancing student performance. It employs federated learning-based authentication to ensure secure and private access for both course instructors and students. Intelligent Software Agents (ISAs) evaluate weekly student engagement using the Shannon Entropy method, classifying students into either engaged or not-engaged clusters. E-FedCloud utilises weekly engagement status, demographic information, and an innovative DRL-based early warning system, specifically ID2QN, to predict the performance of not-engaged students. Based on these predictions, the system categorises students into three groups: risk of dropping out, risk of scoring lower in the final exam, and risk of failing the end exam. It employs a multi-disciplinary ontology graph and an attention-based capsule network for automated, personalised recommendations. The system also integrates performance tracking to enhance student engagement. Data are securely stored on a blockchain using the LWEA encryption method.

1. Introduction

The development of information and communication technology in recent years has significantly transformed numerous sectors, including education. The rapid proliferation of artificial intelligence (AI) and big data has been a major driver in the evolution of online education. Online education not only saves time for both students and teachers, but also encourages students to think and process information independently. Teachers guide students to enhance their academic skills, soft skills, and personality development, preparing them for a brighter future [1].
E-learning environments boost students’ knowledge and academic performance through collaborative tasks and computer-assisted activities. Computer-Supported Collaborative Learning (CSCL) is a pedagogical strategy that facilitates learning through online or computer-mediated social interaction. This approach allows participants to share and produce information using technological tools, potentially improving learning outcomes. E-learning systems are popular for their scalability, ability to offer uninterrupted learning 24/7, and cost-effectiveness [2,3]. AI is poised to further enhance these systems by providing personalised content. An AI-based adaptive and personalised e-learning system offers specific and targeted content to each learner, unlike traditional systems where all students at the same grade level receive identical content. Personalisation in e-learning, which adapts content delivery to a learner’s strengths and weaknesses, can make learning more effective. However, most current AI-based personalised e-learning strategies lack integration and do not provide a comprehensive framework. Creating such tailored e-learning platforms can address the limitations of individual models [4,5,6]. Personalisation means evaluating and instructing each student differently, and AI systems can achieve this by identifying suitable topics and levels for each learner. For example, if a learner struggles with a topic, instructors may revisit it using a different teaching approach, while students with a better understanding can progress to more complex material [7,8,9].
Moreover, while adaptive learning models like VARK cater to students’ learning preferences [10,11], they often overlook non-academic factors in performance prediction [12,13,14,15,16]. Furthermore, security is another major concern, as current e-learning platforms are vulnerable to data breaches [17,18,19,20,21]. The proposed model seeks to address these issues by integrating AI with Federated Learning, creating a secure, real-time, personalised learning ecosystem that takes both academic and non-academic factors into account.

1.1. Research Contribution

This research aims to improve the academic well-being of students by providing robust, secure, and tailored recommendations, leveraging advanced AI techniques. The key contributions of this study are outlined as follows:
  • Enhanced Security Against Cyber Threats: this work introduces federated learning-based authentication combined with the Cycle-GAN algorithm to mitigate risks, such as malicious traffic and primary vulnerabilities, enhancing the security framework within the e-learning environment.
  • Improved Accuracy in Performance Predictions: We have implemented the MV-MOC algorithm for clustering weekly student engagement data using an intelligent software agent. This method integrates various student and instructor activities, significantly improving the precision of early performance predictions.
  • Boosted Prediction Reliability: The development of an ID2QN-based early warning system aims to increase the accuracy of performance forecasts, thereby elevating the true positive rate of student evaluations. This has been complemented by creating a multi-disciplinary ontology graph to refine and expedite the recommendation process.
  • Enhanced Academic Support: The research introduces Att-CapsNet, an automated, personalised recommendation engine that extracts data from the ontology graph to formulate targeted academic advice. This system also tracks student progress over time, supporting an increase in overall academic performance rates.

1.2. Organisation

This paper is organised into the following sections: The related work section (Section 2) reviews existing systems, particularly focusing on deep reinforcement learning (DRL) in education, and highlights research gaps. This is followed by Section 3, which details the problem statement. The implementation section (Section 4) discusses the technical development of the system, the integration of visual data analysis, and deployment considerations in an e-learning environment. The results and discussion section (Section 5) evaluates the system’s performance compared to existing systems. Section 6 concludes this work by summarising contributions, acknowledging limitations, and suggesting future research directions.

2. Literature Review

This literature review delves into the realms of e-learning, particularly focusing on student performance assessments and recommendation generations. Recent studies have significantly contributed to understanding and enhancing student engagement and performance through various technological and methodological innovations.
The exploration of emotional engagement in e-learning environments has been a pivotal focus. For instance, the study identified in [22] employed machine learning algorithms, specifically convolutional neural networks (CNNs), to analyse students’ emotional states through eye movements and facial expressions during online learning sessions. This approach delineates a method for categorising student engagement levels, albeit with challenges, such as overfitting. Similarly, the research presented in [23] undertakes the classification of human emotions into distinct categories like surprise, anger, and sadness using a Bayesian classifier, underscoring the importance of emotional assessment but also noting limitations when excluding interactive and skill-based parameters.
In the domain of predictive analytics for identifying at-risk students, several methodologies have been employed. The hidden Markov model used in [24] exemplifies an intelligent approach to predicting student dropout risks, although it notably lacks incorporating demographic features, which could enhance prediction accuracy. Likewise, refs. [25,26], have advanced the application of gradient boosting and artificial neural networks (ANN) for early warning systems and performance analysis, respectively, each discussing specific challenges, such as handling large datasets and computational demands. The development of recommendation systems to enhance e-learning experiences has seen innovative applications of machine learning and deep learning techniques.
Studies like [27,28] have contributed to this field by utilising ontology and autoencoders for course recommendations, highlighting the balance between leveraging historical data and addressing individual learner needs.
Similarly, the combination of machine learning techniques with fuzzy methodologies in [29] presents an advanced approach for designing performance prediction systems. By integrating fuzzy logic, this model aims to refine the accuracy of predictions and recommendations, addressing the complexity of student data and learning paths. In the context of adaptive learning environments, ref. [30] introduces the application of Deep Q-Network (DQN) algorithms from Deep Reinforcement Learning (DRL), proposing a method for distributing learning rules and exercises intelligently. However, this approach is not without its challenges, particularly concerning the security and integrity of the learning system, as highlighted by potential vulnerabilities in the login methodology.
Further, the integration of deep learning for personalised e-learning environments, as discussed in [31,32], showcases the shift towards more adaptive and learner-centred approaches, although they also reveal gaps in evaluating and integrating learner information effectively. Learning style identification has also garnered attention, with [33] employing various machine learning algorithms to detect and cater to different learning preferences, signifying a move towards more personalised education. However, the challenge remains in accurately capturing and responding to the diverse learning modalities of students.
Moreover, the application of machine learning extends into predictive analytics for academic success, with studies like [34,35,36] exploring different algorithms and data preprocessing techniques to forecast student performance. These studies underline the potential of predictive analytics, while also pointing out the limitations of relying solely on historical academic data or a narrow set of predictive factors.
In the domain of Information Retrieval and Natural Language Processing (NLP), a new study [37] presents a fake news detection model called MST-FaDe. This model utilises a multiscale transformer architecture to accurately identify and categorise fake news across mixed-language content. This research enhances the ability to detect fake news with high precision and recall, especially in environments where content does not adhere to a single language. Integrating this model into educational AI technologies can protect against the spread of misinformation and create a more inclusive and secure learning environment.
In synthesising these works, it becomes evident that while advancements in machine learning and predictive analytics offer significant promise for enhancing e-learning environments, they also present challenges, such as data complexity, model overfitting, and the need for comprehensive feature integration. The emerging trends underscore the importance of holistic and adaptive approaches that consider a wide range of factors affecting student performance and engagement. This review sets the stage for our research into the E-FedCloud platform, which aims to integrate these insights into a cohesive system for improving e-learning outcomes.
By addressing the noted limitations and building on the foundational work reviewed, we aspire to contribute meaningfully to the evolving landscape of educational technology. Table 1 provides a summary of the literature survey.

3. Problem Definition

Our previous work, AISAR [38], developed a model using artificial intelligence algorithms to enhance academic performance in e-learning environments. This model encompassed several processes: student authentication, score estimation, clustering, performance prediction, and personalised recommendations. Authentication was conducted through PINs, passwords, and IDs, while student scores were estimated via a recurrent neural network. Students were then categorised into low, average, and excellent groups using a density-based spatial clustering algorithm. Performance prediction employed a threshold-assisted map reduction method to categorise students as very low, low, and normal. For generating recommendations, the State-Action-Reward-State-Action (SARSA) algorithm was implemented. However, this model had several limitations: the authentication approach was susceptible to modern security threats; the clustering method inadequately classified student involvement, leading to high false-positive rates; and the performance prediction lacked real-time data processing capabilities, resulting in reliability and availability issues. Additionally, the SARSA-based recommendation system did not adequately cater to the needs of below-average students requiring personalised support.
The authors in [39] propose a machine learning-based model for predicting student performance, utilising behaviour and learning information. This process includes data pre-processing, feature extraction, classification, and the development of a machine-based recommendation model. This model is built upon features such as learning behaviour consolidation, interaction, knowledge acquisition, and preparation. However, the study fails to consider crucial factors, such as student interactiveness, participation, emotions, and skills, which could compromise the effectiveness of predictions. Additionally, the study underscores the necessity of addressing issues related to complexity, overfitting, and underfitting to enhance the accuracy of prediction outcomes.
In [40], the authors proposed using personalised feedback and machine learning algorithms to predict learning behaviours in e-learning modules. Data were collected from performance indicators, educational metrics, and demographic variables. Pre-trained gated recurrent units were employed to classify students into pass or fail categories. Although personalised feedback was provided, these methods did not significantly improve the learning outcomes for struggling students. The study predominantly focused on identifying aspects and behaviours related to students, rather than considering teacher-related aspects that could influence student performance.
The authors in [41] proposed a deep learning algorithm to predict student performance and generate recommendations based on an early warning system. The data underwent pre-processing, grade prediction, and clustering using the k-nearest neighbour algorithm. The findings indicated that deep neural networks surpassed the performance of other machine learning models. However, the study faced limitations, including the omission of student and teacher legitimacy, the lack of personalised recommendations based on grades, and the lower accuracy and high computational complexity associated with the k-nearest neighbour algorithm.
The study [42] developed an automated framework to generate recommendations based on student academic performance using machine learning algorithms. This framework employed natural language processing for semantic analysis. Data were collected from user profiles, pre-processed, and analysed using an agent-based methodology. The results demonstrated that agent-based predictive recommendations surpassed the performance of other methods. However, the study did not consider factors such as student engagement results, teacher behaviour, and effective tracing of recommendations for further assistance.

4. Proposed Methods

To enhance the e-learning experience, the proposed E-FedCloud system adopts a comprehensive approach, as depicted in Figure 1, which outlines the system’s overall architecture. The process begins by ensuring robust authentication for both students and instructors, utilising diverse credentials, such as ID, password, voice, facial data, and keystroke dynamics. This is achieved through federated learning combined with the Cycle GAN algorithm. Following authentication, the system shifts focus to student engagement detection, employing deep learning and threshold-based entropy classification methods. This in-depth evaluation includes analyses of student interactions, academic behaviours, instructor behaviours, and academic emotions. Leveraging deep learning and the Att-CapsNet algorithm, E-FedCloud then generates personalised suggestions for students, tailored according to their assessed levels of participation.
Each student’s performance is taken into account when generating these recommendations, which may include initiatives like scholarships for students at risk of dropping out and additional support for those underperforming, encompassing options like retesting and adjustments to their learning styles. The system employs the ID2QN algorithm, which considers a broad array of factors, including academic achievement, demographics, the teaching methods of course instructors, and students’ prior academic backgrounds, thereby enhancing the accuracy of performance prediction. E-FedCloud also uses majority voting-based clustering algorithms to conduct weekly engagement status assessments, categorising students based on their interactivity, academic behaviour, facial emotions, and instructor actions. Furthermore, the system routinely validates weekly statuses and offers opportunities for retesting to monitor students’ progress. This comprehensive approach ensures a holistic and adaptable e-learning environment, meeting the varied needs of students and consistently fostering their academic development.
The primary focus of this research is to develop an accurate, effective, and secure performance prediction and recommendation model to assess and enhance student academics. To achieve this objective, we have integrated Artificial Intelligence (AI), Federated Learning (FL), and Cloud Computing (CC) technologies.
AI is employed to facilitate automation with high reliability; FL technology is utilised to ensure the privacy of entities; and CC technology is harnessed to provide and store computational resources on-demand without compromising quality.
The entities involved in this study include students, Course Instructors (CIs), Intelligent Software Agents (ISAs), and Cloud Servers (CS).
The detailed process, along with the functioning of each entity, is provided in the following Section. This can be understood by reviewing the sequential research processes outlined below:
  • Federated learning-based authentication
  • Majority voting-based multi-objective student engagement clustering
  • Deep reinforcement learning (DRL)-based early warning system and multi-disciplinary ontology graph construction
  • Automated personalised recommendation generation and tracking

4.1. Federated Learning-Based Authentication

The study [43] significantly expands the discussion on data privacy challenges and personalised education solutions, contributing to a richer literature review. It explores a low-latency edge computation offloading strategy for trust evaluation in finance-level Artificial Intelligence of Things (AIoT) environments. The goal is to minimise latency in data processing and decision-making by relocating these processes to edge devices closer to data sources. This approach aims to improve the speed and reliability of AI operations, thus enhancing the efficiency of technology-driven environments.
Similarly, the implementation of federated learning-based authentication in e-learning enhances data privacy and security. This approach allows for the decentralised training of machine learning models across numerous devices without necessitating the sharing of sensitive data, thereby preserving user privacy [44]. It acts as a safeguard against potential data breaches by decentralising data storage and ensuring adherence to privacy regulations. Moreover, federated learning enables the delivery of personalised learning experiences. It optimises bandwidth usage, establishing it as a bandwidth-efficient methodology. By integrating these advanced techniques, the paper underlines the importance of adopting innovative solutions to overcome the prevalent challenges in e-learning platforms. Federated learning, in particular, plays a pivotal role in maintaining the confidentiality and integrity of data, while facilitating a customised educational experience for learners. This synergy between cutting-edge AIoT strategies and federated learning underscores the evolving landscape of secure, personalised, and efficient e-learning environments. To mitigate malicious activities in the adaptive e-learning environment, a robust and privacy-preserving authentication scheme has been developed for both course instructors (CIs) and students, utilising Federated Learning (FL) technology. This scheme involves CIs, students, and cloud servers (CS).
The authentication process begins with the registration initiated by CIs and students, who send their requests to the CS. The CS comprises two centralised sub-servers, the Student Server (SS) and the Course Instructor Server (CIS), where students and CIs, respectively, register their credentials. The credentials, which include ID, password, face data, voice data, and keystroke dynamics, are treated as local models.
Subsequently, the sub-servers create the global model using a Cycle General Adversarial Network (CGAN). CGAN, consisting of generators (δ) and discriminators (ϵ) for each domain (features from students and course instructors), aims to translate characteristics from one domain to another.
Generators δ1 and δ2: convert characteristics between domains. δ1 transforms SS into CIs, while δ2 transforms CIs into SS.
δ 1 ( α ) ( β )
δ 2 ( β ) ( α )
Discriminator  ε 1  and  ε 2 : evaluate the resemblance of generated features to real features in their respective domains.
For ε 1 (Discriminating SS)
ε 1 β [ 0 , 1 ]
For ε 2 (Discriminating CIs)
ε 2 α [ 0 , 1 ]
Cycle consistency loss: ensures that a feature translated from one domain to another and back again should resemble the original feature, preserving information. We will refer to the translated features as α (SS) and β (CIs).
For δ 1 (SS to CIs like) and its reverse:
L c y c l e   δ 1 = δ 1 δ 2 α α | 1
For δ 2 (CIs to SS like) and its reverse:
L c y c l e   δ 2 = δ 2 δ 1 β β | 1
Here, L i d e n t i t y encourages generated features to resemble the original, while L c y c l e ensures cycle consistency. Equation (7) represents the generator loss ( L δ ) .
L δ = L i d e n t i t y δ 1 + L i d e n t i t y δ 2 + L c y c l e ( δ 1 δ 2 ) + L c y c l e ( δ 2 δ 1 )
Here, L r e a l encourages the discriminator to correctly identify real features, while L f a k e encourages the discriminator to accurately identify generated features as fake. Equation (8) represents the discriminator loss ( L ε ) .
L ε = L r e a l ε 1 + L f a k e ( ε 1 δ 1 ) + L r e a l ε 2 + L f a k e ( ε 2 δ 2 )
The process for implementing the Cycle Generative Adversarial Network (CGAN) is detailed in Pseudocode 1. This pseudocode outlines the steps for optimising the generator and discriminator networks over multiple epochs.
Pseudocode 1: CGAN Implementation
Input: n u m e p o c h s ,   d a t a l o a d e r ,   δ 1 ,   δ 2 ,   ε 1 ,   ε 2
for epoch in range
  for   batch   in   d a t a l o a d e r
  for epoch in range
      Optimise   ε ( ε 1 ,   ε 2 ,   δ 1 ,   δ 2 , batch)
   for epoch in range
      Optimise   δ ( δ 1 ,   δ 2 ,   ε 1 ,   ε 2 , batch)
  update   network   weights   ( δ 1 ,   δ 2 ,   ε 1 ,   ε 2 )
Output:   Trained   δ ( δ 1 , δ 2 ) ,   Trained   ε ( ε 1 ,   ε 2 )
The generated global models are shared with the students and CIs for authentication purposes. It is important to note that the CI models and student models are swapped between sub-servers to ensure a secure and private environment. By utilising these trained models, issues commonly associated with conventional authentication methods, such as sniffing, spoofing, and shoulder surfing attacks, can be effectively mitigated through the proposed, federated learning-based authentication approach.

4.2. Majority Voting-Based Multi-Objective Student Engagement Clustering

Only authenticated students and course instructors (CIs) are permitted to participate in the adaptive e-learning course. Throughout the online course, through multiple distributed interactions, Intelligent Software Agents (ISAs) are responsible for capturing students’ academic behaviours, interactivity, emotions, and other relevant behaviours, as well as CIs’ behaviours and teaching styles. This information is utilised to determine the weekly engagement status of the students. To be more specific, the responsibilities of ISAs are outlined below:

4.2.1. Student Interactiveness Acquisition

Data quality in an e-learning environment, especially regarding emotional data collected through cameras, is critical. Encouraging camera use, while ensuring informed consent and privacy, involves establishing clear guidelines on camera positioning, lighting, and background. Additionally, setting minimum technical standards for camera resolution and internet connectivity is essential. To technologically enhance data quality, implementing real-time feedback mechanisms is crucial. These mechanisms alert students when the camera is off or if the video quality is compromised due to inadequate lighting or incorrect angles. Furthermore, employing pre-processing algorithms, such as brightness and contrast adjustments, noise reduction, and image stabilisation, can significantly improve the clarity of facial features, ensuring a more accurate analysis of student emotions.
During online courses, course instructors (CIs) engage with students through various methods, such as posing questions, conducting seminars, and administering short quizzes related to lectures, among other interactive activities. Based on these daily interactions, Intelligent Software Agents (ISAs) evaluate the students’ levels of interactivity each week, categorising them into two classes: “highly interactive” and “less interactive”. This evaluation employs the threshold-based Shannon entropy (TSE) method, which quantifies the frequency of interactions exceeding a predefined threshold, thereby assessing engagement levels.
Shannon entropy, denoted as H(ϰ), measures the uncertainty in a dataset. It is calculated for a discrete random variable ϰ with probability distribution P(ϰ) using the formula:
H ( ϰ ) = x P ϰ   · l o g 2 ( P ϰ )
Here, x represents the possible values of ϰ, and P(ϰ) is the probability of each value occurring. TSE specifically focuses on values exceeding a specified threshold, represented as ℧. The unit of Shannon entropy is bits when using the base-2 logarithm (log2). To compute TSE (H), it first establishes the probability distribution for interactions surpassing the threshold and then employs the Shannon entropy formula.
H = H ( ϰ > )
Here, ϰ > ℧ represents the set of values exceeding the threshold value. H (ϰ > ℧) denotes the Shannon entropy for values greater than the threshold. TSE is calculated and then utilised to determine each student’s interactiveness grade for the week. For instance, if TSE is below a certain threshold, the student could be categorised as “less interactive” (represented as ϱlow), whereas if TSE is above the threshold, the student could be classified as “highly interactive” (represented as ϱhigh).
The process for calculating the Threshold-based Shannon Entropy (TSE) is outlined in Pseudocode 2. This pseudocode details the steps involved in determining student interactiveness based on daily interaction data and a specified threshold.
Pseudocode 2: TSE Calculation
Input: R ( i n t e r a c t i v e   d a t a ) , (threshold)
I n i t i a l i s e   H to 0.0
For   each   daily   count   in   R :
 If ℘ (probability or some related metric) > ℧:
   Calculate H using Equation (9) for probability above ℧
If   H < s o m e   v a l u e :
 Interactiveness = “Less Interactive” (ϱ_low)
Else:
 Interactiveness = “Highly Interactive” (ϱ_high)
Output:   classify   R   based   on   H

4.2.2. Student Academic Emotion Acquisition

Enhancing data quality in e-learning platforms can be achieved through advanced technological measures, including real-time feedback systems, pre-processing algorithms, and systematic quality checks. Specifically, the YOLOv6 algorithm [45] significantly improves the handling of varying image qualities by integrating features from SEC-YOLO and MCE-YOLO [46], focusing on essential facial regions like the eyes, forehead, and cheeks for emotion analysis. This deep learning approach enables the precise classification of student emotions—frustration, confusion, boredom, and engagement—by leveraging robust feature extraction that remains effective even in lower-quality video streams.
Intelligent Software Agents (ISAs) utilise the YOLOv6 algorithm to monitor and analyse students’ facial emotions daily, employing the threshold-based Shannon Entropy (TSE) method for real-time emotion classification based on the clarity and visibility of key facial features. This dual approach, combining the YOLOv6’s advanced detection capabilities with the TSE method, allows for accurate emotion categorisation and the compilation of weekly emotional insights without compromising data quality or student privacy.
Overall, the integration of YOLOv6, which marries single-channel detection with multi-channel depth for enhanced object recognition, alongside a targeted facial feature analysis, ensures a comprehensive and nuanced understanding of student emotional states. This methodological synergy not only improves the accuracy of emotion detection, but also supports a more personalised and responsive e-learning environment.
To maintain high-quality emotional data collection, providing alternatives for students who are unable or unwilling to use cameras is crucial. Periodic check-ins or mood surveys can supplement or replace camera-based emotional data. Regularly updating and training the YOLOv6 model with new data samples will enhance its accuracy and robustness under varied conditions.

4.2.3. Student Behaviour Acquisition

Student behaviour acquisition is a process that monitors and analyses student behaviours in an e-learning context to improve engagement classification. This involves collecting data on various student behaviours, such as attendance rate, actions (such as frequently pausing the course video), chitchatting, disturbances, and malpractice during exams. Processing these behaviours provides insights into their significance in the learning process. The Threshold-based Shannon Entropy (TSE) method is used to categorise these behaviours into ‘good’ and ‘bad’, based on their entropy values. The ultimate goal is to improve engagement classification results, allowing the e-learning system to assess student engagement levels and tailor the learning experience to enhance both engagement and academic performance. This approach demonstrates a comprehensive effort to use behavioural data to improve the effectiveness of e-learning systems.

4.2.4. CI Behaviours and Teaching Style Acquisition

Finally, the daily behaviours of the course instructors (CIs) and their teaching styles, including their level of interactiveness, course time management, and teaching methods (e.g., PowerPoint presentations, voice, video, etc.), are classified into two categories, acceptable and non-acceptable, using the Threshold-based Shannon Entropy (TSE) method.
Based on the acquisition results mentioned above, the weekly engagement status of the students is computed using the Majority Voting Multi-Objective Clustering (MV-MOC) method. In this context, the multiple objectives pertain to the students’ weekly academic and behavioural status, encompassing interactiveness, academic emotions, and behaviours, respectively.
Multi-Objective Optimisation: MV-MOC extends conventional clustering by optimising multiple objectives concurrently. Each objective corresponds to a distinct clustering solution, and the algorithm aims to find a compromise among these objectives.
Pareto Front: The Pareto Front is a set of solutions that are not dominated. A solution is considered non-dominant if it cannot be improved on one objective without negatively impacting another. The Pareto front illustrates the trade-offs between these objectives.
Majority Voting: MV-MOC assigns data points to clusters based on the consensus of multiple clustering solutions using majority voting. The diversity between Solutions X and Y, based on their cluster assignments, can be calculated. This can be achieved using the Jaccard Index (J), which measures the dissimilarity or diversity between two sets, X and Y.
J   ( X , Y ) = X Y X Y
Here, X Y represents the number of data points assigned to the same cluster in both Solution X and Solution Y (the intersection of sets X and Y ), while X Y represents the total number of unique data points assigned to clusters in Solutions X and Y (the union of sets X and Y ).
The Pareto Dominance Test is performed to determine whether Solution X dominates Solution Y based on multiple objectives. Solution X is considered to dominate Solution Y if it is equal to or better than Solution Y in all objectives. This dominance check can be expressed for each objective i as follows:
f i ( S X )   f i ( S Y )
Here, f i ( S X ) represents the value of objective i in Solution X , and f i ( S Y ) represents the value of objective i in Solution Y . For Solution X to dominate Solution Y , this inequality must hold for all objectives i . If it holds for at least one objective, Solution X is considered to dominate or be equivalent for some objectives, but not all.
In summary, these equations are used to analyse the diversity between two clustering solutions (Diversity Calculation) and to check if one solution dominates another based on multiple objectives (Pareto Dominance Check). In the final step of the MV-MOC algorithm, the number of times each data point has been assigned to each cluster across all Pareto front solutions is calculated. Subsequently, each data point is assigned to the cluster that receives the majority of votes, determining its final assignment. Optionally, the quality of the clustering solution can be evaluated using cluster validity f indices, such as the silhouette score, which measures how well the data points are clustered. represents the Pareto front solutions.
The process for the Majority Voting Multi-Objective Clustering (MV-MOC) is detailed in Pseudocode 3, which outlines the steps for clustering data based on multiple objectives and determining the engagement status of students.
Pseudocode 3: MV-MOC Calculation
Input: R ,   τ , H
Cluster data for each objective separately
Calculate  J   using   J   ( X , Y ) =   X Y   X Y
Verify X for all objectives f i ( S X ) f i ( S Y )
Generate
A majority vote on
Optionally calculate f .
Implement a termination condition
Obtain Final cluster assignments
Output: Engaged not engaged
To ensure that the algorithm does not operate indefinitely, a termination condition is imposed based on either a fixed number of iterations or convergence. Ultimately, the final clustering solution consists of clusters obtained through majority voting based on Pareto-front solutions, enabling the effective classification of data points according to their engagement status or other relevant objectives. Concerning these objectives, the majority voting method clusters students into classes, such as engaged and non-engaged students. To clarify, students exhibiting good behaviour and positive academic emotions but lower levels of interactiveness would be categorised as engaged students, and vice versa.

4.3. DRL-Based Early Warning System & Multi-Disciplinary Ontology Graph Construction

The Improved Duelling Deep Q Network (ID2QN) serves as an early warning system employing a deep reinforcement learning (DRL) model to forecast student performance. This prediction is based on their engagement status, demographic details, and other pertinent data. Designed to anticipate student performance early, the ID2QN aims to identify students requiring additional support or resources for success. Unlike traditional DRL methods that rely on a table for storing Q-value data, which becomes impractically intensive with growing environmental complexity, ID2QN utilises a neural network for direct state-to-action mapping, negating the need for a Q-value table. The ID2QN algorithm distinguishes between two distinct value functions: the state value function and the action advantage function. The integration of these functions facilitates the determination of the Q-value, assisting in the selection of the most suitable action. The model adopts a dual neural network structure; one network is responsible for generating Q-values for the present state-action pair, while another calculates Q-values for future state-action scenarios. These networks’ parameters are adjusted through a loss function and the stochastic gradient descent algorithm to minimise discrepancies between anticipated and actual Q-values. ID2QN stands out for its capability to efficiently learn the value of each state without the need to explicitly determine the most advantageous action at every juncture. This efficiency renders it a potent model for early performance prediction, particularly within the realm of student engagement and success, especially in environments where accurately assessing the value of actions and states poses significant computational challenges.
The workflow of the Deep Reinforcement Learning System using ID2QN for Early Performance Prediction is as follows:
  • Data Input and Pre-processing:
Input: Collect weekly engagement data, students’ demographic details, and course instructors’ (CIs) behaviours and teaching styles alongside students’ academic histories for input into the neural network.
Process: these inputs are utilised for the early prediction of performance among students showing signs of disengagement.
  • Neural Network Structure and Q-value Estimation:
Implement the DQN algorithm within a neural network framework, formalised by the equation:
ϱϱ(ψ, ϖ, φ) ≈ ϱ(ψ, ϖ)
Here, ψ represents the state, ϖ denotes potential educational interventions (actions), and φ embodies the network parameters. The strategy eradicates the need for traditional Q-value tables by facilitating state-to-action mappings directly within the neural network. Q-values, ϱ(ψ, ϖ, φ), are estimated to represent the expected rewards of certain actions given specific states.
  • Loss Function and Parameter Updating:
The Loss Function, L(φ), assesses the gap between predicted and actual rewards, guiding neural network training, and is defined as:
L ( φ ) = Ε [ ( γ + ς max ϖ ϱ ψ , ϖ , φ ϱ ( ψ , ϖ , φ ) ) 2 ]
Network parameters (φ) are updated using the stochastic gradient descent to minimise L(φ), thereby refining prediction capabilities.
  • Duelling Network Architecture:
ID2QN adopts a dual-component structure:
Evaluate Net: Generates current state-action Q-values, ϱ(ψ, ϖ, φ).
Target Net: Outputs future state-action Q-values, ϱ(ψ′, ϖ′, φ′).
The neural network output divides into two distinct functions: the state-value function V(ψ, φ) and the action-advantage function A(ϖ, φ), integrated as:
ϱ ψ , ϖ , φ , φ V , φ A = V ψ , φ , φ V + A ( ψ , ϖ , φ , φ A )
This delineation enhances decision-making by independently evaluating the current state’s value from the advantages of the specific actions.
  • ID2QN Specifics:
Emphasise centralising the advantage function to improve network robustness as:
ϱ ψ , ϖ , φ , φ V , φ A = V ( ψ , φ , φ V ) + A ψ , ϖ , φ , φ A 1 | A | A A ( ψ , ϖ , φ , φ A )
The model distinctly evaluates the value of each state V(ψ) and the added value of each action A(ϖ), facilitating clearer differentiation between actions.
  • Early Performance Prediction and Intervention:
    -
    Using the ID2QN model to predict student performance and categorise them based on risk levels derived from the neural network’s Q-value output.
    -
    Developing targeted interventions based on individual risk assessments indicated by the DRL analysis.
  • Personalised Recommendation Generation:
    -
    Leveraging outputs from the ID2QN model to guide the Att-CapsNet system in crafting personalised educational recommendations.
    -
    Continuously refine these recommendations, as per students’ changing engagement and performance metrics.
  • Evaluation and Feedback Loop:
    -
    Evaluate the impact and efficiency of DRL-based interventions on enhancing student outcomes.
    -
    Iterate and refine neural network parameters and the entire predictive model based on continuous feedback and observed educational impacts, aiming for ongoing improvement and greater accuracy in early warning predictions.
This refined workflow, supported by detailed equations and methodologies, demonstrates the application of the ID2QN model within an e-learning context to identify students requiring additional support and to customise interventions to improve their educational experiences effectively.
This study illustrates the integration between the Deep Reinforcement Learning (DRL) Early Warning System and Cycle-GAN within an e-learning framework. While Cycle-GAN fortifies the platform’s authentication and privacy measures, the DRL-based early warning system, leveraging the ID2QN model, evaluates student engagement and performance. Their integration underpins a comprehensive e-learning environment where secure and authenticated access, facilitated by federated learning and Cycle-GAN, underwrites the integrity and privacy of data processed by the DRL system. This integration is crucial for the precise and proactive identification of students’ academic needs.
Specifically, the Cycle-GAN (Generative Adversarial Network) technique is deployed for federated learning-based authentication to enhance the platform’s defence against cyber threats, including malicious traffic and vulnerabilities. It simplifies the authentication process within a federated learning framework, safeguarding data integrity during node-to-node transfers. Meanwhile, DRL aims to teach systems to adapt and make informed decisions based on environmental interactions, offering applications in e-learning, such as tailoring educational pathways, allocating resources dynamically, and formulating adaptive assessments.
Therefore, while Cycle-GAN and DRL are both grounded in sophisticated machine learning methodologies, they address distinct needs within the e-learning domain. Cycle-GAN focuses on securing data through enhanced authentication protocols, whereas DRL is utilised to enrich the learning experience, adapting educational content and strategies to align with student preferences and performance. This, in turn, enables accurate and early prediction of student performance.

4.4. Automated Personalised Recommendation Generation and Tracking

The deep learning algorithm, known as the Attention-Based Capsule Network (Att-CapsNet), automatically generates personalised recommendations for students by leveraging the multi-disciplinary ontology graph of student performance. The Att-CapsNet model integrates the capsule network with a channel attention mechanism, specifically the Squeeze-and-Excitation (SE) block, and primarily comprises four layers, described below:
Convolutional Layer: the initial layer is a convolutional layer that employs 3 × 3 convolution kernels, a step size of 1, and the ReLU activation function to extract local information.
Attention Layer: SE blocks, known for their simplicity of implementation, enhance feature extraction within the model and facilitate classification. The squeeze operation in this layer aims to capture the overall features of a channel, with each feature map from the convolutional layer.
Primary Caps Layer: After the SE block, PrimaryCaps receives each feature map and attention weight. The PrimaryCaps layer differs from the standard convolution layer. After this layer, we can produce capsules, also known as vectors, which can store significant information. The SE block then produces attention feature maps by multiplying the original feature maps by the learned channel weights to perform a scale operation.
DigitCaps Layer: The DigitCaps layer stores Ponzi and non-Ponzi capsules. Vectors represent the ultimate result. Capsule networks squash. The likelihood of an entity’s presence is determined by the length of the output vector, while retaining its orientation.
To generate recommendations for a student, the relevant student information is extracted from the multi-disciplinary ontology graph, which illustrates the relationships among various factors influencing student performance and their actual performance outcomes. Using these queried data, the proposed Att-CapsNet recommender system offers a tailored set of recommendations for each student.
For example, a student at risk of dropping out is analysed in conjunction with their performance-affecting factors and may be recommended various options, such as scholarship opportunities, accommodation arrangements, and more, all personalised to their needs. In the case of students facing the risk of failing or achieving lower marks, the system assesses the behaviour of course instructors (CIs) and modifies their teaching styles. This modification may involve simplifying content through animations, incorporating gamified e-learning elements, offering multiple attempts and feedback-based question-and-answer sections (i.e., retests), and so on.
Additionally, the proposed approach continually monitors and tracks student performance on a weekly basis, aiming to enhance performance and provide recommendations to boost student engagement in the course.
In a different context, the lightweight encryption algorithm (LWEA) introduces a secure, computationally efficient encryption method. This encryption method consumes less power due to its reduced key size. The process involves three distinct algorithms:
  • Secret key generation and text encryption
  • Secret key encryption and text encryption merging
  • Text decryption using a secret key
The strategy relies on the one-time pad technique, where a random number generator generates the secret key. Notably, the secret key is transmitted within the encrypted file itself, adding an extra layer of security to the process.
All information, including student authentication details, engagement status, performance status, ontology graph data, and recommendation outcomes, is securely stored within the blockchain in an encrypted format utilising the Lightweight Encryption Algorithm (LWEA) [47]. This approach ensures the utmost security and confidentiality, making the data unreadable to anyone without the proper decryption key. The use of LWEA, known for its efficiency in environments with limited computational resources, alongside blockchain technology, provides a robust and tamper-resistant solution, ensuring sensitive information remains protected from unauthorised access, breaches, or tampering, thereby enhancing the overall security and integrity of the e-learning platform.

5. Experimental Results

To simulate the proposed research method, JDK 1.8, Netbeans 12.3, and Wamp Server 2.0 (MySQL 5.1.36) were utilised. These tools are efficient and provide all the necessary specifications for the proposed technique.
In evaluating the system, the study used the Open University Learning Analytics dataset (OULAD), developed by Kuzilek et al. in 2017 [48], specifically designed for e-learning testing environments. This VLE-based dataset includes a wide array of data, featuring information on 32,593 students across 22 courses, coupled with detailed records of click activities totalling 10,655,280 entries. The OULAD-VLE dataset is extensive, encompassing three distinct types of data: demographics, performance, and learning behaviour.
The dataset features a range of information categories, including course details, student demographics, registration data, assessments, VLE interactions, student results, and student assessment scores. Course information comprises the code name, presentation duration, and semester offering. Demographic details cover student identification numbers, regional locations, and age groups. Registration data reveal enrolment patterns and dropout rates. The assessment section offers a comprehensive view of each course’s evaluation methods, including the type of assessment, its weight, and the students’ final scores. Logs of VLE interactions provide an in-depth look at student engagement with the online platform, which is crucial for measuring engagement levels and understanding how they correlate with learning outcomes. Student results detail the final outcomes for each course attempt, facilitating analyses against demographic and engagement metrics. Lastly, student assessment scores give a clearer picture of individual performances across different assessment types and stages within each course.
The OULAD enables in-depth learning analytics studies focused on identifying factors contributing to student success and developing predictive models to improve educational outcomes. Researchers can use this dataset to explore relationships between student engagement, assessment performance, demographic factors, and final results, enhancing their understanding of effective online learning practices.
This work utilised a variety of data types to assess the model’s effectiveness. Student demographic data (e.g., age, gender, nationality, and socio-economic status) were used to assess the model’s generalisability across diverse populations, and academic history data (grades, test scores, and assignment completion rates) were analysed to understand student performance patterns and trends. Engagement metrics (login frequency, time spent on materials, forum participation, and quiz/assignment completion) measured student interaction with the e-learning platform. For instance, data might show that students who log in daily and actively participate in forums tend to have higher academic performance.
Additionally, multi-modal biometric data (e.g., facial expressions, eye movements, and keystroke dynamics) [data reflecting physiological and behavioural responses] were captured to identify emotional states, like confusion or disengagement, triggering support mechanisms. For example, a student’s frequent eye movements away from the screen might indicate a lack of focus. Behavioural data (number of video pauses, replays, and multitasking instances) provided insights into students’ learning behaviours. Frequent pausing of lecture videos, for instance, might indicate difficulty understanding the material.
By incorporating these diverse types of data, our evaluation process demonstrates how the DRL-based early warning system can accurately predict student performance and provide personalised recommendations to enhance their academic success. This comprehensive data approach ensures a robust evaluation and validation of our model’s effectiveness.

5.1. Comparative Analysis

This section compares the proposed method with several existing ones, including PCT [43], MST-FaDe [37], Fuzzy SVM [29], and Artificial Intelligence-Based Student Assessment and Recommendation (AISAR) [38]. It evaluates its effectiveness using performance metrics, such as accuracy, precision, recall, F1-scores, true positive rates, false positive rates, true negative rates, and false negative rates.

5.1.1. No. of Epochs vs. Accuracy

Certainly, a simple linear equation can be used to represent the link between the number of training epochs and accuracy (∀):
= · E + I
Here, E represents the number of training epochs, and the rate of change in accuracy per epoch is given by ℘. It indicates how much accuracy increases with each additional epoch. I is the starting accuracy, often known as the intercept, which represents the accuracy at epoch 0. Figure 2 and Table 2 present the number of epochs vs. accuracy.
As the number of epochs increases, the accuracy rate also increases. Choosing five epochs as a comparison point in machine learning model training can indicate early performance, assess model efficiency, and benchmark against standard practices. It also highlights resource efficiency, as fewer epochs mean less computational time and energy usage. For example, in several epochs (e.g., 5), the accuracy of the proposed method reaches 65%, while existing methods, such as PCT, achieve 15%, MST-FaDe achieves 25%, Fuzzy SVM achieves 35%, and AISAR achieves 45%. With a higher number of epochs (e.g., 25), the proposed method attains an accuracy level of 73%, while PCT reaches 39%, MST-FaDe reaches 49%, Fuzzy SVM reaches 59%, and AISAR reaches 61%. Ultimately, the proposed method demonstrates a higher accuracy level compared to existing ones.

5.1.2. No of Epochs vs. F1-Score

Certainly, the relationship between the number of training epochs and the F1-Score can be succinctly explained through a simple equation. The F1-Score is a widely used metric for evaluating the balance between precision and recall in classification problems. The formula is traditionally expressed as:
F = 2 · ( E · K ) E + K
The F1-Score (F) is the harmonic mean of precision (E) and recall (K)). Precision (E) is defined as the ratio of true positives to the sum of true positives and false positives, while recall (K) is defined as the ratio of true positives to the sum of true positives and false negatives. Consequently, the F1-Score provides a single value that encapsulates model performance, considering both false positives and false negatives. This offers a balanced perspective, particularly useful in scenarios with uneven class distributions.
Table 3 represents numerical outcomes, and Figure 3 illustrates the relationship between the number of epochs and the F1-score. As the number of epochs increases, the F1-score correspondingly rises. In the proposed method, F1-scores of 50%, 65%, and 95% are achieved with 5, 25, and 50 epochs, respectively. Conversely, the PCT method yields F1-scores of 42%, 52%, and 76%; the MST-FaDe method yields F1-scores of 43%, 57%, and 79%; and the fuzzy SVM method yields F1-scores of 45%, 62%, and 80% for the same respective epochs. Meanwhile, AISAR registers F1-scores of 46%, 63%, and 81% for the corresponding epochs. Existing approaches demonstrate lower F1-scores when compared to the proposed method.

5.1.3. Precision vs. Recall Rate

A single equation can succinctly represent the relationship between precision and recall as:
E = k k + r
Here, precision ( E ) is a metric representing the accuracy of a classification model, defined as the ratio of true positives ( k ) to the total predicted positives ( k + r ). True positives ( k ) refer to instances correctly identified as positive by the model, while false positives ( r ) denote instances that are incorrectly labelled as positive.
In contrast, K can be expressed using a different equation:
K = k k + Z
Here, recall K of a classification model is represented as the ratio of true positives to total actual positives ( k + Z ). False negatives ( Z ) are instances that the model fails to identify as positive. Table 4 illustrates the numerical outcomes and Figure 4 illustrates the relationship between precision and recall.
PCT initiates with a low precision of 0.08, gradually improving precision as the recall rate rises, reaching an accuracy of 0.85 and a recall rate of 1.0; MST-FaDe initiates with a low precision of 0.10, increasing accuracy over time while maintaining a constant recall rate, reaching an accuracy of 0.90 and a recall rate of 1.0; Fuzzy SVM initiates with a low precision of 0.11, steadily enhancing precision as the recall rate ascends, achieving an accuracy of 0.95 and a recall rate of 1.0. AISAR begins with a reasonable precision of 0.12, which rapidly improves alongside the recall rate, attaining a precision of 0.96 at a recall rate of 1.0. The proposed model demonstrates the highest precision among the five models, commencing at 0.13 and progressively improving, culminating in a precision of 0.99 at a recall rate of 1.0.

5.1.4. True Negative Rate vs. False Negative Rate

The true negative rate ( A ) and false negative rate ( B ) are pivotal metrics used in binary classification to assess a classifier’s performance. Typically, these rates are computed using the confusion matrix, which is a table that summarises the classifier’s predictions. A , Specificity, represents the fraction of actual negatives (class 0) that the classifier correctly predicts as negatives. Here is the A / B equation, along with a brief explanation:
A = Z Z + r
B represents the proportion of true negatives (class 0) that the classifier incorrectly predicts as positives and can be calculated using the following equation:
B = r r + k
At a false negative rate (FNR) of 0.1, PCT has a true negative rate (TNR) of 50 (1–0.50) and gradually enhances its TNR as the FNR increases, reaching 0.05 at a FNR of 1.0. MST-FaDe has a true negative rate (TNR) of 49 (1–0.51) at a false negative rate (FNR) of 0.1. Its TNR steadily improves as the FNR rises, reaching 0.05 with a false negative rate of 1.0 at an FNR of 0.1. Fuzzy SVM has a TNR of 0.48 (1–0.52) and gradually enhances its TNR as the FNR increases, reaching 0.05 at an FNR of 1.0. AISAR exhibits a similar pattern, beginning with a slightly higher TNR of 0.47 (1–0.53) at an FNR of 0.1 and improving to 0.04 at an FNR of 1.0.
The proposed model starts with the highest initial TNR of 0.46 (1–0.54) at an FNR of 0.1 and consistently outperforms the other models, achieving the highest TNR of 0.01 at an FNR of 1.0. Table 5 represents the numerical results and Figure 5 illustrates the trade-off between TNR and FNR for each of the five models, demonstrating that the proposed model optimally balances minimising false negatives and maximising true negatives.

5.1.5. True Positive Rate vs. False Positive Rate

The true positive rate ( A ), also recognised as sensitivity or recall, and the false positive rate ( s ) are critical metrics in binary classification utilised to evaluate a classifier’s performance. Both rates can be derived using the confusion matrix.
A signifies the proportion of actual positives (class 1) that are accurately predicted as positives, as detailed in Equation (20). On the other hand, s represents the proportion of actual negatives (class 0) that the classifier erroneously predicts as positives. The relationship for s can be described as:
s = k k + r
In practical terms, PCT starts with a true positive rate (TPR) of 0.45 and gradually elevates the TPR with a rising false positive rate (FPR), achieving 0.77 at an FPR of 1.0. The true positive rate (TPR) of MST-FaDe is initially set at 0.49, and it is progressively increased with an increasing false positive rate (FPR) to reach 0.84 at an FPR of 1.0. Fuzzy SVM starts with a TPR of 0.52 and gradually elevates the TPR with a rising FPR, achieving 0.90 at an FPR of 1.0. AISAR exhibits a similar trajectory, initiating with a TPR of 0.53 at an FPR of 0.1 and escalating to 0.91 at an FPR of 1.0. The proposed model begins with a TPR of 0.54 at an FPR of 0.1, consistently outperforming the prior models, and achieves the highest TPR of 0.98 at an FPR of 1.0. Table 6 represents the numerical outcomes, and Figure 6 illustrates the trade-off between TPR and FPR for all models, demonstrating that the proposed model consistently sustains a higher TPR while managing FPR, indicative of enhanced classification task performance.

5.1.6. Specificity vs. Number of Epochs

A statistic called the specificity metric is used in classification tasks to assess a model’s accuracy in identifying negative situations. It is expressed as the ratio of true negatives ( N ) to the total of false positives ( F N ) and true negatives ( N ). The specificity formula is as follows in mathematics:
S = T N T N + F N
T N stands for True Negatives, or the quantity of negative situations that are accurately detected. F N stands for False Positives, or the quantity of positive instances that were mistakenly recognised.
Figure 7 displays the specificity, while Table 7 shows the specificity’s numerical results. When compared to other current techniques, like PCT, MST-FaDe, fuzzy SVM, and AISAR, the recommended approach provides exceptional specificity. That is the overall utility as the number of epochs increases. The greatest specificity of the suggested methods is 98% in terms of epochs; by comparison, the maximum specificities of PCT, MST-FaDe, fuzzy SVM, and AISAR are 70, 80, 85, and 89 bits per second, respectively. Our suggested strategy outperforms the current ones in terms of specificity.

5.2. Research Summary

Students and Course Instructors (CIs) are initially enrolled in matching sub-servers using credentials such as ID, password, face data, voice data, and keystroke dynamics. The credentials of students and CIs are treated as local models. Once these local models are established, the sub-servers employ CGAN to construct the global model and perform authentication based on this model. The TSE approach is employed to assess students’ interactiveness, academic emotions, and behaviour, as well as the behaviours and teaching styles of CIs. The weekly engagement status of the students is then determined using the MV-MOC approach. The performance of non-engaged students is subsequently predicted early using the ID2QN. Personalised recommendations for students are automatically generated using a deep learning technique called Att-CapsNet and are stored on the blockchain using the LWEA. Table 8 presents the numerical evaluation of the current and proposed strategies.

6. Conclusions

In conclusion, this research establishes a detailed framework for the management of both students and course instructors, enhancing engagement assessment within the educational sphere. It lays down a solid groundwork for user authentication, incorporating multi-modal biometric data to ensure security. The application of the Threshold-based Shannon Entropy (TSE) method allows for a nuanced analysis of student behaviour, emotional responses, and instructional strategies, shedding light on diverse aspects of the educational process. Utilising the MV-MOC technique, the study enables educators to closely monitor and stimulate student participation on a weekly basis, fostering a more engaged learning environment. Furthermore, the integration of predictive modelling, exemplified by the ID2QN algorithm, aids in the timely identification of students in need of additional academic support. Additionally, this study innovates by offering personalised recommendations through the Att-CapsNet system, further secured by blockchain technology via the Lightweight Encryption Algorithm (LWEA), marking a significant step towards a more integrated and individualised educational approach.
Performance-wise, the proposed model outperforms existing systems, such as PCT, MST-FaDe, Fuzzy SVM, and AISAR, exhibiting significant improvements in accuracy (97%), F1-score (95%), precision-recall balance (0.99), and true positive rates, while simultaneously minimising false negatives and positives. This system has been refined to enhance agility, personalisation, and security, operating effectively without added complexity. These qualities underscore its robustness and efficiency across a range of classification and prediction scenarios, markedly enhancing the quality and responsiveness of e-learning platforms.
However, the proposed work has its limitations, which should be addressed. These may include the dependency on high-quality, multi-modal biometric data, which may not be readily available or may raise privacy concerns among users. Additionally, the complexity of the ID2QN and Att-CapsNet algorithms may require substantial computational resources, potentially limiting their applicability in resource-constrained environments. Furthermore, the effectiveness of the proposed model in diverse educational settings, with varying curricula and student populations, needs to be thoroughly evaluated. Future work could focus on addressing these limitations, exploring the scalability of the system, and extending its applicability to broader educational contexts.

Author Contributions

W.B.: developed the system; wrote the literature review; tested and analysed the data; wrote, edited, and formatted the paper. N.C.: provided overall leadership and direction for the research; contributed to and edited the literature review; reviewed the paper. A.S.S.: co-supervised the research; and contributed to reviewing and editing the paper. S.B.: worked on the authentication parts. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

This work was tested using the OULAD dataset. It is available online at https://analyse.kmi.open.ac.uk/open_dataset (accessed on 26 September 2022).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Gao, P.; Li, J.; Liu, S. An introduction to key technology in artificial intelligence and big data-driven e-learning and e-education. Mob. Netw. Appl. 2021, 26, 2123–2126. [Google Scholar] [CrossRef]
  2. Qazi, A.; Hardaker, G.; Ahmad, I.S.; Darwich, M.; Maitama, J.Z.; Dayani, A. The role of Information & Communication Technology in Elearning environments: A systematic review. IEEE Access 2021, 9, 45539–45551. [Google Scholar]
  3. Giannakas, F.; Troussas, C.; Voyiatzis, I.; Sgouropoulou, C. A deep learning classification framework for early prediction of team-based academic performance. Appl. Soft Comput. 2021, 106, 107355. [Google Scholar] [CrossRef]
  4. Murtaza, M.; Ahmed, Y.; Shamsi, J.A.; Sherwani, F.; Usman, M. AI-based personalised e-learning systems: Issues, challenges, and solutions. IEEE Access 2022, 10, 81323–81342. [Google Scholar] [CrossRef]
  5. Bozkurt, A.; Karadeniz, A.; Baneres, D.; Guerrero-Roldán, A.E.; Rodríguez, M.E. Artificial intelligence and reflections from the educational landscape: A review of AI Studies in half a century. Sustainability 2021, 13, 800. [Google Scholar] [CrossRef]
  6. Rahayu, N.W.; Ferdiana, R.; Kusumawardani, S.S. A systematic review of ontology use in E-Learning recommender system. Comput. Educ. Artif. Intell. 2022, 3, 100047. [Google Scholar] [CrossRef]
  7. Chen, S.Y.; Wang, J.H. Individual differences and personalised learning: A review and appraisal. Univers. Access Inf. Soc. 2021, 20, 833–849. [Google Scholar] [CrossRef]
  8. Mishra, S.; Tyagi, A.K. The role of machine learning techniques in internet of things-based cloud applications. In Artificial Intelligence-Based Internet of Things Systems; Springer: Berlin/Heidelberg, Germany, 2022; pp. 105–135. [Google Scholar]
  9. Sychev, O.; Penskoy, N.; Anikin, A.; Denisov, M.; Prokudin, A. Improving comprehension: Intelligent tutoring system explaining the domain rules when students break them. Educ. Sci. 2021, 11, 719. [Google Scholar] [CrossRef]
  10. Stecuła, K.; Wolniak, R. Influence of COVID-19 pandemic on the dissemination of innovative e-learning tools in higher education in Poland. J. Open Innov. Technol. Mark. Complex. 2022, 8, 89. [Google Scholar] [CrossRef]
  11. El-Sabagh, H.A. Adaptive e-learning environment based on learning styles and its impact on development students’ engagement. Int. J. Educ. Technol. High. Educ. 2021, 18, 53. [Google Scholar] [CrossRef]
  12. Namoun, A.; Alshanqiti, A. Predicting student performance using data mining and learning analytics techniques: A systematic literature review. Appl. Sci. 2020, 11, 237. [Google Scholar] [CrossRef]
  13. Albreiki, B.; Zaki, N.; Alashwal, H. A systematic literature review of student performance prediction using machine learning techniques. Educ. Sci. 2021, 11, 552. [Google Scholar] [CrossRef]
  14. Maican, M.A.; Cocoradă, E. Online foreign language learning in higher education and its correlates during the COVID-19 pandemic. Sustainability 2021, 13, 781. [Google Scholar] [CrossRef]
  15. Wolbring, G.; Lillywhite, A. Equity/equality, diversity, and inclusion (EDI) in universities: The case of disabled people. Societies 2021, 11, 49. [Google Scholar] [CrossRef]
  16. Issah, I.; Appiah, O.; Appiahene, P.; Inusah, F. A systematic review of the literature on machine learning application of determining the attributes influencing academic performance. Decis. Anal. J. 2023, 7, 100204. [Google Scholar] [CrossRef]
  17. Salehi, A.; Han, R.; Rudolph, C.; Grobler, M. DACP: Enforcing a dynamic access control policy in cross-domain environments. Comput. Netw. 2023, 237, 110049. [Google Scholar] [CrossRef]
  18. Priya, K.D.; Sumalatha, L. Secure Framework for Cloud based E-Education Using Deep Neural Networks. In Proceedings of the 2021 2nd International Conference on Intelligent Engineering and Management (ICIEM), London, UK, 28–30 April 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 406–411. [Google Scholar]
  19. Salehi, A.S.; Rudolph, C.; Grobler, M. A dynamic cross-domain access control model for collaborative healthcare application. In Proceedings of the 2019 IFIP/IEEE Symposium on Integrated Network and Service Management (IM), Arlington, VA, USA, 8–12 April 2019; IEEE: Piscataway, NJ, USA, 2021; pp. 643–648. [Google Scholar]
  20. Rafiq, M.S.; Jianshe, X.; Arif, M.; Barra, P. Intelligent query optimization and course recommendation during online lectures in E-learning system. J. Ambient Intell. Hum. Comput. 2021, 12, 10375–10394. [Google Scholar] [CrossRef]
  21. Salehi Shahraki, A.; Lauer, H.; Grobler, M.; Sakzad, A.; Rudolph, C. Access Control, Key Management, and Trust for Emerging Wireless Body Area Networks. Sensors 2023, 23, 9856. [Google Scholar] [CrossRef] [PubMed]
  22. Sharma, P.; Joshi, S.; Gautam, S.; Filipe, V.; Reis, M.C. Student Engagement Detection Using Emotion Analysis, Eye Tracking and Head Movement with Machine Learning. arXiv 2019, arXiv:1909.12913. [Google Scholar]
  23. Du, Y.; Crespo, R.G.; Martínez, O.S. Human emotion recognition for enhanced performance evaluation in e-learning. Prog. Artif. Intell. 2022, 12, 199–211. [Google Scholar] [CrossRef]
  24. Mubarak, A.A.; Cao, H.; Zhang, W. Prediction of students’ early dropout based on their interaction logs in online learning environment. Interact. Learn. Environ. 2020, 30, 1414–1433. [Google Scholar] [CrossRef]
  25. Adnan, M.; Habib, A.; Ashraf, J.; Mussadiq, S.; Raza, A.A.; Abid, M.; Bashir, M.; Khan, S.U. Predicting at-Risk Students at Different Percentages of Course Length for Early Intervention Using Machine Learning Models. IEEE Access 2021, 9, 7519–7539. [Google Scholar] [CrossRef]
  26. Rodríguez-Hernández, C.; Musso, M.F.; Kyndt, E.; Cascallar, E.C. Artificial neural networks in academic performance prediction: Systematic implementation and predictor evaluation. Comput. Educ. Artif. Intell. 2021, 2, 100018. [Google Scholar] [CrossRef]
  27. Joy, J.; Raj, N.S.; Renumol, V.G. Ontology-based E-learning Content Recommender System for Addressing the Pure Cold-start Problem. ACM J. Data Inf. Qual. 2021, 13, 16. [Google Scholar] [CrossRef]
  28. Gomede, E.; Barros, R.M.; Mendes, L.D. Deep auto encoders to adaptive E-learning recommender system. Comput. Educ. Artif. Intell. 2021, 2, 100009. [Google Scholar] [CrossRef]
  29. Kumar, T.; Sankaran, K.S.; Ritonga, M.; Asif, S.; Kumar, C.S.; Mohammad, S.; Sengan, S.; Asenso, E. Fuzzy Logic and Machine Learning-Enabled Recommendation System to Predict Suitable Academic Program for Students. Math. Probl. Eng. 2022, 2022, 5298468. [Google Scholar] [CrossRef]
  30. Sayed, W.S.; Noeman, A.M.; Abdellatif, A.; Abdelrazek, M.; Badawy, M.G.; Hamed, A.; El-Tantawy, S. AI-based adaptive personalised content presentation and exercises navigation for an effective and engaging E-learning platform. Multimed. Tools Appl. 2023, 82, 3303–3333. [Google Scholar] [CrossRef] [PubMed]
  31. Yang, X.; Tan, L. The Construction of Accurate Recommendation Model of Learning Resources of Knowledge Graph under Deep Learning. Sci. Program. 2022, 11, 1010122. [Google Scholar] [CrossRef]
  32. Safarov, F.; Kutlimuratov, A.; Abdusalomov, A.B.; Nasimov, R.; Cho, Y. Deep Learning Recommendations of E-Education Based on Clustering and Sequence. Electronics 2023, 12, 809. [Google Scholar] [CrossRef]
  33. Rasheed, F.; Wahid, A. Learning style detection in E-learning systems using machine learning techniques. Expert Syst. Appl. 2021, 174, 114774. [Google Scholar] [CrossRef]
  34. González, M.R.; Ruíz, M.D.; Ortin, F. Massive LMS log data analysis for the early prediction of course-agnostic student performance. Comput. Educ. 2021, 163, 104108. [Google Scholar] [CrossRef]
  35. Hussain, S.; Khan, M.Q. Student-Performulator: Predicting Students’ Academic Performance at Secondary and Intermediate Level Using Machine Learning. Ann. Data Sci. 2021, 10, 637–655. [Google Scholar] [CrossRef] [PubMed]
  36. Hessen, S.H.; Abdul-kader, H.M.; Khedr, A.E.; Salem, R.K. Developing Multiagent E-Learning System-Based Machine Learning and Feature Selection Techniques. Comput. Intell. Neurosci. 2022, 2022, 2941840. [Google Scholar] [CrossRef] [PubMed]
  37. Guo, Z.; Zhang, Q.; Ding, F.; Zhu, X.; Yu, K. A Novel Fake News Detection Model for Context of Mixed Languages through Multiscale Transformer. IEEE Trans. Comput. Soc. Syst. 2023, 1–11. [Google Scholar] [CrossRef]
  38. Bagunaid, W.; Chilamkurti, N.K.; Veeraraghavan, P. AISAR: Artificial Intelligence-Based Student Assessment and Recommendation System for E-Learning in Big Data. Sustainability 2022, 14, 10551. [Google Scholar] [CrossRef]
  39. Qiu, F.; Zhang, G.; Sheng, X.; Jiang, L.; Zhu, L.; Xiang, Q.; Jiang, B.; Chen, P. Predicting students’ performance in e-learning using learning process and behaviour data. Sci. Rep. 2022, 12, 453. [Google Scholar] [CrossRef] [PubMed]
  40. Wang, X.; Zhang, L.; He, T. Learning Performance Prediction-Based Personalised Feedback in Online Learning via Machine Learning. Sustainability 2022, 14, 7654. [Google Scholar] [CrossRef]
  41. Tao, T.; Sun, C.; Wu, Z.; Yang, J.; Wang, J.J. Deep Neural Network-Based Prediction and Early Warning of Student Grades and Recommendations for Similar Learning Approaches. Appl. Sci. 2022, 12, 7733. [Google Scholar] [CrossRef]
  42. Shahbazi, Z.; Byun, Y. Agent-Based Recommendation in E-Learning Environment Using Knowledge Discovery and Machine Learning Approaches. Mathematics 2022, 10, 1192. [Google Scholar] [CrossRef]
  43. Zhu, F.; Ma, F.; Ding, Z.; Guo, J.; Yang, J.; Yu, K. A Low-Latency Edge Computation Offloading Scheme for Trust Evaluation in Finance-Level Artificial Intelligence of Things. IEEE Internet Things J. 2024, 11, 114–124. [Google Scholar] [CrossRef]
  44. Mistry, D.; Mridha, M.F.; Safran, M.; Alfarhood, S.; Saha, A.K.; Che, D. Privacy-Preserving On-Screen Activity Tracking and Classification in E-Learning Using Federated Learning. IEEE Access 2023, 11, 79315–79329. [Google Scholar] [CrossRef]
  45. Li, C.; Li, L.; Jiang, H.; Weng, K.; Geng, Y.; Li, L.; Ke, Z.; Li, Q.; Cheng, M.; Nie, W.; et al. YOLOv6: A single-stage object detection framework for industrial applications. arXiv 2022, arXiv:2209.02976. [Google Scholar] [CrossRef]
  46. Saydirasulovich, S.N.; Abdusalomov, A.; Jamil, M.K.; Nasimov, R.; Kozhamzharova, D.; Cho, Y.-I. A YOLOv6-based improved fire detection approach for smart city environments. Sensors 2023, 23, 3161. [Google Scholar] [CrossRef] [PubMed]
  47. Fu, J.; Wang, N.; Cai, Y. Privacy-preserving in healthcare blockchain systems based on lightweight message sharing. Sensors 2020, 20, 1898. [Google Scholar] [CrossRef]
  48. Kuzilek, J.; Hlosta, M.; Zdrahal, Z. Open University Learning Analytics dataset. Sci. Data 2017, 4, 170171. [Google Scholar] [CrossRef]
Figure 1. Overall system architecture.
Figure 1. Overall system architecture.
Futureinternet 16 00206 g001
Figure 2. Number of epochs vs. accuracy.
Figure 2. Number of epochs vs. accuracy.
Futureinternet 16 00206 g002
Figure 3. Number of epochs vs F1-Score.
Figure 3. Number of epochs vs F1-Score.
Futureinternet 16 00206 g003
Figure 4. Precision vs. recall.
Figure 4. Precision vs. recall.
Futureinternet 16 00206 g004
Figure 5. True negative rates vs. false negative rates.
Figure 5. True negative rates vs. false negative rates.
Futureinternet 16 00206 g005
Figure 6. True positive rates vs. false positive rates.
Figure 6. True positive rates vs. false positive rates.
Futureinternet 16 00206 g006
Figure 7. Number of epochs vs. specificity.
Figure 7. Number of epochs vs. specificity.
Futureinternet 16 00206 g007
Table 1. Summary of the literature survey.
Table 1. Summary of the literature survey.
ReferenceObjectiveModel or Algorithm Limitations
[22]Analysing student engagement in online classesCNNSusceptibility to overfitting and underfitting issues with the adoption of CNN for distraction and facial emotion analysis
[23]Assessing human emotions to enhance distraction-free e-learningBayesian ClassifierFocusing solely on student emotions, without incorporating interactiveness and skills, limits performance prediction
[24]Predicting student dropout risk in e-learning using a Hidden Markov ModelHMMNeglect of key demographic factors impacts prediction accuracy
[25]Designing an early warning system for student performance using machine learningGradient BoostingLimited analysis of demographic and non-academic factors
[26]Predicting student performance using ANNANNProne to overfitting and computational complexity
[27]Resolving course selection anxiety in e-learning through ontology-based course recommendationsOntology-based recommendation systemProne to overfitting and computational complexity
[28]Developing a robust recommendation system for adaptive e-learning using various autoencodersCollaborative Filtering-based Deep Autoencoders Handling massive datasets with machine learning algorithms introduces complexity
[29]Performance prediction using machine learning and fuzzy methodologyFuzzy SVMHandling massive datasets with machine learning algorithms introduces complexity
[30]Designing an adaptive e-learning environment using DRLDQNVulnerability to login manipulation by malicious attackers
[31]Designing an e-learning recommendation system using Deep Learning and Knowledge GraphsDeep learning and Knowledge GraphsNeglected student–resource relationship evaluation, leading to suboptimal recommendations
[32]Providing robust online course recommendations using DNNDNNUnsuitability of K-Means clustering for data with varying density and size
[33]Identifying learning styles using machine learning algorithmsNaïve Bayes, Linear Discriminant Analysis, K-Nearest Neighbors, Random Forest, Logistic Regression, Decision Tree, Support Vector MachineLimited factors, increased false positive rates
[34]Predicting student performance using logged data in a learning management systemSupport Vector Machine, Multi-layer Perceptron, Logistic Regression, Gaussian Model, Decision Tree ModelExclusively relying on previous grades overlooks other factors influencing student academics
[35]Developing a student performance predictor using machine learningDecision Tree Regression, Decision Tree ClassifierLimited assessment of demographic and non-academic factors in performance prediction
[36]Academic performance prediction using machine learning agentsLogistic Regression, Decision Tree, Random Forest, Naïve BayesLimited factors, increased false positive rates
Table 2. Numeric results of accuracy.
Table 2. Numeric results of accuracy.
Number of EpochsAccuracy
PCTMST-FaDeFuzzy SVMAISARProposed
51525354565
101729374166
152232424368
203646565871
253949596173
304151616378
354353636882
404959697185
455161717792
506065757997
Table 3. Numeric results of F1-score.
Table 3. Numeric results of F1-score.
Number of EpochsF1-Score
PCTMST-FaDeFuzzy SVMAISARProposed
54243454650
104345484951
154548525356
204852575961
255257626365
305762656671
356265727476
406572767780
457276798088
507679808195
Table 4. Numeric results of precision.
Table 4. Numeric results of precision.
Number of EpochsPrecision
PCTMST-FaDeFuzzy SVMAISARProposed
0.10.080.100.110.120.13
0.20.190.210.240.250.28
0.30.250.290.310.350.39
0.40.350.390.420.450.48
0.50.450.480.510.530.59
0.60.550.590.620.650.69
0.70.680.700.730.760.79
0.80.750.780.810.860.89
0.90.770.800.820.870.90
10.850.900.950.960.99
Table 5. Numeric results of true negative rates.
Table 5. Numeric results of true negative rates.
False Negative RateTrue Negative Rate
PCTMST-FaDeFuzzy SVMAISARProposed
0.10.450.490.520.530.54
0.20.470.520.620.660.69
0.30.500.650.740.770.79
0.40.530.700.750.780.80
0.50.580.720.760.790.81
0.60.620.740.770.810.85
0.70.650.760.810.830.86
0.80.750.800.860.890.92
0.90.780.830.890.910.96
10.800.850.950.960.99
Table 6. Numeric results of true positive rates.
Table 6. Numeric results of true positive rates.
False Positive RateTrue Positive Rate
PCTMST-FaDeFuzzy SVMAISARProposed
0.10.450.490.520.530.54
0.20.470.520.620.660.69
0.30.500.650.740.770.79
0.40.530.700.750.780.80
0.50.580.720.760.790.81
0.60.620.740.770.810.85
0.70.650.760.810.830.86
0.80.700.780.820.840.92
0.90.740.800.860.870.96
10.770.840.900.910.98
Table 7. Numeric results of specificity.
Table 7. Numeric results of specificity.
Number of EpochsSpecificity
PCTMST-FaDeFuzzy SVMAISARProposed
201020252835
402030354047
604050555865
805060656779
1007080858998
Table 8. Numerical evaluation of proposed vs. existing methods.
Table 8. Numerical evaluation of proposed vs. existing methods.
S. NoSuccess MetricsPCTMST-FaDeFuzzy SVMAISARProposed
1Accuracy (%)6065757997
2F1-Score (%)7679808195
3Recall Rate0.850.900.950.960.99
4True Negative Rate0.800.850.9450.960.99
5True Positive Rate0.770.840.900.910.98
6Specificity7080858998
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bagunaid, W.; Chilamkurti, N.; Shahraki, A.S.; Bamashmos, S. Visual Data and Pattern Analysis for Smart Education: A Robust DRL-Based Early Warning System for Student Performance Prediction. Future Internet 2024, 16, 206. https://doi.org/10.3390/fi16060206

AMA Style

Bagunaid W, Chilamkurti N, Shahraki AS, Bamashmos S. Visual Data and Pattern Analysis for Smart Education: A Robust DRL-Based Early Warning System for Student Performance Prediction. Future Internet. 2024; 16(6):206. https://doi.org/10.3390/fi16060206

Chicago/Turabian Style

Bagunaid, Wala, Naveen Chilamkurti, Ahmad Salehi Shahraki, and Saeed Bamashmos. 2024. "Visual Data and Pattern Analysis for Smart Education: A Robust DRL-Based Early Warning System for Student Performance Prediction" Future Internet 16, no. 6: 206. https://doi.org/10.3390/fi16060206

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop