1. Introduction
Students interact with a university through various activities such as portal login, uploading assignments, and class attendance, among many others, and these actions often leave behind digital traces that can be used to analyze how the university is managing and providing help for students. Such traces can be stored as potentially widely applicable data in Learning Analytics (LA). It is the process of analyzing, measuring, collating, and reporting data to make decisions about the progress made by students and educators. LA is emerging as a top priority for many institutions (
Gonzalez-Nucamendi et al., 2022), especially leveraged during world crises such as COVID-19.
Learning Analytics (LA) and the scope of predicting students’ performance is multidimensional (
Waheed et al., 2020) and, therefore, can be tackled through various approaches from multiple perspectives to explore and analyze the problem, including identifying at-risk students, understanding the factors affecting the student’s performance, and the early prediction of dropouts and withdrawals during or after a course (
Shafiq et al., 2022). Thus, choosing the right approach highly depends on the prediction goal and the dataset structure. In this study, LA is demonstrated by extracting students’ behavior and interaction features using log files from Learning Management Systems (LMSs) to analyze student performance.
The study of the relationship between behavioral interactions and academic performance in courses supported by Virtual Learning Environments (VLEs) has been the focus of LA (
Agudo-Peregrina et al., 2014;
Wong & Li, 2020). This aspect has emerged as the most crucial for planning and implementing effective learning processes. Researchers have used physiological sensors and human observers to collect data on learners’ behavior in Virtual Learning Environments (VLEs) (
Aguagallo et al., 2023;
Prakash, 2023). However, data collection using human observers is labor-intensive and may be biased (
Llerena-Izquierdo et al., 2023). To overcome these challenges, researchers have developed mechanisms to automatically collect real-time data on learners’ interaction behavior in online learning (
Khor & Dave, 2022).
Data collection mechanisms, such as logs, open up new opportunities for data collection and provide insights into learners’ behavior and learning processes in VLEs. By applying AI and ML tools to analyze the collected data, researchers aim to understand the behavioral patterns contributing to higher learning gains and predicting learning outcomes (
Lavidas et al., 2022). This information can be used to provide scaffolds and adaptive feedback to enhance the learning experience in VLEs.
The platform Modular Object-Oriented Dynamic Learning Environment (Moodle) has long been identified as the most popular and preferred open-source LMS (
Moodle, 2024). It has been known as an advanced learning platform used in multiple disciplines, particularly in STEM education. It is mainly used within university settings, focusing on undergraduate studies. The platform has effectively improved student performance, satisfaction, and engagement while enhancing flexibility in learning environments (
Gamage et al., 2022;
Furqon et al., 2023). Using LMSs for obtaining digital traces to analyze student behavior online eliminates the need for time-consuming data collection methods. However, LMS log data are raw and may not provide tangible measurements, necessitating investigations of how and whether such data can be utilized for Learning Analytics.
The analysis of the effect of Learning Management Systems (LMSs) on student performance in different educational settings has gained attention among researchers, and it is still ongoing (
Işıkgöz, 2024;
Khairy et al., 2024). Studies have shown that utilizing LMSs has a beneficial effect on academic performance among students and fosters a favorable perception of LMS implementation in educational endeavors (
Furqon et al., 2023). Additionally, using LMSs allows for the analysis of student trajectories and identifying factors that lead to successful or unsuccessful learning outcomes (
Shaimov et al., 2022). Research conducted in the Computer Science Department of a Nigerian institution found that using LMSs improved students’ academic performance and awareness of collaborative and academic study (
Ifeanyi & Chinonso, 2023). Furthermore, statistically based approaches using data from the Moodle LMS have indicated that the number of interactions with the LMS can serve as a good indicator of student performance and can be used to track their learning progress (
Suay et al., 2022). On the other hand, analysis of log-based profiling and behavior on LMSs revealed that an online learner with fewer interactions could drop out or achieve lower academic performance (
Çakiroğlu et al., 2024).
Moreover, given that the nature of student performance is multifactorial and complex, relying on limited academic and LMS factors may not capture the entire picture of why students perform in a certain way, and their performance can be heavily influenced by other factors such as socio-economic background and financial factors, among many.
In a blended learning environment setting, which is the focus of this paper, the integration of traditional factors alongside those derived from LMSs is necessary to provide a comprehensive understanding of student success and informs targeted interventions. Factors such as age, gender, and socioeconomic status significantly influence academic performance, as highlighted in studies that incorporate these variables into performance assessments (
Loan et al., 2024). Other traditional factors include Nationality, where research indicates that incorporating such factors is crucial for understanding disparities in educational outcomes (
Sommerville & Singaram, 2018;
Banda et al., 2023).
Fernández-Leyva et al. (
2021) found that such factors significantly influence academic success, particularly among immigrant students, where cultural and social skills play a pivotal role.
Furthermore, several studies have shown a positive correlation between class attendance and academic performance (
Jiang, 2022;
Ng et al., 2022;
Latif Khan et al., 2019). In one study, lecture class attendance was found to have a positive correlation with final exam results. Another study used a random forest algorithm to identify the most important independent variables affecting student performance and found that the number of school absences was one of the critical factors. A data-mining approach also revealed that absenteeism was a significant factor in predicting student performance. Furthermore, research conducted on undergraduate medical students showed that attendance had a significant impact on their academic achievements. Overall, these findings suggest that attendance and leave of absence play a crucial role in determining student performance. However, these studies did not consider the different types of attendance in class, particularly in a university learning environment, where class can be conducted in the form of lectures, tutorials, and practical assignments, which may reveal interesting insights for a more tailored intervention approach.
While traditional metrics are essential, the growing reliance on online learning necessitates a balanced approach that incorporates both sets of factors to effectively monitor and enhance student performance. For example, academic results such as marks or grades are highly important to predict the performance of the student; however, these are still considered insufficient indicators of why the student is performing in this way or at that level. Therefore, these are suggested to be used in conjunction with online metrics to provide a holistic view of performance (
Shou et al., 2024), potentially reducing the misinterpretations of student capabilities.
1.1. Research Questions
This study addresses the importance of understanding how student interactions within the Moodle platform impact their academic performance by examining both the log variables (such as the total logins, time spent on the platform, interaction with different activities, and hourly login patterns) and traditional factors (such as demographics, academic history, family background, and attendance data). This research aims to uncover insights into the relationship between online engagement and academic success by addressing four (4) key research questions formulated below:
RQ1. How do traditional features such as the student’s demographic profile, academic history, and family background correlate with the final academic mark?
RQ2. What attendance features are significantly associated with academic performance?
RQ3. How do different patterns of LMS login and engagement contribute to academic success or failure among undergraduate students?
RQ4. Which features of the LMS demonstrate the strongest correlation with academic performance among undergraduate students?
1.2. Research Highlight and Contribution
This study is substantial, as it sheds light on how online interactions in blended educational settings impact student performance. It delivers constructive information for educators to improve their teaching strategies and use of LMSs. Additionally, it was found that most of the studies in the literature focus on overall behavior patterns in LMSs, ignoring other factors, such as temporal distribution (
Li et al., 2022), that can capture distinct patterns and students’ behavior. Therefore, we aim to analyze the impact of such features on students’ performance, potentially contributing to the ongoing evolution of online education tools. Furthermore, as for attendance, this feature may include different attendance types in higher education, such as lectures or tutorials, each of which can reveal interesting findings catered to each type and may call for specific interventions by institutions; however, in the literature, researchers often use overall attendance only (
Shafiq et al., 2022;
Li et al., 2022).
Our contribution to the literature can be summarized as follows:
Establish robust correlations using rigorous statistical methods and heatmaps.
Analyze a taxonomy of factors and available features in universities that can capture undergraduate student performance.
Provide valuable insights and investigate students’ online learning behavior.
Identify key factors that are significant determinants of academic performance, aiding future researchers in the feature selection process for accurate predictive models, as well as institutions by identifying valuable markers for intervention and support strategies.
Expand on the existing literature by examining temporal online features and engagement analysis of LMS patterns—an area that has received less attention.
This study used a new dataset from a private university in Malaysia, making it particularly beneficial for similar institutions.
The rest of the paper is organized as follows.
Section 2 explains the methodology adopted for the analysis, highlighting the data collection, pre-processing, feature extraction, tools, and statistical tests used.
Section 3 includes the results and key findings of the tests.
Section 4 consists of the discussion and implications by relating the findings with the existing literature. In
Section 5, our research concludes by summarizing the importance and content of this study paper, the limitations, and, finally, future work.
3. Results
This section presents the results and key findings of the analysis and statistical tests to answer the research questions highlighted in
Section 1.1 (please refer to
Table 5).
3.1. The Impact of Demographic and Academic History Factors on Academic Performance
This dataset had 175 male students and 46 female students. Using Tukey HSD, the tests revealed a negative mean difference of approximately −16.9013 units, indicating that, on average, male students had a lower attendance rate than female students. The adjusted p-value was less than 0.05, indicating that the difference in mean attendance rates was statistically significant. This suggests that the difference was unlikely to have occurred by random chance alone. In summary, evidence suggested a statistically significant difference in mean attendance rates between female and male students. The negative mean difference indicated that, on average, female students had a higher attendance rate in tutorials than male students. This was further supported by the Mann–Whitney U test, a non-parametric test that revealed a p-value of 0.0323.
The bar plot in
Figure 4 compares the average final mark across different categories, such as male and female. This provides a quick overview of the performance using the gender variable. The findings reveal that, on average, female students scored higher or performed better in exams than male students.
Before exploring these findings in more detail to verify the plot using statistical tests, normality tests needed to be performed to check whether there was a normal distribution between each group (male with final mark) and (female with final mark). For this purpose, the Shapiro–Wilk test was conducted. It was found that since the p-value of the male group was significantly less than the standard significance level of 0.05, it was suggested that this group did not follow a normal distribution. Meanwhile, the female group’s p-value was larger than 0.1705, indicating that it fell within the normal distribution.
Therefore, due to the above findings, non-parametric tests that were robust to violations of normality needed to be performed. In this case, the Mann–Whitney U Test was performed to measure the significant difference between the distribution of the two independent groups illustrated in
Table 6.
As the p-value was less than the significance level (<0.05), we can reject the null hypothesis and conclude that there was a significant difference between the two groups.
The Mann–Whitney U test, also known as the Wilcoxon rank-sum test, is specifically designed to compare two independent groups. Since “Nationality” consists of several groups, more than two, using different statistical tests, such as ANOVA or Kruskal–Wallis, is required. However, the assumptions of normality and homogeneity must be met to perform the ANOVA test. This was tested for Nationality with the final mark, and the results revealed violation in these terms; therefore, using a nonparametric test such as Kruskal–Wallis was more appropriate.
The Kruskal–Wallis test results indicate significant differences in the “Final_Mark” scores among at least some nationalities, as revealed by the p-value of less than 0.05, which supports the rejection of the null hypothesis. The H-statistic measures the variability in the data, which can be attributed to the differences between groups.
Following these findings, it is interesting to explore which groups are statistically significant; in other words, it is worth finding which nationalities result in statistical differences with the final mark. For this purpose, post hoc tests or pairwise comparisons should be performed to identify which specific groups differ from each other.
Tukey’s Honest Significant Difference (HSD) post hoc test was performed in this case.
Table 7 shows the pairwise comparisons between different nationalities for the “Final_Mark” variable. The results included a specific comparison of groups, the mean difference, the
p-value, and the confidence interval for the mean difference.
Figure 5 represents the correlation heatmap revealing the correlations between all features relating to the demographic, socioeconomic, educational background, administrative, financial, and academic performance factors.
Using the Spearman test, a correlation coefficient of 0.86 indicates a strong positive linear relationship between the number of qualifications and the final mark. As the number of qualifications increases, the final mark also tends to increase, as illustrated in
Figure 5.
Furthermore, correlation analysis on the effect of “age” on academic performance revealed a coefficient between age and final marks of −0.75. This negative correlation suggests a moderately strong inverse relationship between age and final marks. In other words, final marks tend to decrease as age increases, and vice versa. The strength of the correlation indicates a relatively consistent pattern.
3.2. The Impact of Parental Occupation on Students’ Exam Performances
Recognizing the influence of parental occupation and involvement is essential to support a student’s academic success. In this study, we studied the impact of different parents’ occupations on achieving good performance.
Figure 6 represents a WordCloud demonstrating the top three occupations of students’ parents: business, housewife, and manager.
Parents’ occupations were assessed based on the student’s academic performance using the Kruskal test, which revealed an H-Statistic of 95.413 and a
p-Value of 0.023, less than the significance level of 0.05. This suggests a statistical difference between the student’s performance and different groups of parents’ occupations. Furthermore, post hoc analysis was conducted using Dunn’s test, which revealed the following results in
Table 8.
The post hoc analysis with Dunn’s test reveals substantial mean variations in final marks based on parents’ employment. Students whose parents work in “Business” earn substantially higher final grades than those whose parents work in “Air traffic control”, as an “Employee”, and as a “District officer”. There is substantial proof of these differences, as the p-values for these comparisons are less than the significance level of 0.05. On the other hand, students whose parents are in “Air traffic control”, are an “Employee”, are a “District officer”, or are an “Executive” tend to receive lower marks, performing poorly than students whose parents are a “Secretary”. The low p-values and negative mean differences indicate substantial inequalities in academic achievement influenced by the parents’ jobs. These results highlight how parents’ professional backgrounds affect their children’s educational performance.
Furthermore, the question of whether being a first-generation student was found to not affect academic performance, as the p-value was 0.787, larger than 0.05, reveals no significant difference.
3.3. Class Attendance Effect on Students’ Exam Performances
Boxplots were used before exploring the relationship between attendance and student performance. These are valuable tools for identifying outliers in a dataset. Outliers in boxplots are often defined as individual data points that fall outside the “whiskers” of the boxplot. Any data points beyond the upper and lower whiskers are considered potential outliers. In this case, no outliers are defined in the attendance dataset.
By extracting the start and end hours, calculating the class duration, and then categorizing time slots into Morning, Afternoon, and Evening using the if else function, it is found that Morning slots/classes have higher attendance rates than Afternoon classes. However, based on the t-test results, there is no significant difference in class durations between students who attended (“Y”) and those who did not (“N”). The p-value is more significant than 0.05, suggesting that any observed differences could be due to random variation, and there is not enough evidence to conclude that attendance status is associated with a significant difference in class durations.
Lectures generally have higher attendance rates compared to tutorials. The lecture absence rate has a moderately positive correlation of 0.59 with the tutorial absence rate, which suggests a moderate tendency for students who miss lectures to also miss tutorials.
Before analyzing the effect of attendance on student’s performance, normality tests were conducted using the Q-Q (Quantile–Quantile) plot, which is a graphical tool used to assess whether a dataset follows a specific theoretical distribution, such as the normal distribution. It compares the quantiles of the observed data against the quantiles of the expected distribution. As a result of
Figure 7 and the Shapiro–Wilk statistical test, the dependent variable, which is the final mark, does not follow a normal distribution. This is due to some points departing from the straight line and the
p-value being less than 0.05, which is relatively small.
Due to the above findings, non-parametric tests are more suitable, as they are less sensitive to outliers, work well with small sample sizes, and do not assume a specific distribution for the underlying population. In other words, they can capture a non-linear relationships as well. Instead of relying on the Pearson correlation, the Spearman rank correlation is performed to test the relationship between attendance rates and final marks, as shown in
Figure 8.
In summary, the tutorial attendance rate has a statistically significant positive effect on final marks, while the lecture attendance rate does not show a significant monotonic relationship. Furthermore, the attendance rate for tutorials correlates positively with students’ exam final mark of 0.43. This suggests that as the tutorial attendance increases, the final mark increases. In other words, students who attend tutorial classes are more likely to score well in the final exam.
3.4. Assessing LMS Login Behaviour with Attendance and Exam Performance
Figure 9,
Figure 10 and
Figure 11 provide an intuitive way to understand the relative proportion of logins during weekdays (coral sector) compared to weekends (blue sector). The pie charts grasp the temporal distribution of average logins and identify patterns in login behavior based on the day of the week.
As illustrated in
Figure 12a, tutorial attendance correlates positively with weekend logins by having a strong 0.79 correlation.
Furthermore, weekend logins and final exam marks positively correlate, using a Spearman correlation coefficient of 0.64. This positive correlation indicates that students who engage more on weekends (higher weekend logins) tend to achieve higher final exam marks. This could imply a positive relationship between online engagement during weekends and academic success. This is illustrated in
Figure 12b.
Regarding hourly login, a Spearman correlation coefficient of 0.64 between late PM logins (19:59 to 23:59) and the final exam mark indicates a moderately strong positive monotonic relationship between these two variables. Students who engage more during late evening hours (late PM logins) tend to achieve higher final exam marks. This could imply a positive relationship between night-time online engagement and academic success.
3.5. Assessing LMS Engagement Behaviour with Attendance and Exam Performance
To investigate the different patterns of LMS logins and engagement among students who passed and failed, a Kruskal–Wallis test was performed, where the passing score was assumed to be set to 50 out of 100. In
Figure 13, the bar plot shows the obtained
p-values based on the significance levels (<0.05), which is indicated by the red dashed line for all features extracted from Moodle.
Moreover, as supported by the Figure, we can conclude that actively engaging with the module on the platform, such as viewing the page, contributes to better understanding and performance. This is indicated by the positive p-value of 0.006.
Students who obtained a passing score in their module may be more engaged with peer interaction and collaborative learning. This is indicated by the 2.48 × 10−8 p-value on the “forum create” feature. This group of students is more likely to create or initiate discussion in the online course forum.
In terms of login behavior, we identified that the total number of logins to the platform and weekday logins does not differ significantly between “Pass” and “Fail” students.
Figure 14a,b reveal the correlation of LMS academic engagement with the attendance and final mark, respectively.
However, weekend logins were statistically significant among the groups, with a
p-value of 0.0053. This was further investigated by taking a quantitative approach using box plots to calculate the difference in median final marks among students who logged in during the weekend and those who did not, as demonstrated in
Figure 15.
3.6. Assessing Time Spent on the LMS with Attendance and Exam Performance
To study whether the total average time spent daily by students in minutes affected their attendance and exam performance, a heatmap was used to study the correlation, using the Spearman rank test as illustrated in
Figure 16a,b.
Lastly, the time spent on the platform in minutes among “pass” and “fail” students showed a statistically significant difference, with a p-value of 0.008. This emphasizes that investing more time in accessing the module materials and engaging in different activities on the platform may lead to student success.
4. Discussion
In this section, we discuss the results and their implications, highlighting the contributions made by this study to the existing literature and extensively addressing the research questions.
The results of our study indicate that students with a higher number of qualifications tend to achieve higher final marks. This positive correlation suggests a valuable finding, as it provides insights into the relationship between academic qualifications and exam performance. If this relationship holds, it might be worthwhile for educational institutions to consider the number of qualifications as a potential factor influencing student success, suggesting that prior academic experience may contribute to overall performance.
Our study also investigated whether cultural differences impact student performance, as revealed by the findings on the Nationality of students. Given that some international students, such as those from Libya, may not be familiar with the Malay language or its cultural context, this could present additional challenges for them in adapting to the teaching and learning atmosphere and achieving higher marks in subjects that emphasize these aspects, such as MPU modules. This supports the conclusion that there are actual disparities in academic performance among these groups, and the findings are not merely due to chance, as indicated by the p-value. Such insights can help educational institutions better support international students.
Attendance count and whether the student takes a leave of absence are strong predictors of student performance. In this study, it was revealed that, overall, female students had better class attendance than male students; this was in line with
Mitra et al.’s (
2022) results, where a positive correlation was revealed. However, our findings offer new insights into analyzing student performance according to different class types rather than focusing on overall attendance. For example, this study investigated how traditional factors incorporated with online LMS features are used to understand student engagement in a bigger picture. We reveal that more weekend logins indicate higher tutorial attendance, which could have many implications related to engagement in learning. The relationship is said to be monotonic, meaning an increase in both variables; however, it is not necessarily at a constant rate. This might be due to the inconsistent trends in students’ logins over time.
In this study, it was found that students attending lectures are more likely to participate in tutorials. However, other factors might contribute to this correlation, and further analyses or experiments would be needed to establish causation.
The above findings suggest that students who attend tutorials regularly are more likely to be engaged in the learning process and demonstrate higher levels of weekend login activity, emphasizing their commitment to academic success, as they are proactive about their studies during weekends, possibly working on assignments, reviewing materials, or seeking additional resources online. The positive correlation highlights a consistent pattern of learning behavior among students. Those who invest time in tutorials during the week may carry this commitment into the weekend, maintaining a steady approach to their academic responsibilities.
Furthermore, our study revealed that tutorial attendance is more prominent in impacting exam performance. This highlights the importance of a holistic approach to student engagement, where it suggests that students who passed their module exhibit a higher frequency of logins outside of their class hour, as they tend to put more effort into reviewing the module materials and completing their tutorials during the weekend. Further investigation revealed that, on average, students who regularly log on to the platform on the weekends tend to achieve a final mark of 6.0 points higher than those who do not, representing a 10% relative increase in their score.
In research, it has been well documented that parental occupation has a significant impact on students’ academic performance. Prior studies reveal that students whose parents work in professional fields tend to have higher academic achievements (
Hussain, 2022;
Roshita et al., 2023), a trend mirrored in the findings of our study as well. More specifically, it highlighted the superior performance of students whose parents work in “Business” roles, which may be associated with high educational expectations, instilling values of diligence and perseverance in their upbringings. Conversely, other occupations such as “Air traffic control”, “Employee”, “District officer”, or “Executive” tend to affect the performance of students, where they might face academic challenges. These results emphasize the critical influence of parents’ occupations on educational outcomes.
Furthermore, age has been considered by many studies as a feature that can be used to analyze and predict academic performance. However, our study reveals an inverse correlation between age and the final mark, with older individuals tending to have lower performance. This was further supported by the findings of
Bravo-Agapito et al.’s (
2021) study, indicating age as a negative predictor of performance. This may be due to the decline in cognitive mechanisms necessary for learning something new that occurs with age (
Ameen et al., 2019). Therefore, while age may have an impact on academic performance, it is not the sole determining factor, and other individual characteristics that were beyond the scope of this study should also be considered.
Traditional factors such as student’s demographics and socioeconomic background may have some influence on rationalizing differential outcomes and identifying the performance of students, but it must be used with caution (
You, 2016). However, this does not eliminate the necessity for the inclusion of these features in prediction modeling to facilitate robust profiling studies of academic performance. In terms of demographic features such as gender, our study indicates that female students tend to perform better in final exams, and having a scholarship may be associated with a statistically significant impact on final marks in the dataset, suggesting that students with scholarships tend to perform better in final exams.
Forum tools on online educational platforms are utilized to anticipate participation and discussion among students. This feature has remained persistent in the existing literature, as it can strongly predict a student’s performance (
Shafiq et al., 2023). Active participation in forums fosters knowledge sharing, problem solving, and community building, positively impacting learning outcomes. Our study contributes to the literature by demonstrating that creating discussion in forums is significant among pass and fail students, suggesting that educational institutions should put more emphasis on this tool to improve learning practices. This was in line with the results found by
Pa et al. (
2023) on the factors related to predicting the final results, which suggested that more focus on forum discussion is required.
Overall, these findings highlight the importance of active engagement, participation, and time investment in academic success. Students who pass tend to demonstrate higher levels of interaction with course materials, assignments, and peers, suggesting a more comprehensive and dedicated approach to their studies. By contrast, students who fail may benefit from strategies to increase their engagement and involvement with course content, assignments, and collaborative learning opportunities to improve their chances of success. This was in line with
Jovanović et al.’s (
2021) findings, where it was concluded that the total time spent online, actively contributing to the forum, and access to course materials proved to be the most significant predictors of students’ learning outcomes (measured through the final course grade) across courses.
Among the login behavior features, it is concluded that the total number of logins to the platform and weekday logins do not affect academic performance. This could be explained by the instructional design of the courses in the university, where students are required to access the platform on a weekly basis, often during their classes, whether low- or high-performing students; therefore, the results did not reveal any significance for such features. However, total logins to the platform may be useful in understanding student online engagement patterns; for example, our results revealed that modules such as MPU and UCM that are heavy on tutorial tasks may exhibit higher course logins by students on weekends. Furthermore, we notice that the number of log entries tends to increase during week 15, assuming the modules in this study expand over a 14-week period for one semester in the university; week 15 is where students may likely refer more to course materials to study in an attempt to prepare for their exams.
Furthermore, as revealed by the results, there is a positive correlation between the class absence rate and the time spent on the e-learning platform, which may seem counterintuitive at first glance; however, it could be due to several factors such as compensatory behavior, where students who are often absent for class may feel the need to compensate and spend more time to catch up on the missed materials, or it could be due to the complexity of the course material.
Ultimately, it is found that module/course views, forum creation, overall assignment interaction, and time spent on the platform are among the top LMS variables that contribute significantly to students’ academic performance. The analyzed features seem to complement each other, such as tutorial attendance and login activity on the weekend. While the importance of such variables has already been widely mentioned in the literature (
Li et al., 2022;
Arizmendi et al., 2023;
Alnasyan et al., 2024;
Wang & Mousavi, 2023), the latter features related to the weekend and hourly logins received less attention (
Quinn & Gray, 2019), and their significance across multiple courses is a new contribution that this study brings to the field.
5. Conclusions
The study examined a comprehensive set of online and traditional features covering several aspects of students’ learning behavior in an LMS, involving indicators of engagement levels and interaction regularity. Besides the overall analysis of students’ online behavior, temporal analysis across multiple courses was performed.
By analyzing log variables alongside traditional factors, future researchers can gain deeper insights into how students engage with e-learning platforms, which in turn helps educators tailor their teaching methods and optimize their educational resources. For example, in this study, it is revealed that instructors need to encourage their students to engage more with the content available on Moodle, especially using the Forum tool, as it can spark discussion and motivate them to score higher in their final exam.
The study is also beneficial to future research, as it uncovers the relationship between online and offline features that could be leveraged to build accurate predictive models in the field of Learning Analytics. Moreover, as the results revealed significant and insignificant features in Moodle logs, such insights can greatly aid future researchers in building effective feature selection methods.
5.1. Limitations
This paper is subject to certain limitations that can be addressed by future research. The study was based on a sample of 19 blended courses that were assumed to be homogeneous regarding institutional settings, discipline, and nominal learning design. Therefore, this is a current limitation of the study, as less information on the courses was provided. Furthermore, several other forms of content on Moodle exist, such as quiz components, videos, and lecture recordings, which were not considered in this study due to their lack of use in the courses involved in this study.
5.2. Future Direction
As e-learning tools evolve, students may exhibit different interaction and behavioral patterns on learning platforms; therefore, in the future, it is worth identifying heterogeneous groups of students to build predictive models with the ability to provide tailored and group-focused interventions. Furthermore, researchers could investigate more patterns beyond the accessibility features in Moodle logs, as this could provide more valuable information on what kind of resources students prefer and make more informed decisions on their teaching and learning. Future research could also investigate the underlying factors contributing to the inverse relationship between age and academic performance, such as external commitments like work and family responsibilities, cognitive changes, or disparities in educational experiences.