Towards Massive Data and Sparse Data in Adaptive Micro Open Educational Resource Recommendation: A Study on Semantic Knowledge Base Construction and Cold Start Problem

Sun, Geng; Cui, Tingru; Beydoun, Ghassan; Chen, Shiping; Dong, Fang; Xu, Dongming; Shen, Jun

doi:10.3390/su9060898

Open AccessArticle

Towards Massive Data and Sparse Data in Adaptive Micro Open Educational Resource Recommendation: A Study on Semantic Knowledge Base Construction and Cold Start Problem

by

Geng Sun

^1,*,

Tingru Cui

¹,

Ghassan Beydoun

²,

Shiping Chen

³

,

Fang Dong

⁴,

Dongming Xu

⁵ and

Jun Shen

¹

School of Computing and Information Technology, University of Wollongong, Northfields Ave, Wollongong, NSW 2522, Australia

²

School of Systems, Management and Leadership, University of Technology Sydney, 15 Broadway, Ultimo, NSW 2007, Australia

³

CSIRO Data61, Marsfield, NSW 2122, Australia

⁴

School of Computer Science and Engineering, Southeast University, Nanjing 211189, China

⁵

UQ Business School, The University of Queensland, Brisbane, QLD 4072, Australia

^*

Author to whom correspondence should be addressed.

Sustainability 2017, 9(6), 898; https://doi.org/10.3390/su9060898

Submission received: 5 April 2017 / Revised: 12 May 2017 / Accepted: 19 May 2017 / Published: 26 May 2017

(This article belongs to the Special Issue Smart X for Sustainability)

Download

Browse Figures

Versions Notes

Abstract

:

Micro Learning through open educational resources (OERs) is becoming increasingly popular. However, adaptive micro learning support remains inadequate by current OER platforms. To address this, our smart system, Micro Learning as a Service (MLaaS), aims to deliver personalized OER with micro learning to satisfy their real-time needs. In this paper, we focus on constructing a knowledge base to support the decision-making process of MLaaS. MLaas is built using a top-down approach. A conceptual graph-based ontology construction is first developed. An educational data mining and learning analytic strategy is then proposed for the data level. The learning resource adaptation still requires learners’ historical information. To compensate for the absence of this information initially (aka ‘cold start’), we set up a predictive ontology-based mechanism. As the first resource is delivered to the beginning of a learner’s learning journey, the micro OER recommendation is also optimized using a tailored heuristic.

Keywords:

adaptive learning; micro open learning; educational data mining and learning analytics; cold start problem

1. Introduction

In the information age, the development and dissemination of learning resources are booming at a much higher speed and wider range than their traditional shapes. People have shown increasing interest in getting access to online learning resources and getting involved in online learning activities, especially via mobile devices [1]. Many leading universities have opened up access to their courses. Indeed, access to open education resources (OERs) is exponentially increasing. This boom of OERs gains wider popularity in the entire higher and adult education sector, and has also attracted many researchers’ attention from educational, social, and computational views [2]. According to the latest statistics, millions of people have attended the virtual classroom of online open learning to access OERs, which are produced and updated on a daily basis. This leads to an emerging concomitant trend: that of open learning [3].

Open learning is recognized as a novel and effective learning method that could lead to a revolution in traditional learning, distance learning and electronic learning (e-learning) which have been widely used in the first decades of 21th century. Nevertheless, current OER delivery still faces challenges and its sustained success remains in doubt. Recent studies actually suggest that massive open online courses (MOOCs), a common open learning environment, are currently suffering from low completion rates [4]. Most learners who enroll in MOOC courses end up dropping out. Educational professionals have focused much of their effort on exploring open learning, OER and MOOC formats, but the concomitant pedagogical innovations for mobile learning are yet to receive sufficient attention [5]. Indeed, there are many opportunities to improve open learning and OER delivery.

Our previous studies [6,7] demonstrated that micro learning is becoming a mainstream online learning mode; but coupled with mobile platforms, the knowledge attainment process is often fragmented. Using mobile devices, learners are easily affected by their mood or environmental distractions [8]. In this paper, we will present our research on profiling micro learning processes through OERs and building a knowledge base to support the decision-making process of micro OER adaptation. This knowledge base will be built using a top-down approach. A construction of augmented ontologies oriented to micro open learning will be illustrated first, followed by a data processing strategy. Due to the brevity of learner information at the commencement of the micro open learning, we will introduce an ontological approach to technically address the cold start problem.

2. Background

2.1. Nature of OERs and Open Learning Delivery in Mobile Environments

Open learning is different from on-campus, e/m-learning modes. OERs are “digital learning resources offered online freely and openly to teachers, educators, students, and independent learners in order to be used, shared, combined, adapted, and expanded in teaching, learning and research” [3]. Open learning is the combination of informal learning and formal learning. Learners enjoy high flexibility in online open learning because there is no strict time constraint for joining and quitting. Learners engaged in open learning are from different age groups and cultural backgrounds, with a wide range of geographic distribution.

Generally, OERs can be differentiated from MOOC and open courseware (OCW). Contrary to MOOC, OCW only offers course materials rather than entire courses. In other words, OER can be structured (MOOC content), unstructured (i.e., OCW), or even both. OER providers and instructors have tried to promote their courses and affiliated educational products at full stretch. They have leveraged mobile learning (m-learning) for learners to easily participate in learning activities regardless of restrictions in time and location.

Another aspect is that mobile learning activities in open learning normally consist of two sections: online learning and offline learning [9]. Since mobile learners can freely download materials onto their mobile devices to view offline, they do not often stay on open learning platforms and attend virtual classrooms [10]. In fact, accessing OERs online is only a part of learning; more tasks associated with learning would require offline activity [11], such as data collection, data analysis, and report writing for an assignment. Logically, mobile open learning is conducted through online systems that include guided and instructional materials, transaction details and deliverable resources [12]. Hence, while learners are able to accomplish many open learning tasks offline, for some necessary procedures, such as data entry and work submission, they need to go back online to conduct these specific tasks.

2.2. Micro Learning

Micro learning refers to short-term learning activities in small learning units [13]. Its learning process can cover time spans from a few seconds (e.g., in mobile learning) to up to 15 min [14]. With mobile devices, learners normally accomplish learning objectives in a short time period. According to prior study [14], micro learning can be defined by the assumption that a short time span is needed to complete a relevant learning task. Hence, micro learning is booming with the wide use of mobile devices, and is becoming a major learning tool in the mobile environment. Micro learning shares some similar characteristics with mobile learning, as both are individually referable, self-contained, reusable and re-mixable [15].

Micro learning resources are available on-demand to facilitate just-in-time learning [16]. These small learning bytes cannot be learned on-the-go, but require less effort. They can aid quick assimilation, thus reducing the dependency on a fixed time slot or the need to take a large chunk of time out of learners’ working day [13]. As micro learning evolves, micro-content delivery with a sequence of micro interactions enables users to learn without information overload [16]. Compared to traditional learning modes, the overall effort required to progress through an entire concept will proceed in a continuous, or even intermittent, way rather than a consecutive way [16].

3. Research Challenges and Design

3.1. Research Design

3.1.1. System Framework and Previous Work

In our previous studies [6,7], we have discussed the popularity of adopting micro learning in accessing OER, especially through mobile devices. The necessity of improving existing mechanisms of micro learning support has also been stated. Having studied the present status of research and development of open learning and OERs, we are motivated to carry out research to provide learners with adaptive OERs by means of micro learning with regard to their individual needs. In other words, we are dedicated to tailoring OERs into chunks of relatively short time length, and allocating them to learners at the right time. This approach was realized by Software as a Service, Micro learning as a Service (MLaaS). In optimal conditions, through use of MLaaS, learners can easily complete the learning process by using their fragmented pieces of time. For example, a learner may spend 15 min using mobile devices to learn a piece of a MOOC course on his or her way home from work by train. In this case, an ideal course module delivered to him or her should be limited to that time length (e.g., 15 min), to ensure a micro but complete learning experience.

The framework of MLaaS is shown in Figure 1. As a data-rich system, MLaaS will be able to exploit detailed learner activity data not only for recommending what the next micro learning activity for a particular student should be, but also for predicting how that student will perform that future learning content.

In our pilot work [17], we proposed peer-to-cloud and peer-to-peer models for resource sharing and storage in service-oriented contexts. Such models can have higher upload and download speeds than a traditional cloud model, user model or peer-to-server-peer model, and can be more robust to the failures of peers or servers in the cloud environment [17,18]. Hence, we adopt this design and apply its concept as the topology of the new system for micro open learning.

The P2P sub-network of the proposed system is to conform with the nature of open learning, where varieties of P2P learning occur frequently and randomly. This P2P tier guarantees that P2P learning can be organized instantly, and the first-hand resources can be shared and exchanged, regardless of access to the cloud.

From the top-down view, MLaaS borrows the cloud service to maximize the capability of hosting. The cloud part of the system consists of four domains: data tracking, data collection, data processing and data storage.

The functions of modules in MLaaS’s cloud-end are outlined in prior works [5,19]. A noticeable feature of the system is that there are three file transmission channels:

A channel between learners and an instructor-created OER pool in the cloud storage part (i.e., Channel A in Figure 1).
A channel between learners and a learner-generated OER pool in the cloud storage part (i.e., Channel B in Figure 1).
A channel among all learners engaged in open learning (i.e., Channel C in Figure 1).

Once a learner indicates his or her desire to carry out micro learning and sends such a request from a mobile device, OERs will be transmitted through one of the three channels.

Where the OERs actually come from in the cloud resources pools (i.e., from which exact cloud nodes the OERs are retrieved and invoked) will be defined and externally supported by third-party service-selection and resource-allocation services from mainstream service providers. This problem has been well studied; typical solutions can be found in the work reported in [20].

We have reported on the architecture and technical details of MLaaS in prior studies [19,21]. A comprehensive description is beyond the scope of this paper. It is worth noting that MLaaS only produces micro OERs, rather than OERs. That is to say, normal OERs available online are collected by MLaaS and clustered in the OER pools, as shown in Figure 1. For this reason, despite MLaaS owning its data collection mechanism, it shares some demographic and educational data with the platforms or providers from which the OERs originate. This helps in learner profiling, which will be introduced in the Section 5.1, even if a new learner registration in MLaaS is informal, and without sufficient demographic and educational data provided.

The system framework has thereby briefly been introduced here as background, and we will now move on to the focus of this paper: ontology construction, data processing strategy and cold-start problem.

3.1.2. Research Problem Identification and Design

Given all decisions of micro OER adaptation are made by the Adaptive Engine, it acts as the core of the system. It consumes the results from all other services and transmits its output straight to the user interfaces. For this reason, the MLaaS is conceived to meet the standard of a data-rich system, and a knowledge base serves as its think tank. Basically, the knowledge base is constructed using a top-down approach, by making use of semantics means, from the pattern level to data level. In other words, several ontologies are drawn at first, followed by ‘data processing’ work. We attempt to combine the pattern and rule discovery processes of micro learning with a survey of the education literature to produce features that could affect the learning experience and outcomes in a mobile environment [21]. In detail, ‘data processing’ work involves all operations on data, from the very beginning to the end, such as entity extractions, relationship extractions, resolution disambiguation and so on.

Given that we have the overall system framework in place, we consequently adopt a conceptual graph-based approach for dealing with the ontology construction [22,23]. These graphs profile the features that play a significant role in an ongoing micro learning process, and also depict how features were mutually affected by and interrelated with each other. According to our design, the profiling procedure is carried out from two sides, the learner side and the OER side.

While the profiling proceeds forward, some new problems appear which contradict our original design intentions. One of the most important problems is that the system, MLaaS, knows little about the learners, because either ‘OER’ or ‘learner’ is new to this emerging educational setting. This creates serious difficulties for beginning the ‘data processing’ process. Profile construction is impossible with insufficient information about the learner at the commencement of open learning. Therefore, the learner profile cannot be fully filled in with valid data.

In this paper, the research focuses on the knowledge base for micro OER recommendation and delivery. Naturally, based on the volume of retrievable data, this problem can be approached from two sides.

If a learner is well-known by the MLaaS, an educational data mining and learning analytics (EDM/LA) approach will be applied to his or her historical data to understand his or her learning patterns and preferences. Thereby, a well-grounded recommendation can be made based on his or her personalized settings and particular surroundings.
If a learner is relatively poorly known by the MLaaS, (i.e., this is a new learner to the OER environment), this will be treated as a cold-start problem and tackled by filling in the gaps with predicted data, so that a recommendation will be made based on demographic information. Freshly generated information, along with the cold-start recommendation, will populate the first version of a learner’s profile.

3.2. Research Challenges

3.2.1. EDM/LA for Micro Learning

Student learning data collected by open learning systems are explored to develop predictive models by applying educational data mining methods that classify data or identify relationships. These models play a key role in building adaptive learning systems, in which adaptations or interventions based on the model’s predictions can be used to change what students experience next, or even to recommend academic services to support their learning.

Analyzing these newly logged events requires new techniques for working with unstructured text and image data, data from multiple sources, and vast amounts of data (“big data”). Big data does not have a fixed size; any number assigned to define it would change as computing technology advances to handle more data. For example, Manyika et al. defines big data as “Datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze” [24].

At this cutting edge, EDM and LA are widely used in research. They are used to build models in several areas that can influence online learning systems. As its name implies, EDM is a state of the art method that applies data mining techniques to educational data. It is concerned with many developing methods, and acts on exploring the unique types of data in educational settings. Using these methods, students and educational settings can be better understood [25]. To enable smart and adaptive micro learning for MOOC, EDM and LA are key concepts that we employ to build the basis of the dynamic learner model construction.

Generally, OCW data is locked away in independent data silos hosted by different OCW/OER providers. This makes it much less useful than it could be. It is difficult to develop tools for consuming data from multiple silos. Searching OCW/OER across multiple silos means invoking the user interface of each one, and receiving the results in separate groups. The presence of data silos makes accessing data and interoperability between repositories harder in several ways.

Browsing OERs also has a problem in that each silo has its own organizational structure. Some silos have no way to link to a particular item, and so hinder the free flow of information. The presence of OCW silos impedes the interoperability, discovery, synthesis, and flow of knowledge. As a result, it is a difficulty for teachers, students and self-learners to look for resources, and sometimes they make decisions based on incomplete information. Linked data have the potential to create bridges between OCW data silos.

3.2.2. Cold Start Problem in Micro OER Delivery

In computer science literature, widely used adaptive recommendation methods generally consist of two main categories, i.e., memory-based and model-based algorithms [26]. Although they have been found in many successful cases of recommender systems, for example, the Amazon online store, it is usually difficult to provide reliable recommendations due to the insufficiency of initial data of ratings or preferences. This leads to the occurrence of the cold start problem. Commonly, the cold start problem is triggered by three factors: new community, new item and new users.

The cold start problem becomes more severe in open learning, especially in micro learning through OERs [14]. Both open learning and OER are relatively new products, which have been emerging in very recent years. Meanwhile, the followers of this novel trend, no matter whether they are new education pursuers or regular learners migrating from other online learning modes, are forming a completely new community. On the other hand, the learning demands and expectations of learners engaged in open learning are much more practical than conventional university students. In other words, they are mostly self-regulated, so that they have a great deal of flexibility in deciding when to join or quit the online course, and they can switch among courses frequently and at will [27]. Consequently, for OER providers, it is difficult to establish a model and update it accordingly for any individual learner because they do not have any historical data to hand.

In micro open learning, or micro learning over OERs, it is very normal to find that learners take part in and deviate from the learning scenarios frequently, as well as turning on and off the learning activities at will. That is to say, the overall situations of micro learning vary completely from individual to individual. Moreover, it is very common for freshmen to join open learning, or for existing learners to unfold a brand-new course learning profile at any time. All in all, then, (1) there are a large number of new learners in open learning; (2) new learners usually initiate access of new learning resources; and (3) learners who went through learning resources in the same branch of a discipline will form new communities.

If treated carelessly, the cold start problem may lead to the loss of learners who were previously engaged in open learning, but decide to stop using the OER delivery system or be unwilling to adopt the learning mode [28]. The reasons behind the situation are mainly due to the lack of accuracy in the recommendations received in that first stage, in which the learners have not yet cast a significant number of votes or rating to feed the recommender systems. Basically, the sparsity of data affects user satisfaction, and then it can further affect user acceptance of the new open learning mode.

3.3. Contribution

Extended from our published works [19,29], which presented the general framework and educational background of micro open learning, this research will provide innovative deliverables by the following means:

Top-down processing of semantic knowledge base building.
Conceptual graph-based ontology construction for the pattern level.
Data source documentation and data processing strategies for the data level.
Complete ontology-based mechanism for tackling the cold start problem.

4. Conceptual Graph-Based Ontology Construction for Micro Open Learning and Proposed Data Processing Strategy

4.1. Conceptual Graph-Based Ontology Construction

Generally, a workable knowledge base has a two-tier structure: a pattern level at the top and data level at the bottom [30]. For the pattern level, the ontologies are constructed based on conceptual graphs, as we briefly described in Section 3.1.1. By this means, the ontologies represent the formal dimensions of the data processing workflow, and can drive data processing with a priori knowledge, thereby reducing the search space [23,31].

By accomplishing a comprehensive survey of literature in the fields of pedagogy, psychology, e-learning and mobile learning, we sorted out features that might play key roles in the micro open learning experience and achievement. These conceptual graphs also represent how features were affected by and interrelated with each other in the ongoing micro open learning process. This will be introduced in the subsequent Section 4.1.1 and Section 4.1.2, from the OER (item) side and learner (user) side, respectively.

4.1.1. Augmented Micro OER Ontology

From the item-based perspective, we deepen the sights from normal e/m-learning into the micro learning environment. For this reason, the general ontology of OER is augmented to adapt the needs of micro learning.

In an augmented micro OER ontology, an annotation of a micro OER is self-describing; with metadata exploring its educational parameters, such as typology (video, audio, text, etc.), type of interaction (expositive, active, mixed, two-way), didactic model (e.g., inductive, deductive, learning by doing, etc.), and non-functional attributes, such as QoS, semantic density and so on [32]. Each node in an augmented OER ontology indicates a micro OER chunk. A chunk is the smallest unit in the micro learning settings—normally a finely-cut piece of an OER from its provider—with an apparently shorter time length (preferably less than 15 min) than its original shape. It can be a mini concept or knowledge point, tinier than what teachers used to deliver; or it can be a cut of course video or lecture notes; or course settings delivered along with a concept, such as assessment, task, reading material and so on [21].

No chunk is totally independent, and each of them is part of a relational web rather than merely a conceptual object [33]. This ontology is used to explicitly classify the OERs for recommendation among a pedagogically defined set of distinctive main concepts, fed as the raw material into the reasoning process of MLaaS [29,33].

A conceptual graph of the augmented OER ontology is shown as Figure 2.

4.1.2. Augmented Micro Learning Learner Profile Taxonomy

From a user-based perspective, the main ontology, on which all learner profiles are based, is named the Benchmark ontology, where the element Learner is put at the center of the graph [29]. Acting as an instance of a preset domain ontology, a specific learner profile oriented to micro learning is a set of nodes from the Benchmark ontology matched with a node in the augmented micro OER ontology. It contains plenty of annotations in terms of their learning behaviors and context. A conceptual graph of the benchmark ontology is shown as Figure 3.

4.2. EDM and LA Strategy

Reporting learners’ data visually and statistically to demonstrate their unique learning story, and also their learning constraints (such as time availability), is crucial. This plays a significant role in assessing learners’ study status, estimating learners’ study progress, and carrying out strategic decision-making. This process is responsible for the benchmark setting for routine data extraction from the open learning platform.

For the bottom level (i.e., the data level) of the knowledge base, the technical operation of semantic learner profiles and knowledge base construction for micro open learning is based on data that populates the graphs from two sources: explicit data collection (e.g., through mandatory requests); and implicit data tracking (e.g., automatic extraction) [33].

In addition, rather than developing the domain ontology for OERs by ourselves, a general structure of courseware ontology is built jointly by making use of existing ontologies, which were extracted from major OER providers, such as universities involved in major open courseware alliances (e.g., participating institutions in edX (https://www.edx.org/schools-partners)), or from the Linked Open Data Cloud community (http://lod-cloud.net/) [34].

The investigation of ‘big’ open learning data is organized at the OER side. Among the massive OERs, three main types of relations are foreseen:

ConsistsOf is an inclusion relation. This relation can be generally found between two OERs or one OER and one micro OER. Two items with this relation are located in different hierarchies of the augmented micro OER ontology.
RequiredSequence is a strong order between two items (OER or micro OER), where the former micro OER must necessarily be learnt before the latter one, due to course settings and educational consideration.
RecommendedSequence is a weak order relation between two items (OER and micro OER), where the former micro OER is suggested to be learnt before the latter one, according to the instructor’s guidance, but is not mandatory.
It is certainly possible for two items (OER or micro OER) to have no relation at all.
Both relations regarding sequence can be inherited by entities’ descendants, for example, if there is a RecommendedSequence(R₁, R₂) indicating an OER R₁ is preferably learnt prior to R₂, then, for MR₁ ∈ R₁ and MR₂∈ R₂, there is a RecommendedSequence(MR₁, MR₂).

The purpose of the EDM/LA is to amend, enrich and validate the aforementioned ontologies built manually and extracted semi-automatically, and verify and weigh the importance of discovered relations. Our combination of EDM/LA is realized on the basis of two components [35]; on-campus mobile learning data (i.e., structured data), and ‘big’ open learning data (i.e., unstructured data). In particular, we are carrying out the experimental EDM/LA by conducting a substantial analysis of the real data of learning behaviors of students from a public university in Australia. The data are collected from the main learning management system (LMS) and data warehouse of the university. This analysis aims to identify the regular patterns of students getting involved in blended learning (i.e., on-campus learning and e/m learning); for example, whether and how often they adopt micro learning modes to accomplish learning tasks, to explore the major factors that affect their learning habits, and most importantly, to understand the rules for the ways in which features listed in the personalized learner model are mutually affected by, interrelate with, and act upon their learning outcomes. At this stage, we are discovering potential trends, which cannot be directly shown from the data we have gathered. We can then apply such findings to open learning scenes and infer what is behind the scene. The detailed data sets are illustrated in Table 1:

The study is subsequently extended and applied to a larger scale, by analyzing ‘big’ data from real open learning activities. Data mining means with different aims are shown in the first column of Table 2.

To a large extent, the establishment of the data level can involve integrating heterogeneous OCW repositories, refining and blending available OERs into the micro learning context and publishing their metadata as linked data. Because in recent years some educators and researchers have made great efforts to publish and popularize the OER in terms of the linked data concept, a workflow developed with this extended aim is generally divided into six phases:

Identify and select heterogeneous data sources to determine the scope of the content.
Model vocabularies for OER domains.
Data extraction.
Generate standardized data descriptions (e.g., RDF data).
Publish linked data.
Consume and display linked data.

5. Ontological Approach for the Cold Start Problem

5.1. Representation of Learner Profile

Adopting ontologies as the basis of the learner profile is crucial in addressing the cold start problem in micro OER delivery. It allows the initial learner behavior to be matched with priori knowledge defined in the ontologies and relationships among them.

The learner profile is managed by MLaaS in two parts: a static part and a dynamic part. The static part can be represented by a vector, which contains demographic and educational information. By matching these two augmented ontologies for item and user, respectively, the dynamic part of a learner node is denoted as a pair, L_j = {MR_u, ML_j}, L_j ∈ L. Herein, the element MR_u denotes the uth micro OER, as described in Section 4.1.2, which is a particular version of the micro OER ontology, and a three-dimensional element ML_j {P_u_,j, TA_j, D_j} is exclusive to jth learner during the micro learning process. Herein, the element P_u_,j indicates the learner’s preferences, TA_j indicates the jth learner’s instant time availability, and D_j denotes the level of distraction in terms of the given learning environment and surroundings.

Whenever MLaaS gathers any information from the learner’s learning process over OER, the learner profile will be updated with regard to ML_j.

5.2. Preference Propagation

Provided the cold start condition for the first micro OER delivery, a learner is required to quickly mark down a preference for a specific micro OER. Consequently, a spreading activation approach is applied to maintain the preference against its parent node (i.e., the R_v is the vth OER from which the MR_u is derived) as well as updating the learner profile. It propagates the learners’ preferences upwards in the hierarchy of the micro OER ontology based on activation values. In other words, the preference obtained from a micro OER is applied to its ancestor and spread in its superclass (i.e., OER) level. An example of the spreading activation is shown in Figure 4.

A partial view of augmented micro OER ontology in the ‘information technologies’ area is shown in Figure 4. Specifically, it describes an ‘e-business’ OER from an Australian provider, OpenLearning (https://www.openlearning.com/). At the bottom level of the ontology, nodes which are depicted with an oval shape typically conform to the standard of micro OER. Red integers shown in nodes with a rectangle shape are preference values from a learner versus target OERs. Algorithm 1 is proposed to execute the process of preference propagation.

Algorithm 1 Preference Propagation

Input: Dynamic part of learner profile L_j ={ MR_u, ML_j }, a trial micro OER MR_u , L_j ∈ L

Output: Updated dynamic part of learner profile with updated P_u,j value in the triple dimensional set ML

P(R_v)and Activation(R_v), preference value and activation value for the OER R_v

//Step 1: Spreading Activation

begin: Initialize PriorityQueue;//PriorityQueue is the set of OERs within the same discipline where R_v belongs to

Set Activation of all micro OER to 0

for each L_j ∈ L do

if (MR_u ∈ R_v) then

Activation(R_v)= P(R_v)

PriorityQueue.Add(R_v)

end if

end for

while PriorityQueue.Count >0 do

Sort PriorityQueue; //activation values in descending order

Select the first item MR_u in PriorityQueue // R_v with highest P value

Remove R_v in PriorityQueue

for each R_v do

LinkedOERs=GetLinkedOERs(R_v)//get linked nodes of R_v

for each R_w in LinkedOERs do //propagate activation to its neighbors

Activation(R_w)+=Activation(R_v)*Weight(R_v, R_w )

PriorityQueue.Add(R_v)

Sort PriorityQueue

end for

end while

//Step2: Learner Profile Normalization

for each L_j ∈ L do

P(R_v)= P(R_v)+ Activation(R_v)

k = 1 / \sqrt{\sum_{t = 1}^{v} {(R_{v})}^{2}}

//normalization factor

P(R_v) = P(R_v)*k //normalization

P_v,j=P(R_v)

end for

end

The normalization factor acts on preventing the propagated preferences from escalating continuously to such an extent that they exceed a reasonable range, which could result in difficulty of data processing in the forthcoming process. The confidence degree for the propagated preference of OER is recorded as CD(P_v_,j).

5.3. Instant Time Availability

The system is able to obtain explicit information on how long the learner can (or would like to) spend on a micro OER through mobile devices in real time. As a mandatory request, a learner is required to input his instant time availability at the beginning of every micro learning activity.

According to the system settings, the instant time availability, TA_j, is suggested to be represented by an integer from 1 to 15. However, if the learner is not very sure how long he is able to spend on the micro OER at once, he is free to leave a time span, which can be continuous integers in the same range.

5.4. Learner Feature Prediction

5.4.1. Demographic Classification

In our prior work [21], we have discussed the key issues that might cause distraction in micro open learning, which generally comes from two sides, the social side and the environmental side.

In addition, MLaaS investigates existing learners’ degrees of distraction as reference, and senses every learner’s location information through built-in functions in mobile devices. Based on the given taxonomy and augmented ontology, we carry out a demographic classification that aims to cluster learners into cohorts, in order to match them with micro-pieces of OERs.

The mechanism of classification is implemented because learners who have similar static information—such as employment and/or education background, occupation—and a similar learning environment/location, are more likely to face similar levels of distraction. For the same reason, their overall time availabilities are more likely to fall in the same range. Herein MLaaS tries to associate a learner with a pre-clustered learner group by applying the stereotyping technique to fulfill the requirements of demographic classification.

For a newly joined learner, L_j, an ensemble method of a binary classifier and a one-against-all model is utilized to obtain multi-class classifications [36,37] in order to predict its category, C_j. The system is trained with an existing set of learners, L. Typical binary classification techniques, e.g., C4.5 decision tree [38,39] or Naive Bayes classifier [40], can be employed to serve as the base algorithm (i.e., training algorithm F in Algorithm 2) in order to produce a suitable classifier, CF_k. A new learner L_j is classified with the label k, whose CF_k produces the highest value of

\hat{y}

.

Algorithm 2 New Learner Classification (One-against-All)

Input: Sample (Current learner set L), Lables y ( where y_i is the label for a sample learner L_i and y_i ∈ {1,2,…K} ), Training Algorithm F, a new learner L_j

Output: Category of the new learner L_j, C_j

begin:

for each k in {1,2,…K} do

set a new label vector z_i for y_i,

if (y_i=k) then

z_i=1

else

z_i=0

end if

C_k=F(L,z_i) //use binary classification technique to produce classifiers

end for

for L_j ∈ Ldo

\hat{y} = \arg \max_{k \in {1, 2 . . . . k}} C F_{k} (L_{j})

end for

output C_j =k //category of the new learner L_j

end

Given L_j is categorized into C_k, afterwards, the learner’s neighborhood, NB_j, is calculated by Algorithm 3. This aims to match a new learner’s category with an existing learner’s category.

Algorithm 3 Neighbourhood Calculation

Input: new learner set N and Existing learner set L

Output: set of neighbours, NB_j, of a new learnerL_j

begin: Build a binary classifier

Execute the one-against-all model//as in Algorithm 2

Build the ensemble method of multiclass classifier//categorize new learners

for each L_j in N do

NB_j=null

Predict C_j// L_j’s category

for each L_i in l do

Retrieve C_i

if C_j== C_i then

NB_j.add(L_i)

end if

end for

end

Hence, the demographic classification is realized according to learners’ static and location information. Once new learners join the open learning scenario, MLaaS responds immediately by classifying them into clusters.

5.4.2. Similarity Measure between Two Learners

MLaaS is responsible for finding similar existing learners in the discovered demographic categories, so as to recommend micro OERs to them that have been recognized as suitable to learn in a given time span, situation and environment.

Learners’ learning location information is sensed from the location service embedded in the mobile devices. Thus, the similarity between two learners, L_i and L_j, is evaluated using Equation (1).

s i m (i, j) = [(\sum_{l = 1}^{m} S_{l} W_{l})^{2} + (S L o_{i, j} W_{i, j})^{2}]^{1 / 2}

(1)

where S_l is the similarity value of the lth attribute in the static part of learner profile, and W_l is its corresponding weight. SLo_i_,j denotes their similarity of location, and W_i_.j denotes the weight of the location factor.

5.4.3. Distraction Prediction

Thus, in terms of Equation (2), the distraction value can be estimated in accordance with the action that any member in a same cluster indicates the predicted distraction level.

D_{j, L o_{a}} = \frac{1}{2} * (\frac{\sum s i m (i . j) • d_{i, L o_{a}}}{\sum s i m (i, j)} + d_{j, L o_{a}})

(2)

where d_j,_Loa is the self-identified degree of distraction the learner L_j felt in the location Lo_a, acquired by mandatory request. This follows the expectation that learners who have similar general situations (i.e., social factors) and surroundings (i.e., environmental factors) have a high probability of having similar degrees of distraction.

The confidence degree for the predicted distraction is depicted as CD(D_i_,Loa).

5.5. Integration of Recommendation Results

5.5.1. Downwards Propagation

In the Section 5.1 we have merely obtained the preference of a learner on an ‘entire’ OER rather than on a micro OER, now the preference values are again propagated downwards through the ontology hierarchy. Consequently, each micro OER node receives an estimated preference value from its ancestor. This propagation process is executed with a decay factor. For each micro OER the final preference value, P_u_,j, can be calculated use the following Equation (3).

P_{u, j} = \frac{\sum P_{R, j} C D (P_{R, j}) + \sum P_{v, j} C D (P_{v, j}) Q (u, v)}{\sum C D (P_{R, j}) + \sum C D (P_{v, j}) Q (u, v)}

(3)

where R is the set of all the nodes in the higher hierarchy than MR_u, R_v is a direct ancestor of node MR_u and Q(u,v) depends on the count of level between MR_u and R_v.

Q (u, v) = L (| u v |) = {\begin{cases} L (0) = 1 \\ L (l) = (1 - λ) L (l - 1) \end{cases}

(4)

and the confidence degree for the descendant node, in regards to P_u_,j, is calculated as the average of the confidence values of its ancestors, decreased by a decay factor, μ.

C D (P_{u, j}) = \frac{\sum C D (R)}{| R |} - μ

(5)

Having settled all values for the three attributes denoting preferences, instant time availability and degree of distraction in the set ML, a complete learner profile is constructed from the sparse initial information by the MLaaS.

5.5.2. Micro OER Sorting Rules

For each micro OER, once MLaaS has acquired its final preference value and confidence degree, those nodes which do not meet the minimum requirement of confidence degree are rejected by the system.

A list of recommended micro OERs is generated, in which the ones with higher learner interest are placed at the top. For two micro OERs MR_u and MR_w, their sequence is determined according to some heuristic rules which are defined in accordance with the extraction of three kinds of relations discussed in Section 4.1.1. These rules are executed sequentially with priority.

If there is a RequiredSequence relation between these two micro OERs, the prerequisite one is placed above (refer to Section 4.1.1).
If in the preference regarding these two OERs, P_u_,j, P_w_,_j, the former is higher than the latter one, then MR_u is above MR_w.
If, in absolute terms, the confidence degree CD(P_u_,j) is high and the CD(P_w_,j) is low, then MR_u is above MR_w.
If there is a RecommendedSequence relation between these two micro OERs, the one which is suggested to be accessed first is placed above (refer to Section 4.1.1).
The micro OER which is more related to the learner’s education background, or falls in relevant disciplines or inter-disciplines, is placed with priority if the disciplinary difference between these two candidate micro OERs is obvious.
Otherwise, the recommended micro OER list is randomly ordered if none of the above rules applies.

Herein, the first rule is deemed to be a hard rule which should be strictly obeyed, and the rest rules are soft rules that can be violated on a case-by-case basis in consideration of educational factors.

5.5.3. Recommendation Results Optimization

MLaaS consumes the value P and D in conjunction with their TA to compare with the attributions and requirements annotated in the metadata of the augmented OER ontology.

The next step is to integrate the outcomes from Section 5.4. A fitness function will convert these selected multidimensional arrays into one variable. Hence, this problem is hereby properly transferred to a multi-objective optimization problem.

To initiate the constrained multi-objective optimization, candidate learning path solutions (chromosomes) are randomly generated where each of them is a learning path with a series of micro OERs. For a chromosome, its violation degree is investigated by examining the relations between each contiguously prior/posterior micro OER pair against the first 5 rules listed in Section 5.4.2, and then summing up. For such a pair in a chromosome, its violation degree, VD(MR^t, MR^t⁺¹), is calculated by the weighted sum of its violations of rule 2 to rule 5, where MR^t is the tth micro OER in k and MR^t⁺¹ is the (t + 1)th. The higher the violation degree is, the more serious the candidate learning path violates the rules. The violation degree of a candidate learning path, k, is calculated using the following Equation (6):

V D_{k} = \sum V D (M R^{t}, M R^{t + 1})

(6)

Thereafter, let the variable RA_u denote the degree of required attention of a given micro learning resource, MR_u, whose real-time suitability for micro learning, RT_u_,j, is calculated by comparing with the learner, L_j’s predicted distraction, using the following Equation (7):

R T_{u, j} = {{(R A_{u})}^{2} + {[C D (D_{j, L o_{a}}) * D_{j, L o_{a}}]}^{2}}^{1 / 2}

(7)

Hence, for the candidate learning path, k, RT_k_,j denotes the sum of the real-time suitability of micro OERs it contains. Similarly, P_k_,j sums up all the predicted preferences from the learner L_j versus micro OERs that k contains.

η = \min (α V D_{k} + β R T_{k, j} + γ / P_{k, j}^{1} + δ / P_{k, j})

(8)

where α, β, γ and δ serve as weight for each variable and suggestively α > β > γ > δ,

P_{k, j}^{1}

denotes the L_k’s preference value of the first micro OER in the candidate learning path k.

Algorithm 4 indicates typical steps for making the first recommendation

Algorithm 4 Micro OER Recommendation in a Cold Start Condition

Input: P_u_,j (the Learner L_k’s predicted reference to the micro OER MR_u), D_j_,Loa (predicted distraction level), CD(P_u_,j) and CD(D_i_,Loa) (their confidence degree), RA_u (the degree of required attention of MR_u), TA_j (the instant time availability), rules (1st–6th)

Output: the tag of a micro OER which acts as the first delivery

begin: Randomly generate candidate learning paths as chromsomes

for each chromosome k do

Select micro OER it contains

for each MR_u in a chromosome k,

Caculate its P_u_,j and CD(P_u_,j).

Import D_j_,Loa , CD(D_i_,Loa) and RA_u

Caculate its RT_u_,j

end for

Calculate k’s VD_k

Use Equation (8) to evaluate its fitness η

end for

while iteration times < max iteration time do

apply heuristic approach to generate new candidate solutions

for each new chromosome k’ do

check time length of the first micro OER in k’,

{T L}_{k^{'}}^{1}

if

{T L}_{k^{'}}^{1}

is in the range of TA_j

keep k’

otherwise

reject k’

end if

evaluate the fitness of k’, η, using Equation (8)

end for

replace chromosomes with higher η values

end while

output the selected chromosome k’’with minimum η and satisfied

{T L}_{k^{″}}^{1}

select the first micro OER in k’’as the first delivery

end

By this means, the heuristic Algorithm 4 infers a suitable micro OER as the first attempt of learning resource recommendation in the novel open learning experience through MLaaS.

Along with the successful launch of a solution to the well-known cold start problem in micro learning, learners’ upcoming behaviors will be continuously acquired by MLaaS to feed the reasoning engine.

6. Conclusions

In this paper, we introduced a study aimed at dealing with the adaptive micro OER delivery. We proposed a tailored system for this MLaaS. Aside from its technical details and working principles, we primarily focused on the construction of the knowledge base. It was built using a top-down approach, by having ontologies at the pattern level first. Using this, a strategy on processing data was then developed at the lower level. This supported the decision-making process of the micro OER recommendation system. However, because both the system and user are new, learners’ information deficit in MLaaS delayed the commencement of adaptive micro open learning and MLaaS operation. A detailed approach was therefore provided to deal with this so called ‘cold start problem’ based on predicting learners’ features from the sparse initial information.

Our future work will extend the EDM/LA work, and prototyping this ontological approach for the cold start problem and further developing its corresponding component in MLaaS. This will be further evaluated by measuring the prediction accuracy. We will also engage real learners to compare the quality of recommendations. Apart from the ‘new user’ cold start problem discussed in this paper, we will look for solutions to deal with ‘new items’, i.e., micro OERs, by using a queue-jumping method to insert them into established learning paths.

Acknowledgments

This research has been conducted with the support of the Australian Government Research Training Program Scholarship.

Author Contributions

Geng Sun, Tingru Cui and Jun Shen conceived and designed this research; Ghassan Beydoun and Dongming Xu contributed on the ontology construction; Shiping Chen and Fang Dong provided contributions to the cloud architecture and approach for cold start problem; Geng Sun and Tingru Cui wrote this paper; Geng Sun and Jun Shen revised this paper critically; all authors read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Hilton, J. Open Educational Resources and College Textbook Choices: A Review of Research on Efficacy and Perceptions. Educ. Technol. Res. Dev. 2016, 64, 573–590. [Google Scholar] [CrossRef]
Baggaley, J. MOOC Rampant. Distance Educ. 2013, 34, 368–378. [Google Scholar] [CrossRef]
Hylen, J.; Damme, D.V.; Mulder, F.; D’Antoni, S. Open Educational Resources: Analysis of Responses to the OECD Country Questionnaire; OECD Education Working Papers; OECD: Paris, France, 2012. [Google Scholar]
Nawrot, I.; Doucet, A. Building engagement for MOOC students: Introducing support for time management on online learning platforms. In Proceedings of the WWW’14 Companion, Seoul, Korea, 7–11 April 2014. [Google Scholar]
Warrd, I.; Koutropoulos, A.; Keskin, N.O.; Abajian, S.C.; Hogue, R.; Rodriguez, C.O.; Gallagher, M.S. Exploring the MOOC format as a pedagogical approach for m-learning. In Proceedings of the 10th World Conference on Mobile and Contextual Learning, Beijing, China, 19–21 October 2011. [Google Scholar]
Sun, G.; Cui, T.; Chen, S.; Guo, W.; Shen, J. MLaaS: A cloud based system for mobile micro learning in MOOC. In Proceedings of the 4th IEEE International Conference on Mobile Service (MS), New York, NY, USA, 27 June–2 July 2015; pp. 120–127. [Google Scholar]
Sun, G.; Cui, T.; Li, K.; Xu, D.; Chen, S.; Shen, J.; Guo, W. Towards bringing adaptive micro learning into MOOC courses. In Proceedings of the 15th IEEE International Conference on Advanced Learning Technologies (ICALT), Hualien, Taiwan, 6–9 July 2015. [Google Scholar]
Dennen, V.P.; Hao, S. Intentionally Mobile Pedagogy: The M-COPE Framework for Mobile Learning in Higher Education. Technol. Pedagog. Educ. 2014, 23, 397–419. [Google Scholar] [CrossRef]
Trifonova, A.; Ronchetti, M. Mobile learning: Is anytime + anywhere = always online? In Proceedings of the 6th IEEE International Conference on Advanced Learning Technologies, Kerkrade, The Netherlands, 5–7 July 2006. [Google Scholar]
Attewell, J. A technology update and m-learning project summary. In Mobile Technology and Learning; Learning and Skills Development Agency: London, UK, 2005. [Google Scholar]
Wu, H.; Hamdi, L.; Mahe, N. Tango: A flexible mobility-enabled architecture for online and offline mobile enterprise applications. In Proceedings of the 11th International Conference on Mobile Data Management, Kansas City, MO, USA, 23–26 May 2010. [Google Scholar]
Lee, M.J.W.; McLoughlin, C. Teaching and Learning in the Web 2.0 Era: Empowering Students through Learner-Generated Content. Int. J. Instr. Technol. Distance Learn. 2007, 4, 21–24. [Google Scholar]
Kovachev, D.; Cao, Y.; Klamma, R.; Jarke, M. Learn-as-you-go, new ways of cloud based micro-learning for the mobile web. In Proceedings of the 10th International Conference on Web-based Learning, Hongkong, China, 8–10 December 2011. [Google Scholar]
Bruck, P.A.; Motiwalla, L.; Foerster, F. Mobile learning with micro-content: A framework and evaluation. In Proceedings of the 25th Bled eConference, Bled, Slovenia, 18–21 June 2012; pp. 527–542. [Google Scholar]
Hug, T.; Lindner, M. ML: Emerging concepts, practices and technologies after e-Learning. In Proceedings of the Micro Learning, Innsbruck, Austria, 23–24 June 2005. [Google Scholar]
Souza, M.I.; Amaral, S.F.D. Educational Micro content for Mobile Learning Virtual Environments. Creative Educ. 2014, 5, 672–681. [Google Scholar] [CrossRef]
Sun, Z.; Shen, J. A high performance peer to cloud and peer model augmented with hierarchical secure communications. J. Syst. Softw. 2013, 86, 1790–1796. [Google Scholar] [CrossRef]
Shen, J.; Beydoun, G.; Yuan, S.; Low, G. Comparison of bio-inspired algorithms for peer selection in services composition. In Proceedings of the IEEE International Conference on Services Computing, Washington, DC, USA, 4–9 July 2011; pp. 250–257. [Google Scholar]
Sun, G.; Cui, T.; Guo, W.; Beydoun, G.; Xu, D.; Shen, J. Micro learning adaptation in MOOC: A software as a service and a personalized learner model. In Proceedings of the 14th International Conference on Web-based Learning (ICWL), Guangzhou, China, 5–8 November 2015; pp. 174–184. [Google Scholar]
Wang, L.; Shen, J. Multi-phase ant colony system for multi-party data-intensive service provision. IEEE Trans. Serv. Comput. 2016, 9, 264–276. [Google Scholar] [CrossRef]
Sun, G.; Cui, T.; Yong, J.; Shen, J.; Chen, S. MLaaS: A Cloud-based System for Delivering Adaptive Micro Learning in Mobile MOOC Learning. IEEE Trans. Serv. Comput. 2015. [Google Scholar] [CrossRef]
Starr, R.R.; de Oliveira, J.M.P. Concept Maps as the First Step in an Ontology Construction Method. Inf. Syst. 2013, 38, 771–783. [Google Scholar] [CrossRef]
Xu, D.; Wijesooriya, C.; Wang, Y.-G.; Beydoun, G. Outbound logistics exception monitoring: A multi-perspective ontologies approach with intelligent agents. Expert Syst. Appl. 2011, 38, 13604–13611. [Google Scholar] [CrossRef]
Manyika, J.; Chui, M.; Brown, B.; Bughin, J.; Dobbs, R.; Roxburgh, C.; Byers, A.H. Big Data: The Next Frontier for Innovation, Competition, and Productivity. 2011. Available online: http://abesit.in/wp-content/uploads/2014/07/big-data-frontier.pdf (accessed on 26 May 2017).
Romero, C.; Ventura, S. Educational Data Mining: A Review of the State-of-the-Art. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 2010, 40, 601–618. [Google Scholar] [CrossRef]
Sieg, A.; Mobasher, B.; Burke, R. Ontology-based collaborative recommendation. In Proceedings of the 8th Workshop on Intelligent Techniques for Web Personalization and Recommender Systems, Big Island, HI, USA, 20 June 2010; pp. 20–32. [Google Scholar]
Miranda, S.; Mangione, G.R.; Orciuoli, F.; Gaeta, M.; Loia, V. Automatic generation of assessment objects and remedial works for MOOCs. In Proceedings of the 12th International Conference on Information Technology Based Higher Education and Training, Antalya, Turkey, 10–12 Octorber 2013. [Google Scholar]
Lika, B.; Kolomvatsos, K.; Hadjiefthymiades, S. Facing the Cold Start Problem in Recommender Systems. Expert Syst. Appl. 2014, 41, 2065–2073. [Google Scholar] [CrossRef]
Sun, G.; Cui, T.; Beydoun, G.; Chen, S.; Shen, J. Profiling and supporting adaptive micro learning on open education resources. In Proceedings of the 4th International Conference on Advanced Cloud and Big Data (CBD), Chengdu, China, 13–14 August 2016; pp. 158–163. [Google Scholar]
Cambria, E.; Olsher, D.; Rajagopal, D. SenticNet3: A common and common-sense knowledge base for cognition-driven sentiment analysis. In Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI), Quebec, QC, Canada, 27–31 July 2014; pp. 1515–1521. [Google Scholar]
Dou, D.; Wang, H.; Liu, H. Semantic data mining: A survey of ontology-based approaches. In Proceedings of the 9th IEEE International Conference on Semantic Computing (ICSC), Anaheim, CA, USA, 7–9 February 2015; pp. 244–251. [Google Scholar]
Miranda, S.; Albano, G. Personalized Learning in Mathematics. J. e-Learn. Knowl. Soc. 2015, 11, 25–42. [Google Scholar]
Moreno, A.; Valls, A.; Isern, D.; Marin, L.; Borras, J. SigTur/E-Destination: Ontology-Based Personalized Recommendation of Tourism and Leisure Activities. Eng. Appl. Artif. Intell. 2013, 26, 633–651. [Google Scholar] [CrossRef]
Capuano, N.; Dell’Angelo, L.; Orciuoli, F. Ontology extraction from existing education al content to improve personalized e-Learning experiences. In Proceedings of the 3rd IEEE International Conference on Semantic Computing, Berkeley, CA, USA, 14–16 September 2009; pp. 577–582. [Google Scholar]
Markellou, P.; Mousourouli, I.; Spiros, S.; Tsakalidis, A. Using semantic web mining technologies for personalized e-learning experiences. Proceedings of The Web-based Education, Grindelwald, Switzerland, 21–23 February 2005; pp. 461–826. [Google Scholar]
Milgram, J.; Cheriet, M.; Sabourin, M. One against one or one against all: Which one is better for handwriting recognition with SVMs. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition, La Baule, France, 23–26 October 2006. [Google Scholar]
Galar, M.; Fernandez, A.; Barrenechea, E.; Bustince, H.; Herrera, F. An Overview of Ensemble Methods for Binary Classifiers in Multi-class Problems: Experimental Study on One-vs.-One and One-vs.-All Schemes. Pattern Recognit. 2011, 44, 1761–1766. [Google Scholar] [CrossRef]
Kotsiantis, S.B. Supervised Machine Learning: A Review of Classification Techniques. Informatica 2007, 31, 249–268. [Google Scholar]
Polat, K.; Gunes, S. A Novel Hybrid Intelligent Method Based on C4.5 Decision Tree Classifier and One-against-All Approach for Multi-class Classification Problems. Expert Syst. Appl. 2009, 36, 1587–1592. [Google Scholar] [CrossRef]
Dalton, L.A.; Dougherty, E.R. Optimal Classifiers with Minimum Expected Error within a Bayesian Framework-Part II: Properties and Performance Analysis. Pattern Recognit. 2013, 46, 1288–1300. [Google Scholar] [CrossRef]

Figure 1. Topology of the Architecture of MLaaS.

Figure 2. Conceptual Graph of Micro OER Ontology.

Figure 3. Concept Graph of Benchmark Ontology for a Learner Profile in Micro Open Learning.

Figure 4. Partial View of the Augmented Micro OER Ontology and Spreading Activation for a Learner’s Preference on OER.

Table 1. EDM/LA Data Sources from University Warehouse.

Data Type	Purpose
Learners’ exact time of logon/out for each time	To know how long they stay online each time
The IP address or gateway information of their internet connection	To know their exact learning location and surroundings
Mobile device information, mobile operator information and mobile OSs	To know their general situation
Their personal enrollment information (full time or part time, nationality)	To know their learning time availability, organization and language skills
Their residential information (session address and permanent address)	To understand their distance to campus and the potential modes of transportation they adopt)
Subjects they have chosen (current)	To know their academic background and field
Subjects they have chosen (historical)	To know their academic background and field
Historical grades	To know their academic background and infer level of pre-knowledge
Course materials they have accessed (material type, topic, length, requirement associated with them)	To know their learning habits (how they prefer learning resources to be passed on)
Course requirement/milestones set in LMS (by instructor)	To know the suggested learning schedule
Their detailed learning activities (What they do when staying online and how long they spend on each specific learning activity, type of resource they access for each specific time)	To know their learning habits, learning engagements, learning speed and so on.
Their interactions with LMS and learner-generated content (from forum and thread, etc.)	To know their preferences, interests and to measure their engagement.
Frequencies of their participation in interactive learning activities (e.g., forum, thread)	To know their engagement
Extent of completeness for each learning activity	To know whether they finished an entire step of learning or drop off halfway
The learning paths they have gone through (the sequence of their access of learning resources over LMS)	To further establish optimal learning paths
Their learning achievement (grades and final marks if possible)	To know how their learning behaviors affect their learning outcomes
Groups or teams they have participated in	To know their collaborative learning performance and similarities/changes of learning time frame among learners

Table 2. EDM/LA Scheme for Open Learning Data.

Technique	Object	Purpose
Prediction	Well-defined micro OERs	To establish a recommendation model for students in similar situations in the future
Structure Discovery	Well-defined micro OERs	For web documents using clustering methods in order to personalize e-learning based on maximal frequent item sets
Latent Knowledge Estimation	Non-micro OERs	To discover which stages of them are generally finished within relatively larger time length
Structure Discovery	Non-micro OERs	To determine time spans where the pauses made by learners usually fall in
Factor Analysis	Non-micro OERs	To find out the actual reasons why learners spent more time on these stages and made such pauses
Latent Knowledge Estimation	Non-micro OERs	To measure potential suitability of micro learning (from learners’ frequencies of using fragmented time pieces)
Factor Analysis	Non-micro OERs	To identify resources’ suitability for micro learning, for example, whether hands-on practice is needed, or whether the OER delivery is necessarily associated with lots of writing or computation work which is inconvenient to complete on mobile devices
Prediction	Subscription OERs	To determine when to push information to learners in the best timing and remind them
Clustering	All micro OERs	To determine their correlations for better repository purpose
Relationship Mining	Time Availability	To discover the correlation between their overall time availability and learners’ types
Clustering/Prediction	Time Availability	To involve similar learners into cohorts and build a potential time frame for their overall learning schedule
Latent Knowledge Estimation	Learning habit (learning time distribution)	To discover whether there are regular patterns of time organization within time frame among learners in or across cohorts
Latent Knowledge Estimation	Learners’ latest learning contents and activities	To retrieve and profile learners’ learning recentness
Categorization	Learning habits	To set up a unique learning habit summary for each learner
Relationship Mining	Learners’ learning location data	To know the degree of distraction and how it interrelates to disruption from external environment
Relationship Mining	Learners’ mobile app usage	To know the degree of distraction and how it interrelates to disruption from the content on mobile internet
Social Network Analysis	OERs in affiliated social networks	To distinguish information that can be useless, harmful and may cause time wasted for learners.
Social Network Analysis	Other content in affiliated social networks	To screen well-recognized information in order to recommend to learners as their learning augmentation besides the OERs (text mining technique employed)

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sun, G.; Cui, T.; Beydoun, G.; Chen, S.; Dong, F.; Xu, D.; Shen, J. Towards Massive Data and Sparse Data in Adaptive Micro Open Educational Resource Recommendation: A Study on Semantic Knowledge Base Construction and Cold Start Problem. Sustainability 2017, 9, 898. https://doi.org/10.3390/su9060898

AMA Style

Sun G, Cui T, Beydoun G, Chen S, Dong F, Xu D, Shen J. Towards Massive Data and Sparse Data in Adaptive Micro Open Educational Resource Recommendation: A Study on Semantic Knowledge Base Construction and Cold Start Problem. Sustainability. 2017; 9(6):898. https://doi.org/10.3390/su9060898

Chicago/Turabian Style

Sun, Geng, Tingru Cui, Ghassan Beydoun, Shiping Chen, Fang Dong, Dongming Xu, and Jun Shen. 2017. "Towards Massive Data and Sparse Data in Adaptive Micro Open Educational Resource Recommendation: A Study on Semantic Knowledge Base Construction and Cold Start Problem" Sustainability 9, no. 6: 898. https://doi.org/10.3390/su9060898

APA Style

Sun, G., Cui, T., Beydoun, G., Chen, S., Dong, F., Xu, D., & Shen, J. (2017). Towards Massive Data and Sparse Data in Adaptive Micro Open Educational Resource Recommendation: A Study on Semantic Knowledge Base Construction and Cold Start Problem. Sustainability, 9(6), 898. https://doi.org/10.3390/su9060898

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Towards Massive Data and Sparse Data in Adaptive Micro Open Educational Resource Recommendation: A Study on Semantic Knowledge Base Construction and Cold Start Problem

Abstract

1. Introduction

2. Background

2.1. Nature of OERs and Open Learning Delivery in Mobile Environments

2.2. Micro Learning

3. Research Challenges and Design

3.1. Research Design

3.1.1. System Framework and Previous Work

3.1.2. Research Problem Identification and Design

3.2. Research Challenges

3.2.1. EDM/LA for Micro Learning

3.2.2. Cold Start Problem in Micro OER Delivery

3.3. Contribution

4. Conceptual Graph-Based Ontology Construction for Micro Open Learning and Proposed Data Processing Strategy

4.1. Conceptual Graph-Based Ontology Construction

4.1.1. Augmented Micro OER Ontology

4.1.2. Augmented Micro Learning Learner Profile Taxonomy

4.2. EDM and LA Strategy

5. Ontological Approach for the Cold Start Problem

5.1. Representation of Learner Profile

5.2. Preference Propagation

5.3. Instant Time Availability

5.4. Learner Feature Prediction

5.4.1. Demographic Classification

5.4.2. Similarity Measure between Two Learners

5.4.3. Distraction Prediction

5.5. Integration of Recommendation Results

5.5.1. Downwards Propagation

5.5.2. Micro OER Sorting Rules

5.5.3. Recommendation Results Optimization

6. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI