Next Article in Journal
Embedded Exponentially-Fitted Explicit Runge-Kutta-Nyström Methods for Solving Periodic Problems
Previous Article in Journal
Brain Tissue Evaluation Based on Skeleton Shape and Similarity Analysis between Hemispheres
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison and Evaluation of Different Methods for the Feature Extraction from Educational Contents

by
Jose Aguilar
1,2,*,†,
Camilo Salazar
2,†,
Henry Velasco
3,†,
Julian Monsalve-Pulido
2,† and
Edwin Montoya
2,†
1
Escuela de Sistemas, Facultad de Ingeniería, Universidad de los Andes, Mérida 5101, Venezuela
2
GIDITIC, Universidad EAFIT, Carrera 49 No. 7 Sur 50, Medellin 050001, Colombia
3
LANTIA SAS, Medellin 050001, Colombia
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Computation 2020, 8(2), 30; https://doi.org/10.3390/computation8020030
Submission received: 17 February 2020 / Revised: 23 March 2020 / Accepted: 23 March 2020 / Published: 15 April 2020
(This article belongs to the Section Computational Engineering)

Abstract

This paper analyses the capabilities of different techniques to build a semantic representation of educational digital resources. Educational digital resources are modeled using the Learning Object Metadata (LOM) standard, and these semantic representations can be obtained from different LOM fields, like the title, description, among others, in order to extract the features/characteristics from the digital resources. The feature extraction methods used in this paper are the Best Matching 25 (BM25), the Latent Semantic Analysis (LSA), Doc2Vec, and the Latent Dirichlet allocation (LDA). The utilization of the features/descriptors generated by them are tested in three types of educational digital resources (scientific publications, learning objects, patents), a paraphrase corpus and two use cases: in an information retrieval context and in an educational recommendation system. For this analysis are used unsupervised metrics to determine the feature quality proposed by each one, which are two similarity functions and the entropy. In addition, the paper presents tests of the techniques for the classification of paraphrases. The experiments show that according to the type of content and metric, the performance of the feature extraction methods is very different; in some cases are better than the others, and in other cases is the inverse.
Keywords: feature extraction; content analysis; educational contents; semantic representation; information retrieval; recommendation system feature extraction; content analysis; educational contents; semantic representation; information retrieval; recommendation system

Share and Cite

MDPI and ACS Style

Aguilar, J.; Salazar, C.; Velasco, H.; Monsalve-Pulido, J.; Montoya, E. Comparison and Evaluation of Different Methods for the Feature Extraction from Educational Contents. Computation 2020, 8, 30. https://doi.org/10.3390/computation8020030

AMA Style

Aguilar J, Salazar C, Velasco H, Monsalve-Pulido J, Montoya E. Comparison and Evaluation of Different Methods for the Feature Extraction from Educational Contents. Computation. 2020; 8(2):30. https://doi.org/10.3390/computation8020030

Chicago/Turabian Style

Aguilar, Jose, Camilo Salazar, Henry Velasco, Julian Monsalve-Pulido, and Edwin Montoya. 2020. "Comparison and Evaluation of Different Methods for the Feature Extraction from Educational Contents" Computation 8, no. 2: 30. https://doi.org/10.3390/computation8020030

APA Style

Aguilar, J., Salazar, C., Velasco, H., Monsalve-Pulido, J., & Montoya, E. (2020). Comparison and Evaluation of Different Methods for the Feature Extraction from Educational Contents. Computation, 8(2), 30. https://doi.org/10.3390/computation8020030

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop