Jointly Modeling Aspect Information and Ratings for Review Rating Prediction

Peng, Qingxi; You, Lan; Feng, Hao; Du, Wei; Zheng, Kesong; Zhu, Fuxi; Xu, Xiaoya

doi:10.3390/electronics11213532

Open AccessArticle

Jointly Modeling Aspect Information and Ratings for Review Rating Prediction

by

Qingxi Peng

¹

,

Lan You

^2,*,

Hao Feng

¹,

Wei Du

¹,

Kesong Zheng

¹,

Fuxi Zhu

¹ and

Xiaoya Xu

²

¹

School of Information Engineering, Wuhan College, Wuhan 430212, China

²

Faculty of Computer Science and Information Engineering, Hubei University, Wuhan 430062, China

^*

Author to whom correspondence should be addressed.

Electronics 2022, 11(21), 3532; https://doi.org/10.3390/electronics11213532

Submission received: 2 September 2022 / Revised: 13 October 2022 / Accepted: 15 October 2022 / Published: 29 October 2022

(This article belongs to the Special Issue Intelligent Data Analysis in Cyberspace)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Although matrix model-based approaches to collaborative filtering (CF), such as latent factor models, achieve good accuracy in review rating prediction, they still face data sparsity problems. Many recent studies have exploited review text information to improve the performance of predictions. The review content that they use, however, is usually on the coarse-grained text level or sentence level. In this paper, we propose a joint model that incorporates review text information with matrix factorization for review rating prediction. First, we adopt an aspect extraction method and propose a simple and practical algorithm to represent the review by aspects and sentiments. Then, we propose two similarity measures: aspect-based user similarity and aspect-based product similarity. Finally, aspect-based user and product similarity measures are incorporated into a matrix factorization to build a joint model for rating prediction. To this end, our model can alleviate the data sparsity problem and obtain interpretability for the recommendation. We conducted experiments on two datasets. The experimental results demonstrate the effectiveness of the proposed model.

Keywords:

rating prediction; matrix factorization; product review; aspect analysis

1. Introduction

E-commerce websites enable people to rate products and services with 1 to 5 stars after purchasing goods. These ratings are important to both merchants and customers. Merchants can use ratings to improve their production and sales strategies, while potential customers can use them to make better decisions. However, the volume of reviews is growing so rapidly that it is becoming increasingly difficult for users to browse reviews to find relevant information. Therefore, review rating predictions have become an extensively investigated issue in both academia and industry. The predictions enable researchers to estimate how satisfied a user will be with a product, without some extra text.

Most of the previous solutions consider the rating prediction as a recommendation system. The concept of context-aware recommendation technology proposed that context information can be introduced into the recommendation system, thereby improving the accuracy of the recommendation [1]. The use of contextual information to improve recommendations has experienced an upsurge in interest in the recommendation systems community. On E-commerce websites, users tend to write a review when they vote on products or services. Based on the above ideas, many works exploit various features from the review text, such as words, patterns, syntactic structure, and semantic topics, to improve the performance of rating prediction [2,3,4,5,6,7,8]. The above studies usually exploit text-level or sentence-level information in rating prediction. Although review analysis at the document level and sentence level is useful, it is still coarse-grained. It is worth noting that reviewers usually describe aspects of products or services to express their sentiments and convince other people. To obtain a more fine-grained review analysis, we need to delve into the aspect level. Aspect-level review text analysis is considered to be a fine-grained analysis in a large number of works [8,9,10,11,12,13,14,15,16]. Our focus of this paper is on how to use aspect-based information in reviews to improve the accuracy of rating prediction. We first present a simple but effective rule-based algorithm to extract the aspect and corresponding sentiment from reviews. Then, we compute the aspect-based user and product similarity from the review text. Finally, we integrate the similarity into matrix factorization to obtain a joint model, thereby improving the accuracy of rating prediction.

In this paper, we propose a novel joint model, which incorporates the aspect-based product similarity and the aspect-based user similarity into matrix factorization. We first present a simple and effective aspect and corresponding sentiment extraction algorithm and then apply it to represent the review. Then, aspect-based products, as well as user similarity, are proposed. Finally, we propose a joint model, which incorporates the similarity measure into matrix factorization. Rather than performing context pre-filtering or post-filtering on the context recommendation [1], we model the aspect-based information in a single learning stage, which enables us to explore the implicit information of users and products simultaneously.

The main contributions of this study are summarized below:

We present simple and powerful aspects and corresponding sentiment extraction algorithms and apply them to represent the review text.
Two aspect-based similarity measures according to users and products are proposed.
We propose a joint model, which incorporates aspect-based information into matrix factorization for review rating prediction.

The remainder of the paper is organized as follows: Section 2 discusses related work on review rating prediction and matrix factorization techniques. In Section 3, an aspect and corresponding sentiment algorithm is proposed, which combines a bootstrap algorithm and sentiment lexicon. Reviews are represented by aspects and sentiment polarity. Then, aspect-based product and user similarity are proposed. Moreover, we propose a model to incorporate the above similarity measure into the matrix factorization algorithm for review rating prediction. Section 4 presents the empirical experiments used to evaluate the proposed model. Finally, the conclusions of our study are given in Section 5.

2. Related Work

2.1. Review Rating Prediction

Early researchers generally adopted classification or regression methods for rating prediction. In their study [17], Pang and Lee regarded rating prediction as a multi-classification problem and used classification and regression methods to find a solution. Goldberg and Zhu presented a graph-based semi-supervised learning algorithm to address the problem [18]. Lu et al. proposed an approach to predict the rating according to the strength of adverbs and adjectives in the review text [19]. Qu et al. introduced a bag-of-opinion to represent the review and adopted a constrained ridge regression algorithm to handle the rating prediction [20].

Many recent researchers have regarded rating prediction as a recommendation problem and exploited review text to help improve prediction performance. Wang et al. proposed a probabilistic rating regression model [8]. They also proposed a unified generative model for prediction, which did not require pre-specified aspect keywords [21]. In another study, Li et al. [22] modeled user, product, and text features as a three-dimension tensor to improve the performance of rating prediction. McAuley and Leskovec proposed a probabilistic model that combines latent rating dimensions with latent review topics for rating prediction [5]. Gao et al. modeled the rating as the similarity between user and product. They combined the topic modeling and regression model to predict the rating [23]. Tan and colleagues [24] exploited the topic-based user preference similarity in a traditional collaborative filtering algorithm to solve the data sparsity problem. Lei et al. proposed a matrix factorization method that incorporated three factors—user sentiment similarity, interpersonal sentimental influence, and product reputation similarity [7]. Yu et al. proposed a model combining the latent factor model and the latent Dirichlet Allocation [3]. By combining user sentiments in the review and the rating score, their model improved the predictive ability. Yu et al. proposed a recommendation algorithm by integrating the user’s social status with a matrix factorization model [25]. Ning et al. proposed a regression model based on generative convolutional neural networks [26]. They employed metadata instead of review text for rating prediction. Chambua et al. proposed a tensor factorization model with the semantic similarity between review texts [27]. Wu et al. proposed an enhanced review-based rating prediction by exploiting aside information and user influence [28]. Their model achieved 1.32% improvements on average in terms of MSE compared to existing models. A few existing studies employ attention mechanisms to differentiate the importance of reviews. Tay et al. proposed a multi-pointer learning scheme that learns to combine multiple views of user–item interactions [29]. Chen et al. introduced a novel attention mechanism to explore the usefulness of reviews, and proposed a neutral attention regression model with review-level explanations for recommendations [30]. Their model could not only predict precise ratings, but also learned the usefulness of each review simultaneously. Liu et al. proposed a hybrid neural recommendation model to learn the deep representations for users and items from both ratings and reviews [31]. Their model contains three major components, i.e., a rating-based encoder to learn deep and explicit features from the rating patterns of users and items, a review-based encoder to model users and items from text reviews, and the prediction module for recommendation according to the rating- and review-based representations of users and items.

The above works have improved rating prediction performance with the help of text-level review analysis. Aspect-level review analysis, however, can further improve the predictive ability.

2.2. Matrix Factorization Techniques

As the Netflix Prize competition has demonstrated, matrix factorization models are superior to classic near-neighbor techniques for producing recommendations [32]. Recommendation systems rely on different types of input data, often placed in a matrix, with one dimension representing users and the other dimension representing items of interest.

Matrix factorization models map both users and items to a joint latent factor space of dimensionality f, such that user–item interactions are modeled as inner productions in this space. Accordingly, each item j is associated with a vector

V_{j} \in R^{f}

, and each user u is associated with a vector

U_{u} \in R^{f}

. For a given item j, the elements of

V_{j}

measure the extent to which the item possesses those factors, positive or negative. For a given user u, the elements of

U_{u}

measure the extent of interest that the user has in items that are high on the corresponding factors, again, positive or negative. The resultant dot product captures the interaction between the user and the item’s characteristics. This approximates user

u ’ s

rating of item j, which is denoted by

R_{u j}

, leading to the estimate

{\hat{R}}_{u j} = U_{u}^{T} V_{j}

(1)

For each given training case, the system predicts

R_{u j}

and computes the associated prediction error.

The goal of rating prediction is, given training data

R_{u j}

, to find a mapping

U_{u}

and

V_{j}

, such that

E_{u j} = R_{u j} - {\hat{R}}_{n j}

(2)

is a minimum, where

{\hat{R}}_{n j}

is the predicted rating given as the product j by the user u.

To learn the factor vectors (

U_{u}

and

V_{j}

), the system minimizes the regularized squared error on the set of known ratings:

\sum_{u = 1}^{K} \sum_{j = 1}^{N} {(R_{u j} - U_{u}^{T} V_{j})}^{2} + λ_{U} ∣ ∣ U ∣ ∣_{F}^{2} + λ_{V} ∣ ∣ V ∣ ∣_{F}^{2}

(3)

Here,

R_{u j}

is the training set, and

U_{u}^{T} V_{j}

is the true rating.

The algorithm learns the model by fitting the previously observed ratings. However, the goal is to generalize those previous ratings in a way that predicts future, unknown ratings. Thus, the system should avoid overfitting the observed data by regularizing the learned parameters whose magnitudes are penalized. The constant

λ_{U}

and

λ_{V}

control the extent of regularization and are usually determined by cross-validation.

Two approaches to minimize Equation (3) are stochastic gradient descent (SGD) and alternating least squares (ALS). We adopt SGD in this paper. To optimize Equation (3), the algorithm iterates over each rating on the training set. We set

λ_{U} = λ_{V} = λ

for simplification. For each pair of (

U_{u}

,

V_{j}

,

R_{u j}

), the algorithm defines a new loss function

E_{u j}

.

E_{u j} = \frac{1}{2} \sum_{u = 1}^{K} \sum_{j = 1}^{N} {(R_{u j} - U_{u}^{T} V_{j})}^{2} + \frac{λ}{2} (∣ ∣ U ∣ ∣_{F}^{2} + ∣ ∣ V ∣ ∣_{F}^{2})

(4)

For parameter

U_{u}

and

V_{j}

, the SGD method first separately finds their partial derivatives.

\frac{\partial E_{u j}^{'}}{\partial U_{u}} = E_{u j} \cdot (- V_{j}) + λ U_{u}

(5)

\frac{\partial E_{u j}^{'}}{\partial V_{j}} = E_{u j} \cdot (- U_{u}) + λ V_{j}

(6)

According to the SGD method, the iteration equation is

V_{j} \leftarrow V_{j} + γ \cdot (E_{u j} \cdot U_{u} - λ \cdot V_{j})

(7)

U_{u} \leftarrow U_{u} + γ \cdot (E_{u j} \cdot V_{j} - λ \cdot U_{u})

(8)

The parameter

γ

is the learning rate. Finally, we use submatrices U and V for rating prediction.

The other method of solution of the matrix factorization is the ALS method. The ALS techniques rotate between fixing the values of

U_{u}

and

V_{j}

. When all values of

U_{u}

are fixed, the system recomputes

V_{j}

by solving the least-squares problem, and vice versa. This ensures that each step decreases the value of Equation (4) until convergence is achieved.

Many researchers have used the matrix factorization technique to solve the problem of rating prediction. Pero and Horvath incorporated ratings provided by users and opinions inferred from their reviews in matrix factorization [33]. Zhang et al. proposed a kernel-based attribute-aware matrix factorization model to integrate the attribute information of items into matrix factorization for rating prediction [34]. Zhang et al. proposed a framework that combined network embedding and probabilistic matrix factorization for improved predictive ability [35]. In this paper, we also take the above strategy, and integrate the fine-grained aspect-based information into a standard matrix factorization technique for rating prediction.

3. Methodology

Traditional methods either discard review text or treat all of the text as a whole. To combine the aspect information, we should resort to aspect analysis, including aspect extraction and aspect summarization. To make the method easier, we transform the aspect information into user similarity and product similarity. Then, we model the transformed aspect information in a classical matrix factorization model. Our model is concise and easy to interpret. First, we represent the review text with aspect information. Then, we compute the aspect-based product similarity and the aspect-based user similarity. Next, the aspect-based similarities are modeled in matrix factorization, to predict the review rating. Figure 1 shows the review rating prediction flowchart. The hyphen in the left figure represents the value that needs to be predicted, while the red number in the right figure represents the predicted value.

In Table 1, we list the notations of the following parts.

3.1. Aspect Sentiment Representation

To exploit the aspect information of reviews, we should extract it from the review text. We adopt an aspect segmentation algorithm, presented in [8]. Given a collection of reviews and a set of aspect keywords, the algorithm splits the reviews into sentences with aspect assignments. We modified the algorithm with sentiment lexicons to separate the aspect and corresponding sentiments. Suppose a review r = {<w1, q1>, <w2, q2>, …< wN, qN>}, where

w_{i}

is the aspect keyword,

w_{k}

is the aspect category.

w_{i}

∈

W_{k}

,

q_{i}

is the sentiment keyword,

ϱ^{+}

represents a positive sentiment lexicon, and

ϱ^{-}

represents a negative sentiment lexicon.

q_{i}

∈

ϱ^{+}

∪

ϱ^{-}

.

In Figure 2, user

u_{1}

expresses four aspects concerning product

i_{2}

: breakfast, people, location, and room. With algorithm RAS (Algorithm 1), we can easily segment the review text into four aspects and corresponding sentiment polarities. We formally transform the review text as (breakfast, great), (people, nice), (location, good), (room, disappointed). It is important to note that the aspect keyword in the first aspect is breakfast. In our method, breakfast is included into the category of food. The aspect keyword in the second—people—is included into the category of staff. Since the positive sentiment value is set to 1, and the negative sentiment value is set to 0, we obtain (food, 1), (staff, 1), (location, 1), and (room, 0). It is important to note that our approach will not handle neutral words. Therefore, there are only two situations, positive and negative, in our method.

Algorithm 1 Review Aspect and Sentiment Algorithm, RAS

1:: Input: r = {<wi, qi>∣ i = 1, ..N} // review collection
2:: r = {w1, q2, ..wk} // aspect category
3:: D = $ϱ^{+}$ ∪ $ϱ^{-} / / s e n t i m e n t l e x i c o n$
4:: Output: r = {<wi, plt>∣ i = 1, ..N} // $p l t$ is Boolean value, 1 means positive sentiment, 0 means negative sentiment.
5:: For i = 1 to do
6:: if $w_{i} \in W_{k} do$
7:: $if q_{i} \in ϱ^{+} then r = r \cup < W k, 1 >$
8:: $if q_{i} \in ϱ^{-} then r = r \cup < W k, 0 >$
9:: End do
10:: End do
11:: Return r

3.2. Aspect-Based Similarity Measure

3.2.1. Aspect-Based Product Similarity

In the above section, one piece of the review was represented by aspects and corresponding sentiments. In this section, we build a product–aspect matrix. We define the value of the product j

s^{t h}

review aspect k as

w_{s j k}

. One product may correspond to multiple reviews. For total S reviews, if there are more positive sentiments than negative sentiments in aspect k, we define

w_{s j k}

as 1. If there are more negative sentiments than positive sentiments, we define

w_{s j k}

as −1. Otherwise, we define

w_{s j k}

as 0. Then, we obtain the product–aspect matrix M as

M_{j k} = \{\begin{matrix} 1, & i f \sum_{s = 1}^{S} w_{s j k} > S / 2 \\ - 1, & i f \sum_{s = 1}^{S} w_{s j k} < S / 2 \\ 0, & e l s e \end{matrix}

(9)

According to traditional item-based collaborative filtering, product similarity can be defined as cosine similarity. Here, we define aspect-based product similarity using improved cosine similarity [36]. The similarity matrix between product j and product n can be defined as

S_{j n} = \frac{\sum_{k = 1}^{E} (M_{j k} - {\bar{M}}_{k}) (M_{n k} - {\bar{M}}_{k})}{(\sqrt{\sum_{k = 1}^{E} {(M_{j k} - {\bar{M}}_{k})}^{2}}) (\sqrt{\sum_{k = 1}^{E} {(M_{n k} - {\bar{M}}_{k})}^{2}})}

(10)

where

M_{j k}

and

M_{n k}

represent values in the aspects of products j and n;

{\bar{M}}_{k}

represents the mean value of aspect k.

3.2.2. Aspect-Based User Similarity

In this subsection, we first build a user–aspect matrix. Since one user may post multiple reviews, we choose the majority principle. We define the value of user u

s_{t h}

review aspect k as

w_{s u k}

. One user may post multiple reviews. For the total reviews, if there are more positive sentiments than negative sentiments in aspect k, we define

w_{s u k}

as 1. If there are more negative sentiments than positive sentiments, we define

w_{s u k}

as −1. Otherwise, we define

w_{s u k}

as 0. Then, we obtain the user–aspect matrix N as

N_{u k} = \{\begin{matrix} 1, & i f \sum_{s = 1}^{S} W_{s u k} > S / 2 \\ - 1, & i f \sum_{s = 1}^{S} W_{s u k} < S / 2 \\ 0, & e l s e \end{matrix}

(11)

According to traditional item-based collaborative filtering, user similarity can be defined as cosine similarity. We define aspect-based user similarity using improved cosine similarity [32]. The similarity matrix between user u and user n can be defined as

T_{u m} = \frac{\sum_{k = 1}^{E} (N_{u k} - {\bar{N}}_{k}) (N_{m k} - {\bar{N}}_{k})}{(\sqrt{\sum_{k = 1}^{E} {(N_{u k} - {\bar{N}}_{k})}^{2}}) (\sqrt{\sum_{k = 1}^{E} {(N_{m k} - {\bar{N}}_{k})}^{2}})}

(12)

where

N_{u k}

and

N_{m k}

represent values in the aspect of user u and m;

{\bar{N}}_{k}

represents the mean value of aspect k.

3.3. Joint Model for Rating Prediction

Standard matrix factorization can be expressed as Equation (4). Given user–product rating matrix R, it represents a rating from user u to product j.

U_{u}^{T} V_{j}

is the prediction rating.

R_{u j}

is the actual rating. Now, the aspect-based user similarity and the product similarity are incorporated into the above objective function. Our joint aspect-based similarity model is as in Equation (13).

L (U, V) = \{\begin{matrix} \frac{1}{2} \sum_{u = 1}^{M} \sum_{j = 1}^{N} {(R_{u j} - U_{u}^{T} V_{j})}^{2} + \frac{α}{2} \sum_{j = 1}^{N} \sum_{n = 1}^{N} {(S_{j n} - V_{j}^{T} V_{n})}^{2} + \\ \frac{β}{2} \sum_{u = 1}^{M} \sum_{m = 1}^{M} {(T_{u m} - U_{u}^{T} U_{m})}^{2} + \frac{λ}{2} (∣ ∣ U ∣ ∣_{F}^{2} + ∣ ∣ V ∣ ∣_{F}^{2}) \end{matrix}

(13)

On the one hand, this model can alleviate the problem of low accuracy caused by sparse data. On the other hand, our model takes advantage of aspect information, which is less difficult than modeling review text directly.

The objective function is minimized by the SGD algorithm as Equations (14) and (15).

\frac{\partial L}{\partial U_{u}} = \sum_{j = 1}^{N} (R_{u j} - U_{u}^{T} V_{j}) (- V_{j}) + β \sum_{m = 1}^{M} (T_{u m} - U_{u}^{T} U_{m}) (- U_{m}) + λ U_{u}

(14)

\frac{\partial L}{\partial V_{j}} = \sum_{u = 1}^{M} (R_{u j} - U_{u}^{T} V_{j}) (- U_{u}) + α \sum_{n = 1}^{N} (S_{j n} - V_{j}^{T} V_{n}) (- V_{n}) + λ V_{j}

(15)

The stochastic gradient descent algorithm is as Algorithm 2. We call our method as Aspect-Based User and Product Matrix Factorization (AUPMF).

Algorithm 2 Aspect-Based User and Product Matrix Factorization, AUPMF

1:: Input:R // rating matrix
2:: $S_{j n}$ // aspect-based product similarity
3:: $T_{u m}$ // aspect-based user similarity
4:: $α, β$ // weight parameter
5:: $λ$ // normalization parameter
6:: ${i t e r}_{m a x}$ // iteration limit
7:: $ϵ$ // stop condition
8:: Output: $\hat{R}$ // user–product rating matrix
9:: // data preprocessing
10:: Initialize $U^{(0)} V^{(0)}$ with random value
11:: t = 0; //Iteration number
12:: $τ = 0$ ; //Convergence flag
13:: Compute $L^{(t)}$ ; //Equation (13)
14:: While( $t < {i t e r}^{m a x} a n d τ = 0) do$
15:: $η = 1$ ;
16:: $Compute \frac{\partial L}{\partial U^{(t)}}, \frac{\partial L}{\partial V^{(t)};}$ //Equations (14) and (15)
17:: $While (L (U^{(t)} - η \frac{\partial L}{\partial U^{(t)}}, V^{(t)} - η \frac{\partial L}{\partial V^{(t)}}) \geq L^{(t)}) do$
18:: $η = η / 2;$
19:: $U^{(t + 1)} = U^{(t)} - η \frac{\partial L}{\partial U^{(t)}}, V^{(t + 1)} = V^{(t)} - η \frac{\partial L}{\partial V^{(t)}}; / / U p d a t e$
20:: Compute $L^{(t + 1)}$ ; //Equation (13)
21:: If $(L^{(t)} - L^{(t + 1)} \leq ε)$
22:: $τ = 1;$
23:: $t = t + 1;$
24:: $End$
25:: $End$
26:: Return $\hat{R} = U^{(t) T} V^{(t)}$

4. Experiments and Analysis

To evaluate the effectiveness of the proposed model, this section uses real-life review data to conduct experiments. First, we analyze the impact of the weight parameters on the proposed model. Second, we compare our proposed approach with five existing models to demonstrate our model. The third experiment studies the impact that matrix density has on the predictive ability of the model. The fourth experiment investigates the influence of the latent dimension. The experimental results demonstrate the effectiveness of the proposed approach.

In this section, we first describe the review data that we used for evaluating the proposed model and then discuss the experiments.

4.1. The Dataset and Preprocessing

Our hardware and software configuration are Intel(R) Core(TM)i7-5600U CPU with 2.60 GHz and 8.0 G memory, Windows 2012, Python 3.5.2, NLTK 3.0, Numpy 1.11.2, SciPy 0.17.0, Scikit-Learn 0.19.1.

4.2. Experimental Setup

Our experimental data come from the review data of Yelp. Yelp is a famous rating website that has large numbers of restaurants, shopping malls, hotels, etc. Yelp allows users to post review text and ratings on the website. After a series of preprocessing steps, we obtain the Yelp data as follows.

As shown in Table 2, our Yelp dataset includes two subsets: a restaurant dataset with 1,344,405 reviews, and a hotel dataset with 96,384 reviews.

We manually set the aspect seed keywords for restaurants and hotels as listed in Table 3 and Table 4.

We conduct 5-fold cross-validation in the experiments. The data have been split into five parts. Four parts have been treated as training data, while the last part has been treated as the test data. This paper chooses the Mean Squared Error (MSE) as the evaluation standard. The MSE is defined as

M S E = \frac{1}{M} \sum_{u, j} {({\hat{R}}_{u j} - R_{u j})}^{2}

(16)

where M is the total number of reviews in our collection.

{\hat{R}}_{u j}

and

{\hat{R}}_{u j}

are the predicted rating and the actual rating in the test data. The result is the mean value of five experiments. The metric measures how much our predicted rating deviates from the true rating. A smaller MSE value indicates better performance.

4.3. Baselines

We use several baselines to compare with our approaches.

Basic Matrix Factorization (BasicMF): Koren etc. propose the standard matrix factorization [32], which only uses rating to train the model. The BasicMF model optimizes Equation (4) using the SGD algorithm with Equations (7) and (8) until the iteration ends.
Word-Based Similarity Matrix Factorization (WSMF): WSMF directly uses word similarity in the review, to improve the standard matrix factorization. First, it lists all the words in the review text, and exploits TF-IDF to sort important words to build features. It then transforms the review into an N-dimensional vector. The similarity of reviews is the cosine similarity of the above two vectors. Finally, we incorporate the similarity into matrix factorization.
Matrix Factorization with Bias (BiasMF): BiasMF exploits user and product bias information in matrix factorization to improve the rating prediction [32].
Sentiment-Based Rating Prediction method (RPS): Lei etc. propose a sentient-based method [7]. It first builds a sentiment lexicon, and then calculates the sentiment of the review with a series of rules. Next, it proposes three important factors (user sentiment similarity, item reputation similarity, and interpersonal sentiment influence), and fuses them into matrix factorization.
Hidden Factors and Hidden Item Topics (HFT): The HFT model uses a traditional latent factor model to combine latent rating dimensions with latent review topics [5]. The accuracy of HFT is higher than that of the traditional LFM model. The HFT is a state-of-the-art algorithm for rating prediction.

4.4. Evaluation Results

To verify the effectiveness of the AUPMF model, we perform comparisons with existing models. We also employ five-fold cross-validation. All the results are represented by means and variance of five results.

4.4.1. Impact of Weight Parameter

The weight parameters

α

and

β

, respectively, represent the proportion of aspect-based product similarity and user similarity in the proposed model. For the weight parameter

α

, a larger

α

means that the joint model relies more on product similarity. On the contrary, a smaller

α

means that the joint model relies less on product similarity. If

α

= 0, the joint model will not rely on product similarity, and it will only rely on user similarity to learn the latent factor vector. For weight parameter

β

, a larger

β

means that the joint model relies more on user similarity. On the contrary, a smaller

β

means that the joint model relies less on user similarity. If

β

= 0, the joint model will not rely on user similarity, and only relies on product similarity to learn the latent factor vector.

We use the AUPMF algorithm to conduct experiments on the restaurant and hotel datasets. First, we set the aspect number as 6. The aspect seed words are set manually as listed in Table 3 and Table 4. To study the weight parameter

α

and

β

, we set the values of

α

and

β

from 0 to 100, respectively, in steps of 10. We also set normalization parameter

λ

= 1, the number of latent features f = 20, and number of iterations to 1000.

Figure 3 shows how the weight parameters

α

and

β

impact the rating prediction in the restaurant dataset. The weight parameters

α

and

β

indeed influence the effectiveness of the proposed model. As

α

increases, MSE passes through a minimum, which means that the rating prediction initially goes up and then decreases. Parameter

β

also has the same effect on rating prediction. It is shown in Figure 3 that in the restaurant dataset, the MSE has a minimum value of 1.312 when

α

= 20 and

β

= 60. Therefore, the optimal values are

α

= 20 and

β

= 60.

We continue the experiment with the above settings in the hotel dataset, and keep other parameters unchanged. Then, we study the impact that the weight parameters

α

and

β

have on the rating prediction in the hotel dataset.

Figure 4 shows how the weight parameters

α

and

β

impact the effectiveness of the rating prediction in the hotel dataset. The parameters

α

and

β

indeed affect the performance of the prediction model. As above, the MSE shows similar trends for the restaurant and hotel datasets. With an increase in parameters

α

and

β

, the MSE value first decreases, which means higher accuracy of prediction. When a certain value is reached, the MSE increases with increasing

α

and

β

, which means lower accuracy.

The data presented in Figure 4 also show that the optimal values of the weight parameters

α

and

β

are the same in the two datasets. For the restaurant dataset, the model acquires the highest accuracy when

α

= 20 and

β

= 60, whilst, for the hotel dataset, the model also acquires the highest accuracy when

α

= 20 and

β

= 60. It can be seen from the experimental results that for the weight parameters

α

and

β

,

β

is much larger than

α

. It suggests that our model relies more on aspect-based user similarity than on aspect-based product similarity. In the review dataset, the products represent restaurant and hotels, and the number is relatively small compared with the number of users. Therefore, the impact of the product on the rating prediction is relatively small.

The value of MSE in Figure 3 is less than the value of MSE in Figure 4, indicating that our model’s predictive ability varies for different datasets. The reasons for this will be discussed in Section 4.4.4. Now, we set

α

= 20 and

β

= 60 in the following experiments.

4.4.2. Influence of Latent Dimension

For matrix factorization-based models, the latent dimension is an important parameter to tune. Our model involves such a parameter, f. In Section 4.4.1, we temporarily set it as 20. This section records how the number of latent factors influences the predictive ability of AUPMF. We vary it from 5 to 50 with a step of 5, and examine how the performance changes with regard to the latent dimension. As shown in Figure 5, using f = 20 yields the best performance in the restaurant dataset, and f = 25 in the hotel dataset. In order to facilitate the procedures, we still set f = 20 in the following experiment.

4.4.3. Comparison of Rating Prediction

Figure 6 compares the MSE for the two datasets determined using the six different models. The following conclusions can be obtained.

Due to data sparsity, the standard matrix factorization could not achieve better results. The MSEs of BasicMF for the restaurant and hotel datasets are 1.740 and 1.762, respectively.
WSMF performs worse than BasicMF, with MSEs of 1.920 and 1.971 for the two datasets, respectively. WSMF directly employs a word vector on standard matrix factorization, which reduces the predictive ability of the model.
BiasMF employs the bias information of the user and product to improve the matrix factorization, gaining stronger prediction. The MSEs of BiasMF are 1.575 and 1.621, respectively, for the two datasets.
The RPS model fuses several types of information to reduce the MSE; values of 1.437 and 1.534 were achieved for the two datasets, respectively. This model, however, relies on the sentiment lexicon, which affects the stability of prediction. It can be seen from Figure 5 that the deviation of several experiments is large.
Compared with the above baseline models, the HFT’s predictive ability is relatively strong. The average MSE was 1.346 and 1.458 for the restaurant and hotel datasets, respectively.
The results obtained from the experiments indicate that AUPMF performs consistently and significantly better than the baseline methods. This is illustrated in Figure 6 in terms of the MSE. The average MSE of AUPMF is 0.03 lower than that of HFT for the restaurant dataset, which means higher accuracy. For the hotel dataset, the average MSE of AUPMF is 0.03 lower than that of HFT, meaning that the predictive ability is stronger than that achieved using HFT.

All of the methods provide better prediction for the restaurant dataset compared with the hotel dataset. The reason is that the two datasets are different in sparsity. The hotel dataset is sparser than the restaurant dataset, and therefore the rating prediction is worse. We will discuss the effect of matrix density on rating prediction in Section 4.4.4.

4.4.4. Influence of Matrix Density

An important problem to be solved in this paper is the influence of matrix density on prediction. In order to examine the influence of matrix density, we conducted experiments with a range of different matrix densities. Suppose that m users post x reviews on n products. The user–product rating matrix density is

\frac{x}{m \times n}

. To obtain matrices with different densities, we construct them from the original matrix according to the method proposed by Li et al. [22]. We only conduct this experiment in the restaurant dataset. We define a threshold value

δ

. A large

δ

value means that users and products with large numbers of reviews will be kept, resulting in a dense matrix. A small

δ

value means that users and products with fewer reviews will be kept, resulting in a sparse matrix. There are four sub-datasets arranged on the X-axis of Figure 6. Their matrix densities

δ

are 0.034, 0.142, 0.207, and 0.292. The predicted MSEs for the different sub-datasets are plotted on the Y-axis. The experiments in this section are only carried out using the three best rating prediction models, i.e., RPS, HFT, and AUPMF.

As shown in Figure 6, the predictive ability of all models increases as the matrix density increases. When the matrix becomes dense, all models obtain more information and the performance of the different models will be improved.

As can be seen from Figure 7, both AUPMF and HFT outperform RPS in all sub-datasets. When the matrix density is 0.034, the MSE value of AUPMF is 0.03 and 0.13 lower than that of HFT and RPS, respectively. When the matrix density is 0.142, the MSE value of AUPMF is 0.03 and 0.14 lower than that of HFT and RPS, respectively. In one instance, HFT provides the best prediction; when the matrix density is 0.207, the MSE value of HFT is 0.01 and 0.06 lower than AUPMF and RPS. When the matrix density is 0.292, the MSE value of AUPMF is 0.02 and 0.07 lower than HFT and RPS. From the above analysis, we can see that the performance of AUPMF and HFT is relatively close. On three of four sub-datasets, the performance of AUPMF is better than that of HFT. Only when the matrix density of the sub-dataset is 0.207, the MSE of AUPMF is higher than that of HFT, showing that its predictive ability is slightly lower than that of HFT. When the matrix density of the sub-datasets is relatively small (0.142 and 0.034), the performance of AUPMF exceeds that of HFT, showing the strong robustness of the AUPMF model.

According to Table 2, the matrix density of the restaurant dataset is 0.0094, and the matrix density of the hotel dataset is 0.0043. In Figure 7, the matrix density of the constructed sub-datasets is higher than that of the original datasets, so the predictive ability of the model is improved. This also explains why the model proposed in Section 4.4.3 has stronger predictive ability for the restaurant dataset compared with the hotel dataset.

5. Conclusions

In this paper, we propose a joint aspect-based user and product model for review rating prediction. Our method first represents the review with aspect-based sentiment. Then, it presents the aspect-based user similarity and product similarity. Next, the aspect-based similarities are incorporated into a matrix factorization model. To assess our proposed methods, we conducted four experiments on two datasets. The results show that the proposed model is effective and outperforms existing approaches.

Author Contributions

Conceptualization, methodology, funding acquisition, original draft, Q.P.; validation, formal analysis, experiment, L.Y.; resources, data curation, writing—review and editing, W.D. and X.X.; investigation, experiment, H.F. and F.Z.; experimental analysis, conclusions, K.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Hubei Provincial Enterprise-Level Intelligent Application Excellent Young and Middle-Aged Scientific Technological Innovation Team, Natural Science Foundation of Hubei Province, No. ZRMS2019001565, the Technology Innovation Special Program of Hubei Province (No. 2022BAA044) and the Artificial Intelligence Application Research Center of Wuhan College.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All the details of this work, including data and algorithm codes, are available from the corresponding author: [email protected].

Acknowledgments

The authors would like to thank the reviewers for their helpful suggestions, which have considerably improved the quality of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Adomavicius, G.; Mobasher, B.; Ricci, F.; Tuzhilin, A. Context-aware recommender systems. AI Mag. 2011, 32, 67–80. [Google Scholar]
Ganu, G.; Elhadad, N.; Marian, A. Beyond the Stars: Improving rating predictions using review text content. In Proceedings of the Twelfth International Workshop on the Web and Database (WebDB 2009), Providence, RI, USA, 28 June 2009. [Google Scholar]
Yu, D.; Mu, Y.; Jin, Y. Rating prediction using review texts with underlying sentiments. Inf. Process. Lett. 2017, 117, 10–18. [Google Scholar] [CrossRef]
Ganu, G.; Kakodkar, Y.; Marian, A. Improving, the, quality, of, predictions, using, textual, information in online user reviews. Inf. Syst. 2013, 38, 1–15. [Google Scholar] [CrossRef]
McAuley, J.; Leskovec, J. Hidden factors and hidden topics: Understanding rating dimensions with review text. In Proceedings of the 7th ACM Conference on Recommender Systems, Hongkong, 12–16 October 2013; pp. 165–172. [Google Scholar]
Jin, Z.; Li, Q.; Zeng, D.D.; Zhan, Y.; Liu, R.; Wang, L.; Ma, H. Jointly modeling review content and aspect ratings for review rating prediction. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, Pisa, Italy, 17–21 July 2016; pp. 893–896. [Google Scholar]
Lei, X.; Qian, X.; Zhao, G. Rating prediction based on social sentiment from textual reviews. IEEE Trans. Multimed. 2016, 18, 1910–1921. [Google Scholar] [CrossRef]
Wang, H.; Lu, Y.; Zhai, C. Latent aspect rating analysis on review text data: A rating regression approach. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 24–28 July 2010; pp. 783–792. [Google Scholar]
Thet, T.T.; Na, J.C.; Khoo, C.S. Aspect-based sentiment analysis of movie reviews on discussion boards. J. Inf. Sci. 2010, 36, 823–848. [Google Scholar] [CrossRef]
Mukherjee, A.; Liu, B. Aspect extraction through semi-supervised modeling. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, ACL, Jeju Island, Korea, 8–14 July 2012; pp. 339–348. [Google Scholar]
Moghaddam, S.; Ester, M. Aspect-based opinion mining from product reviews. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, OR, USA, 12–16 August 2012; p. 1184. [Google Scholar]
Lu, Y.; Zhai, C.; Sundaresan, N. Rated aspect summarization of short comments. In Proceedings of the 18th International Conference on World Wide Web, Madrid, Spain, 20–24 April 2009; pp. 131–140. [Google Scholar]
Li, P.; Wang, Y.; Gao, W.; Jiang, J. Generating aspect-oriented multi-document summarization with event-aspect model. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, ACL, Edinburgh, UK, 27–31 July 2011; pp. 1137–1146. [Google Scholar]
Jo, Y.; Oh, A.H. Aspect and sentiment unification model for online review analysis. In Proceedings of the fourth ACM International Conference on Web Search and Data Mining, Hong Kong, China, 9–12 February 2011; pp. 815–824. [Google Scholar]
Zhang, L.; Liu, B. Aspect and entity extraction for opinion mining. In Data Mining and Knowledge Discovery for Big Data; Springer: Berlin/Heidelberg, Germany, 2014; pp. 1–40. [Google Scholar]
You, L.; Peng, Q.; Xiong, Z.; He, D.; Qiu, M.; Zhang, X. Integrating aspect analysis and local outlier factor for intelligent review spam detection. Future Gener. Comput. Syst. 2020, 102, 163–172. [Google Scholar] [CrossRef]
Pang, B.; Lee, L. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, Ann Arbor, MI, USA, 25–30 June 2005; pp. 115–124. [Google Scholar]
Goldberg, A.B.; Zhu, X. Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization. In Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing, ACL, New York, NY, USA, 9 June 2006; pp. 45–52. [Google Scholar]
Lu, Y.; Kong, X.; Quan, X.; Liu, W.; Xu, Y. Exploring the sentiment strength of user reviews. In Proceedings of the 11th International Conference on Web-Age Information Management, Jiuzhaigou, China, 15–17 July 2010; pp. 471–482. [Google Scholar]
Qu, L.; Ifrim, G.; Weikum, G. The bag-of-opinions method for review rating prediction from sparse text patterns. In Proceedings of the 23rd International Conference on Computational Linguistics, Beijing, China, 23–27 August 2010; pp. 913–921. [Google Scholar]
Wang, H.; Lu, Y.; Zhai, C. Latent aspect rating analysis without aspect keyword supervision. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, 21–24 August 2011; pp. 618–626. [Google Scholar]
Li, F.; Liu, N.N.; Jin, H.; Zhao, K.; Yang, Q.; Zhu, X. Incorporating reviewer and product information for review rating prediction. In Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, AAAI, Barcelona, Spain, 16–22 July 2011; pp. 1820–1825. [Google Scholar]
Gao, Y.; Yu, W.; Chao, P.; Zheng, Z.; Zhang, R. Analyzing review for rating prediction and item recommendation. J. East China Norm. Univ. 2015, 2015, 80–90. [Google Scholar]
Tan, Y.; Zhang, M.; Liu, Y.; Ma, S. Collaborative Recmooendation Framework Based on Rating and Textual Reviews. Pattern Recognit. Artif. Intell. 2016, 29, 359–366. [Google Scholar]
Yu, Y.; Gao, Y.; Wang, H.; Sun, S. Integrating User Social Status and Matrix Factorization for Item Recommendation. J. Comput. Res. Dev. 2018, 55, 113–124. [Google Scholar]
Ning, X.; Yac, L.; Wang, X.; Benatallah, B.; Dong, M.; Zhang, S. Rating prediction via generative convolutional neural networks based regression. Pattern Recognit. Lett. 2018, 132, 12–20. [Google Scholar] [CrossRef]
Chambua, J.; Niu, Z.; Yousif, A.; Mbelwa, J. Tensor factorization method based on review text semantic similarity for rating prediction. Expert Syst. Appl. 2018, 114, 629–638. [Google Scholar] [CrossRef]
Wu, S.; Zhang, Y.; Zhang, W.; Bian, K.; Cui, B. Enhanced review-based rating prediction by exploiting aside information and user influence. Knowl. Based Syst. 2021, 222, 107015. [Google Scholar] [CrossRef]
Tay, Y.; Luu, A.T.; Hui, S.C. Multi-pointer co-attention networks for recommendation. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 2309–2318. [Google Scholar] [CrossRef] [Green Version]
Chen, C.; Zhang, M.; Liu, Y.; Ma, S. Neural attentional rating regression with review-level explanations. In Proceedings of the 2018 World Wide Web Conference on World Wide Web, International World Wide Web Conferences Steering Committee, Lyon, France, 23–27 April 2018; pp. 1583–1592. [Google Scholar]
Liu, H.; Wang, Y.; Peng, Q.; Wu, F.; Gan, L.; Pan, L.; Jiao, P. Hybrid neural recommendation with joint deep representation learning of ratings and reviews. Neurocomputing 2020, 374, 77–85. [Google Scholar] [CrossRef]
Koren, Y.; Bell, R.; Volinsky, C. Matrix factorization techniques for recommender systems. Computer 2009, 42, 42–49. [Google Scholar] [CrossRef]
Pero, Š.; Horváth, T. Opinion-driven matrix factorization for rating prediction. In Proceedings of the 21th International Conference on User Modeling, Adaptation, and Personalization, Rome, Italy, 10–14 June 2013; pp. 1–13. [Google Scholar] [CrossRef]
Zhang, J.D.; Chow, C.Y.; Xu, J. Enabling kernel-based attribute-aware matrix factorization for rating prediction. IEEE Trans. Knowl. Data Eng. 2016, 29, 798–812. [Google Scholar] [CrossRef]
Zhang, M.; Hu, B.; Shi, C.; Wu, B.; Wang, B. Matrix factorization meets social network embedding for rating prediction. In Proceedings of the Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data, Macau, China, 23–25 July 2018; pp. 121–129. [Google Scholar] [CrossRef]
Sarwar, B.; Karypis, G.; Konstan, J.; Riedl, J. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th International Conference on World Wide Web, Hong Kong, 1–5 May 2001; pp. 285–295. [Google Scholar] [CrossRef]

Figure 1. Review rating prediction flowchart.

Figure 2. Review aspect and sentiment representation.

Figure 3. MSE vs. weight parameter in restaurant dataset.

Figure 4. MSE vs. weight parameter in hotel dataset.

Figure 5. MSE vs. number of latent dimensions.

Figure 6. MSE vs. different models for two datasets.

Figure 7. MSE vs. different matrix densities for restaurant dataset.

Table 1. Notations.

Symbol	Description
U	m × f matrix, represents user’s preference for a product
V	n × f matrix, indicates that a product belongs to a preference
R	rating matrix
$U_{u}$	f dimensional column vector of user u
$V_{j}$	f dimensional column vector of product j
$R_{u j}$	rating of user u to product j
$λ$	normalization parameter
$M_{j k}$	product–aspect matrix, product j to aspect k, the value is +1, −1 or 0
$S_{j n}$	aspect-based similarity matrix between product j and n
$N_{u k}$	user–aspect matrix, user u to aspect k, the value is +1, −1 or 0
$T_{u m}$	aspect-based similarity matrix between user u and m
$α$	weight parameter to balance product weight
$β$	weight parameter to balance user weight

Table 2. Yelp data statistics.

	Restaurant	Hotel
reviews	1,344,405	126,384
products	7438	2372
users	19,150	12,305

Table 3. Aspect seed words for restaurant dataset.

Aspect	Seed Words
value	money, price, dollars, cash, check, quality
service	waiter, manager, staff, hostess
meat	beef, bbq, pork, hamburger, hotdog
decor	design, ceiling, decor, window, space
dessert	dessert, chocolate, ice cream, macaroons
ambiance	ambiance, atmosphere, experience

Table 4. Aspect seed words for hotel dataset.

Aspect	Seed Words
room	room, suite, view, bed
value	value, price, quality, worth
location	location, traffic, car, restaurant
cleanliness	clean, dirty, maintain, smell
check in/front desk	stuff, check, help, reservation
service	service, food, breakfast, buffet

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Peng, Q.; You, L.; Feng, H.; Du, W.; Zheng, K.; Zhu, F.; Xu, X. Jointly Modeling Aspect Information and Ratings for Review Rating Prediction. Electronics 2022, 11, 3532. https://doi.org/10.3390/electronics11213532

AMA Style

Peng Q, You L, Feng H, Du W, Zheng K, Zhu F, Xu X. Jointly Modeling Aspect Information and Ratings for Review Rating Prediction. Electronics. 2022; 11(21):3532. https://doi.org/10.3390/electronics11213532

Chicago/Turabian Style

Peng, Qingxi, Lan You, Hao Feng, Wei Du, Kesong Zheng, Fuxi Zhu, and Xiaoya Xu. 2022. "Jointly Modeling Aspect Information and Ratings for Review Rating Prediction" Electronics 11, no. 21: 3532. https://doi.org/10.3390/electronics11213532

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Jointly Modeling Aspect Information and Ratings for Review Rating Prediction

Abstract

1. Introduction

2. Related Work

2.1. Review Rating Prediction

2.2. Matrix Factorization Techniques

3. Methodology

3.1. Aspect Sentiment Representation

3.2. Aspect-Based Similarity Measure

3.2.1. Aspect-Based Product Similarity

3.2.2. Aspect-Based User Similarity

3.3. Joint Model for Rating Prediction

4. Experiments and Analysis

4.1. The Dataset and Preprocessing

4.2. Experimental Setup

4.3. Baselines

4.4. Evaluation Results

4.4.1. Impact of Weight Parameter

4.4.2. Influence of Latent Dimension

4.4.3. Comparison of Rating Prediction

4.4.4. Influence of Matrix Density

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI