Modeling user rating preference behavior to improve the performance of the collaborative filtering based recommender systems

Mubbashir Ayub; Mustansar Ali Ghazanfar; Zahid Mehmood; Tanzila Saba; Riad Alharbey; Asmaa Mahdi Munshi; Mayda Abdullateef Alrige

doi:10.1371/journal.pone.0220129

Abstract

One of the main concerns for online shopping websites is to provide efficient and customized recommendations to a very large number of users based on their preferences. Collaborative filtering (CF) is the most famous type of recommender system method to provide personalized recommendations to users. CF generates recommendations by identifying clusters of similar users or items from the user-item rating matrix. This cluster of similar users or items is generally identified by using some similarity measurement method. Among numerous proposed similarity measure methods by researchers, the Pearson correlation coefficient (PCC) is a commonly used similarity measure method for CF-based recommender systems. The standard PCC suffers some inherent limitations and ignores user rating preference behavior (RPB). Typically, users have different RPB, where some users may give the same rating to various items without liking the items and some users may tend to give average rating albeit liking the items. Traditional similarity measure methods (including PCC) do not consider this rating pattern of users. In this article, we present a novel similarity measure method to consider user RPB while calculating similarity among users. The proposed similarity measure method state user RPB as a function of user average rating value, and variance or standard deviation. The user RPB is then combined with an improved model of standard PCC to form an improved similarity measure method for CF-based recommender systems. The proposed similarity measure is named as improved PCC weighted with RPB (IPWR). The qualitative and quantitative analysis of the IPWR similarity measure method is performed using five state-of-the-art datasets (i.e. Epinions, MovieLens-100K, MovieLens-1M, CiaoDVD, and MovieTweetings). The IPWR similarity measure method performs better than state-of-the-art similarity measure methods in terms of mean absolute error (MAE), root mean square error (RMSE), precision, recall, and F-measure.

Citation: Ayub M, Ghazanfar MA, Mehmood Z, Saba T, Alharbey R, Munshi AM, et al. (2019) Modeling user rating preference behavior to improve the performance of the collaborative filtering based recommender systems. PLoS ONE 14(8): e0220129. https://doi.org/10.1371/journal.pone.0220129

Editor: Chi Ho Yeung, Education University of Hong Kong, CHINA

Received: March 5, 2019; Accepted: July 9, 2019; Published: August 1, 2019

Copyright: © 2019 Ayub et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All the relevant data and its supporting information files (including Java source code) of the manuscript can be accessed by visiting the following publicly accessible URL of figshare data repository: https://figshare.com/articles/Article_PLoS_One_data_files_PONE-D-19-02116/8039942.

Funding: The authors received no specific funding.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The advancements in machine learning revolutionized the e-commerce business in the last decade. The companies are taking advantage of these advancements by providing users a plethora of online resources for shopping. People are now more interested in buying things online rather than in the old traditional way. In addition, users connect with their friends and colleagues through social networking sites and get reviews about different products. This paradigm shift of shopping and user attitude led to the information overload problem, where users can buy items from millions of online shopping stores. This leads companies to deal with so-called big data problem [1]. Recommender systems aim at solving the information overload problem by recommending products and information to users based on their need and preferences of the community [2, 3].

A few renowned examples of recommender system are Amazon [4], YouTube [5], and Google news [6]. Recommender system uses several categories for creation and generation of information [7]. There are two basic entities in the recommender systems, namely users and items. An active user is a user that is utilizing the recommender system, expresses opinions and provides ratings about different products. The recommender systems apply intelligent filtering methods to rate and recommend items to active users. The two main categories of recommender system are content-based filtering (CBF) and collaborative filtering (CF). The CBF gathers information using the content of items. The main concept of CBF is that the users may like a similar type of items as they liked in the past. In this type of recommender system filtering, user profile record the user interaction with the recommender system and preserve the users’ interests and preferences. For example, LIBRA which is a recommender system for the books. The main concept of CF is that people who agree in the past also agree in the future too. An active user makes a prediction on the target items to find out other similar users called neighbors. The CF is further divided into two types, which are memory-based CF and model-based CF. The memory-based CF uses the rating data to find similar users or items [8]. Different commercial systems are using the memory-based CF for the recommendation. It is efficient and easy to implement. The main focus of this article is to overcome the issues of memory-based CF for the recommender system. The model-based CF discovers latent factors and uses this latent factor to make predictions. Some common examples of the model-based CF are clustering models [9], Bayesian networks[10], singular value decomposition (SVD)[7], and kernel-mapping recommender (KMR) system based algorithms[11]. The memory-based CF gives more significant results for the recommender system. However, its shortcoming is that it takes too much time in case of a large number of users and items. The model-based CF gives faster recommendations as compared to the memory-based CF because it can run off-line to reduce recommendation time. However, shortcomings of model-based CF are that it cannot gives accurate results as compared to the memory-based CF as well as it cannot give out of box recommendation. Despite several benefits, CF also suffers from some limitations like data sparsity and cold-start problems [7]. Data sparsity problem arises when the number of rated items are very low as compared to the items to predict. It occurs due to an overlap between the items rated by two users is too narrow or not exist. The cold start scenario is a problem for a particular item, which holds no ratings or an extremely low number of ratings [12]. In some conditions, CF lacks the ability to provide reliable recommendations due to diverse user interests or when items have different contents [7]. In E-commerce, this problem is called Multiple-interest and Multiple-content recommendation problem [13]. Considering the aforementioned problems of CF, researchers proposed different hybrid methods to improve the performance of the recommender systems such as demographic and semantic information [14–19]. These recommender system methods also use traditional similarity measures like the Pearson correlation coefficient (PCC) [13, 20, 21].

Traditional CF-based methods compute similarities between users for all co-rated items as well as for those items that are different than the target items. Therefore, the neighbors for the active user remain the same even for different items. The most important part of the CF-based method is to find similarities between different users. The commonly used recommender systems are based on traditional similarity measures like PCC or cosine vector similarity [13, 20, 21], which consider only local context information. There are still some issues in traditional similarity measures despite their enormous success. Traditional similarity measures calculate user similarity without considering user RPB. Generally, many users do not rate items according to their quality. These users can be categorized into two types; one that rates every item with almost the same rating leading to zero variance in their ratings. Second, those who give an average rating to all items regardless of the item quality. This creates a serious problem while calculating similarities among users, which often leads to poor recommendations[22]. The miniature research is carried to handle this type of user behavior despite its high impact on recommendations [23–26]. There are several design objectives, which need to be intended to make the recommender system successful, which are discussed below:

a) Accurate: Accuracy is one of the most important design objectives of the recommender system. The accuracy helps to build the trust of users when they interact with the recommender system. When a user buys any product recommended to him and starts using it after some time the user realize that the system has given the wrong recommendation about that product. Consequently, a user stops trusting that recommender system. Therefore, the main objective of a recommender system is to give an accurate prediction for items to any user. The proposed IPWR similarity measure method gives accurate recommendations as compared with the state-of-the-art similarity measure methods used for the recommender systems.

b) Scalable: A good recommender system should be able to handle large datasets and generate predictions in real time. When the number of users and items increases, the search space grows as well, then it may be difficult to give result in real time, if the recommender system is not scalable. There is always a conflict between accuracy and scalability of a recommender system.

c) Overspecialization: In CF-based methods, items are recommended to a user, which are most similar to a user profile. Typical methods of recommender systems cannot give any recommendations about non-co-rated items. The IPWR similarity measure method also overcomes such a scenario by considering ratings of the non-co-rated items.

In this article, we present a novel method for recommender system known as IPWR similarity measure. It takes into account the user RPB towards an item rating to improve standard PCC similarity measure method. To record the user RPB, two methods are proposed: the first method uses mean and variance for each user and it is known as IPWR with variance. The second method uses mean and standard deviation (SD) for each user and it is known as IPWR with SD. The results from either method are then linearly combined with improved PCC similarity measure method. The performance of the IPWR similarity measure method is evaluated on the four state-of-the-art datasets for recommender systems using state-of-the-art similarity measure methods. The IPWR similarity measure outperforms state-of-the-art similarity measures in terms of MAE, RMSE, precision, recall, and F-measure.

The main contributions of this article are as follows:

A simple yet highly effective similarity measure method is proposed to model the rating preference behavior (RPB) of users.
Standard PCC similarity is improved by overcoming some of its shortcomings. These shortcomings are discussed in a subsection of “Related Work” section entitled as “Shortcomings of standard PCC”.
Improved PCC similarity is then adaptively combined with RPB of users.
The IPWR similarity measure method not only considers local context information but also take into account the global preference of user ratings.
The IPWR similarity measure method can handle the scenarios if no co-ratings are found between two users, as in generally cold start scenarios.

The rest of the sections of this article are organized as follows: Section 2 describes the state-of-the-art literature related to recommender systems. Section 3 describes the methodology of the IPWR similarity measure. Section 4 presents the experimental results and discussion of the IPWR similarity measure method and its comparison with state-of-art similarity measure methods. Section 5 concludes the IPWR similarity measure method and presents future research directions.

Related work

Collaborative filtering (CF) is now commonly used in many fields for personalized recommendation [2, 12, 13, 27–32]. However, there are also some issues in collaborative filtering (CF), like accuracy, scalability, and cold start, etc. In this paper, the main focus is to improve the prediction accuracy. In CF, items are recommended to users’ according to their preferences, therefore, it is very important that the history of users’ preferences must be available. Different researchers worked on prediction accuracy to improve the performance of the recommender systems. For instance, Ahn et al. [20] propose a solution for CF known as proximity impact popularity (PIP) measure to address the shortcomings of standard PCC and cosine similarity. The PIP measure is the combination of three different aspects of user ratings, which are proximity, impact, and popularity. The PIP similarity only considers the local information of user rating, while the global preference of user ratings is ignored. Moreover, the results of the recommender system using PIP similarity measure are not normalized, which makes it difficult to combine it with other similarity measures. To resolve this issue, the weighted Pearson correlation coefficient (WPCC) method is proposed in. In WPCC[33], the idea of detaining confidence is considered that can be placed on the neighbors. When the number of rated items increases, the confidence also increases and vice versa. Jamali et al.[34] propose a similarity measure, which is based on the sigmoid function. This similarity measure can weaken the similarity of small common rated items among users. J. Bobadilla[35] propose adjusted cosine similarity to overcome the deficiencies of traditional cosine similarity, however, it does not consider the users’ preferences.

Bobadilla et al.[36] propose a novel similarity measure, which utilizes two similarity measures that are the mean squared difference and Jaccard similarity measure. Another metric called mean Jaccard difference (MJD) is proposed to address the cold start problem. Three steps are included in this metric to address the cold start problem. Firstly, the similarity metric is selected. The second step is an evaluation, in which weights are evaluated using neural networks. The last step is a prediction, which is obtained according to the selected similarity metric. [35]proposed a novel similarity measure known as a singularity-based similarity measure. In this similarity measure, it is assumed that the obtained results can be improved by taking contextual information. The user ratings are grouped as positive and negative and the singularity value of user and item is computed. The experimental results show the effectiveness of the proposed similarity. The significance-based similarity measure is proposed by Bobadilla et al.[35]. In this method, the significance of an item, the significance of a user, and the significance of an item for a user are computed. Then according to significance, similarity among users is computed using a standard Pearson correlation coefficient or cosine similarity. It also uses a data smoothing technique for similarity measure, which is the most widely used technique of recommender systems. Different sparsity measures are also used to improve the accuracy of the recommender system. H. Ma et al. [37] propose a similarity measure, in which information of users and items is taken into account and threshold for both are set, respectively. SongJie Gong et al.[38] propose another method to fill the missing ratings by merging SVD and item-based recommender. It uses the item-based method to recommend items to the user.

Szwabe et al. [39] propose a hybrid recommender system method that occupies two-stage data. It processes the data with content features that describe the items and users’ preferences. It improves the accuracy of a system without raising the computational complexity. Moreover, probabilistic matrix factorization is also merged in the recommender system to address issues like data sparsity, cold start, etc. N. Polatidis[40] also propose a novel similarity measure, which uses four different thresholds on a number of co-rated items using PCC to improve the accuracy of the recommender system. Liu et al.[33] also propose a novel similarity measure known as the new heuristic similarity method (NHSM). It computes three parameters, which are proximity, significance, and singularity for each co-rated item. After that, each computed parameter is multiplied by modified Jaccard similarity. The obtained similarity is then again multiplied with a function named as URP to obtain the resultant NHSM similarity [20]. The computation of NHSM-based similarity is complex and lengthy, which makes it difficult to produce a result in real time for the recommender system. All factors in NHSM are again and again multiplied, which ultimately weakens the performance and combining these results with some other similarity measure becomes difficult. A novel similarity measure based on the Bhattacharyya coefficient is proposed by Bidyut Kr. Patra et al. [41]. This method considers both co-rated and non-co-rated items for similarity measure. The resultant similarity measure is a linear combination of the Bhattacharyya coefficient, PCC similarity, and Jaccard similarity. Shuang-Bo Sun et al. [42] propose a novel similarity measure, which combines triangle and Jaccard similarities to improve the performance of the recommender system. Sadasivam et al. [43] propose a novel similarity measure for recommender system, which modifies the Bhattacharyya coefficient using an exponential function and then combined it with Jaccard followed by proximity, significance, and singularity (PSS) measures using a weighted scheme.

Shortcomings of standard PCC

The standard PCC suffers from some shortcomings, which are discussed below in conjunction with Cosine and CPCC similarity measures.

Shortcoming 1: Flat value of ratings.

In case user1 rating vector is flat such as (1,1,1) or (3,3,3) or (5,5,5) and user2 rating vector is (1,5,1), PCC will be not a number (NaN). Cosine value will be 0.777 and CPCC value depends upon whether the rating vector is above, equal to or below the median rating value of the rating scale. CPCC value will be +0.333, if rating vector consists of rating values less than median value (i.e. median value = 3), and will be -0.333 if rating values are greater than median value and CPCC, is NaN if all rating values are 3.

Shortcoming 2: Only single co-rated item.

In case two users contain a single co-rated item, then PCC will be NaN and Cosine will be 1.0. There are two cases for CPCC. In the first case, when the value of rating for both users is equal then the value of CPCC is 1.0 for all values above or below than median value and NaN if rating value is equal to the median value. In the second case, if both users common rating value is different, then CPCC is also NaN.

Shortcoming 3: Ignorance of user rating preference behavior (RPB).

The rating preferences may vary from user to user. Some users may rate every item high and some may rate every item low. This scenario of user RPB is not considered in standard PCC.

Shortcoming 4: Ignorance of corresponding item average rating in case of user-based CF.

In case of user-based CF, standard PCC only consider the average rating of users and ignores the average rating of the corresponding item. Similarly, in the case of item-based CF, user averages are also ignored.

Keeping in view the aforementioned shortcomings of the standard PCC, the proposed IPWR similarity measure method is named as improved PCC weighted with RPB (IPWR). In the IPWR similarity measure method, user RPB is modeled as a Cosine function of user averages and variance or SD. Almost all the aforementioned method in the related work, as well as state-of-the-art similarity measure methods for recommender system, ignore this behavior of a user rating. After that, calculated user RPB is linearly combined with improved PCC to enhance the performance of the IPWR similarity measure method.

The IPWR similarity measure method

In this section, we explain the methodology of the IPWR similarity measure that is used in memory-based CF to improve the performance of the recommender system. We denote the set of users by U = {a,b,c,…,z}3320 and a set of items are denoted by I = {i₀,i₁,i₂,…,i_m}. Each user (e.g. denoted by a) rates a set of items denoted by I_a. The rating of a user a for item i is denoted by R_a,i and it can be any real number (normally ratings are represented by real numbers in some range [min, max]). The mathematical representation of different similarity measures that are used as a performance comparison with the proposed IPWR similarity measure is presented in Table 1.

Download:

Table 1. A mathematical representation of different similarity measures.

https://doi.org/10.1371/journal.pone.0220129.t001

As discussed earlier, almost all methods of the similarity measure for the recommender system uses the co-ratings provided by the user. There are many users whose rating preference behavior is different than normal users. They tend to rate items according to their own behavior. Some users may rate every item low whether the item is good or bad. They may do this for bad items or even for good items. There is the second category of users who rate every item high whether that item is good or bad. These types of behavior of the user are termed as rating preference behavior (RPB). In this article, to handle such behaviors, the IPWR similarity measure method uses variance or standard deviation (SD) of each user using a Cosine function. The variance for the user a is calculated as follows: (7) where R_a,j represent the rating of user a for item j, represents the mean of ratings for the user a and I_a represents a set of items rated by user a. The SD for the user a can be calculated as follows: (8) where var_a in Eq (8) denotes the variance of usera. The calculated values of variance and SD can be used to calculate RPB for two users separately. The RPB_(a,b) function which uses variance is denoted by RPB_(a,b)using var and if SD is used then it is denoted by RPB_(a,b)using SD and mathematical represented using Eq (9) and Eq (10), respectively as follows: (9) (10) In Eq (9) and Eq (10), and represents the mean of ratings by user a and mean of ratings for user b, respectively. The cosine function is used to model the RPB of two users. The cosine function is the most commonly used function, which is used by the similarity measure methods either in CF or CB-based recommender systems in the literature. In addition, if two users have the same average rating and variance or SD value, then there RPB will be equal to 1. The use of the Cosine function also results in normalized values whose range is from -1 to +1, which is another reason to use cosine function. In the proposed IPWR similarity measure method, we intend to improve the standard PCC. The standard PCC works only on co-rated items and suffers from shortcoming 4 as discussed in the subsection of related work section entitled as “shortcomings of standard PCC”. To tackle shortcoming 4, both user and item average ratings are used as mentioned in Eq (11). The resultant similarity is given the name of improved PCC similarity measure which is denoted by Sim_IPCC. Furthermore, the standard PCC ignores users rating pattern, which is also estimated by IPWR similarity measure method using full rating information in the form of user averages and variance or SD. (11) where the variables involved in Eq (11) are the same as used in Table 1 for standard PCC. The final similarity is named as improved PCC weighted with RPB_(a,b) and denoted by IPWR_(a,b) is mathematically represented using Eq (12). The IPWR similarity measure considers both RPB_(a,b) and Sim_IPCC _{(a, b)} by combining both factors using an adaptive weighting scheme. Two weights α and β are chosen, α is applied to RPB_(a,b) and β is applied to Sim_IPCC _{(a, b)}. This also ensures that IPWR similarity measure method considers the user rating behavior and it also normalizes high rating effect as well as the low rating effect of each user.

(12)

(13)

The weights of α and β are determined in a separate subsequent section entitled as “Determining best weights for α and β”. The range of values for IPWR_(a,b) are from -1 to +1 and thus a similarity threshold θ_s is also required to be put on the similarity value generated by Eq (12) or Eq (13). The reason for adding both RPB_(a,b) and Sim_IPCC_(a,b) is that similarity range of Sim_IPCC_(a,b) is from -1 to +1, while RPB_(a,b) similarity range is also from -1 to +1. Now if two users have a slightly negative Sim_IPCC_(a,b) similarity but a high positive value for RPB_(a,b) then the overall similarity value for Eq (12) will become greater than zero implying a positive similarity between these two users. However, if RPB_(a,b) is not used in conjunction with Sim_IPCC_(a,b) then both users are treated as a dis-similar user by Sim_IPCC_(a,b). Similarly, if two users have a slightly positive Sim_IPCC_(a,b) similarity while a high negative value for RPB_(a,b) then overall similarity IPWR_(a,b) consider these two users as dis-similar users.

The final recommendations are generated using Eq (14), which is known as Resnick’s formula [44] and either Eq (12) or Eq (13) can be used by IPWR_(a,b) similarity measure method and defined mathematically as follows: (14) where p denotes a user belonging to nearest neighbor (NN) network of the user a. The top similar users of a are identified as nearest neighbors of the user a.

The pseudo code for the IPWR similarity measure method is outlined below and entitled as “Algorithm 1”. The experimental results of the IPWR similarity measure method are reported using five-fold cross validation[45] on each of the publically available dataset. The Pearson similarity is computed using Eq (11) and RPB is computed using either Eq (9) or Eq (10). The results of step 4 and step 5 are combined using Eq (12). Final prediction is generated using Eq (14).

______________________________________________________________________________________________________

Algorithm 1: Procedure of recommendation by the IPWR similarity measure method

Input: Rated dataset

Output: Predicted rating

______________________________________________________________________________________________________

1. Perform five-fold cross validation on ratings dataset to obtain test and training set.

2. Select target user a and test item i₀ from the test set.

3. Find all users b who have rated i₀ in the training set.

4. Find improved Pearson similarity Sim_IPCC_(a,b) between a and b using Eq (11).

5. Find rating preference behavior RPB_(a,b) of a and b using either Eq (9) or Eq (10).

6. Combine results from steps 4 and 5 using either Eqs (12) or (13).

7. Make prediction on target item i₀ of target user a using Eq (14).

8. Return .

_____________________________________________________________________________________

Consider an example, which demonstrates the working of the IPWR similarity measure method. In this example, Table 2 is showing an instance of a user-item based rating matrix. In the current situation, the rating matrix consist of five users and five items. The five users are denoted by a to e, while five items are denoted by i₁ to i₅. In this example, we want to predict a rating of an item i₅ for the user a. The distinct feature of this user-item based rating matrix is that user e rating vector is flat which corresponds to shortcoming 1 of standard PCC as mentioned earlier in the subsection entitled as “Shortcomings of standard PCC”. For this reason, although user a and user e consist of exactly the same value of co-rated items, PCC is NaN. User a and user c consist of single co-rated item (i.e. i₃ only), which corresponds to the shortcoming 2 of the standard PCC.

Download:

Table 2. An example of a user-item based rating matrix.

https://doi.org/10.1371/journal.pone.0220129.t002

Table 3 contain various parameters, which are computed from Table 2. The variance of each user is computed using Eq (7), RPB of users is computed using Eq (9), Sim_PCC value is computed using Eq (1), and Sim_IPCC value is computed using Eq (11).

IPWR_(a,b) = 0.5*0.073+0.5*0.0563 = 0.0646

IPWR_(a,c) = 0.5*0.978+0.5*1.0 = 0.744

IPWR_(a,d) = 0.5*(-0.09) +0.5*(-0.079) = -0.084

IPWR_(a,e) = 0.5*(0.785) +0.5*(0.707) = 0.372

Download:

Table 3. Computed values of different statistics from Table 2.

https://doi.org/10.1371/journal.pone.0220129.t003

From Table 3, it can be noted that Sim_PCC_(a,c) and Sim_PCC_(a,e) values are NaN, but at the same time values of Sim_IPCC and IPWR are real numbers that effectively abolishing shortcomings 1 and 2 of the standard PCC. The value of Sim_PCC_a,d = 0.0, which is very interesting. The users b,d, and e rated target item i₅. By applying the similarity threshold θ_s, a set of nearest neighbors (NN) for the user a are identified and NN = {b,e}. The value of user b rating for an item i₅ is 1.0 and the value of user e rating for an item i₅ is 3.0.

Experimental setup and performance evaluation metrics

Datasets

The standard datasets (namely Epinions [46], MovieLens-100K (ML-100K) [47], MovieLens-1M (ML-1M)[48], CiaoDVD[49], and MovieTweetings [50]) are used for the performance evaluation of the IPWR similarity measure method. The details about these datasets are as follows:

a) Epinions dataset: The Epinions is an online community website that allows users to review different products and services. Users can also rate the other user’s review on a numerical scale. This dataset contains 664823 ratings on the scale of 1.0 (worst rating) to 5.0 (best rating) with the step size of 1.0. This dataset contains 139738 items that are rated by 40163 users with 99.90% sparsity. The value of mean rating per user is 10.39 with a maximum of 1023 ratings per user. The value of mean rating per item is 4.75 with a maximum of 2026 ratings per item.

The sparsity is calculated as follows: (15)

b) MovieLens-100K (ML-100K) dataset: The ML-100K dataset contains 943 users that rated different movies on a scale of 1.0 (worst rating) to 5.0 (best rating). The most rated value of this dataset is 4.0. This dataset includes 100000 user ratings over 1682 movies and each user rated at least 20 movies. This dataset is used by different state-of-the-art similarity measure methods for recommender system and its sparsity is 93.70%.

c) MovieLens-1M (ML-1M) dataset: The group lens research group collected and made publically available this dataset from the MovieLens website. On this web site, users can rate and review different movies. This dataset contains 6040 users, 3952 movies, and 1000209 user ratings. The ratings take values from 1.0 (worst rating) to 5.0 (best rating) with the step size of 1.0. The sparsity of this dataset is 95.80%. The value of mean rating per user is 15.63 with a maximum of 2314 ratings per user and value of mean rating per item is 269.80 with a maximum of 3428 ratings per user.

d)CiaoDVD dataset: The CiaoDVD dataset contains 72665 user ratings on the scale of 1.0 (worst rating) to 5.0 (best rating) with a step size of 1.0. This dataset contains 16121 items rated by 17615 users with 99.90% sparsity. The value of mean rating per user is 1.13 with a maximum of 1106 ratings per user. The value of mean rating per item is 4.48 with a maximum of 424 ratings per item.

The rating distribution for all four datasets is shown in Table 4. In the IPWR similarity measure method, RPB function comprises of user average rating and variance or SD, so the IPWR similarity measure method considers statistics of user average ratings, variance, and SD in intervals of 0.5. These statistics are shown in (Fig 1A–1E). It can be observed that maximum value of the variance/SD ratings occurs in the interval of 0.5–1.0 and a maximum value of average ratings of the users occur in the interval of 3.5–4.0 for the ML-1M dataset. For the Epinions dataset, the variance/SD ratings of a maximum number of users occur in the interval 0–0.5 while the maximum value of average ratings of the users occurs in the interval of 4.0–4.5. For CiaoDVD dataset, the variance/SD ratings of a maximum number of users also occur in the interval of 0–0.5 and maximum average ratings of the users occur in the interval of 4.5–5.0. Similarly, for ML-100K dataset, the variance/SD ratings of a maximum number of users occur in the intervals of 0.5–1.0/1.0–1.5, while the maximum value of average ratings of the users occurs in the interval of 3.5–4.0. According to the details shown in Fig 1(E) for the MovieTweetings dataset, it can be noted that the variance/SD of more than 50% of users occurs in the interval of 0–0.5. Furthermore, maximum intervals for user average values are found to be 8.0–8.5 and 9.5–10.0. Keeping these facts into view, the RPB function of Eq (9) and Eq (10) give better performance for the Epinions, CiaoDVD, and MovieTweetings datasets as compared to the ML-1M and ML-100K datasets. Its reason is that variance/SD ratings of a maximum number of users occur in the interval of 0–0.5. It is also obvious from the results of (Fig 1A–1E) that user averages and variances/SD ratings are not well distributed. The variance/SD occur in the interval of initial scale to median scale, while maximum average ratings of the users occur in the opposite side (i.e. median scale to maximum scale). This is the main reason behind the modeling of RPB function in terms of user average rating and variance/SD value.

Download:

Fig 1.

(a-e) Percentage of user average/variance/SD ratings for the reported datasets.

https://doi.org/10.1371/journal.pone.0220129.g001

Download:

Table 4. Rating distributions of the reported datasets.

https://doi.org/10.1371/journal.pone.0220129.t004

Performance evaluation metrics

The performance of the IPWR similarity measure method is evaluated using five-fold cross-validation due to its extensive usage in state-of-the-art similarity measure methods for recommender system and average results are reported. An alternate choice to five-fold cross-validation is Leave-one-out method which requires high computational complexity as compared to five-fold cross-validation method[45]. The performance of the IPWR similarity measure method is measured in terms of MAE, RMSE, precision, recall, and F-measure. The MAE calculates the average absolute deviation among the predicted ratings given by the recommender system and true ratings given by the user. The RMSE takes an average of squared error result by giving more weight to higher value errors and less weight to smaller value errors. The mathematical representation of the MAE and RMSE are as follows: (16) (17) where N denotes the total number of items for which the prediction process is performed. The main goal of any recommender system is to decrease the MAE and RMSE.

The precision and recall assess the specificity and sensitivity of a recommender system by measuring the frequency of items, respectively. The most suitable way to measure the precision and recall is to predict the top N items for known ratings. All the experimental results of the IPWR similarity measure method are reported by setting N = 5. The fundamental supposition is the division among relevant and irrelevant items in every user’s dataset. Precision and recall are empirically defined in Table 5 and are mathematically expressed in Eq (18) and Eq (19) as follows: (18) (19)

Download:

Table 5. Confusion matrix: Each row represents an actual class and each column represents predicted class.

https://doi.org/10.1371/journal.pone.0220129.t005

However, a tradeoff exists for precision and recall in the sense that if one value increases then other value decreases and vice versa. To overcome this tradeoff, the state-of-the-art similarity measure methods for recommender system also uses F-measure as a performance evaluation metric, which is mathematically defined using Eq (20) as follows: (20)

Experimental results and discussions

Three different cases are used to measure the performance of the IPWR similarity measure method. In the first case, the impact of the varying similarity threshold (denoted by θ_s) on the performance of the IPWR similarity measure method is analyzed. In the second case, the best weights of α and β are determined for each dataset using an adaptive weighting scheme. In the third case, the impact on the performance of the IPWR similarity measure method is analyzed by varying neighbor’s size and its performance comparisons are performed with state-of-the-art similarity measure methods. The experimental details about these three cases are given in the following subsections.

Methods used for comparison

The performance of the IPWR similarity measure method is compared with state-of-the-art similarity measure methods. These similarity measure methods include PCC, CPCC, WPCC, SPCC, Cosine, PIP, Singularity measure. and NHSM. The detail about these state-of-the-art similarity measure methods is as follows:

a) PCC similarity measure.

The value of PCC similarity measure method is calculated using Eq (1). The range of PCC value is from -1 to +1. The -1 corresponds to the worst similarity value and +1 corresponds to the best similarity value. For all similarity measure methods, a similarity threshold is also required to be imposed on produced similarity values. The value of the similarity threshold (denoted by θ_s) for PCC similarity measure is greater than zero. This implies that users having negative similarity are ignored.

b) CPCC similarity measure.

This similarity measure method is calculated using Eq (2). The CPCC similarity measure categorizes all rating values as positive or negative. A rating value is positive if it is above the median rating of the rating scale and negative if it is below the median rating of the rating scale. For all reported datasets, the value of the median rating is set to 3. Like PCC, CPCC result range is also from -1 to +1 and the similarity threshold (θ_s) is also set to greater than zero.

c) WPCC similarity measure.

This similarity measure method is calculated using Eq (3). This method gives more weight to users whose number of common rated items are greater than some threshold and its value is set to 50 [20]. The range of values for the WPCC similarity measure method is from -1 to +1.

d) SPCC similarity measure.

This is an exponential version of the standard PCC similarity measure method and it is calculated using Eq (4). Its possible values are from -1 to +1 and the similarity threshold (θ_s) is set to greater than zero.

e) COSINE similarity measure.

This similarity measure method is introduced by [12]. Its possible values are from 0 to 1, which indicate that all users with similarity greater than zero are selected for the prediction process.

f) PIP similarity measure.

This similarity measure method first computes a Boolean function followed by an agreement between two user ratings. After that, it calculates PIP factors based on whether the agreement is true or false. The value of the PIP similarity measure method is greater than zero and it can be any real number. The value of the PIP similarity measure method is calculated using Eq (5).

g) Singularity measure.

In this similarity measure, the user ratings are grouped as positive and negative, and the singularity value of user and item is computed. The prediction is generated based upon computed singularity value.

h) NHSM similarity measure.

This similarity measure method is calculated using Eq (6). This method considers both local and global preference of a user rating. Its value range is from 0 to 1.

Effect of the similarity threshold

To estimate the impact of the similarity threshold (θ_s), its values are varied from 0 to 1.0 with a step size of 0.1 at a fixed nearest neighbor size of 5. It is obvious from Fig 2(A) of the CiaoDVD dataset that no significant change is observed until θ_s = 0.9. The worst results for RMSE are produced when θ_s = 1.0 and suddenly jumps from 1.102 to 1.85 when corresponding MAE results remain unchanged. The precision value decreases from 0.794 to 0.779. Similarly, the value of recall at θ_s = 1.0 decreases to 0.533 from 0.582. The value of F-measure reaches to 0.634 from 0.672. Similarly, in Fig 2(B) of the ML-100K dataset, results of all evaluation parameters remain almost the same till θ_s = 0.9. At θ_s = 1.0, MAE value increases from 0.773 to 0.939, RMSE value increases from 0.995 to 1.326. The value of precision at θ_s = 1.0 decreases from 0.619 to 0.499, recall increases from 0.432 to 0.495, and value of F-measure decreases from 0.507 to 0.497. In Fig 2(C) of the ML-1M dataset, similar behavior of ML-100K dataset is observed. After θ_s = 0.9, MAE, and RMSE values are increases, while the value of the precision, recall, and F-measure is decreasing. In Fig 2(D) of the Epinions dataset, MAE values remain almost constant till θ_s = 0.7 and after that, it starts increasing. For RMSE, performance remains almost constant till θ_s = 0.7, and at θ_s = 0.8 and θ_s = 0.9 its increases, while at threshold = 1.0, performance decreases to 1.199. Which means that values of the precision, recall, and F-measure are also decreasing due to an increase in the threshold value. After observing these experimental details of the reported datasets, it can be concluded that values of the MAE and RMSE increases with increase in the value of the similarity threshold (θ_s) while values of the precision, recall, and F-measure are decreases.

Download:

Fig 2.

(a-d) Effect of similarity threshold on reported datasets.

https://doi.org/10.1371/journal.pone.0220129.g002

Determining best weights for α and β

In order to determine the best weights of α and β to achieve improved performance of the IPWR similarity measure method as compared with state-of-the-art similarity measure methods, its performance is evaluated by varying different weights of α and β from 0 to 1. The experimental details about the performance of the IPWR similarity measure method on different weights of α and β are given in Table 6 for all the reported datasets. In Table 6, the bold values indicate the best weights of α and β which gives the best performance of the IPWR similarity measure method in terms of the performance evaluation metrics for all the reported datasets. For ML-100K dataset, the best results are found when α = 0.5 and β = 0.5. Similarly, by setting α = 0.1 and β = 0.9, better results are gathered as compared to the case, when α = 0 and β = 1.0. This implies that RPB produces an important effect on the recommendation performance of the IPWR similarity measure method. For Epinions, CiaoDVD, and ML-1M datasets, the best performance of the IPWR similarity measure method is obtained by setting weights of α = 0.4 and β = 0.6. In these datasets, it is also obvious that performance is improved when weights of α and β are increased from 0.0, 1.0 to 0.1, and 0.9. Furthermore, the worst performance is obtained, when α = 1.0 and β = 0.0, which indicates that RPB alone is not able to yield good results. In the case of the MovieTweetings (10-star rating) dataset, the best performance of the IPWR similarity measure method is achieved by setting the weight of α = 0.4 and β = 0.6.

Download:

Table 6. Estimating the best weights of α and β for reported datasets (bold values in each row indicate the best performance, while bold values of α and β indicate best weights selected on the bases of lowest MAE and highest F-measure values).

https://doi.org/10.1371/journal.pone.0220129.t006

Effect of the number of neighbors and comparison of state-of-the-art similarity measure methods with IPWR similarity measure method

In this section, details about performance comparison of the IPWR similarity measure method with state-of-the-art similarity measure methods (i.e. Standard PCC, CPCC, WPCC, SPCC, COSINE, PIP, Singularity measure, and NHSM) is carried. The performance of the IPWR similarity measure method and its competitor methods are analyzed in terms of the performance evaluation metrics (i.e. MAE, RMSE, Precision, Recall, and F-measure) by varying a different number of neighbors whose details are shown in (Fig 3A–3E) to (Fig 7A–7E).

Download:

Fig 3.

(a-e) Performance comparison of state-of-the-art similarity measure methods with the IPWR similarity measure method in terms of MAE, RMSE, precision, recall, and F-measure on the Epinions dataset.

https://doi.org/10.1371/journal.pone.0220129.g003

Download:

Fig 4.

(a-e) Performance comparison of state-of-the-art similarity measure methods with the IPWR similarity measure method in terms of MAE, RMSE, precision, recall, and F-measure on the ML-100K dataset.

https://doi.org/10.1371/journal.pone.0220129.g004

Download:

Fig 5.

(a-e) The performance comparison of state-of-the-art similarity measure methods with the IPWR similarity measure method in terms of MAE, RMSE, precision, recall, and F-measure on the ML-1M dataset.

https://doi.org/10.1371/journal.pone.0220129.g005

Download:

Fig 6.

(a-e) The performance comparison of state-of-the-art similarity measure methods with the IPWR similarity measure method in terms of MAE, RMSE, precision, recall, and F-measure on the CiaoDVD dataset.

https://doi.org/10.1371/journal.pone.0220129.g006

Download:

Fig 7.

(a-e) The performance comparison of state-of-the-art similarity measure methods with the IPWR similarity measure method in terms of MAE, RMSE, precision, recall, and F-measure on the MovieTweetings dataset.

https://doi.org/10.1371/journal.pone.0220129.g007

Performance analysis on the Epinions (5-star rating) dataset.

The performance analysis of the IPWR similarity measure method with its competitor methods is presented in (Fig 3A–3E) for the Epinions dataset. (Fig 3A–3E) present performance analysis of the IPWR similarity measure method with its competitor methods in terms of the performance evaluation metrics (i.e. MAE, RMSE, Precision, Recall, and F-measure) versus the different number of neighbors. The performance analysis indicates that IPWR similarity measure method outperforms as compared to its competitor similarity measure methods in terms of the performance evaluation metrics. In order to verify the results of the IPWR similarity measure method among its competitor methods, statistical analysis is performed on the experimental results of the reported datasets using non-parametric Wilcoxon matched-pairs signed-rank test and paired t-test whose details are presented in Tables 7–10. The statistical analysis is performed by setting a standard value of the level of significance at 0.05 (95%) and results are analyzed in terms of the z-score, p-value, and t-score. For all the reported datasets, the value of p is less than the value of the significance level (i.e. α≤0.05), which proof the robust performance of the IPWR similarity measure method as compared to its competitor similarity measure methods. The p-value also indicate that the performance of the IPWR similarity measure method is strongly significant. It means that there is significant differences exist between IPWR similarity measure method and its competitor similarity measure methods. The negative sign of z-score also indicates the robustness of the IPWR similarity measure method as compared to its competitor similarity measure methods. Moreover, we have applied paired t-test to reassess the statistical significance of the experimental results of the IPWR similarity measure method and its competitor similarity measure methods. The value of the degrees of freedom (df) for the paired t-test is set to 11. The negative sign of t-score also indicates that the IPWR similarity measure method gives the best performance as compared to its competitor similarity measure methods. Furthermore, all the results of t-score are highly significant which shows that there is significant difference exist between IPWR similarity measure method and its competitor similarity measure methods.

Download:

Table 7. Statistical significance of the experimental results of the IPWR similarity measure method based on the MAE with state-of-the-art similarity measure methods on the Epinions dataset (* indicate the best performance).

https://doi.org/10.1371/journal.pone.0220129.t007

Download:

Table 8. Statistical significance of the experimental results of the IPWR similarity measure method based on the MAE with state-of-the-art similarity measure methods on the MovieLens-100K dataset (* indicate the best performance).

https://doi.org/10.1371/journal.pone.0220129.t008

Download:

Table 9. Statistical significance of the experimental results of the IPWR similarity measure method based on the MAE with state-of-the-art similarity measure methods on the MovieLens-1M dataset (* indicate the best performance).

https://doi.org/10.1371/journal.pone.0220129.t009

Download:

Table 10. Statistical significance of the experimental results of the IPWR similarity measure method based on the MAE with state-of-the-art similarity measure methods on the CiaoDVD dataset (* indicate the best performance).

https://doi.org/10.1371/journal.pone.0220129.t010

Performance analysis on the MovieLens-100K (ML-100K) (5-star rating) dataset.

The performance comparisons in terms of the performance evaluation metrics (i.e. MAE, RMSE, Precision, Recall, and F-measure) of the IPWR similarity measure method with its competitor similarity measure methods is presented in (Fig 4A–4E) for the ML-100K dataset. After analyzing the experimental details of (Fig 4A–4E), it can be concluded that IPWR similarity measure method outperforms as compared to its competitor similarity measure methods. Furthermore, performance analysis of the IPWR similarity measure method in terms of accuracy is also better than its competitor similarity measure methods because it considers the average rating of an item and an average rating of a user simultaneously. Similarly, the RPB of a user is ignored by its competitor similarity measure methods, while IPWR similarity measure also considers user RPB, which result in improved performance.

Performance analysis on the MovieLens-1M (ML-1M) (5-star rating) dataset.

(Fig 5A–5E) present performance analysis of the IPWR similarity measure method with its competitor similarity measure methods by a varying number of neighbors and analyzing performance in terms of the MAE, RMSE, Precision, Recall and F-measure on the ML-1M dataset. It is a large dataset with a sparsity of 95.80%. For this dataset, the IPWR similarity measure method also performs better than its competitor similarity measure methods except for the NHSM similarity measure method. The reason for the better performance of the NHSM similarity measure method is its proximity, significance, and singularity (PSS) factors, which are calculated for each common rating individually. However, the results of the NHSM similarity measure method are very close to IPWR similarity measure method. In the case of RMSE, the performance of the IPWR similarity measure method is better as compared to the NHSM similarity measure method.

Performance analysis on the CiaoDVD (5-star rating) dataset.

(Fig 6A–6E) present performance analysis for a different number of neighbors in terms of performance evaluation metrics for the CiaoDVD dataset. In this dataset, mean ratings per user are 1.13 and mean ratings per item are 4.48. The IPWR similarity measure method also performs better on this dataset because of the consideration of the user RPB and improved PCC. It is also observed that the increase in the number of neighbors does not affect the performance of the IPWR similarity measure method. This implies that a small number of neighbors give the same results as a large number of neighbors, which also reduces the computational cost of the IPWR similarity measure method as compared to its competitor similarity measure methods.

Performance analysis on the MovieTweetings (10-star rating) dataset.

The MovieTweetings dataset is also publicly available dataset which is crawled from twitter. This dataset consists of movie ratings in the range of [1–10]. In this rating scale, 1 indicates the worst rating and 10 indicates the best rating of a movie given by a user. This dataset contains a total of 759746 user ratings given by 56304 users using a total of 32810 movies. The detail of the user rating scales and user rating distribution for the MovieTweetings dataset is presented in Table 11. The sparsity of MovieTweetings dataset is 99.90%. The best performance of the IPWR similarity measure method is achieved by setting the weight of α = 0.4 and β = 0.6 whose experimental details are presented in Table 6 for the MovieTweetings dataset. The performance comparisons in terms of the performance evaluation metrics (i.e. MAE, RMSE, Precision, Recall, and F-measure) of the IPWR similarity measure method with its competitor similarity measure methods is presented in (Fig 7A–7E) for the MovieTweetings dataset. After analyzing the experimental details of (Fig 7A–7E), it can be concluded that IPWR similarity measure method outperforms in terms of evaluation metrics (i.e. MAE, RMSE, and Precision) as compared to its competitor similarity measure methods because it considers the average rating of an item and an average rating of a user simultaneously. Similarly, the RPB of a user is ignored by its competitor similarity measure methods, while IPWR similarity measure method also considers user RPB, which result in improved performance.

Download:

Table 11. User rating scale and rating distribution for the MovieTweetings dataset.

https://doi.org/10.1371/journal.pone.0220129.t011

The statistical details of the IPWR similarity measure and its competitor similarity measure methods for the MovieTweetings dataset are presented in Table 12. The statistical analysis is performed using a non-parametric Wilcoxon matched-pairs signed-rank test and paired t-test to investigate and provide statistical evidence regarding the robust performance of the IPWR similarity measure method as compared to its competitor similarity measure methods. The value of the degrees of freedom (df) for the paired t-test is set to 11 for the MovieTweetings dataset. According to the statistical results presented in Table 12, the negative sign of z-score and t-score shows that IPWR similarity measure method shows the best performance as compared to its competitor methods. Furthermore, z-score results of the IPWR similarity measure are highly significant in all cases as its p-values are less than the level of significance at 0.05 (95%) however in case of t-score, all the results are significant except of NHSM and IPWR with SD similarity measure methods which give insignificant results due to least significant difference exist between these two methods.

Download:

Table 12. Statistical significance of the experimental results of the IPWR similarity measure method based on the MAE with state-of-the-art similarity measure methods on the MovieTweetings dataset (_* indicate the best performance).

https://doi.org/10.1371/journal.pone.0220129.t012

Conclusion and future work

In this article, we identify and analyze some limitations of the state-of-the-art similarity measure methods, especially the PCC similarity measure method. These similarity measures are used by collaborative filtering based methods to find similar users’ and items’ profiles. User RPB is one of the most important aspects, which is ignored by traditional similarity measurement methods. Typically, different users have different RPB and based upon this behavior, they tend to rate items with values that have not many variations. In this article, we have proposed an improved similarity measure method that uses the user’s RPB pattern to find similar users. The RPB pattern is modeled as a function of user rating averages and user variance or standard deviation. The proposed IPWR similarity measure method overcomes some inherent shortcomings of a standard PCC similarity measure and it also considers the RPB pattern of users to achieve better performance. The extensive experiments are performed to check the effectiveness of the IPWR similarity measure method. The performance of the IPWR similarity measure method is compared against state-of-the-art similarity measure methods using four publically available datasets. The results show that the IPWR similarity measure performs better than conventional and state-of-the-art similarity measure methods like NHSM and PIP. It is also observed from experimental results that IPWR similarity measure method performs better on sparse datasets (i.e. Epinions, CiaoDVD, and MovieTweetings datasets) than dense datasets (i.e. ML-100K and ML-1M datasets). In future work, we intend to learn weights of α and β using various machine learning methods such as support vector machine (SVM), particle swarm intelligence (PSO), and artificial neural networks (ANN). Although the IPWR similarity measure method considers both local and global information of user ratings in terms of user RPB, one important information, which is the actual rating value of non-co-rated items are ignored. In the future, we will also try to incorporate this information using the IPWR similarity measure method. Furthermore, a friendship network of a user can also be used as an additional information source in the extremely cold start or sparse conditions.

References

1. Boratto L., Carta S., and Fenu G., Investigating the role of the rating prediction task in granularity-based group recommender systems and big data scenarios. Information Sciences, 2017. 378: p. 424–443.
- View Article
- Google Scholar
2. Prugel-Bennett, M.A.G.a.A., A Scalable, Accurate Hybrid Recommender System. IEEE Third International Conference on Knowledge Discovery and Data Mining, 2010: p. 94–98.
3. Li Y.-M., Wu C.-T., and Lai C.-Y., A social recommender mechanism for e-commerce: Combining similarity, trust, and relationship. Decision Support Systems, 2013. 55(3): p. 740–752.
- View Article
- Google Scholar
4. https://www.amazon.com/, Accessed on 14th May 2018.
5. Shumeet B., Rohan S., Sivakumar D., Jing Y., Yagnik J., Kumar S., et al., Video suggestion and discovery for YouTube: taking random walks through the view graph. International Conference on World Wide Web, 2008: p. 895–904.
6. https://news.google.com/, Accessed on 14th May 2018.
7. Ghazanfar, M.A. and Prugel-Bennett, A. A scalable, accurate hybrid recommender system. in 2010 Third International Conference on Knowledge Discovery and Data Mining. 2010. IEEE.
8. Ayub, M., Ghazanfar, M.A., Maqsood, M., and Saleem, A. A Jaccard base similarity measure to improve performance of CF based recommender systems. in 32nd International Conference on Information Networking (ICOIN). 2018. IEEE.
9. Zahra S., Ghazanfar M.A., Khalid A., Azam M.A., Naeem U., and Prugel-Bennett A., Novel centroid selection approaches for KMeans-clustering based recommender systems. Information sciences, 2015. 320: p. 156–189.
- View Article
- Google Scholar
10. Xiong, L., Chen, X., Huang, T.-K., Schneider, J., and Carbonell, J.G. Temporal collaborative filtering with bayesian probabilistic tensor factorization. in Proceedings of the 2010 SIAM international conference on data mining. 2010. SIAM.
11. Ghazanfar, M.A. and Prϋgel-Bennett, A. Exploiting context in kernel-mapping recommender system algorithms. in Sixth International Conference on Machine Vision (ICMV 2013). 2013. International Society for Optics and Photonics.
12. Adomavicius G. and Tuzhilin A., Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge & Data Engineering, 2005(6): p. 734–749.
- View Article
- Google Scholar
13. Bobadilla J., Ortega F., Hernando A., and Gutiérrez A., Recommender systems survey. Knowledge-based systems, 2013. 46: p. 109–132.
- View Article
- Google Scholar
14. Ricci F., Rokach L., and Shapira B., Introduction to recommender systems handbook. 2011: Springer.
15. Mehmood Z., Anwar S.M., Ali N., Habib H.A., and Rashid M., A novel image retrieval based on a combination of local and global histograms of visual words. Mathematical Problems in Engineering, 2016. 2016.
- View Article
- Google Scholar
16. Mehmood Z., Anwar S.M., and Altaf M., A novel image retrieval based on rectangular spatial histograms of visual words. Kuwait Journal of Science, 2018. 45(1).
- View Article
- Google Scholar
17. Mehmood Z., Abbas F., Mahmood T., Javid M.A., Rehman A., and Nawaz T., Content-based image retrieval based on visual words fusion versus features fusion of local and global features. Arabian Journal for Science and Engineering, 2018. 43(12): p. 7265–7284.
- View Article
- Google Scholar
18. Alahmadi, D.H. and Zeng, X.-J. Twitter-based recommender system to address cold-start: A genetic algorithm based trust modelling and probabilistic sentiment analysis. in 2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI). 2015. IEEE.
19. Lee W.-P. and Ma C.-Y., Enhancing collaborative recommendation performance by combining user preference and trust-distrust propagation in social networks. Knowledge-Based Systems, 2016. 106: p. 125–134.
- View Article
- Google Scholar
20. Ahn H.J., A new similarity measure for collaborative filtering to alleviate the new user cold-starting problem. Information Sciences, 2008. 178(1): p. 37–51.
- View Article
- Google Scholar
21. Guo, G., Integrating trust and similarity to ameliorate the data sparsity and cold start for recommender systems. Proceedings of ACM Conference on Recommender Systems, 2013: p. 451–454.
22. Safoury L. and Salah A., Exploiting user demographic attributes for solving cold-start problem in recommender system. Lecture Notes on Software Engineering, 2013. 1(3): p. 303–307.
- View Article
- Google Scholar
23. Guo G., Jie Zhang, and Daniel Thalmann., Merging trust in collaborative filtering to alleviate data sparsity and cold start. Knowledge-Based Systems, 2014. 57: p. 57–68.
- View Article
- Google Scholar
24. Al-Oufi S., Kim H.-N., and El Saddik A., A group trust metric for identifying people of trust in online social networks. Expert Systems with Applications, 2012. 39(18): p. 13173–13181.
- View Article
- Google Scholar
25. Golbeck, J. and Hendler, J. Filmtrust: Movie recommendations using trust in web-based social networks. in Proceedings of the IEEE Consumer communications and networking conference. 2006. Citeseer.
26. Liu, B.Y.Y.L.D. and Liu, J. Social Collaborative Filtering by Trust. in Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, IEEE. Citeseer.
27. Bobadilla J., F.O., Hernando A., Bernal J., A collaborative filtering approach to mitigate the new user cold start problem. Knowledge-Based Systems, 2011. 26: p. 225–238.
- View Article
- Google Scholar
28. Bobadilla J., F.O., Hernando A., Bernal J., A collaborative filtering similarity measure based on singularities, Inform. Process. Manage. 2012. 48: p. 204–217.
- View Article
- Google Scholar
29. Jabeen S., Mehmood Z., Mahmood T., Saba T., Rehman A., and Mahmood M.T., An effective content-based image retrieval technique for image visuals representation based on the bag-of-visual-words model. PloS one, 2018. 13(4): p. e0194526. pmid:29694429
- View Article
- PubMed/NCBI
- Google Scholar
30. Sarwar A., Mehmood Z., Saba T., Qazi K.A., Adnan A., and Jamal H., A novel method for content-based image retrieval to improve the effectiveness of the bag-of-words model using a support vector machine. Journal of Information Science, 2019. 45(1): p. 117–135.
- View Article
- Google Scholar
31. Mehmood Z., Rashid M., Rehman A., Saba T., Dawood H., and Dawood H., Effect of complementary visual words versus complementary features on clustering for effective content-based image search. Journal of Intelligent & Fuzzy Systems, 2018(Preprint): p. 1–14.
- View Article
- Google Scholar
32. Qazi K.A., Nawaz T., Mehmood Z., Rashid M., and Habib H.A., A hybrid technique for speech segregation and classification using a sophisticated deep neural network. PloS one, 2018. 13(3): p. e0194151. pmid:29558485
- View Article
- PubMed/NCBI
- Google Scholar
33. Liu H., Hu Z., Mian A., Tian H., and Zhu X., A new user similarity model to improve the accuracy of collaborative filtering. Knowledge-Based Systems, 2014. 56: p. 156–166.
- View Article
- Google Scholar
34. Jamali, M. and Ester, M. Using a trust network to improve top-N recommendation. in Proceedings of the third ACM conference on Recommender systems. 2009. ACM.
35. Bobadilla J., Ortega F., and Hernando A., A collaborative filtering similarity measure based on singularities. Information Processing & Management, 2012. 48(2): p. 204–217.
- View Article
- Google Scholar
36. Bobadilla J., Hernando A., Ortega F., and Gutiérrez A., Collaborative filtering based on significances. Information Sciences, 2012. 185(1): p. 1–17.
- View Article
- Google Scholar
37. Ma H., King I., and Lyu M.R., Learning to recommend with explicit and implicit social relations. ACM Transactions on Intelligent Systems and Technology (TIST), 2011. 2(3): p. 29.
- View Article
- Google Scholar
38. Gong, S., Ye, H., and Dai, Y. Combining singular value decomposition and item-based recommender in collaborative filtering. in 2009 Second International Workshop on Knowledge Discovery and Data Mining. 2009. IEEE.
39. Szwabe, A., Ciesielczyk, M., and au>Janasiewicz, T. Semantically enhanced collaborative filtering based on RSVD. in International Conference on Computational Collective Intelligence. 2011. Springer.
40. Polatidis N. and Georgiadis C.K., A multi-level collaborative filtering method that improves recommendations. Expert Systems with Applications, 2016. 48: p. 100–110.
- View Article
- Google Scholar
41. Patra B.K., Launonen R., Ollikainen V., and Nandi S., A new similarity measure using Bhattacharyya coefficient for collaborative filtering in sparse data. Knowledge-Based Systems, 2015. 82: p. 163–177.
- View Article
- Google Scholar
42. Sun S.-B., Zhang Z.-H., Dong X.-L., Zhang H.-R., Li T.-J., Zhang L., et al., Integrating Triangle and Jaccard similarities for recommendation. PloS one, 2017. 12(8): p. e0183570. pmid:28817692
- View Article
- PubMed/NCBI
- Google Scholar
43. Saranya K. and Sadasivam G.S., Modified heuristic similarity measure for personalization using collaborative filtering technique. Applied Mathematics and Information Sciences, 2017. 11(1): p. 317–25.
- View Article
- Google Scholar
44. Ayub, M., Ghazanfar, M.A., Maqsood, M., and Saleem, A. A Jaccard base similarity measure to improve performance of CF based recommender systems. in 2018 International Conference on Information Networking (ICOIN). 2018. IEEE.
45. Tan Z. and He L., An efficient similarity measure for user-based collaborative filtering recommender systems inspired by the physical resonance principle. IEEE Access, 2017. 5: p. 27211–27228.
- View Article
- Google Scholar
46. http://www.trustlet.org/downloaded_epinions.html.
47. http://www.movielens.umn.edu.
48. Harper F.M. and Konstan J.A., The movielens datasets: History and context. Acm transactions on interactive intelligent systems (tiis), 2016. 5(4): p. 19.
- View Article
- Google Scholar
49. Guo, G., Zhang, J., Thalmann, D., and Yorke-Smith, N. Etaf: An extended trust antecedents framework for trust prediction. in Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 2014. IEEE Press.
50. https://github.com/sidooms/MovieTweetings/tree/master/latest.

[ref1] 1. Boratto L., Carta S., and Fenu G., Investigating the role of the rating prediction task in granularity-based group recommender systems and big data scenarios. Information Sciences, 2017. 378: p. 424–443.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Prugel-Bennett, M.A.G.a.A., A Scalable, Accurate Hybrid Recommender System. IEEE Third International Conference on Knowledge Discovery and Data Mining, 2010: p. 94–98.

[ref3] 3. Li Y.-M., Wu C.-T., and Lai C.-Y., A social recommender mechanism for e-commerce: Combining similarity, trust, and relationship. Decision Support Systems, 2013. 55(3): p. 740–752.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref4] 4. https://www.amazon.com/, Accessed on 14th May 2018.

[ref5] 5. Shumeet B., Rohan S., Sivakumar D., Jing Y., Yagnik J., Kumar S., et al., Video suggestion and discovery for YouTube: taking random walks through the view graph. International Conference on World Wide Web, 2008: p. 895–904.

[ref6] 6. https://news.google.com/, Accessed on 14th May 2018.

[ref7] 7. Ghazanfar, M.A. and Prugel-Bennett, A. A scalable, accurate hybrid recommender system. in 2010 Third International Conference on Knowledge Discovery and Data Mining. 2010. IEEE.

[ref8] 8. Ayub, M., Ghazanfar, M.A., Maqsood, M., and Saleem, A. A Jaccard base similarity measure to improve performance of CF based recommender systems. in 32nd International Conference on Information Networking (ICOIN). 2018. IEEE.

[ref9] 9. Zahra S., Ghazanfar M.A., Khalid A., Azam M.A., Naeem U., and Prugel-Bennett A., Novel centroid selection approaches for KMeans-clustering based recommender systems. Information sciences, 2015. 320: p. 156–189.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref10] 10. Xiong, L., Chen, X., Huang, T.-K., Schneider, J., and Carbonell, J.G. Temporal collaborative filtering with bayesian probabilistic tensor factorization. in Proceedings of the 2010 SIAM international conference on data mining. 2010. SIAM.

[ref11] 11. Ghazanfar, M.A. and Prϋgel-Bennett, A. Exploiting context in kernel-mapping recommender system algorithms. in Sixth International Conference on Machine Vision (ICMV 2013). 2013. International Society for Optics and Photonics.

[ref12] 12. Adomavicius G. and Tuzhilin A., Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge & Data Engineering, 2005(6): p. 734–749.
View Article
Google Scholar

[19] View Article

[20] Google Scholar

[ref13] 13. Bobadilla J., Ortega F., Hernando A., and Gutiérrez A., Recommender systems survey. Knowledge-based systems, 2013. 46: p. 109–132.
View Article
Google Scholar

[22] View Article

[23] Google Scholar

[ref14] 14. Ricci F., Rokach L., and Shapira B., Introduction to recommender systems handbook. 2011: Springer.

[ref15] 15. Mehmood Z., Anwar S.M., Ali N., Habib H.A., and Rashid M., A novel image retrieval based on a combination of local and global histograms of visual words. Mathematical Problems in Engineering, 2016. 2016.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref16] 16. Mehmood Z., Anwar S.M., and Altaf M., A novel image retrieval based on rectangular spatial histograms of visual words. Kuwait Journal of Science, 2018. 45(1).
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref17] 17. Mehmood Z., Abbas F., Mahmood T., Javid M.A., Rehman A., and Nawaz T., Content-based image retrieval based on visual words fusion versus features fusion of local and global features. Arabian Journal for Science and Engineering, 2018. 43(12): p. 7265–7284.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref18] 18. Alahmadi, D.H. and Zeng, X.-J. Twitter-based recommender system to address cold-start: A genetic algorithm based trust modelling and probabilistic sentiment analysis. in 2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI). 2015. IEEE.

[ref19] 19. Lee W.-P. and Ma C.-Y., Enhancing collaborative recommendation performance by combining user preference and trust-distrust propagation in social networks. Knowledge-Based Systems, 2016. 106: p. 125–134.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref20] 20. Ahn H.J., A new similarity measure for collaborative filtering to alleviate the new user cold-starting problem. Information Sciences, 2008. 178(1): p. 37–51.
View Article
Google Scholar

[39] View Article

[40] Google Scholar

[ref21] 21. Guo, G., Integrating trust and similarity to ameliorate the data sparsity and cold start for recommender systems. Proceedings of ACM Conference on Recommender Systems, 2013: p. 451–454.

[ref22] 22. Safoury L. and Salah A., Exploiting user demographic attributes for solving cold-start problem in recommender system. Lecture Notes on Software Engineering, 2013. 1(3): p. 303–307.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref23] 23. Guo G., Jie Zhang, and Daniel Thalmann., Merging trust in collaborative filtering to alleviate data sparsity and cold start. Knowledge-Based Systems, 2014. 57: p. 57–68.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref24] 24. Al-Oufi S., Kim H.-N., and El Saddik A., A group trust metric for identifying people of trust in online social networks. Expert Systems with Applications, 2012. 39(18): p. 13173–13181.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref25] 25. Golbeck, J. and Hendler, J. Filmtrust: Movie recommendations using trust in web-based social networks. in Proceedings of the IEEE Consumer communications and networking conference. 2006. Citeseer.

[ref26] 26. Liu, B.Y.Y.L.D. and Liu, J. Social Collaborative Filtering by Trust. in Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, IEEE. Citeseer.

[ref27] 27. Bobadilla J., F.O., Hernando A., Bernal J., A collaborative filtering approach to mitigate the new user cold start problem. Knowledge-Based Systems, 2011. 26: p. 225–238.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref28] 28. Bobadilla J., F.O., Hernando A., Bernal J., A collaborative filtering similarity measure based on singularities, Inform. Process. Manage. 2012. 48: p. 204–217.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref29] 29. Jabeen S., Mehmood Z., Mahmood T., Saba T., Rehman A., and Mahmood M.T., An effective content-based image retrieval technique for image visuals representation based on the bag-of-visual-words model. PloS one, 2018. 13(4): p. e0194526. pmid:29694429
View Article
PubMed/NCBI
Google Scholar

[60] View Article

[61] PubMed/NCBI

[62] Google Scholar

[ref30] 30. Sarwar A., Mehmood Z., Saba T., Qazi K.A., Adnan A., and Jamal H., A novel method for content-based image retrieval to improve the effectiveness of the bag-of-words model using a support vector machine. Journal of Information Science, 2019. 45(1): p. 117–135.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref31] 31. Mehmood Z., Rashid M., Rehman A., Saba T., Dawood H., and Dawood H., Effect of complementary visual words versus complementary features on clustering for effective content-based image search. Journal of Intelligent & Fuzzy Systems, 2018(Preprint): p. 1–14.
View Article
Google Scholar

[67] View Article

[68] Google Scholar

[ref32] 32. Qazi K.A., Nawaz T., Mehmood Z., Rashid M., and Habib H.A., A hybrid technique for speech segregation and classification using a sophisticated deep neural network. PloS one, 2018. 13(3): p. e0194151. pmid:29558485
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref33] 33. Liu H., Hu Z., Mian A., Tian H., and Zhu X., A new user similarity model to improve the accuracy of collaborative filtering. Knowledge-Based Systems, 2014. 56: p. 156–166.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref34] 34. Jamali, M. and Ester, M. Using a trust network to improve top-N recommendation. in Proceedings of the third ACM conference on Recommender systems. 2009. ACM.

[ref35] 35. Bobadilla J., Ortega F., and Hernando A., A collaborative filtering similarity measure based on singularities. Information Processing & Management, 2012. 48(2): p. 204–217.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref36] 36. Bobadilla J., Hernando A., Ortega F., and Gutiérrez A., Collaborative filtering based on significances. Information Sciences, 2012. 185(1): p. 1–17.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref37] 37. Ma H., King I., and Lyu M.R., Learning to recommend with explicit and implicit social relations. ACM Transactions on Intelligent Systems and Technology (TIST), 2011. 2(3): p. 29.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref38] 38. Gong, S., Ye, H., and Dai, Y. Combining singular value decomposition and item-based recommender in collaborative filtering. in 2009 Second International Workshop on Knowledge Discovery and Data Mining. 2009. IEEE.

[ref39] 39. Szwabe, A., Ciesielczyk, M., and au>Janasiewicz, T. Semantically enhanced collaborative filtering based on RSVD. in International Conference on Computational Collective Intelligence. 2011. Springer.

[ref40] 40. Polatidis N. and Georgiadis C.K., A multi-level collaborative filtering method that improves recommendations. Expert Systems with Applications, 2016. 48: p. 100–110.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref41] 41. Patra B.K., Launonen R., Ollikainen V., and Nandi S., A new similarity measure using Bhattacharyya coefficient for collaborative filtering in sparse data. Knowledge-Based Systems, 2015. 82: p. 163–177.
View Article
Google Scholar

[92] View Article

[93] Google Scholar

[ref42] 42. Sun S.-B., Zhang Z.-H., Dong X.-L., Zhang H.-R., Li T.-J., Zhang L., et al., Integrating Triangle and Jaccard similarities for recommendation. PloS one, 2017. 12(8): p. e0183570. pmid:28817692
View Article
PubMed/NCBI
Google Scholar

[95] View Article

[96] PubMed/NCBI

[97] Google Scholar

[ref43] 43. Saranya K. and Sadasivam G.S., Modified heuristic similarity measure for personalization using collaborative filtering technique. Applied Mathematics and Information Sciences, 2017. 11(1): p. 317–25.
View Article
Google Scholar

[99] View Article

[100] Google Scholar

[ref44] 44. Ayub, M., Ghazanfar, M.A., Maqsood, M., and Saleem, A. A Jaccard base similarity measure to improve performance of CF based recommender systems. in 2018 International Conference on Information Networking (ICOIN). 2018. IEEE.

[ref45] 45. Tan Z. and He L., An efficient similarity measure for user-based collaborative filtering recommender systems inspired by the physical resonance principle. IEEE Access, 2017. 5: p. 27211–27228.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref46] 46. http://www.trustlet.org/downloaded_epinions.html.

[ref47] 47. http://www.movielens.umn.edu.

[ref48] 48. Harper F.M. and Konstan J.A., The movielens datasets: History and context. Acm transactions on interactive intelligent systems (tiis), 2016. 5(4): p. 19.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

[ref49] 49. Guo, G., Zhang, J., Thalmann, D., and Yorke-Smith, N. Etaf: An extended trust antecedents framework for trust prediction. in Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 2014. IEEE Press.

[ref50] 50. https://github.com/sidooms/MovieTweetings/tree/master/latest.

Figures

Abstract

Introduction

Related work

Shortcomings of standard PCC

Shortcoming 1: Flat value of ratings.

Shortcoming 2: Only single co-rated item.

Shortcoming 3: Ignorance of user rating preference behavior (RPB).

Shortcoming 4: Ignorance of corresponding item average rating in case of user-based CF.

The IPWR similarity measure method

Experimental setup and performance evaluation metrics

Datasets

Performance evaluation metrics

Experimental results and discussions

Methods used for comparison

a) PCC similarity measure.

b) CPCC similarity measure.

c) WPCC similarity measure.

d) SPCC similarity measure.

e) COSINE similarity measure.

f) PIP similarity measure.

g) Singularity measure.

h) NHSM similarity measure.

Effect of the similarity threshold

Determining best weights for α and β

Effect of the number of neighbors and comparison of state-of-the-art similarity measure methods with IPWR similarity measure method

Performance analysis on the Epinions (5-star rating) dataset.

Performance analysis on the MovieLens-100K (ML-100K) (5-star rating) dataset.

Performance analysis on the MovieLens-1M (ML-1M) (5-star rating) dataset.

Performance analysis on the CiaoDVD (5-star rating) dataset.

Performance analysis on the MovieTweetings (10-star rating) dataset.

Conclusion and future work

References