项武. 基于评分区间相似性的协同过滤推荐算法[J]. 微电子学与计算机, 2010, 27(7): 125-128,132.
引用本文: 项武. 基于评分区间相似性的协同过滤推荐算法[J]. 微电子学与计算机, 2010, 27(7): 125-128,132.
XIANG Wu. A Collaborative Filtering Algorithm Based on Similarity of Rating Interval[J]. Microelectronics & Computer, 2010, 27(7): 125-128,132.
Citation: XIANG Wu. A Collaborative Filtering Algorithm Based on Similarity of Rating Interval[J]. Microelectronics & Computer, 2010, 27(7): 125-128,132.

基于评分区间相似性的协同过滤推荐算法

A Collaborative Filtering Algorithm Based on Similarity of Rating Interval

  • 摘要: 研究协同过滤推荐系统处理大规模稀疏评价数据的精度问题,针对目前余弦相似性处理极稀疏的用户评价矩阵不能获取满意推荐质量,且处理大规模推荐数据时存在性能缺陷的情况,提出基于评分区间相似性的协同过滤推荐算法.将用户评价的数值范围进行定性划分,以项目在各评价区间上的得分建模项目质量,以用户在各评价区间内的评分建模用户评分取向.使用余弦相似性度量方法衡量用户和项目间的相似程度,进而确定最近邻居.此方法的实现过程降低了稀疏性对最近邻发现的影响,且算法具有快速处理大规模评价数据的能力.使用标准的MovieLens数据集,测试提出的推荐算法,实验结果表明,本算法相比传统的最近邻推荐算法具有更好的推荐精度.

     

    Abstract: This work focused on recommended accuracy of the collaborative filtering system to process large-scale sparse data. The cosine similarity measurement and correlation similarity measurement could not handle the sparse rating matrix very well. And these methods undermined the recommended system performance when dealing with large-scale data. This paper provided a new approach based on the similarity of rating interval. The user rating range was divided into several qualitative intervals. According to the rating ratios, user preferences and item qualities were represented by vectors. Finally, this work used the cosine similarity method to determine the nearest neighbors. This approach reduces the impact of the sparse matrix and has the ability to deal with massive data. On MovieLens data set, experiment result shows that the algorithm has reliable and accurate performance.

     

/

返回文章
返回