杨长春, 徐小松, 叶施仁, 周猛. 基于文本相似度的微博网络水军发现算法[J]. 微电子学与计算机, 2014, 31(3): 82-85.
引用本文: 杨长春, 徐小松, 叶施仁, 周猛. 基于文本相似度的微博网络水军发现算法[J]. 微电子学与计算机, 2014, 31(3): 82-85.
YANG Zhang-chun, XU Xiao-song, YE Shi-ren, ZHOU Meng. A Method to Find Water Armies in Weibo Based on Text Similarity[J]. Microelectronics & Computer, 2014, 31(3): 82-85.
Citation: YANG Zhang-chun, XU Xiao-song, YE Shi-ren, ZHOU Meng. A Method to Find Water Armies in Weibo Based on Text Similarity[J]. Microelectronics & Computer, 2014, 31(3): 82-85.

基于文本相似度的微博网络水军发现算法

A Method to Find Water Armies in Weibo Based on Text Similarity

  • 摘要: 微博中水军发表的评论内容具有重复或者相似性,提出了基于文本相似度的微博网络水军发现算法.评论内容可以用特征码来表示.特征码再通过高效的B-Tree来索引,使整个系统具有极高的处理效率.根据水军发帖的重复性或者相似性很高的特点,通过对多个相同或相似的评论内容进行统计分析找出出现次数频繁的用户,初步定义为水军.再对这些用户的评论内容进行分析,发现他们的评论内容基本上都是具有重复性.试验表明,该方法能够准确、有效地找出水军账户.

     

    Abstract: The comments issued by the Water Army were repeatability and similarity.A method to find Water armies in weibo based on Text similarity was proposed.Comments are represented by signatures.By indexing the signatures on efficient B-Tree,the whole system had extreme processing efficiency.According to the repeatability and similarity characteristics of the comments issued by the Water Armies,the users which defined as Water Armies had a high number of occurrences when analyzed on the same or similar comments.And then analyzed the comments published by those users,found that those comments were almost similar.Experiments showed that this method could find the Water Armies accurately and effectively.

     

/

返回文章
返回