马宾. 一种改进的并行K_近邻网络舆情分类算法研究[J]. 微电子学与计算机, 2015, 32(6): 62-66,72. DOI: 10.19304/j.cnki.issn1000-7180.2015.06.014
引用本文: 马宾. 一种改进的并行K_近邻网络舆情分类算法研究[J]. 微电子学与计算机, 2015, 32(6): 62-66,72. DOI: 10.19304/j.cnki.issn1000-7180.2015.06.014
MA Bin. Study of an Improved K_Nearest Neighbor Algorithm for Network Public Opinion Classification[J]. Microelectronics & Computer, 2015, 32(6): 62-66,72. DOI: 10.19304/j.cnki.issn1000-7180.2015.06.014
Citation: MA Bin. Study of an Improved K_Nearest Neighbor Algorithm for Network Public Opinion Classification[J]. Microelectronics & Computer, 2015, 32(6): 62-66,72. DOI: 10.19304/j.cnki.issn1000-7180.2015.06.014

一种改进的并行K_近邻网络舆情分类算法研究

Study of an Improved K_Nearest Neighbor Algorithm for Network Public Opinion Classification

  • 摘要: 针对网络舆情信息数据量大、内容分散、数据结构复杂等特点,研究一种基于Hadoop平台的并行K_近邻分类算法实现网络舆情信息分类.利用Hadoop平台分布式存储与数据并行处理特性,设计基于MapReduce封装的并行K_近邻网络舆情分类算法.对改进的K_近邻网络舆情分类算法的分类能力与分类效率进行实验验证,并应用于网络舆情数据分类测试.结果表明,基于Hadoop平台的并行K_近邻网络舆情分类算法能够有效提升网络舆情文档分类效果与分类效率,可以实现网络舆情快速、正确的分类处理.

     

    Abstract: According to the characters of network public opinion information, which are large-scale data, dispersed content and complex large amount, content dispersed and structure, a parallel K_nearest neighbors(KNN) classification algorithm based on Hadoop platform for network public opinion information classification is studied. In the light of Hadoop platform distributed storage and data parallel processing features, a parallel KNN network public opinion classification algorithm based on MapReduce package is designed. The classification ability and efficiency of the improved KNN network public opinion classification algorithm are experimental verified, and the algorithm is applied to network public opinion data classification tests. The results show that the parallel KNN classification algorithm based Hadoop platform can effectively improve the classification effect and efficiency of network public opinion documents,achieving network public opinion fast、correct classification.

     

/

返回文章
返回