余燕芳. 基于改进遗传算法的Web文本挖掘系统[J]. 微电子学与计算机, 2010, 27(4): 103-105,110.
引用本文: 余燕芳. 基于改进遗传算法的Web文本挖掘系统[J]. 微电子学与计算机, 2010, 27(4): 103-105,110.
YU Yan-fang. A Novel Web Text Mining System Based on Improved Genetic Algorithm[J]. Microelectronics & Computer, 2010, 27(4): 103-105,110.
Citation: YU Yan-fang. A Novel Web Text Mining System Based on Improved Genetic Algorithm[J]. Microelectronics & Computer, 2010, 27(4): 103-105,110.

基于改进遗传算法的Web文本挖掘系统

A Novel Web Text Mining System Based on Improved Genetic Algorithm

  • 摘要: 文本分类是文本数据挖掘中一个非常重要的技术, 已经被广泛地应用于信息管理、搜索引擎、推荐系统等多个领域.现有的文本分类方法很难适用于大规模的文本数据集.为此, 提出了一种基于改进遗传算法的文本挖掘系统.提出的改进遗传算法极大地提高了文本挖掘系统的分类效率.实验结果表明, 该方法适用于大规模文本数据集;该方法提取规则的分类正确率较高, 分类速度较快.

     

    Abstract: Text classification is a very important technique in the field of text mining, and it has been widely applied to the information management, search engine, and recommendation systems. Most existing classification methods cannot be used on the occasion of classifying a large number of samples. Therefore, a novel improved genetic algorithm was proposed for Web text mining system. The performance of the proposed system has been improved largely by our improved genetic algorithm. The proposed approach can be applied to classify a large number of samples. Experimental results show that both the accuracy and the speed of categorization are high.

     

/

返回文章
返回