Vocabulary Correlation Text Clustering Based on HowNet
-
Abstract
The vocabulary correlation method based on HowNet is given out in this paper.This method removes isolated points through statistical z-score. The initial clustering centers are selected according to the sparse division of docunents. The relevance and semantic similarity of words are studied,which improve the precesion of docunent clustersing and reduce the time consumption.
-
-