Combining Structure and Content Similaritiesmeasure for XML Document
-
Abstract
This paper proposed a document similarity calculation method considering the XML document content and structure information in this paper. Different methods was used to calculate the document content similarity and structural information, and different emphasis was laied on them.Then the comprehensive similarity of the document can be attained. Experimental results on real data sets show that the method integrated structure and content information can improve the accuracy of calculation of XML documents similarity.
-
-