姚卓, 蔡皖东, 姚烨. Web网站死链检测方法[J]. 微电子学与计算机, 2012, 29(12): 103-107,111.
引用本文: 姚卓, 蔡皖东, 姚烨. Web网站死链检测方法[J]. 微电子学与计算机, 2012, 29(12): 103-107,111.
YAO Zhuo, CAI Wan-dong, YAO Ye. Website Dead Links Detection Method[J]. Microelectronics & Computer, 2012, 29(12): 103-107,111.
Citation: YAO Zhuo, CAI Wan-dong, YAO Ye. Website Dead Links Detection Method[J]. Microelectronics & Computer, 2012, 29(12): 103-107,111.

Web网站死链检测方法

Website Dead Links Detection Method

  • 摘要: 网站作为大规模的信息集合体,包含了大量的Web链接.有些Web链接经过一段时间之后,因种种原因而失效或者出现错误,从而形成死链.本文提出一种Web网站死链检测方法.根据Web链接的调度过程,自动获取网站链接信息;根据Web链接的结构特点和网页检索操作,对死链进行分析和检测;针对链接的相互引用问题和用户体验与页面深度的关系,对采集的数据进行预处理.实验结果表明,该方法能有效地提高死链的检测覆盖率和处理效率.

     

    Abstract: For large-scale information collection, Web sites contain considerable links.After a period of time, some web references that for a variety of reasons will not lead to a valid or correct web page, which is called dead link.The paper puts forward a dead link testing method.Achieve link messages automatically according to processes of URL dispatch.Analysis and detect dead links based on structural characteristic of web links and chain of actions needed to retrieve a web page;preprocess data collected before, due to a large number of relevant content links existed in a web page and relationship between user experience and page depth.Experiment results show that the method can improve coverage percentages and processing efficiency of dead links detection effectively.

     

/

返回文章
返回