基于深度玻尔兹曼机的文本特征提取研究

张立民; 刘凯

基于深度玻尔兹曼机的文本特征提取研究

张立民,
刘凯

Document Features Extraction Based on DBM

摘要

摘要: 鉴于深度学习模型在进行知识推理相关研究时提取抽象概念的优势,在目前对文本特征提取性能较好的浅层结构RSM的基础上构建以RSM为特征抽取器的深度玻尔兹曼机模型.通过新模型的能量函数和网络连接关系,对模型组成单元的后验概率进行推导,并结合新的交叉熵稀疏惩罚因子,给出模型的详细学习算法.经20-newgroups文档集上测试证明,经过交叉熵稀疏惩罚因子影响后的新模型提取出的特征在对文本表征上性能较好,相比于浅层模型RSM,其分类准确度更高,概念更加抽象,在处理大规模文本分析上具有良好的可行性.

Abstract: According to the advantages of deep learning model in the extraction of abstract concept, a new Deep Boltzmann Machine is designed based on the outperform model-Replicate Softmax Model. Based on Replicate Softmax Model and structure of the new Deep Boltzmann Machine, energy function of this model is proposed and the detail learning algorithm is introduced. The learning can be made more efficient by using a layer-by-layer "pre-training" phase that allows variational inference to be initialized with a single bottom up pass. The values of the latent variables in the deepest layer are easy to infer and give a much better representation of each document than low learning. The 20-newsgroups document sets experiment results illustrated that the novel algorithm learn good generative models, get the better competence of a shallow model- Replicate Softmax Model in handling with an extract abstract concept and has good feasibility in large scale document analysis.

HTML全文

参考文献(0)

施引文献

资源附件(0)