张青露,陈壹华.基于特征共现的无监督哈希检索算法[J]. 微电子学与计算机,2024,41(5):22-30. doi: 10.19304/J.ISSN1000-7180.2023.0289
引用本文: 张青露,陈壹华.基于特征共现的无监督哈希检索算法[J]. 微电子学与计算机,2024,41(5):22-30. doi: 10.19304/J.ISSN1000-7180.2023.0289
ZHANG Q L,CHEN Y H. An unsupervised hash retrieval algorithm based on feature co-occurrence[J]. Microelectronics & Computer,2024,41(5):22-30. doi: 10.19304/J.ISSN1000-7180.2023.0289
Citation: ZHANG Q L,CHEN Y H. An unsupervised hash retrieval algorithm based on feature co-occurrence[J]. Microelectronics & Computer,2024,41(5):22-30. doi: 10.19304/J.ISSN1000-7180.2023.0289

基于特征共现的无监督哈希检索算法

An unsupervised hash retrieval algorithm based on feature co-occurrence

  • 摘要: 现有无监督哈希检索算法的关注点在于哈希映射过程中的信息损失以及生成哈希的质量问题,忽略了图像特征本身对检索精度的影响。为进一步提高检索的精度,提出一种改进的基于特征共现的无监督哈希检索算法(Unsupervised Hash retrieval algorithm based on Feature Co-occurrence, UHFC)。该算法共分为两个阶段:深度特征提取和无监督哈希生成。为提高图像特征的质量,UHFC在卷积神经网络(Convolutional Neural Network, CNN)结构的最后一层卷积后引入了共现层,用来提取特征之间的依赖关系。并用共现激活值的均值来表示共现程度,解决原共现操作存在相同两个通道的共现值不一致的问题;接着,在特征融合部分UHFC设计一种适用于共现特征融合的,结合空间注意力机制的注意特征融合方法(Attention Feature Fusion method based on Spatial attention, AFF-S)。通过注意力机制自主学习共现特征与深度特征融合的权重,降低特征融合过程中背景因素的干扰,提高最终图像特征的表达能力。最后,根据最优传输策略,UHFC采用双半分布哈希编码对图像特征到哈希码的映射过程进行监督,并在哈希层后添加一层分类层通过KL损失进一步提高哈希码所包含的图片信息,整个训练过程中无需数据集的标注,实现无监督哈希的生成。实验表明,UHFC对哈希编码质量改善较好,在Flickr25k和Nus-wide数据集上其平均均值精度(mean Average Precision, mAP)分别达到了87.8%和82.8%,相比于baseline方法分别提高了2.1%与1.2%,效果明显。

     

    Abstract: The existing algorithms of unsupervised hash retrieval focus on the information loss in the process of hash mapping and the quality of hash generation, but ignore the impact of image features on the retrieval accuracy. In order to further improve the retrieval accuracy, this paper proposes an improved Unsupervised Hash retrieval algorithm based on Feature Co-occurrence (UHFC), which is divided into two stages: deep feature extraction and unsupervised hash generation. In order to improve the quality of image features, UHFC introduces a co-occurrence layer after the last convolution layer of Convolutional Neural Network (CNN) structure to extract the dependency relationship between features. The mean value of co-occurrence activation value is used to represent the degree of co-occurrence to solve the problem of inconsistent co-present value of the same two channels in the original co-occurrence operation. Then, in the feature fusion part of UHFC, an Attentional Feature Fusion method based on Spatial attention(AFF-S) mechanism is designed for co-occurrence feature fusion. By self-learning the weight of co-occurrence feature and depth feature fusion by attention mechanism, the interference of background factors in the process of feature fusion is reduced, and the expressive ability of final image features is improved. Finally, according to the optimal transmission strategy, UHFC adopts Bi-half distributed hash coding to supervise the mapping process of image features to hash code, and adds a classification layer after the hash layer to further improve the image information contained in the hash code through KL loss. In the whole training process, no data set labeling is required to realize the generation of unsupervised hash. Experiments have shown that UHFC better improve quality of hash code, in Flickr25k and Nus - wide data sets its mean Average Precision (mAP) reached 87.8% and 82.8% respectively, compared to the baseline method is increased by 2.1% and 1.2%, respectively, effect is obvious.

     

/

返回文章
返回