何南,石书桦.融合红外特征的可见光图像目标检测算法研究[J]. 微电子学与计算机,2023,40(3):29-36. doi: 10.19304/J.ISSN1000-7180.2022.0388
引用本文: 何南,石书桦.融合红外特征的可见光图像目标检测算法研究[J]. 微电子学与计算机,2023,40(3):29-36. doi: 10.19304/J.ISSN1000-7180.2022.0388
HE N,SHI S H. Research on visible image object detection algorithm based on infrared features fusion[J]. Microelectronics & Computer,2023,40(3):29-36. doi: 10.19304/J.ISSN1000-7180.2022.0388
Citation: HE N,SHI S H. Research on visible image object detection algorithm based on infrared features fusion[J]. Microelectronics & Computer,2023,40(3):29-36. doi: 10.19304/J.ISSN1000-7180.2022.0388

融合红外特征的可见光图像目标检测算法研究

Research on visible image object detection algorithm based on infrared features fusion

  • 摘要: 为提升复杂场景下基于可见光图像的目标检测性能,将深层卷积神经网络与多源信息融合技术相结合,提出了一种自适应融合红外特征的可见光目标检测算法. 该算法以红外和可见光图像作为输入,通过卷积、激活结合残差结构的方式分别提取目标红外和可见光特征,并利用空间和通道注意力机制提升目标所属类别以及所在图像区域的特征权重. 其次,将提取的红外特征以自适应加权的方式融入对应维度的可见光特征中,充分弥补目标在单模态模型下的局限. 最后,针对多尺度目标,设计了金字塔采样结构,通过交替上采样和下采样方式来充分融合目标全局及局部特征,增强网络尺度不变性. 通过实验验证,所提注意力机制、特征自适应融合以及金字塔采样结构都能有效提升目标检测效果,相比于同类型红外-可见光目标检测方法,该方法可以充分融合目标多模态特征,并有效降低噪声干扰,使网络具有更高的检测性能. 同时,在实际电网设备检测中,所提方法也表现出较高泛化能力和鲁棒性,可以准确高效的实现目标设备的识别及定位.

     

    Abstract: In order to improve the performance of visible light object detection in complex scenes, a visible light object detection algorithm adaptively fused with infrared features is proposed by combining deep convolutional neural network with multi-source information fusion technology. The algorithm takes infrared and visible images as input, extracts infrared and visible features by means of convolution and activation combined with residual structure, and uses spatial and channel attention mechanisms to improve the category of the target and the feature weight of the image region where the target resides. Secondly, the extracted infrared features are incorporated into the visible features of the corresponding dimension in the way of adaptive weighting, which fully makes up for the limitation of the object in the single-mode model; Finally, for multi-scale objects, a pyramid sampling structure is designed. By alternating up-sampling and down-sampling, the global and local features of the feature are fully integrated to enhance the scale invariance of the network. Experiments show that the proposed attention mechanism, feature adaptive fusion and pyramid sampling structure can effectively improve the effect of object detection. Compared with the same type of infrared visible light object detection method, this method can fully integrate the multi-modal features of the object, effectively reduce noise interference, and make the network have higher detection performance. At the same time, in the actual power grid equipment detection, this method also shows high generalization ability and robustness, and can accurately and efficiently achieve the identification and positioning of object equipment.

     

/

返回文章
返回