赵伟杰, 巢建树, 王新文, 明瑞成. 融合自监督和自注意力的输电线语义分割网络[J]. 微电子学与计算机, 2023, 40(12): 61-69. DOI: 10.19304/J.ISSN1000-7180.2022.0868
引用本文: 赵伟杰, 巢建树, 王新文, 明瑞成. 融合自监督和自注意力的输电线语义分割网络[J]. 微电子学与计算机, 2023, 40(12): 61-69. DOI: 10.19304/J.ISSN1000-7180.2022.0868
ZHAO Weijie, CHAO Jianshu, WANG Xinwen, MING Ruicheng. Semantic segmentation networks for powerlines incorporating self-supervision and self-attention[J]. Microelectronics & Computer, 2023, 40(12): 61-69. DOI: 10.19304/J.ISSN1000-7180.2022.0868
Citation: ZHAO Weijie, CHAO Jianshu, WANG Xinwen, MING Ruicheng. Semantic segmentation networks for powerlines incorporating self-supervision and self-attention[J]. Microelectronics & Computer, 2023, 40(12): 61-69. DOI: 10.19304/J.ISSN1000-7180.2022.0868

融合自监督和自注意力的输电线语义分割网络

Semantic segmentation networks for powerlines incorporating self-supervision and self-attention

  • 摘要: 要解决无人机在空中飞行过程中遭遇输电线时存在的避障难的问题,关键之一是要解决对输电线的语义分割中存在的长距离图像分割不连续的问题. 为此,提出了一种添加自注意力模块来改进U-Net的语义分割算法,用于输电线的语义分割. 通过自注意力模块提取U-Net不同尺度上的全局特征,提高对跨越全局的输电线特征的捕捉能力. 为进一步优化训练过程,提出最大池化标签下采样,增强对不平衡类别输电线的学习能力;提出卷积神经网络图像掩码建模自监督预训练,提高预训练权重的质量. 此外,为在大规模的输电线数据集上进行验证,对TTPLA输电线输电塔数据集实例分割标签进行处理,制作了TTPLA输电线语义分割数据集. 实验表明,改进的网络通过捕捉全局特征的自注意力机制、优化的深度监督过程和自监督预训练,对比原版U-Net具有更高的分割精度. 在TTPLA输电线语义分割数据集的测试中,与原版U-Net相比,其IoU指标提高了2.32%,达到了71.45%. 证明算法增强了图像中长距离输电线语义特征之间的联系,提高了输电线语义分割的完整性,提升了无人机的避障能力.

     

    Abstract: The obstacle avoidance problem is critical when Unmanned Aerial Vehicles(UAVs) encounter transmission lines during aerial flight. One key in semantic segmentation of transmission lines is to solve the problem of long-distance image segmentation discontinuity problem. To this end, we propose a semantic segmentation algorithm with the addition of a self-attention module to improve the U-Net for the semantic segmentation of transmission lines. The self-attention module extracts the global features on different scales of U-Net, which enhances the ability to capture the global features of transmission lines. To further optimize the training process, maximum pooled label downsampling is proposed to enhance the learning ability of unbalanced classes of transmission lines. Moreover, a self-supervised pre-training of convolutional neural network image mask modeling is proposed to improve the quality of pre-training weights. In addition, for validation on a large-scale transmission line dataset, the instance segmentation labels of the TTPLA transmission line and tower dataset are processed to produce a novel semantic segmentation dataset. Experiments show that the improved network has higher segmentation accuracy than the original U-Net through the self-attention mechanism for capturing global features, the optimized depth-supervised process, and the self-supervised pre-training. On the TTPLA transmission line semantic segmentation dataset, the metric IoU increases by 2.32% compared to the original U-Net, reaching 71.45%. Our results prove that the proposed algorithm enhances the association among the semantic features of long-distance transmission lines in images and improves the integrity of transmission line semantic segmentation. The approach can enhance the obstacle avoidance capability of UAVs.

     

/

返回文章
返回