基于图神经网络的百万数据人脸聚类

何川; 侯进; 李金彪

doi:10.19304/J.ISSN1000-7180.2022.0027

基于图神经网络的百万数据人脸聚类

Face clustering based on graph neural network for millions of data

摘要

摘要: 人脸聚类是利用未标记人脸数据的重要工具，在人脸标注和检索等方面有着广泛的应用.如何有效地聚类，特别是在大规模(如百万级或以上)数据集上，是一个悬而未决的问题. 最近的研究表明，基于图卷积神经网络(GCN)的聚类可以显著提高性能.然而这些方法需要生成大量的重叠子图，严重限制了模型的精度和效率.由于这些GCN算法没有分析过不同数据特征对模型的影响，通常仅在特定的数据集上表现出优异的性能.本文综合分析了距离、实例个数分布差异对模型的影响，提出了一种基于DBSCAN的图卷积网络模型.通过两段距离形成二次聚类模型，消除了DBSCAN对距离的依赖，提高了模型精度，在多个数据集中最高提升了20%；通过探索融合one-hot特征编码方式、多种邻接矩阵构图方法，进一步提升了模型的鲁棒性；通过邻接矩阵稀疏化算法解决了人群数量动态变化问题.在多个大型基准上的实验表明，相较于现有GCN算法，所提算法精度提高了2%~7%，并降低了对硬件的要求，提升了运行效率，可以应用于百万级的人脸聚类场景.

Abstract: Face clustering is an important tool for utilizing unlabeled face data and has a wide range of applications in face annotation and retrieval, etc. How to cluster effectively, especially on large-scale (e.g. millions or more) datasets, is an open problem. Recent studies have shown that graph convolutional neural network (GCN)-based clustering can significantly improve performance. However, these methods require the generation of a large number of overlapping subgraphs, which severely limits the accuracy and efficiency of the model. Since these GCN algorithms have not analyzed the impact of different data features on the models, they usually show excellent performance only on specific datasets. In this paper, a DBSCAN-based graph convolutional network model is proposedby comprehensively analyzing the effects of differences in the distribution of distances and the number of instances on the model. By forming a quadratic clustering model through two distances, the dependence of DBSCAN on distance is eliminated, and the model accuracy is improved by up to 20% in multiple datasets.The robustness of the model is further improved by exploring the fusion of one-hot feature encoding method and multiple adjacency matrix composition methods. The problem of dynamic changes in population size is solved by the adjacency matrix sparsification algorithm. Experiments on several large benchmarks show that our algorithm improves accuracy by 2%~7% compared with existing GCN algorithms, and reduces hardware requirements and improves operational efficiency, which can be applied to million-level face clustering scenarios.

HTML全文

参考文献(26)

施引文献

资源附件(0)