TY - JOUR
T1 - Nonnegative matrix factorization constrained by multiple labelled spanning trees for label propagation
AU - Deng, Furong
AU - Zhao, Yang
AU - Pei, Jihong
AU - Wang, Shiqi
AU - Yang, Xuan
N1 - Publisher Copyright:
© 2023 Elsevier Inc.
PY - 2023/11
Y1 - 2023/11
N2 - Label propagation is an important semi-supervised learning method that generalizes the attributes of labelled samples to unlabelled samples based on the correlation of the data distribution. However, handling data containing imbalanced labels, multiple subclasses, bridging points, and nonlinear distributions is challenging. These challenges often make it difficult for existing methods to learn accurate relationships between labelled and unlabelled samples. In this paper, we propose multiple labelled spanning trees (MLSTs). Each labelled sample is initialized as the root node of a tree, and all trees are grown asynchronously according to the manifold structure of their neighbours such that all of the unlabelled samples can establish a connection with the labelled samples. The MLST construction method can better adapt to different disturbances. In addition, MLSTs-constrained nonnegative matrix factorization (MLSTs-NMF) is proposed for label propagation; it constructs a global relationship by using nonlinear representation with labelled samples and combines the manifold structure information from multiple labelled spanning trees to comprehensively estimate the labels of unlabelled samples. The results of the experiments demonstrate that the prediction accuracy of the proposed method is higher than that of the existing methods, and it exhibits better robustness to various interferences.
AB - Label propagation is an important semi-supervised learning method that generalizes the attributes of labelled samples to unlabelled samples based on the correlation of the data distribution. However, handling data containing imbalanced labels, multiple subclasses, bridging points, and nonlinear distributions is challenging. These challenges often make it difficult for existing methods to learn accurate relationships between labelled and unlabelled samples. In this paper, we propose multiple labelled spanning trees (MLSTs). Each labelled sample is initialized as the root node of a tree, and all trees are grown asynchronously according to the manifold structure of their neighbours such that all of the unlabelled samples can establish a connection with the labelled samples. The MLST construction method can better adapt to different disturbances. In addition, MLSTs-constrained nonnegative matrix factorization (MLSTs-NMF) is proposed for label propagation; it constructs a global relationship by using nonlinear representation with labelled samples and combines the manifold structure information from multiple labelled spanning trees to comprehensively estimate the labels of unlabelled samples. The results of the experiments demonstrate that the prediction accuracy of the proposed method is higher than that of the existing methods, and it exhibits better robustness to various interferences.
KW - Graph construction
KW - Label propagation
KW - Multiple labelled spanning trees
KW - Nonnegative matrix factorization
UR - https://www.scopus.com/pages/publications/85169937411
U2 - 10.1016/j.ins.2023.119616
DO - 10.1016/j.ins.2023.119616
M3 - 文章
AN - SCOPUS:85169937411
SN - 0020-0255
VL - 648
JO - Information Sciences
JF - Information Sciences
M1 - 119616
ER -