Skip to main navigation Skip to search Skip to main content

Imbalanced networked multi-label classification with active learning

  • City University of Hong Kong
  • Hefei University of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the rapid development of social networks, the networked multi-label classification algorithms have gained wide attention. The existing networked multi-label classification algorithms mostly only consider the homogeneity or heterogeneity of the network without taking the imbalance of the network into account, and this is actually pretty common in real network environments, which deserves more attention. Moreover, the selection strategy of training set is very critical for multi-label classification algorithm, because it will directly affect both the parameter updating inside the classifier and the precision of the classifier. The application of active learning to the selection of training set can effectively improve the precision of the classifier. Similarly, the application of imbalanced data processing strategies to the selection of training sets also makes classifiers more suitable for imbalanced data networks. Thereout, we propose an algorithm BSHD (Block Sampling with selecting the Highest Degree nodes), which is an active learning based imbalanced networked multi-label classification algorithm. In this algorithm, we divide the network according to the edge density and utilize the oversampling and undersampling to dispose each block. Then we select the nodes with the highest degree from each block to form the training set. Experimental results show that our proposed BSHD outperforms other state-of-arts approaches.

Original languageEnglish
Title of host publicationProceedings - 9th IEEE International Conference on Big Knowledge, ICBK 2018
EditorsXindong Wu, Ong Yew Soon, Charu Aggarwal, Huanhuan Chen
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages290-297
Number of pages8
ISBN (Electronic)9781538691243
DOIs
StatePublished - 24 Dec 2018
Externally publishedYes
Event9th IEEE International Conference on Big Knowledge, ICBK 2018 - Singapore, Singapore
Duration: 17 Nov 201818 Nov 2018

Publication series

NameProceedings - 9th IEEE International Conference on Big Knowledge, ICBK 2018

Conference

Conference9th IEEE International Conference on Big Knowledge, ICBK 2018
Country/TerritorySingapore
CitySingapore
Period17/11/1818/11/18

Keywords

  • Active learning
  • Imbalanced data
  • Multi-label classification algorithm
  • Oversampling
  • Undersampling

Fingerprint

Dive into the research topics of 'Imbalanced networked multi-label classification with active learning'. Together they form a unique fingerprint.

Cite this