Self-training semi-supervised classification based on density peaks of data

doi:10.1016/j.neucom.2017.05.072

CSpace > 大数据挖掘及应用中心

	Self-training semi-supervised classification based on density peaks of data
	Wu, Di1,2 ; Shang, Mingsheng1 ; Luo, Xin1 ; Xu, Ji 1,3; Yan, Huyong 1; Deng, Weihui 1; Wang, Guoyin 1
	2018-01-31
摘要	Having a multitude of unlabeled data and few labeled ones is a common problem in many practical applications. A successful methodology to tackle this problem is self-training semi-supervised classification. In this paper, we introduce a method to discover the structure of data space based on find of density peaks. Then, a framework for self-training semi-supervised classification, in which the structure of data space is integrated into the self-training iterative process to help train a better classifier, is proposed. A series of experiments on both artificial and real datasets are run to evaluate the performance of our proposed framework. Experimental results clearly demonstrate that our proposed framework has better performance than some previous works in general on both artificial and real datasets, especially when the distribution of data is non-spherical. Besides, we also find that the support vector machine is particularly suitable for our proposed framework to play the role of base classifier. (C) 2017 Elsevier B.V. All rights reserved.
关键词	Density peaks Self-training Semi-supervised classification Supervised learning
DOI	10.1016/j.neucom.2017.05.072
发表期刊	NEUROCOMPUTING
ISSN	0925-2312
卷号	275 页码:180-191
收录类别	SCI
WOS记录号	WOS:000418370200018
语种	英语

中国科学院重庆绿色智能技术研究院机构知识库