KMS Chongqing Institute of Green and Intelligent Technology, CAS
A Latent Factor Analysis-Based Approach to Online Sparse Streaming Feature Selection | |
Wu, Di1,2; He, Yi3; Luo, Xin1,2,4; Zhou, MengChu5,6,7 | |
2021-08-03 | |
摘要 | Online streaming feature selection (OSFS) has attracted extensive attention during the past decades. Current approaches commonly assume that the feature space of fixed data instances dynamically increases without any missing data. However, this assumption does not always hold in many real applications. Motivated by this observation, this study aims to implement online feature selection from sparse streaming features, i.e., features flow in one by one with missing data as instance count remains fixed. To do so, this study proposes a latent-factor-analysis-based online sparse-streaming-feature selection algorithm (LOSSA). Its main idea is to apply latent factor analysis to pre-estimate missing data in sparse streaming features before conducting feature selection, thereby addressing the missing data issue effectively and efficiently. Theoretical and empirical studies indicate that LOSSA can significantly improve the quality of OSFS when missing data are encountered in target instances. |
关键词 | Big data computational intelligence latent factor analysis (LFA) missing data online algorithm online feature selection sparse streaming feature streaming feature |
DOI | 10.1109/TSMC.2021.3096065 |
发表期刊 | IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS |
ISSN | 2168-2216 |
页码 | 15 |
通讯作者 | Luo, Xin(luoxin21@cigit.ac.cn) |
收录类别 | SCI |
WOS记录号 | WOS:000732087500001 |
语种 | 英语 |