CSpace
An Instance-Frequency-Weighted Regularization Scheme for Non-Negative Latent Factor Analysis on High-Dimensional and Sparse Data
Luo, Xin1; Wang, Zidong2; Shang, Mingsheng3,4
2021-06-01
摘要High-dimensional and sparse (HiDS) data with non-negativity constraints are commonly seen in industrial applications, such as recommender systems. They can be modeled into an HiDS matrix, from which non-negative latent factor analysis (NLFA) is highly effective in extracting useful features. Preforming NLFA on an HiDS matrix is ill-posed, desiring an effective regularization scheme for avoiding overfitting. Current models mostly adopt a standard L-2 scheme, which does not consider the imbalanced distribution of known data in an HiDS matrix. From this point of view, this paper proposes an instancefrequency-weighted regularization (IR) scheme for NLFA on HiDS data. It specifies the regularization effects on each latent factors with its relevant instance count, i.e., instance-frequency, which clearly describes the known data distribution of an HiDS matrix. By doing so, it achieves finely grained modeling of regularization effects. The experimental results on HiDS matrices from industrial applications demonstrate that compared with an L-2 scheme, an IR scheme enables a resultant model to achieve higher accuracy in missing data estimation of an HiDS matrix.
关键词High-dimensional and sparse (HiDS) data industrial application instance-frequency non-negative latent factor analysis (NLFA) recommender system regularization
DOI10.1109/TSMC.2019.2930525
发表期刊IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
ISSN2168-2216
卷号51期号:6页码:3522-3532
通讯作者Shang, Mingsheng(msshang@cigit.ac.cn)
收录类别SCI
WOS记录号WOS:000652103000018
语种英语