Large–scale Sparse Tensor Decomposition Using a Damped Gauss–Newton Method
Teresa M. Ranadive, Muthu M. Baskaran
Publication Source: IEEE High Performance Extreme Computing Conference (HPEC) 2020
CANDECOMP/PARAFAC (CP) tensor decomposition is a popular unsupervised machine learning method with numerous applications. This process involves modeling a high–dimensional, multi–modal array (a tensor) as the sum of several low–dimensional components. In order to decompose a tensor, one must solve an optimization problem, whose objective is often given by the sum of the squares of the tensor and decomposition model entry differences. One algorithm occasionally utilized to solve such problems is CP–OPT–DGN, a damped Gauss–Newton all–at–once optimization method for CP tensor decomposition. However, there are currently no published results that consider the decomposition of large–scale (with up to billions of non– zeros), sparse tensors using this algorithm. This work considers the decomposition of large–scale tensors using an efficiently implemented CP–OPT–DGN method. It is observed that CP– OPT–DGN significantly outperforms CP–ALS (CP–Alternating Least Squares) and CP–OPT–QNR (a quasi–Newton–Raphson all–at–once optimization method for CP tensor decomposition), two other widely used tensor decomposition algorithms, in terms of accuracy and latent behavior detection.