Erschienen in:

2006 | OriginalPaper | Buchkapitel

Subspace Sampling and Relative-Error Matrix Approximation: Column-Row-Based Methods

verfasst von : Petros Drineas, Michael W. Mahoney, S. Muthukrishnan

Erschienen in: Algorithms – ESA 2006

Verlag: Springer Berlin Heidelberg

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Much recent work in the theoretical computer science, linear algebra, and machine learning has considered matrix decompositions of the following form: given an

matrix

, decompose it as a product of three matrices,

, and

, where

consists of a small number of columns of

consists of a small number of rows of

, and

is a small carefully constructed matrix that guarantees that the product

CUR

is “close” to

. Applications of such decompositions include the computation of matrix “sketches”, speeding up kernel-based statistical learning, preserving sparsity in low-rank matrix representation, and improved interpretability of data analysis methods. Our main result is a randomized, polynomial algorithm which, given as input an

matrix

, returns as output matrices

such that

$$\|{A-CUR}\|_F \leq (1+\epsilon)\|{A-A_k}\|_F$$

with probability at least 1–

. Here,

is the “best” rank-

approximation (provided by truncating the Singular Value Decomposition of

), and ||

is the Frobenius norm of the matrix

. The number of columns in

and rows in

is a low-degree polynomial in

, 1/

, and log(1/

). Our main result is obtained by an extension of our recent relative error approximation algorithm for ℓ

regression from overconstrained problems to general ℓ

regression problems. Our algorithm is simple, and it takes time of the order of the time needed to compute the top

right singular vectors of

. In addition, it samples the columns and rows of

via the method of “subspace sampling,” so-named since the sampling probabilities depend on the lengths of the rows of the top singular vectors, and since they ensure that we capture entirely a certain subspace of interest.

Springer Professional

Subspace Sampling and Relative-Error Matrix Approximation: Column-Row-Based Methods

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner