The Rank of a Matrix

You can think of an r×cr \times c matrix as a set of rr row vectors, each having cc elements; or you can think of it as a set of cc column vectors, each having rr elements.

The rank of a matrix is defined as (a) the maximum number of linearly independent column vectors in the matrix or (b) the maximum number of linearly independent rowrow vectors in the matrix. Both definitions are equivalent.

For an r×cr \times c matrix,

  • If rr is less than cc, then the maximum rank of the matrix is rr.
  • If rr is greater than cc, then the maximum rank of the matrix is cc.

The rank of a matrix would be zero only if the matrix had no elements. If a matrix had even one element, its minimum rank would be one.

For example, the following matrix has rank of 2.

X=(12443480) X = \begin{pmatrix} 1 & 2 & 4 & 4 \\ 3 & 4 & 8 & 0 \end{pmatrix}

Singular Value Decomposition

Formula

A[m×n]=U[m×r]Σ[r×r](V[n×r])TA_{[m \times n]}=U_{[m \times r]} \Sigma_{[r \times r]} (V_{[n \times r]})^T

  • A: Input data matrix
    • m×nm \times n matrix (e.g. m documents, n terms)
  • U: left singular vectors
    • m×rm \times r matrix (m documents, r concepts)
  • Σ\Sigma: Singular values
    • r×rr \times r diagonal matrix (strength of each concept) (r: rank of matrix A)
  • V: Right singular vectors
    • n×rn \times r matrix (n terms, r concepts)

Properties

It is always possible to decompose a real matrix A into A=UΣVTA=U \Sigma V^T, where

  • U,Σ,VU, \Sigma, V: unique
  • U,VU, V: column orthonormal
    • UTU=IU^T U = I; VTV=IV^T V = I (I: identity matrix)
    • (Columns are orthogonal unit vectors)
  • Σ\Sigma: diagonal
    • Entries (singular values) are positive, and sorted in decreasing order (σ1σ20\sigma_1 \geq \sigma_2 \geq \cdots \geq 0)

Interpretation

  • U: user-to-concept similarity matrix
  • V: movie-to-concept similarity matrix
  • Σ\Sigma: its diagonal elements strength of each concept