Singular Value Decomposition (SVD)

Theorem

Let be a matrix, there always exists orthonormal matrices and such that

For some diagonal matrix .

Geometric Interpretation

What this means, is that the transformation described by will:

  1. Take a set of vectors in
  2. Map them to a set of vectors in
  3. And scale them by the corresponding entries in .

But wait! What happened to the extra dimensions in ? They get mapped to the zero vector!

Proof

1

Given a matrix , consider the eigenvalues of .

First observation is that is a symmetric matrix. Thus, by EVD. it must have real eigenvalues. Let’s call them .

Along with the corresponding eigenvectors .

As we’re only interested in non-zero eigenvalues, let’s assume that are the non-zero eigenvalues, where . Of course, it’s possible that , but we’ll keep it general for now. This also means that we will only consider the eigenvectors .

Now, this means we can rewrite our eigenvalues as , where .

2

Now let’s define the following

Additionally, we’ll arbitrarily pick such that the full set is an orthonormal basis for .

Note: now we see why we only looked at the non-zero eigenvalues, because we are dividing by

Now, let’s define our matrix as

We can also define

Where for .

Once again, we arbitrarily picked such that the full set is an orthonormal basis for .

Thus we can now compute

Now let’s analyze and consider the first entry

But if we go further we see

But recall, that is an eigenvector of with eigenvalue . Thus, we have

As, is orthogonal to .

Thus, if we perform the composition, we will get

Where the diagonal is simply followed by s.

Example

Consider the matrix

Thus we then have

Then if we compute the eigenvalues of this matrix, we will get

We then define

Corresponding to the eigenvalues are the eigenvectors, note that we need to normalize these vectors.

We actually don’t need to compute as will become obvious later.

Let’s build out matrix now

Additionally, let’s compute our vectors

Thus, our matrix is

We are now done! So we can assert

It follows, that we could’ve just dropped the last column of and the last row and column of as they don’t contribute to the final product. This gives us a simplified form.

Simplified SVD

This is a corollary to the above, but assuming the we can write

This is assuming .