Singular Value Decomposition (SVD)
Theorem
Let be a matrix, there always exists orthonormal matrices and such that
For some diagonal matrix .
Geometric Interpretation
What this means, is that the transformation described by will:
- Take a set of vectors in
- Map them to a set of vectors in
- And scale them by the corresponding entries in .
But wait! What happened to the extra dimensions in ? They get mapped to the zero vector!
Proof
1
Given a matrix , consider the eigenvalues of .
First observation is that is a symmetric matrix. Thus, by EVD. it must have real eigenvalues. Let’s call them .
Along with the corresponding eigenvectors .
As we’re only interested in non-zero eigenvalues, let’s assume that are the non-zero eigenvalues, where . Of course, it’s possible that , but we’ll keep it general for now. This also means that we will only consider the eigenvectors .
Now, this means we can rewrite our eigenvalues as , where .
2
Now let’s define the following
Additionally, we’ll arbitrarily pick such that the full set is an orthonormal basis for .
Note: now we see why we only looked at the non-zero eigenvalues, because we are dividing by
Now, let’s define our matrix as
We can also define
Where for .
Once again, we arbitrarily picked such that the full set is an orthonormal basis for .
Thus we can now compute
Now let’s analyze and consider the first entry
But if we go further we see
But recall, that is an eigenvector of with eigenvalue . Thus, we have
As, is orthogonal to .
Thus, if we perform the composition, we will get
Where the diagonal is simply followed by s.
Example
Consider the matrix
Thus we then have
Then if we compute the eigenvalues of this matrix, we will get
We then define
Corresponding to the eigenvalues are the eigenvectors, note that we need to normalize these vectors.
We actually don’t need to compute as will become obvious later.
Let’s build out matrix now
Additionally, let’s compute our vectors
Thus, our matrix is
We are now done! So we can assert
It follows, that we could’ve just dropped the last column of and the last row and column of as they don’t contribute to the final product. This gives us a simplified form.
Simplified SVD
This is a corollary to the above, but assuming the we can write
This is assuming .