We saw that converges to , and converges to an upper triangular matrix.
The rate of convergence is not straightforward to derive.
Here is a sketch of a proof. Start with
Span of converges to span of = span of The convergence rate is given by
Proposition 1: Convergence of orthogonal iteration
The norm of the block decays like
Proposition 2: Convergence of the Ritz eigenvalues
Assume that we start with a random with columns. In that case, has dimension . The eigenvalues of are called Ritz eigenvalues. The th Ritz eigenvalue converges to with rate
Letβs go back to the case of and of size . Consider a case where . Then we have the following block structure for .
We converge quickly to a block upper triangular matrix.
More generally if we have sufficient separation between eigenvalues, i.e., then convergence to an upper triangular matrix is very fast.