We saw that converges to , and converges to an upper triangular matrix.

The rate of convergence is not straightforward to derive.

Here is a sketch of a proof. Start with

Span of converges to span of = span of The convergence rate is given by

Proposition 1: Convergence of orthogonal iteration

The norm of the block decays like

Proposition 2: Convergence of the Ritz eigenvalues

Assume that we start with a random with columns. In that case, has dimension . The eigenvalues of are called Ritz eigenvalues. The th Ritz eigenvalue converges to with rate

Let’s go back to the case of and of size . Consider a case where . Then we have the following block structure for .

We converge quickly to a block upper triangular matrix.

More generally if we have sufficient separation between eigenvalues, i.e., then convergence to an upper triangular matrix is very fast.