Convergence of the orthogonal iteration

We saw that $Q_{k}$ converges to $Q$ , and $T_{k} = Q_{k}^{H} A Q_{k}$ converges to an upper triangular matrix.

The rate of convergence is not straightforward to derive.

Here is a sketch of a proof. Start with

A^{k} = X Λ^{k} X^{- 1} .

Span of $Q_{k} [1 : i]$ converges to span of $X [1 : i]$ = span of $Q [1 : i] .$ The convergence rate is given by

\frac{λ _{i + 1}}{λ _{i}}

Proposition 1: Convergence of orthogonal iteration

The norm of the block $T_{k} [i + 1 : n, 1 : i]$ decays like

\frac{λ _{i + 1}}{λ _{i}}^{k} .

$□$

Proposition 2: Convergence of the Ritz eigenvalues

Assume that we start with a random $Q_{0}$ with $r$ columns. In that case, $T_{k}$ has dimension $r \times r$ . The eigenvalues of $T_{k}$ are called Ritz eigenvalues. The $i$ th Ritz eigenvalue converges to $λ_{i}$ with rate

\frac{λ _{r + 1}}{λ _{i}}^{k}

$□$

Let’s go back to the case of $Q_{0} = I$ and $T_{k}$ of size $n \times n$ . Consider a case where $λ_{j} ≫ λ_{j + 1}$ . Then we have the following block structure for $T_{k}$ .

(* ϵ * *)

We converge quickly to a $2 \times 2$ block upper triangular matrix.

More generally if we have sufficient separation between eigenvalues, i.e., $∣ λ_{i} ∣ ≫ ∣ λ_{i + 1} ∣,$ then convergence to an upper triangular matrix is very fast.

📓 CME 302

Explorer

Convergence of the orthogonal iteration

Graph View

Backlinks