We know how to compute p(k+1). We can now calculate the optimal step sizes ΞΌk+1β using the orthogonality relations.
Recall the basic definitions:
Ξx(k)=ΞΌk+1βp(k+1)x(k+1)βx(k)=ΞΌk+1βp(k+1)ββ
Multiply by A
Ax(k+1)βAx(k)=ΞΌk+1βAp(k+1)
We use the definition of the residual: r(k)=bβAx(k). So we get:
βr(k+1)+r(k)=ΞΌk+1βAp(k+1)
Recall that r(k+1)β₯p(k+1). Letβs multiply by (p(k+1))T to the left:
0+(p(k+1))Tr(k)=ΞΌk+1β(p(k+1))TAp(k+1)ΞΌk+1β=(p(k+1))TAp(k+1)(p(k+1))Tr(k)βββ
We can simplify it a bit more using our three-term recurrence:
p(k+1)=r(k)+Οkβp(k),andr(k)β₯p(k).
Take a dot product with r(k):
[r(k)]Tp(k+1)=β₯r(k)β₯22β+Οkβ0
We have proved that:
Theorem. The optimal step-size in CG is given by:
Ξx(k)=x(k+1)βx(k)=ΞΌk+1βp(k+1)ΞΌk+1β=(p(k+1))TAp(k+1)β₯r(k)β₯22ββββ