Errata for Optimization Algorithms on Matrix Manifolds / Absil, Mahony, and Sepulchre
If you find an error that is not in the list below, please let us know about it.
P.-A. Absil, R. Mahony, & R. Sepulchre
- On page 8, replace the 2nd and 3rd sentences by this: "In view of the properties of matrix trace and inversion, one has the invariance property that $f(YN) = f(Y)$ for all invertible $p\times p$ matrices $N$. Each equivalence class $\{YN: N\in\mathbb{R}^{p\times p} \text{ invertible}\}$, which gathers all the $n\times p$ matrices with the same column space as $Y$, contains matrices $YN$ such that $(YN)^T\,B\,YN = I_p$. (Take $N=(Y^TBY)^{-1/2}$.) Thus, without loss of generality, we can restrict the search space to matrices $Y$ such that $Y^TBY=I_p$. Moreover, since $V$ is invertible, we can set $Y=VM$. Since $V^TBV = I_p$, the condition $Y^TBY=I_p$ is equivalent to $M^TM=I_p$."
- page 25, Proposition 3.3.2: the symbol "x" is used to mean two different things. To fix this, replace, e.g., "x" by "\bar{x}" in the 3rd line of the proposition.
- page 34, line -11: also require that \phi(x)=0.
- page 43, line 15: "T_{\bar{x}}\mathcal{M}" should be "T_{\bar{x}}\bar{\mathcal{M}}".
- page 48, line -1: insert "=\xi_x" after "\mathrm{D}\pi(\bar{x})[\bar{\xi}_\bar{x}]".
- page 59, two lines below (4.8): replace "n \times p" be "p \times p".
- page 62, Definition 4.2.1: insert "to $\{x_k\}$" after "gradient-related".
- pages 62-63: In Algorithm 1, without further assumptions, $t^A_k$ may not exist. In that case, Algorithm 1 produces a finite sequence. This situation is explicitly dismissed in subsequent results that assume an infinite sequence of iterates. Nevertheless, in a practical implementation of Algorithm 1, care must be taken to ensure that the inner iteration that computes $t^A_k$ terminates. This can be done as follows. In Definition 4.2.2 (Armijo point), insert the assumption that "$\eta$ is a descent direction (i.e., $\langle \grad\,f(x), \eta \rangle_x < 0$)". In Algorithm 1, as first line in the for loop, insert: "\STATE If $\grad\,f(x_k) = 0$ then Break." Still in Algorithm 1, replace line "2:" by this: "\STATE Pick $\eta_k$ in $T_{x_k}\calM$ by a procedure guaranteeing that (i) $\langle \grad\,f(x_k),\eta_k \rangle < 0$ and (ii) if the sequence $\left\{\eta_i\right\}_{i=0,1,\ldots}$ is infinite (i.e., the above Break is never reached) then it is gradient-related to $\{x_k\}$ (Definition~\ref{def:grad-rel}). (Note that $\eta_k := -\grad\,f(x_k)$ is such a procedure.)"
- page 68, line 13: "\delta>0" should be "\delta\in(0,\epsilon)".
- page 75, table 4.1: the normal space to R^n should rather be $\{0\}$ (seeing R^n as its own embedding space).
- page 80, line 4: "(local)" can be deleted.
- page 80, line below (4.32): replace all "\leq" signs by "<".
- page 86, Algorithm 3, step 2: "Y^T" is missing in -Y(Y^TY)^{-1}Y^TAY.
- page 92, first line below Figure 5.1: replace "\mathbb{R}^n\to\mathbb{R}^n" by "\mathbb{R}\to\mathbb{R}".
- page 108, line 5: replace "T_x" by "T_v"
- page 119, line 9: delete the factor "2" in the first equation.
- page 123, line -6: "span(V)" should be "the orthogonal complement of span(V)".
- page 124, two lines below (6.35): insert "=0" after "Y_k^T B_k Z_k".
- page 130, (6.46): On the right-hand side of the equation, replace "y" by "x".
- page 143, section 7.3.2, second paragraph, 4th line: insert minus sign, yielding \eta_k^N = - (H_k)^{-1} grad f(x_k).
- page 144, Algorithm 10, lines 4, 9, 10: remplace "\Delta" (but not "\delta") by "\Delta_k".
- page 147, line 6: replace "=" by "\leq".
- page 148, line -8: definition of i_x: remove "for all x \in \mathcal{M}".
- page 150, line 4: replace "B_k" by "H_k".
- page 152, line -2: replace "minimizing geodesic" by "geodesic in $\mathcal{V}$".
- page 153, second line of the proof of Lemma 7.4.8: replace "grad f(v)" by "grad f(x)".
- page 154, first line of the proof of Lemma 7.4.10: insert "\setminus\{v\}" at the end of the sentence.
- page 164, Algorithm 12, lines 4, 9, and 10: replace \Delta by \Delta^2, and replace the Frobenius metric by the nonstandard metric (7.55), i.e., insert (Y^TBY)^{-1} in the trace.
- page 171, equation (8.4): replace x^T(0) by u^T (in the middle term).
- page 174, section 8.1.3: replace the two occurences of \mathcal{N} by \mathcal{M}.
- page 174: second displayed equation: replace the subscript R_{\mathcal{Y}}(\eta) in the left-hand side by Y+\overline{\eta}_Y.
- page 174: at the end of the page, add: provided that the right-hand side is a bona-fide horizontal lift. Since horizontality is guaranteed by construction, it remains to require that $\mathrm{D} \pi (\bar{x}+\bar\eta_{\bar{x}}) [\mathrm{P}^h_{\bar{x}+\bar\eta_{\bar{x}}} \bar{\xi}_{\bar{x}}]$ does not depend on the choice of $\bar{x}$ in the fiber $\pi^{-1}(x)$. Equivalently stated, for all real-valued functions $f$ on the quotient manifold,
$\frac{\mathrm{d}}{\mathrm{d}t} f( \bar{x}+\bar\eta_{\bar{x}} + t \mathrm{P}^h_{\bar{x}+\bar\eta_{\bar{x}}} \bar{\xi}_{\bar{x}} )|_{t=0}$
must not depend on the choice of $\bar{x}$ in the fiber $\pi^{-1}(x)$.
- page 179, line -15: "for all p" should be "for all \eta".
- page 181, line 6: remove the minus sign in the formula of alpha_k. (Observe that r_k is defined as b-Ax_k.)
- page 185, line -8: replace "\langle \xi, (DF (x))^* [F (x)] \rangle" by "g(\xi, (DF (x))^* [F (x)])".
- page 189, line 4: n rows
- page 196, line -4: D qf(X)[Z] = Q \rho_{skew}(Q^TZR^{-1}) + (I-QQ^T)ZR^{-1}