# [SOLVED] Advanced Multivariable Calculus: Constrained Extrema Test

• October 9th 2008, 12:20 AM
itai
[SOLVED] Advanced Multivariable Calculus: Constrained Extrema Test
Hello,

I'm reading C.H. Edwards' "Advanced Calculus of Several Variables". I don't understand completely his proof of theorem II.8.9, concerning a "second derivative test" for constrained (Lagrange multiplier) maximum-minimum problems. Specifically, i don't understand the last paragraph of the proof, explaining why the condition on delta can be satisfied. I'd appreciate a more detailed and more formal treatment of this point.

This theorem, in its general form as presented in Edwards, is hard to find in other books. So i've emailed Prof. Edwards, asking him for a clarification of the point stated above, offering in exchange a list of typos and other errors i'd found in the book. But it's been several days now, and i haven't received any answer from him.
• October 13th 2008, 12:39 AM
itai
The relevant pages
Attached are the relevant pages from C. H. Edwards, Jr.'s "Advanced Calculus of Several Variables", Dover 1994 (an unabridged, corrected republication of the work first published by Academic Press, New York, 1973). The book can be purchased online in Amazon and Barnes & Noble, for instance.
• October 20th 2008, 02:59 AM
itai
My book review on Amazon
If anyone's interested, i've just posted a book review of this title on Amazon.
• October 20th 2008, 09:26 AM
Laurent
Quote:

Originally Posted by itai
Attached are the relevant pages from C. H. Edwards, Jr.'s "Advanced Calculus of Several Variables", Dover 1994 (an unabridged, corrected republication of the work first published by Academic Press, New York, 1973). The book can be purchased online in Amazon and Barnes & Noble, for instance.

Here's a (very detailed) explaination for the anotated paragraph (which indeed the author should have developped further, I think):

Recall $m$ is the minimum value of $q$ on the unit ball of $T_a$. Notice this is also the minimum of $q\left(\frac{h}{|h|}\right)$ over the non-zero $h$ in $T_a$. Since $q$ is positive definite, we have $m>0$.

We can also define $M$ as the maximum of $|q|$ on the unit sphere of $\mathbb{R}^d$. Then for every $h\in\mathbb{R}^d$, $|q(h)|\leq M|h|^2$.

Let $\varepsilon_0>0$ (we'll choose its value later).

The "tough" part is to show that if $h$ is in a neighboorhood of zero and $a+h\in M$, then $\frac{h}{|h|}$ is uniformly close to the tangent space $T_a$. Then, we need to manipulate the quadratic form with care to estimate $q(h)$.

The tangent space $T_a$ is the orthogonal subspace of the subspace $(T_a)^\perp$ spanned by the vectors $\nabla G_1(a),\ldots,\nabla G_m(a)$.
As a consequence, there exist real numbers $\lambda_1,\ldots,\lambda_m$ such that the orthogonal projection on $(T_a)^\perp$ writes, for all $h\in \mathbb{R}^n$, $p_{(T_a)^\perp}(h)=\lambda_1 (\nabla G_1(a)\cdot h)\nabla G_1(a) + \cdots + \lambda_m (\nabla G_m(a)\cdot h) \nabla G_m(a)$.

On the other hand, for every $i$, $G_i(a+h)=\nabla G_i(a) \cdot h + o(|h|)$ and, because $G_i$ is $\mathcal{C}^2$, the small o can be made uniform: for every $\varepsilon>0$, there exists $\delta>0$ such that, if $|h|<\delta$, $|G_i(a+h)-\nabla G_i(a)\cdot h|< \varepsilon |h|$, and hence, if in addition $a+h\in M$, then $|\nabla G_i(a)\cdot h|<\varepsilon |h|$.

As a consequence of the last two paragraphs, there exists $\delta>0$ such that, for every $h\in\mathbb{R}^n$, if $|h|<\delta$ and $a+h\in M$, then $|p_{(T_a)^\perp}(h)|< \varepsilon_0 |h|$. As a consequence, $|q(p_{(T_a)^\perp}(h))|.

We also deduce that if $|h|<\delta$ and $a+h\in M$, then $|p_{T_a}(h)|=|h-p_{(T_a)^\perp}(h)|\geq |h|-|p_{(T_a)^\perp}(h)| > (1- \varepsilon_0)|h|$, so that, since $p_{T_a}(h)\in T_a$, $q(p_{T_a}(h))=|p_{T_a}(h)|^2 q\left(\frac{p_{T_a}(h)}{|p_{T_a}(h)|}\right) > (1-\varepsilon_0)^2 |h|^2 m$.

Now, $q(h)=q\left(p_{T_a}(h)+p_{(T_a)^\perp}(h)\right)$. We would like to use the two previous bounds. There is no triangular inequality for $q$ on $\mathbb{R}^n$, but we can write, for every $h,k\in\mathbb{R}^n$, $q(h+k)=q(h)+q(k)+\varphi(h,k)\geq q(h)-|q(k)|-C |h||k|$ where $\varphi$ is the bilinear form associated with $q$ and $C$ is a positive constant (this is the maximum of $\varphi(h,k)$ for $h,k$ on the unit sphere of $\mathbb{R}^n$). We deduce, in our situation: if $|h|<\delta$ and $a+h\in M$, then
$q(h)\geq q(p_{T_a}(h))-|q(p_{(T_a)^\perp}(h))| - C |p_{T_a}(h)||p_{(T_a)^\perp}(h)|$ $\geq (1-\varepsilon_0)^2 m|h|^2 - M \varepsilon_0^2 |h|^2 - C\varepsilon_0^2 |h|^2$
(I used the fact that $|p_{T_a}(h)|\leq |h|$ (by Pythagoras'theorem)), hence:
$q\left(\frac{h}{|h|}\right)\geq (1-\varepsilon_0)^2 m - M \varepsilon_0^2 - C\varepsilon_0^2.$
If $\varepsilon_0$ is chosen small enough, the latter quantity can be made arbitrarily close to $m$ (from below).

As a conclusion: choose $\varepsilon_0>$ such that $(1-\varepsilon_0)^2 > \frac{3}{4}$ and $(M + C)\varepsilon_0^2<\frac{m}{4}$ (these quantities do only depend on $q$, not on $\delta$ of course). Then, if $|h|<\delta$ and $a+h\in M$,
$q\left(\frac{h}{|h|}\right)\geq \frac{3}{4}m - \frac{1}{4}m = \frac{m}{2}$. This is what we wanted.

(and the first condition on $\delta$ is just a consequence of the Taylor expansion)
• October 20th 2008, 11:48 AM
itai
Thank you very much!
Hi Laurent,

Thank you very much. That's marvellous!

There's just one proposition in your proof i fail to understand, namely:
"there exist real numbers http://www.mathhelpforum.com/math-he...3ee78c6c-1.gif such that the orthogonal projection on http://www.mathhelpforum.com/math-he...4740d23b-1.gif writes, for all http://www.mathhelpforum.com/math-he...8d748843-1.gif, http://www.mathhelpforum.com/math-he...6fb6fdfb-1.gif."

The left side of the equation is an n-dimentional vector (in an m-dimensional vector subspace), while the right side is a real number.
• October 20th 2008, 01:04 PM
Laurent
Quote:

Originally Posted by itai
There's just one proposition in your proof i fail to understand, namely:
"there exist real numbers http://www.mathhelpforum.com/math-he...3ee78c6c-1.gif such that the orthogonal projection on http://www.mathhelpforum.com/math-he...4740d23b-1.gif writes, for all http://www.mathhelpforum.com/math-he...8d748843-1.gif, http://www.mathhelpforum.com/math-he...6fb6fdfb-1.gif."

The left side of the equation is an n-dimentional vector (in an m-dimensional vector subspace), while the right side is a real number.

Thank you, I fix it right now. It should be: $p_{(T_a)^\perp}(h)=\lambda_1 (\nabla G_1(a)\cdot h)\nabla G_1(a) + \cdots + \lambda_m (\nabla G_m(a)\cdot h) \nabla G_m(a)$.

If the $\nabla G_i(a)$ are orthonormal, the projection on $(T_a)^\perp$ is just the sum of the projections $h\mapsto (\nabla G_i(a)\cdot h)\nabla G_i(a)$. Otherwise, we can apply a Gram-Schmidt orthonormalization and we get $p_{(T_a)^\perp}$ as a linear combination of the previous projections.