Hi

I am really confused with the "transpose" related stuff.

Can anyone work me through it?

and the paritial differentiation with respect to it

===========================

why is

$\displaystyle (Y-X\beta)^T(Y-X\beta)=Y^TY-2\beta^TX^TY^T+\beta^TX^TX\beta$

instead of

$\displaystyle (Y-X\beta)^T(Y-X\beta)

=(Y^T-\beta^TX^T)(Y-X\beta)

=Y^TY-Y^TX\beta-\beta^TX^TY+\beta^TX^TX\beta$

and differentiate with respect to beta is

$\displaystyle =-2X^TY+X^TX\beta$