Expected distance between two random vectors in n-dimensional space
Hi, Any help with this appreciated.
If I have two column vectors
and
, where the
elements of each are drawn independently from a Guassian
, then the distance,
,between them is given by
^\mathsf{T}(\mathbf{a} - \mathbf{b})})
But what is the expected value of
?
I reckon, in the case where
, it's ![\mathbb{E}[d] = \sqrt{2n\sigma^2}](http://latex.codecogs.com/png.latex?\mathbb{E}[d] = \sqrt{2n\sigma^2})
My feeling is that a non-zero
should make no difference, as it's just shifting the origin, so to speak, and the distance is determined by the relative positions of the vectors.
Can anyone prove the the general case (for
) or show it to be wrong, and if wrong, say what it is in fact in the general case?
I notice also that my expression bears resemblance to the denominator in the normalising term in the Gaussian pdf
, except that n takes the place of
. Is this coincidence, or does it reflect something deeper?
Thanks in advance. MD
Re: Expected distance between two random vectors in n-dimensional space
Hey Mathsdog.
What did you get for the distribution for the distance d? (Hint: think about the sum of products of normal first and then the square root of that final answer).
Re: Expected distance between two random vectors in n-dimensional space
Right, sorted I reckon. Good hint Chiro. It all comes down to the
distribution
For the square of the norm of
, i.e.
^{\mathsf{T}}(\mathbf{a}- \mathbf{b}) = \mathbf{c}^{\mathsf{T}}\mathbf{c}= d^2)
we first note that for the elements of
denoted 
)
where

i.e. the sum of the variances of the elements of
and 
From the fact that the elements of
are distributed as follows
it follows then that
)
![So, \mathbb{E}[1/\sigma^{2}_{c} \cdot \mathbf{c}^{\mathsf{T}}\mathbf{c}] =1/\sigma^{2}_{c} \cdot \mathbb{E}[\mathbf{c}^{\mathsf{T}}\mathbf{c}] = n](http://latex.codecogs.com/png.latex?So, \mathbb{E}[1/\sigma^{2}_{c} \cdot \mathbf{c}^{\mathsf{T}}\mathbf{c}] =1/\sigma^{2}_{c} \cdot \mathbb{E}[\mathbf{c}^{\mathsf{T}}\mathbf{c}] = n)
So, ![\mathbb{E}[\mathbf{c}^{\mathsf{T}}\mathbf{c}]=n \sigma^{2}_{c}= n(\sigma^{2}_{a}+\sigma^{2}_{b})](http://latex.codecogs.com/png.latex?\mathbb{E}[\mathbf{c}^{\mathsf{T}}\mathbf{c}]=n \sigma^{2}_{c}= n(\sigma^{2}_{a}+\sigma^{2}_{b}))
Where 
then
And since the distance between the two vectors
and
is always just the (positive) square root of this squared norm, d is, as previously hypothesised, given by the following
![\mathbb{E}[d] = \sqrt{2n\sigma_{ab}^2}](http://latex.codecogs.com/png.latex?\mathbb{E}[d] = \sqrt{2n\sigma_{ab}^2})
Does that look right to you Chiro?
I think the general case for a non-0 mean would follow from a similar treatment using the noncentral chi squared distribution, but I haven't worked the details out yet.
Thanks again. MD