Originally Posted by

**MikeChch** I studied this a while ago and am a bit rusty with remembering exactly how to get the reults I need. I basically have two sets of data and I want to see if there is any relationship to an increase in one if the other increases. Do I use a regression formula or a different test?

I have attached the data, graph and my conclusion. Please could someone have a look at it and either confirm I'm correct or shoot me down in flames and tell me I have got it totally wrong then point me in the right direction. I used the data analysis in excel rather than working the equations out myself.

Any help is greatly appreciated.

How you do this depends on what you know.

You start with the null hypothesis that there is no connection between the contribution and the starting total.

Now you have a data set $\displaystyle (ST_i,CT_i), i=1, .. n$ and you need to construct a test to see if the null hypothesis can be rejected (I would use a non-parametric test myself, but what you use depends on what you know).

Example: The data below is sorted by starting total:

Code:

Contribution Jackpot starting total
88385 28383
58365 36949
71006 45247
96025 48976
75226 64880
100266 131099
109887 138474
108119 142141
102729 154863
127109 155973

If there is no relation between the two columns then the number of entries in the first five positions in the first column less than 100000 is a binomial RV $\displaystyle B(5,0.5)$. We actually observe $\displaystyle 5$, so we can ask what the probability of observing $\displaystyle 5$ or more is $\displaystyle P(N>5)=\sum_{n\ge 5}} b(n,5,0.5) = b(5,5,0.5)=0.5^5\approx 0.031$.

CB