
Originally Posted by
ballofpopculture
Hi all,
I hope this is the right forum, it is certainly my best guess.
My problem is with a data set (I may come here with a lot of questions based on the research I am trying to do, so I will say this is the first).
Say I have a data set of 30 values. The first 28 values (for simplicity; the order in which they were received is random) all fall between 10 and 14, the last two values are greater than 200. I know these last two values are mistakes, and I think if you handed anyone this data set they could discern that the values seemed very off, but how do I prove it mathematically? Furthermore how do I get an average of the values, while putting as little weight as possible on the extreme outliers?
Thank You muchly.