help needed from a fellow mathematician who has a much wider understanding of stats than myself: when calculating the std dev of a sample rather than the whole population, WHY on earth do I divide by n-1 rather than n? I know it's a newbie question, but I avoided stats like the plague all of my life, and now I'm forced to teach it... I'd like not
(
Read more... )
Comments 3
Intuitively, it is because the smaller the sample the less likely you are to see an extreme outlier. Therefore a smaller sample will tend to underestimate the mean distance from the mean for the whole population.
More technically, it is because the expected mean-difference-squared is a biased estimator for the variance.
If you want to do the working on this my hint is,
[ Xi - m(X) ] = Xi - (1/n)*(X1 + X2 + .... ) = (n-1/n)*Xi + (1/n)*X1 + ... (1/n)*X(i-1) + (1/n)*X(i+1) + ...
Hopefully, that's some help - although perhaps not a lot.
Reply
Reply
How is little Xavier doing after his escapades at the wedding?
Reply
Leave a comment