Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Standard deviation
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
===Corrected sample standard deviation=== If the ''biased [[sample variance]]'' (the second [[central moment]] of the sample, which is a downward-biased estimate of the population variance) is used to compute an estimate of the population's standard deviation, the result is <math display="block">s_N = \sqrt{\frac{1}{N} \sum_{i=1}^N \left(x_i - \bar{x}\right)^2}.</math> Here taking the square root introduces further downward bias, by [[Jensen's inequality]], due to the square root's being a [[concave function]]. The bias in the variance is easily corrected, but the bias from the square root is more difficult to correct, and depends on the distribution in question. An unbiased estimator for the ''variance'' is given by applying [[Bessel's correction]], using ''N'' β 1 instead of ''N'' to yield the ''unbiased sample variance,'' denoted ''s''<sup>2</sup>: <math display="block">s^2 = \frac{1}{N - 1} \sum_{i=1}^N \left(x_i - \bar{x}\right)^2.</math> This estimator is unbiased if the variance exists and the sample values are drawn independently with replacement. ''N'' β 1 corresponds to the number of [[Degrees of freedom (statistics)|degrees of freedom]] in the vector of deviations from the mean, <math>\textstyle(x_1 - \bar{x},\; \dots,\; x_n - \bar{x}).</math> Taking square roots reintroduces bias (because the square root is a nonlinear function which does not [[Commutative property|commute]] with the expectation, i.e. often <math display="inline">E[\sqrt{X}]\neq \sqrt{E[X]}</math>), yielding the ''corrected sample standard deviation,'' denoted by ''s:'' <math display="block">s = \sqrt{\frac{1}{N-1} \sum_{i=1}^N \left(x_i - \bar{x}\right)^2}.</math> As explained above, while ''s''<sup>2</sup> is an unbiased estimator for the population variance, ''s'' is still a biased estimator for the population standard deviation, though markedly less biased than the uncorrected sample standard deviation. This estimator is commonly used and generally known simply as the "sample standard deviation". The bias may still be large for small samples (''N'' less than 10). As sample size increases, the amount of bias decreases. We obtain more information and the difference between <math>\frac{1}{N}</math> and <math>\frac{1}{N-1}</math> becomes smaller.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)