Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Consistency (statistics)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
In [[statistics]], '''consistency''' of procedures, such as computing [[confidence interval]]s or conducting [[hypothesis test]]s, is a desired property of their behaviour as the number of items in the data set to which they are applied increases indefinitely. In particular, consistency requires that as the dataset size increases, the outcome of the procedure approaches the correct outcome.<ref name=Dodge>{{cite book |last=Dodge |first=Y. |year=2003 |title=The Oxford Dictionary of Statistical Terms |publisher=OUP |isbn=0-19-920613-9 }} (entries for consistency, consistent estimator, consistent test)</ref> Use of the term in statistics derives from Sir [[Ronald Fisher]] in 1922.<ref>{{cite book |last1=Upton |first1=G. |last2=Cook |first2=I. |year=2006 |title=Oxford Dictionary of Statistics |edition=2nd |publisher=OUP |isbn=978-0-19-954145-4 }}</ref> Use of the terms ''consistency'' and ''consistent'' in statistics is restricted to cases where essentially the same procedure can be applied to any number of data items. In complicated applications of statistics, there may be several ways in which the number of data items may grow. For example, records for rainfall within an area might increase in three ways: records for additional time periods; records for additional sites with a fixed area; records for extra sites obtained by extending the size of the area. In such cases, the property of consistency may be limited to one or more of the possible ways a sample size can grow. ==Estimators== A [[consistent estimator]] is one for which, when the estimate is considered as a [[random variable]] indexed by the number ''n'' of items in the data set, as ''n'' increases the estimates [[Convergence of random variables|converge in probability]] to the value that the estimator is designed to estimate. An estimator that has [[Fisher consistency]] is one for which, if the estimator were applied to the entire population rather than a sample, the true value of the estimated parameter would be obtained. ==Tests== {{Main|Statistical hypothesis testing}} A [[Statistical hypothesis testing#Definition of terms|consistent test]] is one for which the [[statistical power|power]] of the test for a fixed untrue hypothesis increases to one as the number of data items increases.<ref name=Dodge/> ==Classification== In [[statistical classification]], a consistent classifier is one for which the probability of correct classification, given a training set, approaches, as the size of the training set increases, the best probability theoretically possible if the population distributions were fully known. == Relationship to unbiasedness == An estimator or test may be consistent without being unbiased.<ref>{{Cite book |last=Vaart |first=A. W. van der |url=http://dx.doi.org/10.1017/cbo9780511802256 |title=Asymptotic Statistics |date=1998-10-13 |publisher=Cambridge University Press |isbn=978-0-511-80225-6}}</ref> A classic example is the [[sample standard deviation]] which is a biased estimator, but converges to the expected [[standard deviation]] almost surely by the [[law of large numbers]]. Phrased otherwise, unbiasedness is not a requirement for consistency, so [[Biased estimator|biased estimators]] and tests may be used in practice with the expectation that the outcomes are reliable, especially when the sample size is large (recall the definition of consistency). In contrast, an estimator or test which is not consistent may be difficult to justify in practice, since gathering additional data does not have the asymptotic guarantee of improving the quality of the outcome. ==See also== * [[Consistent estimator]] * [[Homogeneity (statistics)]] * [[Internal consistency]] * [[Reliability (statistics)]] ==References== {{reflist}} [[Category:Asymptotic theory (statistics)]]
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)
Pages transcluded onto the current version of this page
(
help
)
:
Template:Cite book
(
edit
)
Template:Main
(
edit
)
Template:Reflist
(
edit
)