Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
F-test
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Formula and calculation== Most ''F''-tests arise by considering a decomposition of the [[variance|variability]] in a collection of data in terms of [[Partition of sums of squares|sums of squares]]. The [[test statistic]] in an ''F''-test is the ratio of two scaled sums of squares reflecting different sources of variability. These sums of squares are constructed so that the statistic tends to be greater when the null hypothesis is not true. In order for the statistic to follow the [[F-distribution|''F''-distribution]] under the null hypothesis, the sums of squares should be [[independence (probability theory)|statistically independent]], and each should follow a scaled [[chi-squared distribution|ΟΒ²-distribution]]. The latter condition is guaranteed if the data values are independent and [[normal distribution|normally distributed]] with a common [[variance]]. === One-way analysis of variance === The formula for the one-way '''ANOVA''' ''F''-test [[test statistic|statistic]] is :<math>F = \frac{\text{explained variance}}{\text{unexplained variance}} ,</math> or :<math>F = \frac{\text{between-group variability}}{\text{within-group variability}}.</math> The "explained variance", or "between-group variability" is :<math> \sum_{i=1}^{K} n_i(\bar{Y}_{i\cdot} - \bar{Y})^2/(K-1) </math> where <math>\bar{Y}_{i\cdot}</math> denotes the [[average|sample mean]] in the ''i''-th group, <math>n_i</math> is the number of observations in the ''i''-th group, <math>\bar{Y}</math> denotes the overall mean of the data, and <math>K</math> denotes the number of groups. The "unexplained variance", or "within-group variability" is :<math> \sum_{i=1}^{K}\sum_{j=1}^{n_{i}} \left( Y_{ij}-\bar{Y}_{i\cdot} \right)^2/(N-K), </math> where <math>Y_{ij}</math> is the ''j''<sup>th</sup> observation in the ''i''<sup>th</sup> out of <math>K</math> groups and <math>N</math> is the overall sample size. This ''F''-statistic follows the [[F-distribution|''F''-distribution]] with degrees of freedom <math>d_1=K-1</math> and <math>d_2=N-K</math> under the null hypothesis. The statistic will be large if the between-group variability is large relative to the within-group variability, which is unlikely to happen if the [[expected value|population means]] of the groups all have the same value. [[File:5% F table.jpg|thumb|F Table: Level 5% Critical values, containing degrees of freedoms for both denominator and numerator ranging from 1-20]] The result of the F test can be determined by comparing calculated F value and critical F value with specific significance level (e.g. 5%). The F table serves as a reference guide containing critical F values for the distribution of the F-statistic under the assumption of a true null hypothesis. It is designed to help determine the threshold beyond which the F statistic is expected to exceed a controlled percentage of the time (e.g., 5%) when the null hypothesis is accurate. To locate the critical F value in the F table, one needs to utilize the respective degrees of freedom. This involves identifying the appropriate row and column in the F table that corresponds to the significance level being tested (e.g., 5%).<ref>{{Citation |last=Siegel |first=Andrew F. |title=Chapter 15 - ANOVA: Testing for Differences Among Many Samples and Much More |date=2016-01-01 |url=https://www.sciencedirect.com/science/article/pii/B9780128042502000158 |work=Practical Business Statistics (Seventh Edition) |pages=469β492 |editor-last=Siegel |editor-first=Andrew F. |access-date=2023-12-10 |publisher=Academic Press |doi=10.1016/b978-0-12-804250-2.00015-8 |isbn=978-0-12-804250-2|url-access=subscription }}</ref> How to use critical F values: If the F statistic < the critical F value * Fail to reject null hypothesis * Reject alternative hypothesis * There is no significant differences among sample averages * The observed differences among sample averages could be reasonably caused by random chance itself * The result is not statistically significant If the F statistic > the critical F value * Accept alternative hypothesis * Reject null hypothesis * There is significant differences among sample averages * The observed differences among sample averages could not be reasonably caused by random chance itself * The result is statistically significant Note that when there are only two groups for the one-way ANOVA ''F''-test, <math>F = t^{2}</math>where ''t'' is the [[Student's t-test|Student's <math>t</math> statistic]]. ==== Advantages ==== * Multi-group comparison efficiency: facilitating simultaneous comparison of multiple groups, enhancing efficiency particularly in situations involving more than two groups. * Clarity in variance comparison: offering a straightforward interpretation of variance differences among groups, contributing to a clear understanding of the observed data patterns. * Versatility across disciplines: demonstrating broad applicability across diverse fields, including social sciences, natural sciences, and engineering. ==== Disadvantages ==== * Sensitivity to assumptions: the F-test is highly sensitive to certain assumptions, such as homogeneity of variance and normality which can affect the accuracy of test results. * Limited scope to group comparisons: the F-test is tailored for comparing variances between groups, making it less suitable for analyses beyond this specific scope. * Interpretation challenges: the F-test does not pinpoint specific group pairs with distinct variances. Careful interpretation is necessary, and additional post hoc tests are often essential for a more detailed understanding of group-wise differences. ===Multiple-comparison ANOVA problems=== The ''F''-test in one-way analysis of variance ([[ANOVA]]) is used to assess whether the [[expected value]]s of a quantitative variable within several pre-defined groups differ from each other. For example, suppose that a medical trial compares four treatments. The ANOVA ''F''-test can be used to assess whether any of the treatments are on average superior, or inferior, to the others versus the null hypothesis that all four treatments yield the same mean response. This is an example of an "omnibus" test, meaning that a single test is performed to detect any of several possible differences. Alternatively, we could carry out pairwise tests among the treatments (for instance, in the medical trial example with four treatments we could carry out six tests among pairs of treatments). The advantage of the ANOVA ''F''-test is that we do not need to pre-specify which treatments are to be compared, and we do not need to adjust for making [[multiple comparisons]]. The disadvantage of the ANOVA ''F''-test is that if we reject the [[null hypothesis]], we do not know which treatments can be said to be significantly different from the others, nor, if the ''F''-test is performed at level Ξ±, can we state that the treatment pair with the greatest mean difference is significantly different at level Ξ±. ===Regression problems=== {{further|Stepwise regression}} Consider two models, 1 and 2, where model 1 is 'nested' within model 2. Model 1 is the restricted model, and model 2 is the unrestricted one. That is, model 1 has ''p''<sub>1</sub> parameters, and model 2 has ''p''<sub>2</sub> parameters, where ''p''<sub>1</sub> < ''p''<sub>2</sub>, and for any choice of parameters in model 1, the same regression curve can be achieved by some choice of the parameters of model 2. One common context in this regard is that of deciding whether a model fits the data significantly better than does a naive model, in which the only explanatory term is the intercept term, so that all predicted values for the dependent variable are set equal to that variable's sample mean. The naive model is the restricted model, since the coefficients of all potential explanatory variables are restricted to equal zero. Another common context is deciding whether there is a structural break in the data: here the restricted model uses all data in one regression, while the unrestricted model uses separate regressions for two different subsets of the data. This use of the F-test is known as the [[Chow test]]. The model with more parameters will always be able to fit the data at least as well as the model with fewer parameters. Thus typically model 2 will give a better (i.e. lower error) fit to the data than model 1. But one often wants to determine whether model 2 gives a ''significantly'' better fit to the data. One approach to this problem is to use an ''F''-test. If there are ''n'' data points to estimate parameters of both models from, then one can calculate the ''F'' statistic, given by :<math>F=\frac{\left(\frac{\text{RSS}_1 - \text{RSS}_2 }{p_2 - p_1}\right)}{\left(\frac{\text{RSS}_2}{n - p_2}\right)} = \frac{\text{RSS}_1 - \text{RSS}_2 }{\text{RSS}_2} \cdot \frac{n - p_2}{p_2 - p_1},</math> where RSS<sub>''i''</sub> is the [[residual sum of squares]] of model ''i''. If the regression model has been calculated with weights, then replace RSS<sub>''i''</sub> with Ο<sup>2</sup>, the weighted sum of squared residuals. Under the null hypothesis that model 2 does not provide a significantly better fit than model 1, ''F'' will have an ''F'' distribution, with (''p''<sub>2</sub>β''p''<sub>1</sub>, ''n''β''p''<sub>2</sub>) [[Degrees of freedom (statistics)|degrees of freedom]]. The null hypothesis is rejected if the ''F'' calculated from the data is greater than the critical value of the [[F-distribution|''F''-distribution]] for some desired false-rejection probability (e.g. 0.05). Since ''F'' is a monotone function of the likelihood ratio statistic, the ''F''-test is a [[likelihood ratio test]].
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)