Page 107 - 4660
P. 107

Fisher’s F-test


               where we have introduced the variable

                                                       Ns  2   ∑  (x i − ¯x) 2
                                                   u =       =    i         .                             (3.13)
                                                        σ 2         σ 2
                                                          0           0
                   We see that the rejection region λ < λ crit corresponds to a two-tailed rejection region on u
               given by 0 < u < a and b < u < ∞, where a and b are such that λ crit (a) = λ crit (b). In practice,
               however, it is difficult to determine a and b for a given significance level α, so a slightly different
               rejection region, which we now describe, is usually adopted. The sampling distribution P(u|H 0 )
                                                                                                          2
               maybefoundstraightforwardlyfromthesamplingdistributionofs.LetusfirstdetermineP(s |H 0 )
               by demanding that
                                                                   2
                                                                           2
                                                 P(s|H 0 )ds = P(s |H 0 )d(s ),
               from which we find
                                                              2 (N−1)/2
                                           P(s|H 0 )   (N/2σ )                        (  Ns  2  )
                                  2
                                                                         2 (N−2)/2
                              P(s |H 0 ) =          =         0        (s )       exp −         .         (3.14)
                                                           1
                                              2s        Γ( (N − 1))                       2σ 2
                                                           2                                0
                                                             2
                                                         2
               Thus, the sampling distribution of u = Ns /σ is given by
                                                             0
                                                                                 (      )
                                                          1                          1
                                    P(u|H 0 ) =                      u (N−3)/2  exp − u .
                                                           1
                                                2 (N−1)/2 Γ( (N − 1))                2
                                                           2
               We note, in passing, that the distribution of u is precisely that of an (N − 1)-th order chi-squared
               variable, i.e. u ∼ χ 2 N−1 . Although it does not give quite the best test, one then takes the rejection
               region to be 0 < u < a and b < u < ∞, with a and b chosen such that the two tails have equal
               areas; the advantage of this choice is that tabulations of the chi-squared distribution make the size
               of this region relatively easy to estimate. Thus, for a given significance level α, we have
                                        ∫  a               α     ∫  ∞                α
                                            P(u|H 0 )du =    and      P(u|H 0 )du =   .
                                          0                2       b                 2



               Example 3.4. Ten independent sample values x i , i = 1, 2, . . . , 10, are drawn at
               random from a Gaussian distribution with unknown mean µ and standard deviation σ.
               The sample values are as follows:


                                    2.22 2.56 1.07 0.24 0.18 0.95 0.73 − 0.79 2.09 1.81

               Test the null hypothesis H 0 : σ = 2 at the 10% significance level.                            ,

                                                     2
               Solution. For our null hypothesis σ = 2. Since for this sample s = 1.01 and N = 10, from
                                                     0
               (3.13) we have u = 5.10. For α = 0.1 we find, either numerically or using tables, that a = 3.30
               and b = 16.92. Thus, our rejection region is 0 < u < 3.33 and 16.92 < u < ∞. The value
               u = 5.10 from our sample does not lie in the rejection region, and so we cannot reject the null
                                 2
               hypothesis H 0 : σ = 2.

               We now turn to Fisher’s F-test. Let us suppose that two independent samples of sizes N 1 and N 2
                                                                                                  2
                                                                                       2
               are drawn from Gaussian distributions with means and variances µ 1 , σ and µ 2 , σ respectively,
                                                                                       1
                                                                                                  2
                                                                                                       2
                                                                                     2
                                                                                                 2
               and we wish to distinguish between the two hypotheses H 0 : σ = σ and H 1 : σ = σ . In this
                                                                               2
                                                                               1     2           1     2
               case, the generalised likelihood ratio is found to be
                                      (N 1 + N 2 ) (N 1 +N 2 )/2  [F(N 1 − 1)/(N 2 − 1)] N 1 /2
                                 λ =                     ·                                    ,
                                         N  N 1 /2 N N 2 /2  [1 + F(N 1 − 1)/(N 2 − 1)] (N 1 +N 2 )/2
                                           1     2
                                                              107
   102   103   104   105   106   107   108   109   110   111   112