Page 44 - 4660
P. 44

Discrete random variables


                                 p = 0.2
                                                                                         p = 0.5
                                                                       P(X = x)
                     P(X = x)
                                                                        0.40
                     0.40

                     0.20                                               0.20

                     0.00                              x                0.00                             x
                         0    1    2     3    4     5                      0    1     2    3    4     5
                     P(X = x)
                     0.80


                     0.60
                               p = 0.8

                     0.40


                     0.20


                     0.00                              x
                         0    1    2     3    4     5

                            Figure 5.5 – Some typical geometric distributions with various values p.


               The hypergeometric distribution. We saw that the probability of obtaining x successes in n
               independent trials was given by the binomial distribution. Suppose that these n ’trials’ actually
               consist of drawing at random n balls, from a set of N such balls of which M are red and the rest
               white. Let us consider the random variable X = number of red balls drawn.
                   On the one hand, if the balls are drawn with replacement then the trials are independent and
               the probability of drawing a red ball is p = M/N each time. Therefore, the probability of drawing
               x red balls in n trials is given by the binomial distribution as

                                                               n!
                                                                      x
                                             P(X = x) =              p (1 − p) n−x .
                                                           x!(n − x)!
               On the other hand, if the balls are drawn without replacement the trials are not independent and
               the probability of drawing a red ball depends on how many red balls have already been drawn.
               We can, however, still derive a general formula for the probability of drawing x red balls in n trials,
                                                                                  x
               as follows. The number of ways of drawing x red balls from M is C , and the number of ways of
                                                                                  M
               drawing n − x white balls from N − M is C   n−x  . Therefore, the total number of ways to obtain x
                                                           N−M
                                          n−x
                                      x
               red balls in n trials is C C N−M . However, the total number of ways of drawing n objects from N
                                      M
                          n
               is simply C . Hence the probability of obtaining x red balls in n trials is
                          N
                                    x
                                   C C  n−x         M!              (N − M)!           n!(N − n)!
                     P(X = x) =     M   N−M  =                                                     =      (5.16)
                                        n
                                      C N       x!(M − x)! (n − x)!(N − M − n + x)!        N!
                                                     (Np)!(Nq)!n!(N − n)!
                                           =                                       ,                      (5.17)
                                              x!(Np − x)!(n − x)!(Nq − n + x)!N!
               where in the last line p = M/N and q = 1 − p. This is called the hypergeometric distribution.
                   By performing the relevant summations directly, it may be shown that the hypergeometric
               distribution has mean E(x) = n   M  = np and variance
                                                N
                                                   nM(N − M)(N − n)         N − n
                                        Var(X) =                         =         npq.
                                                          2
                                                        N (N − 1)           N − 1
                                                              44
   39   40   41   42   43   44   45   46   47   48   49