Corporate Profile

Download Report

Transcript Corporate Profile

PBG 650 Advanced Plant Breeding
Mathematical Statistics Concepts
–Probability laws
–Binomial distributions
–Mean and Variance of Linear Functions
Probability laws
•
the probability that either A or B occurs (union)
Pr( A  B)  Pr( A )  Pr(B)  Pr( A  B)
•
•
the probability that both A and B occur (intersection)
the joint probability of A and B
Pr( A  B)  Pr( A,B)
•
the conditional probability of B given A
Pr( A,B)
Pr( B A ) 
Pr( A )
Statistical Independence
• If events A and B are independent, then
Pr( B A )  Pr( B)
Pr( A B)  Pr( A )
Pr( A,B)  Pr( A ) Pr( B)
Bayes’ Theorum (Bayes’ Rule)
•
•
Pr( A B ) 
Conditional probability
Bayes’ Theorum
Pr( A B) 
Pr( A, B )
Pr(B )
Pr(B A)Pr  A 
Pr(B)
.
Pr(A) is called the prior probability
Pr(A|B) is called the posterior probability
Pr( Aj B ) 
Pr(B Aj )Pr  Aj 
k
Pr(B A )Pr  A 
i 1
i
i
Probabilities
Plant Height
A1A1
A1A2
A2A2
Marginal
Prob.
height ≤ 50 cm
0.10
0.14
0.06
0.30
50 < height ≤ 75
0.04
0.18
0.10
0.32
height > 75 cm
0.02
0.16
0.20
0.38
Marginal Prob.
0.16
0.48
0.36
1.00
Marginal Probability:
Pr(Genotype= A1A1) = 0.16
Joint Probability:
Pr(Genotype= A1A1, height ≤ 50) = 0.10
Conditional Probability:
Pr height  50 Genotype  A1A1 

Pr (height  50, Genotype  A1A1 ) 0.10

 0.625
Pr( Genotype  A1A1 )
0.16
Statistical Independence
If X is statistically independent of Y, then their joint
probability is equal to the product of the marginal
probabilities of X and Y
A1A1
A1A2
A2A2
Marginal
Prob.
height ≤ 50 cm
0.0480
0.1440
0.1080
0.30
50 < height ≤ 75
0.0512
0.1536
0.1152
0.32
height > 75 cm
0.0608
0.1824
0.1368
0.38
Marginal Prob.
0.16
0.48
0.36
1.00
Plant Height
Discrete probability distributions
•
Let x be a discrete random variable that can take on
a value Xi, where i = 1, 2, 3,…
•
The probability distribution of x is described by
specifying Pi = Pr(Xi) for every possible value of Xi
•
•
•
0 ≤ Pr(Xi) ≤ 1 for all values of Xi
ΣiPi = 1
The expected value of X is
E(X) = ΣiXiPr(Xi) =X
Binomial Probability Function
•
A Bernoulli random variable can have a value of one or zero.
The Pr(X=1) = p, which can be viewed as the probability of
success. The Pr(X=0) is 1-p.
•
A binomial distribution is derived from a series of
independent Bernouli trials. Let n be the number of trials and
y be the number of successes.
– Calculate the number of ways to obtain that result:
n!
n 
y
y! (n  y )!
– Calculate the probability of that result:


n!
n y
Pr( y ) 
p y 1  p
y! (n  y )!

Probability Function
Binomial Distribution
Probability
Binomial Distribution (n=20, p=0.5)
0.20
0.18
0.16
0.14
0.12
0.10
0.08
0.06
0.04
0.02
0.00
0
1
2
3
4
5
6
7
8
9 10 11 12 13 14 15 16 17 18 19 20
Number of successes
Average = np = 20*0.5 = 10
Variance = np(1-p) = 20*0.5*(1-0.5) = 5
For a normal distribution, the variance is independent of the mean
For a binomial distribution, the variance changes with the mean
Mean and variance of linear functions
•
Mean and variance of a constant (c)
E (c )  c
 0
2
c
•
Adding a constant (c) to a random variable Xi
E( X  c )  E( X )  c
the mean increases by the
value of the constant

the variance remains the same
2
X c

2
X
Mean and variance of linear functions
•
Multiplying a random variable by a constant
E(cX )  c[E( X )]
multiply the mean by the constant

multiply the variance by the
square of the constant
2
cX
c 
2
2
X
Adding two random variables X and Y
E( X  Y )  E( X )  E(Y )
mean of the sum is the sum
of the means
(2X Y )  VAR( X )  VAR(Y )  2COVXY
variance of the sum  the sum of the variances if the variables
are independent
Variance - definition
• The variance of variable X
V ( X )  E ( X i  X )2   E ( X i2 )  X2
• Usual formula
X


 
2
X
i
 X
n
2
2


X


 i  n
   X i2 
n




• Formula for frequency data (weighted)
V ( X )   X2   f i X i -  X2
Covariance - definition
• The covariance of variable X and variable Y
Cov ( X ,Y )  E ( X  X )(Y  Y )  E ( XY )  X Y
• Usual formula
X


XY 
i
 X  Yi  Y 
n

X i  Yi 

   Xi Yi 
 n
n


• Formula for frequency data (weighted)
Cov ( X , Y )   XY   f i X iYi -  X Y