Hypothesis testing for population proportion using one sample.
Here we are concerned with making an inference about a population proportion consisting of two categories(like Success or Failures, Pro-Villar or Anti-Villar) from data obtained in a sample of size
.
The null hypothesis is that the population proportion
.
Mean and variance of sample proportion
From the binomial distribution, we know that the mean would be
and the variance is 
In terms of counts X, the mean is
and the variance is
.
An estimate obtained from a sample of size $n$ would be denoted by p (computed as X/n where X is the number of successes).
Standard error of estimated p
The standard error or standard deviation of the estimate of proportion is given by 
The normalized test statistic.
Assuming n is large enough, the normal approximation can be used and the test statistic would be given by

In terms of counts, it is 
Confidence Intervals for population proportions.
Let
be the critical value(s) according to the level of significance
. For example the lower and critical critical values for a two sided test at the 5 percent level of significance is -1.96, and 1.96 but is 1.64 for a right single sided test.
Then for a two-sided test, we accept
depending on whether
or not respectively.
For convenience, here are the acceptance regions or intervals for various common values of
.
![]() |
left sided test : |
double sided test: |
right sided test: |
| 0.01 | [-2.36, ) |
[-2.58, 2.58] | ( , 2.36] |
| 0.05 | [-1.64, ) |
[-1.96, 1.96] | ( , 1.64] |
| 0.10 | [-1.28, ) |
[-1.64, 1.64] | ( , 1.28] |
The p-value of the test.
Suppose that we obtained the sample proportion p. Convert this to a test statistic
. Then the p-value would be given as
| Condition | Formula |
|---|---|
|
) |
|
) |
Here pnorm(
) is the area from (
) using the standard normal distribution. The double-sided test would have a p-value twice the p-values obtained for the single-sided tests.
The margin of error
The confidence interval corresponding to
confidence level coefficient is centered on p, and has upper and lower confidence limits of
where se is the standard error of the sample proportion. The margin of error is
.
Examples here!! TBD.
Had a tough time sorting out the latex problems! It basically involves the ‘<‘ sign inside latex formulas. changed it to \le so the html display would be all right.
)
, 2.36]
)
) 






