Skip to main content

Confidence Interval: Basic Statistics Lecture Series Lecture #11

You'll remember last time, I covered hypothesis testing of proportions and the time before that, hypothesis testing of a sample with a mean and standard deviation.  This time, I'll cover the concept of confidence intervals.

Confidence intervals are of the form μ1-α ∈ (a, b)1-α, where a and b are two numbers such that a<b, α is the significance level as covered in hypothesis testing, and μ is the actual population mean (not the sample mean). This is a the statement of there being a [(1-α)*100]% probability that the true population mean will be somewhere between a and b.  The obvious question is "How do we find a and b?".  Here, I will describe the process.

Step 1. Find the Fundamental Statistics

The first thing we need to find the fundamental statistics, the mean, standard deviation, and the sample size.  The sample mean is typically referred to as the point estimate by most statistics text books.  This is because the point estimate of the population mean is the sample mean.  These will eventually be vital to finding the interval.  It is also important to find the z-value for the particular confidence level 1-α for which we are interested in finding the confidence interval.  Remember from the hypothesis testing entries that the z-value (without use of calculus or statistics software) comes from the z-table.  Since this is an interval where there will be two tails, we need to find the z-values for 1-α2 ​.  Here is a table of z-values for common confidence levels:

Confidence Level
z
0.7
1.04
0.75
1.15
0.8
1.28
0.85
1.44
0.9
1.645
0.92
1.75
0.95
1.96
0.96
2.05
0.98
2.33
0.99
2.58

Step 2. Calculating the Standard Error and Margin of Error

The next step is to calculate the standard error of the sample.  For the case where we have the sampel mean and standard deviation, it is simple $\sigma_{E}=\frac{s}{\sqrt{n}}$, and for the proportion case, it is $\sigma_{E}=\sqrt{\frac{p*(1-P)}{n}}$.  This is the deviation of the data points per data point.  In order to go from the standard error to the margin of error, we simply multiply the standard error by the z-value for the confidence level, $\sigma_{M}=z*\sigma_{E}$.

Step 3. Getting the Confidence Interval

This is simple enough of a calculation to perform.  You'll notice that in the second paragraph that I displayed the confidence interval as (a, b)1-α.  The value for a is subtracting the margin of error from the mean (point estimate), $a=\overline{x}-\sigma_{M}$, where b is taking the sample mean and to it, add the margin of error, $b=\overline{x}+\sigma_{M}$.  So the confidence interval is given by: $$\mu_{1-\alpha} \in (\overline{x}-\sigma_{M}, \overline{x}+\sigma_{M})$$This means that there is a [(1-α)*100]% probability that the true population mean falls between $\overline{x}-\sigma_{M}$ and $\overline{x}+\sigma_{M}$.

You'll notice that the point estimate is a single value and not a confidence interval.  This stems from the notion that the value of α approaches 1 as our confidence approaches 0%.  This means that the z-value approaches 0 as α approaches 1.  This also means that the value of the Margin of Error approaches 0, since it is the z value times a value.  At 0% confidence, z=0, so the margin of error is zero, since zero times anything is zero.  This means that the population mean is "between" $\overline{x}-$ and $\overline{x}+0$, which means that the population mean at a 0% confidence is the population mean, which is the statement of the point estimate.

That's it for confidence interval.  If you have any questions, please leave a comment.  Next time, I will be covering types of statistical error.  Until then, stay curious.

K. "Alan" Eister has his bacholers of Science in Chemistry. He is also a tutor for Varsity Tutors.  If you feel you need any further help on any of the topics covered, you can sign up for tutoring sessions here.

Comments

  1. Lucky Club Casino Site - Lucky Club
    Our site offers you the chance to win millions with the best casino games for players all over the world and the internet. luckyclub.live Join today! Rating: 4.6 · ‎24 votes · ‎Free · ‎iOS · ‎Entertainment

    ReplyDelete

Post a Comment

Popular posts from this blog

Basic Statistics Lecture #3: Normal, Binomial, and Poisson Distributions

As I have mentioned last time , the uniform continuous distribution is not the only form of continuous distribution in statistics.  As promised, here are the three most common continuous distribution types.  As a side note, all sampling distributions are relative to the algebraic mean. Normal Distribution: I think most people are familiar with the concept of a normal distribution.  If you've ever seen a bell curve, you've seen the normal distribution.  If you've begun from the first lecture of this lecture series, you've also seen the normal distribution. This type of distribution is where the data points follow a continuous curve, is non-uniform, has a mean (algebraic average) equal to the median (the exact middle value), falls from highest probability at the mean to (for all practical purposes) zero as the x-values approach $\pm \infty$, and therefor has equal number of data points to the left and to the right of the mean, and has the domain of $(\pm \i

Basic Statistics Lecture #5: Baye's Theorem

As promised last time , I am going to cover Baye's Theorem. If Tree diagram is the common name for Bayes Theorem.  Recall that conditional probability is given by $P(A \mid B) = \frac{P(A \wedge B)}{P(B)}$.   For tree diagrams, let's say that we have events A, B 1 , B 2 , B 3 , … (the reason we have multiple B's is because they all are within the same family of events) such that the events in the family of B are mutually exclusive and the sum of the probabilities of the events in the family of B are equal to 1. Then we have $$P(B_i \mid A)= \frac{P(B_i)*P(A \mid B_i)}{\sum_{m=1}^{n}[P(B_m)*P(A \mid B_m)]}$$  What this means is reliant on the tree diagram. If we are only looking at the sub-items of A, this is what the tree diagram would look like. If J has a probability of 100%, and P(C) and P(D) are not 0, then when we are trying to find the probability of any of the B's being true given that A is true, we have to set the probability of A to be the entir