Skip to main content

Basic Statistics Lecture #1: Basic Concepts

Probability is the foundation of everything we do in Statistics.  There are two types of probability: discrete and continuous.  Discrete probability is when there are distinct probabilities for different values (represented by bars), while continuous probability is where the probabilities change smoothly with the value (represented by a smooth curve).  This is better described by images.  These are two types of probability models and are therefor treated different.
Discrete Probabilities of Dice Rolls.

Continuous Probability Distribution
of Pounds of Waste Produced.
We have what is called Probability Models.  Probability models have a sample space (denoted by an S).  Probabilities are assigned to sample points in sample space.  For the continuous Standard Normal Distribution, the x axis is the sample point and the y axis (denoted P(E)) is the probability axis.  In the graphs above, the values we want to get a probability for is on the horizontal axis (which is typically denoted as the x axis and is defined as a single statistical event) and the probability of that particular value occurring is the area of the distribution between the curve and the x axis.

There are basic rules which all statistical analysis probability models must follow:
1) The probability of a single event must be between 0 and 1, $0 \leq E \leq 1$.  This means that the probability of an event occurring can't exceed 100% and can't be less than 0%.  It can be exactly those percents, or in between those percents, just not outside those amounts.
2) The probability of all events must equal exactly 1, $\sum_{i=1}^{n} P(E_i)=1$.  This means that if you take every single possible event, add all of their probabilities together, the answer you get should be exactly and precisely 100%, no more, no less.
3) The probability of an event occuring is given by the sum of all sample point probabilities in the event, $\sum_{i=1}^{n} P(p_i)=P(E_i)$.  This means that if you want the probability of working less than 40 hours next week, you should add up the probability of working every possible increment of 0.25 hr up to 40 hrs.
4) The probability of an event E not occurring is given by the probability of it occurring taken from 1, $P(E^{c})=1-P(E)$.  Here, the "exponent" of c denotes that it is the compliment of E, or the part of the data set which not covered in the event E.  This means that if you want the probability of you working any amount of hours next week other than 40 hours, you would find the probability of working exactly 40 hours, and subtract it from 100%.

If these rules hold, than it is a statistical model.  If these rules do NOT hold, however, than it is definitely NOT statistical.  It may or may not be a model, but if it is, the model is NOT a statistical model.

These are the basics of probability models and discrete and continuous probability distributions.  That's it for this time.  Next time, I'll cover examples of discrete and continuous probabilities.  Until then, stay curious.

K. "Alan" Eister has his bacholers of Science in Chemistry. He is also a tutor for Varsity Tutors.  If you feel you need any further help on any of the topics covered, you can sign up for tutoring sessions here.

Comments

Popular posts from this blog

Basic Statistics Lecture #3: Normal, Binomial, and Poisson Distributions

As I have mentioned last time , the uniform continuous distribution is not the only form of continuous distribution in statistics.  As promised, here are the three most common continuous distribution types.  As a side note, all sampling distributions are relative to the algebraic mean. Normal Distribution: I think most people are familiar with the concept of a normal distribution.  If you've ever seen a bell curve, you've seen the normal distribution.  If you've begun from the first lecture of this lecture series, you've also seen the normal distribution. This type of distribution is where the data points follow a continuous curve, is non-uniform, has a mean (algebraic average) equal to the median (the exact middle value), falls from highest probability at the mean to (for all practical purposes) zero as the x-values approach $\pm \infty$, and therefor has equal number of data points to the left and to the right of the mean, and has the domain of $(\pm \i

Confidence Interval: Basic Statistics Lecture Series Lecture #11

You'll remember last time , I covered hypothesis testing of proportions and the time before that , hypothesis testing of a sample with a mean and standard deviation.  This time, I'll cover the concept of confidence intervals. Confidence intervals are of the form μ 1-α ∈ (a, b) 1-α , where a and b are two numbers such that a<b, α is the significance level as covered in hypothesis testing, and μ is the actual population mean (not the sample mean). This is a the statement of there being a [(1-α)*100]% probability that the true population mean will be somewhere between a and b.  The obvious question is "How do we find a and b?".  Here, I will describe the process. Step 1. Find the Fundamental Statistics The first thing we need to find the fundamental statistics , the mean, standard deviation, and the sample size.  The sample mean is typically referred to as the point estimate by most statistics text books.  This is because the point estimate of the populati

Basic Statistics Lecture #5: Baye's Theorem

As promised last time , I am going to cover Baye's Theorem. If Tree diagram is the common name for Bayes Theorem.  Recall that conditional probability is given by $P(A \mid B) = \frac{P(A \wedge B)}{P(B)}$.   For tree diagrams, let's say that we have events A, B 1 , B 2 , B 3 , … (the reason we have multiple B's is because they all are within the same family of events) such that the events in the family of B are mutually exclusive and the sum of the probabilities of the events in the family of B are equal to 1. Then we have $$P(B_i \mid A)= \frac{P(B_i)*P(A \mid B_i)}{\sum_{m=1}^{n}[P(B_m)*P(A \mid B_m)]}$$  What this means is reliant on the tree diagram. If we are only looking at the sub-items of A, this is what the tree diagram would look like. If J has a probability of 100%, and P(C) and P(D) are not 0, then when we are trying to find the probability of any of the B's being true given that A is true, we have to set the probability of A to be the entir