HTML Notes on the Laws of Probability
Laws of Probability
Probability Theory studies the assignment of probabilities to Events. Events are statements like propositions for which it is not immediately clear whether they are true or false. For example:
It will be cold in March in Mohali
Arul likes Mathematics
Beena will attend all lectures in MTH202
Most of such statements are about things in the future which have not yet happened, so we cannot be certain that they will happen. However, we can also assign probabilities to statements about events in the past as we are no longer sure that they happened!
We would like to extend the notion of true/false to such statements (to the extent possible) as well. To do this, we can replace true by “certain” and give it the value 1, and we can replace false by “unlikely” and give it the value 0. To various shades of certainty we can give values between 0 and 1. (At this point it is worth pointing out the distinction between “improbable” and “impossible”1).
Events
The algebra of events will be a Boolean Algebra just like the algebra of propositions. In particular, we will have the notions:
is the event that both
and
will occur
is the event that at least one of
and
will occur
which the event that
will not occur
By defining and
, we require that the algebraic system obtained is a Boolean algebra. For completeness, we will also require the notion of an event
which is the union of all possible events and
as the intersection of all possible events. (Since
we can see that
.)
Laws of Probability
To each event we assign a probability
which is a real number between 0 and 1 called the probability of (occurrence of) the event
. This satisfies the following laws:
We have . It follows that
. Moreover, since any
is a subset of
, we obtain an important rule (which we should never forget!):
Any calculation that purports to give a probability where the answer does not satisfy this is obviously wrong!
We note that and so it follows that
.
Mutual Exclusivity
The above formulas and rules do not really give too much of a handle on the probability or
in terms of
and
. However, there are some cases when we are certain that
and
cannot happen together. In other words
. Such events
and
are called "mutually exclusive". The terminology indicates that if
occurs, it excludes
from occurring and vice versa.
For mutually exclusive events, we have . Conversely, when this holds, we have
by the law of addition of probabilities.
The simplest case then this holds is when since, in this case,
so
and equivalently
.
Another important realisation is that if , then
and
cannot be mutually exclusive since
which means that
by the law of addition of probabilities.
A simple case of exclusive events would be the following in the case of the flip of a single coin. The event would be the assertion that this particular flip resulted in a Head and the event B would be the assertion that this particular flip resulted in a Tail. The probability of getting both a Head and a Tail in the same flip is evidently 0 (under normal conditions!).
Conditional Probability
While giving the basic rules governing probability, we have said nothing about how to assign probabilities other than to say that any such assignment should be consistent with the laws of probability!
In practice, we assign probabilities based on information about events that has already been gathered.
Let’s take a specific and common example. When we flip a coin, we have no information about whether it will come heads or tails. So we can assign an equal probability (of half!) to each event since we do not expect it to stand on its edge! Similarly, when we first go to a new city, we can assign an equal probability of finding the food nice or not nice!
However, the two events appear to us differently after we have made a number of observations.
In the case of coin flips, we generally have the feeling (especially if the coin and the person flipping it has changed!) that the knowledge of one 100 coin flips gives us no information about the result of the 101st coin flip.
On the other hand after eating in the mess for 100 days, we have a rather good idea about whether we will like the food on the 101st day or not!
We use the notion to denote the probability that the event
occurs if we are given that
has occurred. We can also think of
as the data gathered about the universe and
as the prediction based on this data. In that case
can be seen as the probability of correct-ness of our prediction
given the data
already gathered. (Intelligence can be seen as the capacity to gather information and convert it into conditional probabilities!)
Note that it does not make sense to think about when
. Basically, how can we determine the probability of an event
occurring, when the improbable (
!) has already happened. We will see a more mathematical justification below.
Since treats
as "background data", we can think of it as another way of assigning probability to the same events. Hence, it satisfies:
Kolmogorov's Formula and Independence
A very important rule that allows us to link conditional probability to the probability of individual events is Kolmogorov's formula:
We can also write this as
We could treat this as “defining” provided that we note that
is not defined if
. In fact, if
, then
since
is contained in
. Thus,
is like
and is undefined. Conceptually, we can also see this it as a way to determine
using
when
.
As a special case, we note that . This might seem confusing since one might believe that
is the probability when we have no information. However, we note that saying that
has occurred gives no information since we know that
, so it has to happen!
There are cases where and
. In other words, where the knowledge that the event
has occurred tells us nothing about the event
and vice versa. In this case we have
. If the latter identity holds, we say that
and
are independent. Consequently, we have
Deciding whether events are independent is not always easy in a practical situation. In fact, ensuring that certain physical quantities can be measured independently is impossible according to quantum mechanics! However, in the cases we will study it will mostly be evident.
Mathematical Summary
We can approach probability as a formal theory without worrying about its interpretation. This can help us avoid confusion during calculations. This formal structure is given below.
In probability theory we study an algebra of events, which is a Boolean Algebra.
To each event we assign a probability
, which is a number between 0 and 1. More generally, for events
and
, we define
which is a number between 0 and 1.
This satisfies the following rules:
If is an event for which
then we define
by the identity
. When
we can define
.
We say that is independent of
if
.
In “Leave it to Psmith” by P G Wodehouse, Psmith says: ‘Comrade Spiller, never confuse the unusual with the impossible.’↩