In this section, we shall present the mathematical, or axiomatic, definition of probability. In a given experiment, it is necessary to assign to each event A in the sample space S a number Pr(A) that indicates the probability that A will occur. In order to satisfy the mathematical definition of probability, the number Pr(A) that is assigned must satisfy three specific axioms. These axioms ensure that the number Pr(A) will have certain properties that we intuitively expect a probability to have under each of the various interpretations described in 1.2 Interpretations of Probability.
The first axiom states that the probability of every event must be nonnegative.
The second axiom states that if an event is certain to occur, then the probability of that event is 1.
Before stating Axiom 3, we shall discuss the probabilities of disjoint events. If two events are disjoint, it is natural to assume that the probability that one or the other will occur is the sum of their individual probabilities. In fact, it will be assumed that this additive property of probability is also true for every finite collection of disjoint events and even for every infinite sequence of disjoint events. If we assume that this additive property is true only for a finite number of disjoint events, we cannot then be certain that the property will be true for an infinite sequence of disjoint events as well. However, if we assume that the additive property is true for every infinite sequence of disjoint events, then (as we shall prove) the property must also be true for every finite number of disjoint events. These considerations lead to the third axiom.
We are now prepared to give the mathematical definition of probability.
We shall now derive two important consequences of Axiom 3. First, we shall show that if an event is impossible, its probability must be 0.
We can now show that the additive property assumed in Axiom 3 for an infinite sequence of disjoint events is also true for every finite number of disjoint events.
From the axioms and theorems just given, we shall now derive four other general properties of probability measures. Because of the fundamental nature of these four properties, they will be presented in the form of four theorems, each one of which is easily proved.
The proof of the following useful result is left to Exercise 1.5.13.
Note: Probability Zero Does Not Mean Impossible. When an event has probability 0, it does not mean that the event is impossible. In Example 1.5.4, there are many events with 0 probability, but they are not all impossible. For example, for every x, the event that water demand equals x corresponds to a line segment in Figure 1.5. Since line segments have 0 area, the probability of every such line segment is 0, but the events are not all impossible. Indeed, if every event of the form {water demand equals x} were impossible, then water demand could not take any value at all. If ϵ>0, the event
{water demand is between x−ϵ and x+ϵ}
will have positive probability, but that probability will go to 0 as ϵ goes to 0.
We have presented the mathematical definition of probability through the three axioms. The axioms require that every event have nonnegative probability, that the whole sample space have probability 1, and that the union of an infinite sequence of disjoint events have probability equal to the sum of their probabilities. Some important results to remember include the following:
If A1,…,Ak are disjoint, Pr(⋃i=1kAi)=∑i=1kPr(Ai).
Pr(Ac)=1−Pr(A).
A⊂B implies that Pr(A)≤Pr(B).
Pr(A∪B)=Pr(A)+Pr(B)−Pr(A∩B).
It does not matter how the probabilities were determined. As long as they satisfy the three axioms, they must also satisfy the above relations as well as all of the results that we prove later in the text.