# Probability axioms

The probability [itex]\mathbb{P}[itex] of some event [itex]E[itex] (denoted [itex]\mathbb{P}(E)[itex]) is defined with respect to a "universe" or sample space [itex]\Omega[itex] of all possible elementary events in such a way that [itex]\mathbb{P}[itex] must satisfy the Kolmogorov axioms.

Alternatively, a probability can be interpreted as a measure on a σ-algebra of subsets of the sample space, those subsets being the events, such that the measure of the whole set equals 1. This property is important, since it gives rise to the natural concept of conditional probability. Every set [itex]A[itex] with non-zero probability defines another probability

[itex]\mathbb{P}(B \vert A) = {\mathbb{P}(B \cap A) \over \mathbb{P}(A)}[itex]

on the space. This is usually read as "probability of [itex]B[itex] given [itex]A[itex]". If the conditional probability of [itex]B[itex] given [itex]A[itex] is the same as the probability of [itex]B[itex], then [itex]B[itex] and [itex]A[itex] are said to be independent.

In the case that the sample space is finite or countably infinite, a probability function can also be defined by its values on the elementary events [itex]\{e_1\}, \{e_2\}, ...[itex] where [itex]\Omega = \{\,e_1, e_2, ...\,\}.\,[itex]

 Contents

## Kolmogorov axioms

The following three axioms are known as the Kolmogorov axioms, after Andrey Kolmogorov who developed them. We have an underlying set Ω, a sigma-algebra [itex]\mathcal{F}[itex] of subsets of Ω, and a function P assigning real numbers to members of F. The members of F are those subsets of Ω that are called "events".

### First axiom

For any set [itex]E\in F,[itex] i.e., for any event, [itex]0 \leq P(E). \,[itex]

That is, the probability of an event is a non-negative real number.

### Second axiom

[itex]P(\Omega) = 1.\,[itex]

That is, the probability that some elementary event in the entire sample set will occur is 1. More specifically, there are no elementary events outside the sample set.

This is often overlooked in some mistaken probability calculations; if you cannot precisely define the whole sample set, then the probability of any subset cannot be defined either.

### Third axiom

Any countable sequence of pairwise disjoint events [itex]E_1, E_2, ...[itex] satisfies [itex]P(E_1 \cup E_2 \cup \cdots) = \sum P(E_i)[itex].

That is, the probability of an event set which is the union of other disjoint subsets is the sum of the probabilities of those subsets. This is called σ-additivity. If there is any overlap among the subsets this relation does not hold.

For an algebraic alternative to Kolmogorov's approach, see algebra of random variables.

## Lemmas in probability

From the Kolmogorov axioms one can deduce other useful rules for calculating probabilities:

[itex]P(A \cup B) = P(A) + P(B) - P(A \cap B).\,[itex]

That is, the probability that A or B will happen is the sum of the probabilities that A will happen and that B will happen, minus the probability that A and B will happen. This can be extended to the inclusion-exclusion principle.

[itex]P(\Omega - E) = 1 - P(E).\,[itex]

That is, the probability that any event will not happen is 1 minus the probability that it will.

Using conditional probability as defined above, it also follows immediately that

[itex]P(A \cap B) = P(A) \cdot P(B \vert A).\,[itex]

That is, the probability that A and B will happen is the probability that A will happen, times the probability that B will happen given that A happened; this relationship gives Bayes' theorem. It then follows that A and B are independent if and only if

[itex]P(A \cap B) = P(A) \cdot P(B).\,[itex]

• Art and Cultures
• Countries of the World (http://www.academickids.com/encyclopedia/index.php/Countries)
• Space and Astronomy