Statistical independence

In probability theory, when we assert that two events are independent, we intuitively mean that knowing whether or not one of them occurred makes it neither more probable nor less probable that the other occurred. For example, the events "today is Tuesday" and "it rains today" are independent.

Similarly, when we assert that two random variables are independent, we intuitively mean that knowing something about the value of one of them does not yield any information about the value of the other. For instance, the height of a person and their IQ are independent random variables. Another typical example of two independent variables is given by repeating an experiment: roll a die twice, let X be the number you get the first time, and Y the number you get the second time. These two variables are independent.

Independent events

We define two events E₁ and E₂ of a probability space to be independent iff

P(E₁ ∩ E₂) = P(E₁) · P(E₂).

Here E₁ ∩ E₂ (the intersection of E₁ and E₂) is the event that E₁ and E₂ both occur; P denotes the probability of an event.

If P(E₂) ≠ 0, then the independence of E₁ and E₂ can also be expressed with conditional probabilities:

P(E₁ | E₂) = P(E₁)

which is closer to the intuition given above: the information that E₂ happened does not change our estimate of the probability of E₁.

If we have more than two events, then pairwise independence is insufficient to capture the intuitive sense of independence. So a set S of events is said to be independent if every finite nonempty subset { E₁, ..., E_n } of S satisfies

P(E₁ ∩ ... ∩ E_n) = P(E₁) · ... · P(E_n).

This is called the multiplication rule for independent events.

Independent random variables

We define random variables X and Y to be independent if

Pr[(X in A) & (Y in B)] = Pr[X in A] · Pr[Y in B]

for A and B any Borel subsets of the real numbers.

If X and Y are independent, then the expectation operator has the nice property

E[X· Y] = E[X] · E[Y]

and for the variance we have

Var(X + Y) = Var(X) + Var(Y).

Furthermore, if X and Y are independent and have probability densities f_X(x)and f_Y(y), then (X,Y) has a joint density of

f_XY(x,y)dx dy = f_X(x)dx f_Y(y)dy.

Still need to deal with independence of sets of more than 2 random variables.