# 15.4. Exponential Distribution¶

A random variable \(T\) has the *exponential distribution with parameter \(\lambda\)* if the density of \(T\) is given by

The graph below shows the density \(f_T\) for \(\lambda = 5\).

## 15.4.1. CDF and Survival Function¶

The exponential distribution is often used as a model for random lifetimes, in settings that we will study in greater detail below. For now, just think of \(T\) as the lifetime of an object like a lightbulb, and note that the cdf at time \(t\) can be thought of as the chance that the object dies before time \(t\):

The complementary event is that the object survives past time \(t\), and therefore its probability defines the *survival function* \(S_T\):

## 15.4.2. Mean and SD¶

To find \(E(T)\), you have two options: \($ E(T) ~ = ~ \int_0^{\infty} t\lambda e^{-\lambda t} dt ~ = ~ \frac{1}{\lambda} $\)

either by integration by parts or by recognizing the indefinite integral of \(\lambda te^{-\lambda t}\) . Since \(T\) is a non-negative variable, an easier way is to use the *tail integral*, analogous to the tail sum formula for the expectation of a non-negative integer valued random variable.

To find \(E(T^2)\), you can use integration by parts, or you can accept for now that \(E(T^2) = 2/\lambda^2\) and therefore

Later in the course we will see how to find \(E(T^2)\) without integration.

The graph below shows the density \(f_T\) with the labeled points on the horizontal axis corresponding to standard units of -1, 0, 1, 2, and 3. The random variable \(T\) can’t be negative, and the density doesn’t go further than 1 SD below the mean. The spread comes from the long right hand tail.

## 15.4.3. Median¶

Here are graphs of the cdf and the survival function.

Notice that the two curves intersect at the vertical level 0.5. If \(t_{0.5}\) is the value of \(t\) at which the curves intersect, then

and therefore \($ P(T > t_{0.5}) = S_T(t_{0.5}) = 0.5 = F_T(t_{0.5}) = P(T \le t_{0.5}) $\)

The point \(t_{0.5}\) is called the *median* of the distribution. We can find \(t_{0.5}\) in terms of \(\lambda\) by using the formula for the survival function.

Because \(\log(2) < 1\), the median lifetime \(t_{0.5}\) is less than the mean lifetime \(E(T) = 1/\lambda\) as you can see on the graph. This is consistent with an observation you made in Data 8: if a distribution has a right hand tail, the median is less than the mean.

The exponential distribution is often used to model lifetimes of objects like radioactive atoms that undergo *exponential decay*. The *half life* of a radioactive isotope is defined as the time by which half of the atoms of the isotope will have decayed. That is, the half life is the median of the exponential lifetime of the atom. The parameter \(\lambda\) is called the *decay rate* of the atom. By the property of the median \(t_{0.5}\) derived above, the relation between \(\lambda\) and the half life is

## 15.4.4. Memoryless Property¶

Let \(s\) and \(t\) be positive, and let’s find the conditional probability that the object survives a further \(s\) units of time given that it has already survived \(t\).

Notice that \(t\) does not appear in the answer. So for example the chance that the object survives an additional year given that it has been alive for 50 years is the same as the chance that is survives a year when it starts out brand new. It *forgets* that it has already lived 50 years.

This is called the *memoryless property* of the exponential distribution. It can be shown that the exponential and the geometric are the only two distributions that have the memoryless property. As you can see, the graph of the exponential density resembles the geometric probability histogram. It can be thought of as a continuous limit of the geometric, as we will see later.

The memoryless property is an excellent reason *not* to use the exponential distribution to model the lifetimes of people or of anything that ages. For lifetimes of things like lightbulbs or radioactive atoms, the exponential distribution often does fine.

## 15.4.5. The Rate¶

If \(\lambda\) is large, then \(E(T) = 1/\lambda\) is small. If you think of \(T\) as a lifetime then large \(\lambda\) implies that an early death is expected. To formalize the notion of \(\lambda\) as a rate, let \(\Delta_t\) be a tiny increment of time and use the memoryless property:

The left hand side is the chance that the object dies immediately after time \(t\), given that it was still alive at time \(t\). We say that \(\lambda\) is the *instantaneous death rate*, because