Q

Standard Deviation Calculation for Common Probability Distributions

2024-10-21


1. Binomial Distribution

Definition:

The binomial distribution applies to independent repeated trials where each trial has only two possible outcomes (success or failure). Let the probability of success in each trial be ( p ), the probability of failure be ( 1 - p ), and the number of trials be ( n ).

Standard Deviation Formula:

$$
\sigma = \sqrt{n \cdot p \cdot (1 - p)}
$$

Explanation:

Example:

Suppose you toss a fair coin ( n ) times and calculate the standard deviation for the number of heads (success).

Calculation:

$$
\sigma = \sqrt{10 \cdot 0.5 \cdot 0.5} = \sqrt{2.5} \approx 1.58
$$

2. Hypergeometric Distribution

Definition:

The hypergeometric distribution describes sampling without replacement from a finite population. The population size is ( N ), where ( K ) items are considered successes (e.g., red balls), and the remaining ( N - K ) are failures (e.g., blue balls). A sample of ( n ) is drawn, and we focus on the number of successes in the sample.

Standard Deviation Formula:

$$
\sigma = \sqrt{n \cdot \frac{K}{N} \cdot \frac{N - K}{N} \cdot \frac{N - n}{N - 1}}
$$

Explanation:

Example:

Suppose there are ( N = 20 ) balls, of which ( K = 5 ) are red, and the remaining are blue. You draw ( n = 4 ) balls. The standard deviation of the number of red balls in the sample is:

Calculation:

$$
\sigma = \sqrt{4 \cdot \frac{5}{20} \cdot \frac{15}{20} \cdot \frac{16}{19}} \approx 0.87
$$

3. Normal Distribution

Definition:

The normal distribution is a key continuous probability distribution, and many natural phenomena approximately follow it. The standard deviation measures the spread of data around the mean.

Standard Deviation Formula for Sample Data:

$$ \sigma = \sqrt{\frac{\sum_{i=1}^{n} (x_i - \mu)^2}{n}}\ $$

Explanation:

Example:

Suppose you have a set of measurements: $$ x_1 = 1, x_2 = 2, x_3 = 3, x_4 = 4, x_5 = 5 $$

Mean Calculation:

$$
\mu = \frac{1 + 2 + 3 + 4 + 5}{5} = 3
$$

Standard Deviation Calculation: $$
\sigma = \sqrt{\frac{(1-3)^2 + (2-3)^2 + (3-3)^2 + (4-3)^2 + (5-3)^2}{5}} = \sqrt{2} \approx 1.41
$$

4. Poisson Distribution

Definition:

The Poisson distribution is used to describe the number of events occurring within a fixed interval of time or space, with the parameter λ , which represents the average number of occurrences.

Standard Deviation Formula:

$$
\sigma = \sqrt{\lambda}
$$

Explanation:

Example:

If a store receives an average of λ = 5 customers per hour, the standard deviation for the number of customers is:

$$
\sigma = \sqrt{5} \approx 2.24
$$

5. Geometric Distribution

Definition:

The geometric distribution describes the number of failures before the first success, with the probability of success ( p ).

Standard Deviation Formula:

$$
\sigma = \frac{\sqrt{1 - p}}{p}
$$

Explanation:

Example:

If the probability of making a successful basketball shot is ( p = 0.3 ), the standard deviation for the number of attempts before the first success is:

$$
\sigma = \frac{\sqrt{1 - 0.3}}{0.3} \approx 2.19
$$

6. Exponential Distribution

Definition:

The exponential distribution describes the time between events, with the parameter λ , representing the rate of occurrence.

Standard Deviation Formula:

$$
\sigma = \frac{1}{\lambda}
$$

Explanation:

Example:

If a device fails on average λ = 2 times per hour, the standard deviation of the time between failures is:

$$
\sigma = \frac{1}{2} = 0.5 \text{ hours}
$$