Additional Concepts in Probability Distributions: Cheat Sheet

Download a pdf of this page here.

Essential Concepts

  • The probability distribution of a discrete random variable describes all possible values of the random variable, as well as the probability associated with each value.
    • The values associated with the random variable of interest are numerical and discrete.
    • All possible values of the random variable are listed in a table or graph, with each value having an associated probability greater than or equal to 0 and less than or equal to 1.
    • The sum of all probabilities in the table or graph equals 1.
  • A Bernoulli trial is a chance experiment with the following three properties:
    1. There are exactly two possible outcomes of the chance experiment. We label one of them as a success and the other as a failure.
    2. The probability of success is the same for every trial. We call the probability of success [latex]p[/latex]. Since the only two outcomes are success and failure, the probability of failure is the probability that the trial does not result in a success, so we can use the NOT probability rule to find that the probability of failure is [latex]1-p[/latex]
    3. The trials are independent from one another.
  • A binomial experiment is an experiment consisting of a fixed number, [latex]n[/latex], of independent Bernoulli trials that counts the number of successes out of [latex]n[/latex] trials. Notice that the number of successes in a binomial experiment is a discrete random variable. The distribution of this random variable is modeled with the binomial distribution.
  • The formula for the probability of obtaining [latex]x[/latex] successes from [latex]n[/latex] independent trials where the probability of success is [latex]p[/latex] is:

[latex]P(X = x) = (\text{number of ways to obtain }x\text{ successes in }n\text{ trials}) \cdot p^{x} \cdot (1-p)^{n-x}[/latex]

where [latex]p^{x}[/latex] occurs because there are [latex]x[/latex] successes, and [latex](1-p)^{n-x}[/latex] occurs because if there are [latex]x[/latex] successes and [latex]n[/latex] trials total, there must be [latex]n-x[/latex] failures.

  • Acceptance Sampling is a quality control technique used to assess the quality of a product or a batch of products. It involves inspecting a random sample from the batch and deciding whether to accept or reject the entire batch based on the quality of the sampled items. Acceptance sampling is commonly used in manufacturing, especially when testing the entire batch would be time-consuming or costly.
  • For a binomial experiment with a probability of success [latex]p[/latex] on [latex]n[/latex] trials, the mean [latex]\mu[/latex] and standard deviation [latex]\sigma[/latex] are defined as follows:

    • The mean of the number of successes is [latex]\mu = np[/latex].
    • The standard deviation of the number of successes is [latex]\sigma = \sqrt{np(1-p)}[/latex].
  • Oftentimes, it is adequate to use the mean and standard deviation to describe the most likely values for the number of successes. For large [latex]n~(\mbox{when}~np\geq 10 \mbox{ and } n(1 – p) \geq 10)[/latex], the binomial distribution has an approximate bell shape. So, we can use the normal distribution to approximate the binomial distribution and conclude that nearly all possibilities for the number of successes fall between the mean and [latex]±3[/latex] standard deviations.
  • The binomial distribution can be approximated well by the normal distribution when n is large enough so that the expected number of successes, [latex]np[/latex], and the expected number of failures, [latex]n(1-p)[/latex], are both at least [latex]10[/latex]. That is: The probability distribution will be approximately symmetric and bell shaped if

    • [latex]np \geq 10[/latex] AND
    • [latex]n(1-p) \geq 10[/latex]
  • In probability theory, a continuity correction is an adjustment that is made when a discrete distribution is approximated by a continuous distribution. We do this by adjusting the discrete whole numbers used in a binomial distribution so that any individual value, [latex]X[/latex], is represented in the normal distribution by the interval from [latex]X - 0.5 \mbox{ to } X + 0.5[/latex] or [latex]X ±0.5[/latex].

Key Equations

conditions for normal distribution

  • [latex]np \geq 10[/latex] AND
  • [latex]n(1-p) \geq 10[/latex]

probability for independent trials

[latex]P(A \cap B) = P(A)\cdot P(B)[/latex]

binomial distribution formula

[latex]P(X=x) = (\text{number of ways to obtain }x\text{ successes in }n\text{ trials}) \cdot p^{x} \cdot (1-p)^{n-x}  = \dfrac{n!}{x!(n-x)!} \cdot p^{x} \cdot (1-p)^{n-x}[/latex]

formula for the probability of obtaining `x` successes from `n` independent trials

[latex]P(X = x) = (\text{number of ways to obtain }x\text{ successes in }n\text{ trials}) \cdot p^{x} \cdot (1-p)^{n-x}[/latex]

binomial mean

[latex]\mu = np[/latex]

binomial standard deviation

[latex]\sigma = \sqrt{np(1-p)}[/latex]

Glossary

acceptance sampling

a quality control technique used to assess the quality of a product or a batch of products

bernoulli trial

a chance experiment with two possible outcomes: success and failure

binomial experiment

an experiment consisting of a fixed number, [latex]n[/latex], of independent Bernoulli trials that counts the number of successes out of [latex]n[/latex] trials

continuity correction

an adjustment that is made when a discrete distribution is approximated by a continuous distribution

discrete probability distribution

all possible values of a discrete random variable, as well as the probability associated with each value