What is Binomial Distribution?

A Binomial Distribution describes the probability of an event that only has 2 possible outcomes. For Example, Heads or tails.

It can also be used to describe the probability of a series of independent events that only have 2 possible outcomes occurring. For Example. Flipping a coin 10 times and having it land with 5 on heads exactly 5 times. Or, flipping a coin 10 times and having at least 5 land on heads.

Furthermore, the probability of an event with n trials and f failures follows a binomial distribution.

Visualizing Binomial Distribution

The Binomial Distribution looks like so when graphed:

Probability mass function for the binomial distribution
By Tayste – Own work, Public Domain, Link

When Would You Use Binomial Distribution?

Requirements and Conditions for a Binomial Distribution

  • Must be a fixed number of trials.
  • Continuous data are not binomial.
  • Probability of success should be the same on every trial, and also probability of success is constant.
  • Two state. Two possible outcomes.
    • (true or false, hot or cold, success or failure, defective or not defective.)
  • Independent trials – trials are statistically independent.
    • (For example: The flip of one coin means nothing to the results of the next coin flip.)
  • Use Binomial Distribution when you are sampling with replacement.

You Cannot Use Binomial Distribution On:

  • When the probability of success is not constant for an event.
    • Ex. The probability of it snowing or not snowing in NYC would not fit the criteria for a Binomial Distribution because the probability of success is not constant. The chance of snow on winter days is higher than summer days.
  • When you sample WITHOUT replacement.
    • In that case, use Hypergeometric.

Cumulative vs Non-Cumulative

There are (2) ways I've seen Binomial Distribution Problems be represented in Six Sigma Exams:

  • Non-cumulative questions
  • Cumulative questions (with or without a chart)

The questions can either be about the actual equations and translating a word problem into an actual solution. Or they can be about the necessary scenarios when you would apply binomial probability – in other words, what MUST be true to use it, or if an event is binomial, what can we infer about it's properties.

Non Cumulative Probability

This is just a fancy statistics term of single events taken on their own.

Example: a single flip of a coin

Cumulative Probability

Another fancy statistics term for multiple events. I like to remember that Cumulative events literally accumulate multiple probabilities.

One thing to be aware of is that you can be asked to either manually calculate a binomial cumulative probability question or you could be used to leverage a chart.

Notes About Sampling With Replacement

Binomial is for N trials and F failures and describes the probability of k successes in n draws with replacement from a finite population of size N containing exactly K successes.

Remember, binomial is for independent events where the likelihood of each event occurring is the same as the others. If you sample without replacing, you're affecting the likelihood of independence.

This is in contrast to the hypergeometric distribution, which describes the probability of k successes in n draws when sampling without replacement.

Examples of Binomial Distribution Problems

Non Cumulative Binomial Probability

Non-Cumulative Calculation Example 1: "Contains Exactly"

If a coin is tossed 10 times, what is the probability of obtaining exactly four heads? (If an honest coin is tossed 10 times, what is the probability of 4 heads?)

With a coin toss there are only 2 possible outcomes; heads or tails. So you should be on alert for using binomial.

binomial successes
binomial successes

Non-Cumulative Calculation Example 2: "Contains Exactly"

A manufacturing process creates 3.4% defective parts. A sample of 10 parts from the production process is selected. What is the probability that the sample contains exactly 3 defective parts?

As soon as you see the word defective, you should be alert to using the Binomial equation. Since defect in this sense means that a part is in a binary state, either functioning or defective, it meets our criteria.

binomial successes 2
binomial successes 2

Non-Cumulative Calculation Example 3: "Contains Exactly"

Forty-five percent of all registered voters in a national election are female. A random sample of 8 voters is selected. The probability that the sample contains 2 males is:

Since in this question the assumption is that there are (2) biological genders measured, male or female, you should be on alert to use the binomial distribution.

binomial successes 3
binomial successes 3

Non-Cumulative Chart Example:

You can see that the same percentage was found on the chart. (In this case, x is the probability of the event happening instead of p.)

Examples of Cumulative Binomial Probability Questions

Cumulative Calculation Example: "At least" or "Fewer Than" or "At Most"

79% percent of the students of a large class passed the final exam. A random sample of 4 students are selected to be analyzed by the school. What is the probability that the sample contains fewer than 2 students that passed the exam?

Since we are examining data that only has a binary state, pass or fail, you should be on alert to using the binomial equation.

Also, since you're asked for 'x or fewer', you have to calculate the probability of ALL POSSIBLE events.

In this example we must calculate the odds of EXACTLY one pass PLUS the odds of EXACTLY no pass in the sample.

binomial successes 4
binomial successes 4

Cumulative Calculation Example: "More than"

An example of this kind of question is "Test the probability of the number of dropped calls will exceed a certain number."

Here you'd do the opposite of the "Fewer" example above and count the probabilities above.

So, if we re-wrote the question above to be 'What's the probability of sampling 10 students and 8 or more passed' it would be the probability of EXACTLY 8 passing, PLUS EXACTLY 9 passing, PLUS EXACTLY 10 passing.

Binomial Probability Questions Short Cuts

This can be a LOT of calculations. Fortunately, there are some short cuts:

Factorial Math short cuts:

I find that a lot of these binomial questions can be made easier by taking advantage of reducing the factorials by cancelling out terms before you plug them into the calculator.

Just for sake of simplicity let's say that you have 5! / 3! If you write that out by hand it's essentially (5*4*3*2*1 / 3*2*1).. Also, you can eliminate a lot of the terms by recognizing this and rewriting 5! / 3! as 5*4.

Distribution table short cuts:

Binomial Probability Charts

Here are some good references:

  • https://www.statisticshowto.datasciencecentral.com/tables/binomial-distribution-table/
  • http://www.math.hawaii.edu/~xander/fa12_471/Binomial_Probabilities.pdf

Binomial Chart Question Example: (Without Calculations)

  • If I were using a Binomial Distribution Table, which values of X would I add together to get the probability of at most N defective?
    • You would add the probabilities of N, n-1, n-2… all the way to zero.
    • For Example, "which values of X would I add together to get the probability of at most 4 defective?"
      • Add the probabilities of 4 + 3 + 2 +1 + 0.

Binomial Chart Question Example: ("More Than")

Let's assume that 20% of drivers on the road have NO car insurance. What is the probability that more than 7 out of a sample of 15 people have no car insurance?

  1. Find n = 15  in the first column on the left.
  2. And then find the column containing p = 0.20 .
  3. Find the7 in the second column on the left, since we want to findF(7) =P(X ≤ 7).

Now, all we need to do is read the probability value where thep = 0.20 column and also the (n = 15,x = 7) row intersect. What do you get?

So, the cumulative binomial probability table tells us thatP(X ≤ 7) = 0.9958. Therefore: P(X > 7) = 1 − 0.9958 = 0.0042

That is, the probability that more than 7 in a random sample of 15 would have no car insurance is 0.0042

Example of Using Binomial Probability in a Six Sigma Project

Typically in a Six Sigma project you are using DMAIC methodology. Let's say during the Define Phase you chartered a project to see if you could improve a process that had a binomial outcome. You even might use the Binomial Distribution to articulate the business case for why you should do the project in the first place – for instance, event A should be binomial, but it's clearly not.

You might use the Binomial distribution to see if samples of the process outcome were in fact following a binomial distribution during the Analysis phase.

If you are running a pilot implementation during the Improve Phase, you might make the assumption that some events that are inputs to your process in fact will follow a binomial distribution.

Also, In the Control Phase you might monitor that a specific process is generating Binomially distributed results if it should.

What You Need to Know for Certification

Six Sigma Green Belts

The ASQ Six Sigma Green Belt BOK requires:

Probability distributions
Describe and interpret normal, binomial, and Poisson, chi square, Student's t, and also F distributions. (Apply)

Six Sigma Black Belts

The ASQ Six Sigma Black Belt BOK Requires:

Commonly used distributions

Describe, apply and interpret the following distributions: normal, Poisson, binomial, chi square, Student's t and F distributions. (Evaluate)

The Villanova Six Sigma Black Belt BOK requires the following

Advanced statistical process control

Distributions commonly used by black belts. Describe and apply binomial, Poisson, normal, chi-square, Student's t, and F distributions.

Videos on Binomial Distributions

Additional Helpful Resources

Hypothesis Testing with Binomial Distribution

  • https://internal.ncl.ac.uk/ask/numeracy-maths-statistics/statistics/hypothesis-testing/hypothesis-testing-with-the-binomial-distribution.html
  • https://www.real-statistics.com/binomial-and-related-distributions/hypothesis-testing-binomial-distribution/