• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar
  • My Store
  • Glossary
  • Home
  • About Me
  • Contact Me

Statistics By Jim

Making statistics intuitive

  • Graphs
  • Basics
  • Hypothesis Testing
  • Regression
  • ANOVA
  • Probability
  • Time Series
  • Fun

Bimodal Distribution: Definition, Examples & Analysis

By Jim Frost 1 Comment

A bimodal distribution has two peaks. In the context of a continuous probability distribution, modes are peaks in the distribution. The graph below shows a bimodal distribution.

Graph of a bimodal distribution.

When the peaks have unequal heights, the higher apex is the major mode, and the lower is the minor mode.

What Causes Bimodal Distributions?

Recognizing that your data follow a bimodal distribution will help you better understand your study topic. This type of distribution usually has an explanation for its existence. Here are several examples.

Merging Two Processes or Populations

In some cases, combining two processes or populations in one dataset will produce a bimodal distribution. Each of the underlying conditions has its own mode. Combine them and, voilà, two modes!

For example, imagine you measure the weights of adult black bears. When you graph the data, you see a distribution with two peaks. It turns out that female black bears have an average weight of 175 pounds while the males average 400 pounds. Each average corresponds to a peak in the distribution. Hence, one peak is for females and the other for males.

Differences between genders frequently produce bimodal distributions.

Or suppose you measure the strength of products from an assembly line and notice a bimodal distribution. After further investigation, you find that one shift uses a slightly different procedure that produces a weaker product. The two processes create dual peaks.

We’ll come back to this example when we analyze an example dataset.

Natural Bimodal Distributions

In other cases, the distribution of the phenomenon you’re studying is naturally bimodal. For example, the size of Weaver ants and the age of onset for Hodgkin’s Lymphoma follow a bimodal distribution. The graph below displays the body lengths of 300 Weaver worker ants from a field study.

Chart showing the bimodal distribution of Weaver ants.

In these cases, you can’t separate the bimodal distribution into separate unimodal distributions. However, understanding the bimodal nature will help you better grasp your study area and accurately identify the more common values occurring near the two peaks.

For example, the Weaver ants researcher* hypothesized that the different sized ants had different roles. The smaller variety are caretakers while the larger ones build and defend the nest.

Analyzing Bimodal Distributions

While bimodal distributions occur less frequently, they’re essential to identify when they occur.

Discovering that you’re working with combined populations, conditions, or processes that cause your data to follow a bimodal distribution is a valuable finding. You’ve identified a factor that affects the outcome. You and other scientists can include this variable in future research.

Typically, you’ll want to assess the subpopulations separately to understand their individual distributions better. That process helps you obtain better measures of central tendency and more precise measures of variability for each group. Failure to account for the different distributions can cause unreliable results. For instance, the mean and mode for a bimodal distribution often won’t be near the most common values.

Let’s see this in action!

Example Analysis

Imagine that we gather a random sample from an assembly line and measure the product’s strength. First, we’ll calculate the descriptive statistics for this sample. The results below look relatively straightforward. Here’s the CSV dataset for this example: Bimodal.

Descriptive statistics for our example dataset.

Now, let’s graph the data in a histogram.

Histogram of our bimodal distribution.

The statistical summary did not suggest that the data follow a bimodal distribution. Never rely solely on statistical summaries. Always graph your data!

See what else you can learn from histograms.

In the descriptive statistics, notice how the mean and median (both near 60) lie between modes where there are relatively few observations. Typically, these measures find where most values fall, but that’s not the case here, reducing their usefulness in bimodal distributions.

After seeing the histogram, we investigate and find that the shifts use a slightly different procedure. Let’s divide the data by shifts to see the impact of the procedural difference. The histogram below highlights the effect!

Histogram displaying two distributions.

Now, let’s calculate the descriptive statistics for each shift.

Descriptive statistics for each of the two groups.

Notice how the mean and median for groups A and B are roughly 50 and 70 and now fall near the peaks in the histogram above. Also, the standard deviation and range, which are measures of variability, are about half the scale relative to the combined dataset shown earlier. Lower variability means we have a more precise understanding of where the values from each process fall in contrast to the broader bimodal distribution.

Related post: Descriptive Statistics in Excel

Reference

Weber, NA (1946). “Dimorphism in the African Oecophylla worker and an anomaly (Hym.: Formicidae),” Annals of the Entomological Society of America. 39: 7–10.

Share this:

  • Tweet

Related

Filed Under: Basics Tagged With: conceptual, distributions, graphs

Reader Interactions

Comments

  1. Kanchan Singh says

    March 21, 2022 at 2:17 am

    Prof. Jim,
    I appreciate your efforts and guidance to make statistical learning easy and comprehensible.I am gaining a lot from going through your posts. Please keep on helping us. If I have any question any time, I shall let you know.
    Kanchan S.

    Reply

Comments and Questions Cancel reply

Primary Sidebar

Meet Jim

I’ll help you intuitively understand statistics by focusing on concepts and using plain English so you can concentrate on understanding your results.

Read More...

Buy My Introduction to Statistics Book!

Cover of my Introduction to Statistics: An Intuitive Guide ebook.

Buy My Hypothesis Testing Book!

Cover image of my Hypothesis Testing: An Intuitive Guide ebook.

Buy My Regression Book!

Cover for my ebook, Regression Analysis: An Intuitive Guide for Using and Interpreting Linear Models.

Subscribe by Email

Enter your email address to receive notifications of new posts by email.

    I won't send you spam. Unsubscribe at any time.

    Follow Me

    • FacebookFacebook
    • RSS FeedRSS Feed
    • TwitterTwitter

    Top Posts

    • How to Interpret P-values and Coefficients in Regression Analysis
    • How To Interpret R-squared in Regression Analysis
    • Mean, Median, and Mode: Measures of Central Tendency
    • Multicollinearity in Regression Analysis: Problems, Detection, and Solutions
    • How to Interpret the F-test of Overall Significance in Regression Analysis
    • Choosing the Correct Type of Regression Analysis
    • How to Find the P value: Process and Calculations
    • Interpreting Correlation Coefficients
    • How to do t-Tests in Excel
    • Z-table

    Recent Posts

    • Fishers Exact Test: Using & Interpreting
    • Percent Change: Formula and Calculation Steps
    • X and Y Axis in Graphs
    • Simpsons Paradox Explained
    • Covariates: Definition & Uses
    • Weighted Average: Formula & Calculation Examples

    Recent Comments

    • Dave on Control Variables: Definition, Uses & Examples
    • Jim Frost on How High Does R-squared Need to Be?
    • Mark Solomons on How High Does R-squared Need to Be?
    • John Grenci on Normal Distribution in Statistics
    • Jim Frost on Normal Distribution in Statistics

    Copyright © 2023 · Jim Frost · Privacy Policy