• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar
  • My Store
  • Glossary
  • Home
  • About Me
  • Contact Me

Statistics By Jim

Making statistics intuitive

  • Graphs
  • Basics
  • Hypothesis Testing
  • Regression
  • ANOVA
  • Probability
  • Time Series
  • Fun

distributions

F-table

By Jim Frost Leave a Comment

These F-tables provide the critical values for right-tail F-tests. Your F-test results are statistically significant when its test statistic is greater than this value. [Read more…] about F-table

Filed Under: Hypothesis Testing Tagged With: conceptual, distributions, graphs

Sampling Distribution: Definition, Formula & Examples

By Jim Frost 5 Comments

What is a Sampling Distribution?

A sampling distribution of a statistic is a type of probability distribution created by drawing many random samples of a given size from the same population. These distributions help you understand how a sample statistic varies from sample to sample. [Read more…] about Sampling Distribution: Definition, Formula & Examples

Filed Under: Hypothesis Testing Tagged With: conceptual, distributions, graphs

Critical Value: Definition, Finding & Calculator

By Jim Frost Leave a Comment

What is a Critical Value?

A critical value defines regions in the sampling distribution of a test statistic. These values play a role in both hypothesis tests and confidence intervals. In hypothesis tests, critical values determine whether the results are statistically significant. For confidence intervals, they help calculate the upper and lower limits. [Read more…] about Critical Value: Definition, Finding & Calculator

Filed Under: Hypothesis Testing Tagged With: conceptual, distributions, graphs

Chi-Square Table

By Jim Frost 2 Comments

This chi-square table provides the critical values for chi-square (χ2) hypothesis tests. The column and row intersections are the right-tail critical values for a given probability and degrees of freedom. [Read more…] about Chi-Square Table

Filed Under: Hypothesis Testing Tagged With: distributions, graphs

Z-table

By Jim Frost 1 Comment

Z-Score Table

A z-table, also known as the standard normal table, provides the area under the curve to the left of a z-score. This area represents the probability that z-values will fall within a region of the standard normal distribution. Use a z-table to find probabilities corresponding to ranges of z-scores and to find p-values for z-tests. [Read more…] about Z-table

Filed Under: Hypothesis Testing Tagged With: distributions, graphs

T-Distribution Table of Critical Values

By Jim Frost 5 Comments

This t-distribution table provides the critical t-values for both one-tailed and two-tailed t-tests, and confidence intervals. Learn how to use this t-table with the information, examples, and illustrations below the table. [Read more…] about T-Distribution Table of Critical Values

Filed Under: Hypothesis Testing Tagged With: distributions

5 Number Summary: Definition, Finding & Using

By Jim Frost 6 Comments

What is the 5 Number Summary?

The 5 number summary is an exploratory data analysis tool that provides insight into the distribution of values for one variable. Collectively, this set of statistics describes where data values occur, their central tendency, variability, and the general shape of their distribution. [Read more…] about 5 Number Summary: Definition, Finding & Using

Filed Under: Basics Tagged With: analysis example, distributions, interpreting results

Lognormal Distribution: Uses, Parameters & Examples

By Jim Frost 2 Comments

What is the Lognormal Distribution?

The lognormal distribution is a continuous probability distribution that models right-skewed data. The shape of the lognormal distribution is comparable to the Weibull and loglogistic distributions. [Read more…] about Lognormal Distribution: Uses, Parameters & Examples

Filed Under: Probability Tagged With: distributions, graphs

A Statistical Thanksgiving: Global Income Distributions

By Jim Frost 11 Comments

In the United States, our Thanksgiving holiday is fast approaching. On this day, we give thanks for the good things in our lives.

Painting of the first Thanksgiving by Brownscombe.For this post, I wanted to quantify how thankful we should be. Ideally, I’d quantify something truly meaningful, like happiness. Unfortunately, most countries are not like Bhutan, which measures the gross national happiness and incorporates it into their five-year development plans.

Instead, I’ll focus on something that is more concrete and regularly measured around the world—income. By examining income distributions, I’ll show that you have much to be thankful for, and so does most of the world! [Read more…] about A Statistical Thanksgiving: Global Income Distributions

Filed Under: Probability Tagged With: distributions, graphs

Variance: Definition, Formulas & Calculations

By Jim Frost 2 Comments

Variance is a measure of variability in statistics. It assesses the average squared difference between data values and the mean. Unlike some other statistical measures of variability, it incorporates all data points in its calculations by contrasting each value to the mean. [Read more…] about Variance: Definition, Formulas & Calculations

Filed Under: Basics Tagged With: conceptual, distributions, interpreting results

Uniform Distribution: Definition & Examples

By Jim Frost 4 Comments

What is a Uniform Distribution?

The uniform distribution is a symmetric probability distribution where all outcomes have an equal likelihood of occurring. All values in the distribution have a constant probability, making them uniformly distributed. This distribution is also known as the rectangular distribution because of its shape in probability distribution plots, as I’ll show you below. [Read more…] about Uniform Distribution: Definition & Examples

Filed Under: Probability Tagged With: conceptual, distributions, graphs

Frequency Table: How to Make & Examples

By Jim Frost Leave a Comment

What is a Frequency Table?

A frequency table lists a set of values and how often each one appears. Frequency is the number of times a specific data value occurs in your dataset. These tables help you understand which data values are common and which are rare. These tables organize your data and are an effective way to present the results to others. Frequency tables are also known as frequency distributions because they allow you to understand the distribution of values in your dataset. [Read more…] about Frequency Table: How to Make & Examples

Filed Under: Basics Tagged With: conceptual, distributions

Mean Absolute Deviation: Definition, Finding & Formula

By Jim Frost 4 Comments

What is the Mean Absolute Deviation?

The mean absolute deviation (MAD) is a measure of variability that indicates the average distance between observations and their mean. MAD uses the original units of the data, which simplifies interpretation. Larger values signify that the data points spread out further from the average. Conversely, lower values correspond to data points bunching closer to it. The mean absolute deviation is also known as the mean deviation and average absolute deviation. [Read more…] about Mean Absolute Deviation: Definition, Finding & Formula

Filed Under: Basics Tagged With: choosing analysis, conceptual, distributions

Stem and Leaf Plot: Making, Reading & Examples

By Jim Frost 2 Comments

What is a Stem and Leaf Plot?

Stem and leaf plots display the shape and spread of a continuous data distribution. These graphs are similar to histograms, but instead of using bars, they show digits. It’s a particularly valuable tool during exploratory data analysis. They can help you identify the central tendency, variability, skewness of your distribution, and outliers. Stem and leaf plots are also known as stemplots. [Read more…] about Stem and Leaf Plot: Making, Reading & Examples

Filed Under: Graphs Tagged With: choosing analysis, distributions, interpreting results

Skewed Distribution: Definition & Examples

By Jim Frost 2 Comments

What is a Skewed Distribution?

A skewed distribution occurs when one tail is longer than the other. Skewness defines the asymmetry of a distribution. Unlike the familiar normal distribution with its bell-shaped curve, these distributions are asymmetric. The two halves of the distribution are not mirror images because the data are not distributed equally on both sides of the distribution’s peak. [Read more…] about Skewed Distribution: Definition & Examples

Filed Under: Basics Tagged With: conceptual, distributions, graphs

Range of a Data Set

By Jim Frost 1 Comment

The range of a data set is the difference between the maximum and the minimum values. It measures variability using the same units as the data. Larger values represent greater variability.

The range is the easiest measure of dispersion to calculate and interpret in statistics, but it has some limitations. In this post, I’ll show you how to find the range mathematically and graphically, interpret it, explain its limitations, and clarify when to use it. [Read more…] about Range of a Data Set

Filed Under: Basics Tagged With: conceptual, distributions, graphs, interpreting results

Z-score: Definition, Formula, and Uses

By Jim Frost 9 Comments

A z-score measures the distance between a data point and the mean using standard deviations. Z-scores can be positive or negative. The sign tells you whether the observation is above or below the mean. For example, a z-score of +2 indicates that the data point falls two standard deviations above the mean, while a -2 signifies it is two standard deviations below the mean. A z-score of zero equals the mean. Statisticians also refer to z-scores as standard scores, and I’ll use those terms interchangeably. [Read more…] about Z-score: Definition, Formula, and Uses

Filed Under: Basics Tagged With: conceptual, distributions, Excel, probability

Relative Frequencies and Their Distributions

By Jim Frost 5 Comments

A relative frequency indicates how often a specific kind of event occurs within the total number of observations. It is a type of frequency that uses percentages, proportions, and fractions.

In this post, learn about relative frequencies, the relative frequency distribution, and its cumulative counterpart. [Read more…] about Relative Frequencies and Their Distributions

Filed Under: Basics Tagged With: conceptual, distributions, graphs

Empirical Rule: Definition & Formula

By Jim Frost 1 Comment

What is the Empirical Rule?

The empirical rule in statistics, also known as the 68 95 99 rule, states that for normal distributions, 68% of observed data points will lie inside one standard deviation of the mean, 95% will fall within two standard deviations, and 99.7% will occur within three standard deviations. [Read more…] about Empirical Rule: Definition & Formula

Filed Under: Probability Tagged With: conceptual, distributions, graphs

Interquartile Range (IQR): How to Find and Use It

By Jim Frost 19 Comments

What is the Interquartile Range (IQR)?

The interquartile range (IQR) measures the spread of the middle half of your data. It is the range for the middle 50% of your sample. Use the IQR to assess the variability where most of your values lie. Larger values indicate that the central portion of your data spread out further. Conversely, smaller values show that the middle values cluster more tightly.

In this post, learn what the interquartile range means and the many ways to use it! I’ll show you how to find the interquartile range, use it to measure variability, graph it in boxplots to assess distribution properties, use it to identify outliers, and test whether your data are normally distributed.

The interquartile range is one of several measures of variability. To learn about the others and how the IQR compares, read my post, Measures of Variability.

Interquartile Range Definition

To visualize the interquartile range, imagine dividing your data into quarters. Statisticians refer to these quarters as quartiles and label them from low to high as Q1, Q2, Q3, and Q4. The lowest quartile (Q1) covers the smallest quarter of values in your dataset. The upper quartile (Q4) comprises the highest quarter of values. The interquartile range is the middle half of the data that lies between the upper and lower quartiles. In other words, the interquartile range includes the 50% of data points that are above Q1 and below Q4. The IQR is the red area in the graph below, containing Q2 and Q3 (not labeled).

Graph the illustrates the interquartile range (IQR) as a measure of variability.

When measuring variability, statisticians prefer using the interquartile range instead of the full data range because extreme values and outliers affect it less. Typically, use the IQR with a measure of central tendency, such as the median, to understand your data’s center and spread. This combination creates a fuller picture of your data’s distribution.

Unlike the more familiar mean and standard deviation, the interquartile range and the median are robust measures. Outliers do not strongly influence either statistic because they don’t depend on every value. Additionally, like the median, the interquartile range is superb for skewed distributions. For normal distributions, you can use the standard deviation to determine the percentage of observations that fall specific distances from the mean. However, that doesn’t work for skewed distributions, and the IQR is an excellent alternative.

Related posts: Quartiles: Definition, Finding, and Using, Median: Definition and Uses, and What are Robust Statistics?

How to Find the Interquartile Range (IQR) by Hand

The formula for finding the interquartile range takes the third quartile value and subtracts the first quartile value.

IQR = Q3 – Q1

Equivalently, the interquartile range is the region between the 75th and 25th percentile (75 – 25 = 50% of the data).

Using the IQR formula, we need to find the values for Q3 and Q1. To do that, simply order your data from low to high and split the value into four equal portions.

I’ve divided the dataset below into quartiles. The interquartile range extends from the Q1 value to the Q3 value. For this dataset, the interquartile range is 39 – 20 = 19.

Dataset that shows how to find the interquartile range (IQR)

Note that different methods and statistical software programs will find slightly different Q1 and Q3 values, which affects the interquartile range. These variations stem from alternate ways of finding percentiles. For details about that, read my post about Percentiles: Interpretations and Calculations.

How to Find the Interquartile Range using Excel

All statistical software packages will identify the interquartile range as part of their descriptive statistics. Here, I’ll show you how to find it using Excel because most readers can access this application.

To follow along, download the Excel file: IQR. This dataset is the same as the one I use in the illustration above. This file also includes the interquartile range calculations for finding outliers and the IQR normality test described later in this post.

In Excel, you’ll need to use the QUARTILE.EXC function, which has the following arguments: QUARTILE.EXC(array, quart)

  • Array: Cell range of numeric values.
  • Quart: Quartile you want to find.

In my spreadsheet, the data are in cells A2:A20. Consequently, I’ll use the following syntax to find Q1 and Q3, respectively:

  • =QUARTILE.EXC(A2:A20,1)
  • =QUARTILE.EXC(A2:A20,3)

As with my example of finding the interquartile range by hand, Excel indicates that Q3 is 39 and Q1 is 20. IQR = 39 – 20 = 19

Related post: Descriptive Statistics in Excel

Using Boxplots to Graph the Interquartile Range

Boxplots are a great way to visualize interquartile ranges and their relation to the median and the overall distribution. These graphs display ranges of values based on quartiles and show asterisks for outliers that fall outside the whiskers. Boxplots work by splitting your data into quarters.

Let’s look at the boxplot anatomy before getting to the example. Notice how it divides your data into quartiles.

Diagram of boxplots that displays the interquartile range (IQR).

The box in the boxplot is your interquartile range! It contains 50% of your data. By comparing the size of these boxes, you can understand your data’s variability. More dispersed distributions have wider boxes.

Additionally, find where the median line falls within each interquartile box. If the median is closer to one side or the other of the box, it’s a skewed distribution. When the median is near the center of the interquartile range, your distribution is symmetric.

For example, in the boxplot below, method 3 has the highest variability in scores and is left-skewed. Conversely, method 2 has a tighter distribution that is symmetrical, although it also has an outlier—read the next section for more about that!

Example of a boxplot that displays scores by teaching method.

Related post: Boxplots versus Individual Value Plots

Using the IQR to Find Outliers

The interquartile range can help you identify outliers. For other methods of finding outliers, the outliers themselves influence the calculations, potentially causing you to miss them. Fortunately, interquartile ranges are relatively robust against outlier influence and can avoid this problem. This method also does not assume the data follow the normal distribution or any other distribution. That’s why using the IQR to find outliers is one of my favorite methods!

To find outliers, you’ll need to know your data’s IQR, Q1, and Q3 values. Take these values and input them into the equations below. Statisticians call the result for each equation an outlier gate. I’ve included these calculations in the IQR example Excel file.

Q1 − 1.5 * IQR: Lower outlier gate.

Q3 + 1.5 * IQR: Upper outlier gate.

Using the same example dataset, I’ll calculate the two outlier gates. For that dataset, the interquartile range is 19, Q1 = 20, and Q3 = 39.

Lower outlier gate: 20 – 1.5 * 19 = -8.5

Upper outlier gate: 39 + 1.5 * 19 = 67.5

Then look for values in the dataset that are below the lower gate or above the upper gate. For the example dataset, there are no outliers. All values fall between these two gates.

Boxplots typically use this method to identify outliers and display asterisks when they exist. In the teaching method boxplot above, notice that the Method 2 group has an outlier. The researchers should investigate that value.

Related post: Five Ways to Find Outliers

Using the Interquartile Range to Test Normality

You can even use the interquartile range as a simple test to determine whether your data are normally distributed. When data follow a normal distribution, the interquartile range will have specific properties. The image below highlights these properties. Specifically, in our calculations below, we’ll use the standard deviations (σ) that correspond to the interquartile range, -0.67 and 0.67.

Image shows how a probability distribution function relates to a boxplot.
By Jhguch at en.wikipedia, CC BY-SA 2.5, Link

You can assess whether your IQR is consistent with a normal distribution. However, this test should not replace a formal normality hypothesis test.

To perform this test, you’ll need to know the sample standard deviation (s) and sample mean (x̅). Input these values into the formulas for Q1 and Q3 below.

  • Q1 = x̅ − (s * 0.67)
  • Q3 = x̅ + (s * 0.67)

Compare these calculated values to your data’s actual Q1 and Q3 values. If they are notably different, your data might not follow the normal distribution.

We’ll return to our example dataset from before. Our actual Q1 and Q3 are 20 and 39, respectively.

The sample average is 31.3, and its standard deviation is 14.1. I’ll input those values into the equations.

Q1 = 31.3 – (14.1 * 0.67) = 21.9

Q3 = 31.3 + (14.1 * 0.67) = 40.7

The calculated values are pretty close to the actual data values, suggesting that our data follow the normal distribution. I’ve included these calculations in the IQR example spreadsheet.

Related posts: Understanding the Normal Distribution and How to Identify the Distribution of Your Data

Filed Under: Basics Tagged With: conceptual, distributions, Excel

  • « Go to Previous Page
  • Go to page 1
  • Go to page 2
  • Go to page 3
  • Go to page 4
  • Go to Next Page »

Primary Sidebar

Meet Jim

I’ll help you intuitively understand statistics by focusing on concepts and using plain English so you can concentrate on understanding your results.

Read More...

Buy My Introduction to Statistics Book!

Cover of my Introduction to Statistics: An Intuitive Guide ebook.

Buy My Hypothesis Testing Book!

Cover image of my Hypothesis Testing: An Intuitive Guide ebook.

Buy My Regression Book!

Cover for my ebook, Regression Analysis: An Intuitive Guide for Using and Interpreting Linear Models.

Subscribe by Email

Enter your email address to receive notifications of new posts by email.

    I won't send you spam. Unsubscribe at any time.

    Follow Me

    • FacebookFacebook
    • RSS FeedRSS Feed
    • TwitterTwitter

    Top Posts

    • How to Interpret P-values and Coefficients in Regression Analysis
    • How To Interpret R-squared in Regression Analysis
    • Z-table
    • How to do t-Tests in Excel
    • Multicollinearity in Regression Analysis: Problems, Detection, and Solutions
    • How to Find the P value: Process and Calculations
    • F-table
    • How to Interpret the F-test of Overall Significance in Regression Analysis
    • Mean, Median, and Mode: Measures of Central Tendency
    • One-Tailed and Two-Tailed Hypothesis Tests Explained

    Recent Posts

    • Sampling Frame: Definition & Examples
    • Probability Mass Function: Definition, Uses & Example
    • Using Scientific Notation
    • Selection Bias: Definition & Examples
    • ANCOVA: Uses, Assumptions & Example
    • Fibonacci Sequence: Formula & Uses

    Recent Comments

    • Jim Frost on Beta Distribution: Uses, Parameters & Examples
    • Norman Abraham on Beta Distribution: Uses, Parameters & Examples
    • Morris on Validity in Research and Psychology: Types & Examples
    • Jim Frost on What are Robust Statistics?
    • Allan Fraser on What are Robust Statistics?

    Copyright © 2023 · Jim Frost · Privacy Policy