Distribution

« Back to Glossary Index

A distribution in statistics describes how the values of a variable are spread or arranged across possible outcomes. They show which values occur more frequently, which occur less frequently, and whether the data are clustered, symmetric, or skewed.

Distributions can be represented with frequency or probability distributions, depending on whether you are summarizing actual data or modeling possible outcomes mathematically. Understanding distributions is essential for describing data, making predictions, and selecting appropriate statistical methods.

Frequency Distributions

A frequency distribution is an empirical summary of data. It shows how often values occur within a dataset, typically organized into a table or displayed in a graph such as a histogram.

There are several types of frequency distributions:

Simple frequency: Lists each value (or interval of values) and the number of times it occurs.
Relative frequency: Shows frequencies as percentages or proportions, which makes it easier to compare datasets of different sizes.
Cumulative frequency: Adds up frequencies across intervals, helping identify medians, quartiles, or percentiles.

Example: A teacher recording test scores for a class might use a grouped frequency distribution to show how many students scored in each range (60–69, 70–79, 80–89, 90–100). This makes overall performance patterns easy to see.

Benefit: Frequency distributions simplify raw data into a form that highlights trends, clusters, and outliers. They are often the first step in exploratory data analysis.

Probability Distributions

A probability distribution is a mathematical model that describes how a random variable behaves. Instead of summarizing observed data, they define the likelihood of different outcomes. They can be represented by probability mass functions (for discrete variables) or probability density functions (for continuous variables).

Two of the most widely used one are the following:

Normal: Continuous and shaped like a symmetric bell curve. Many natural and social phenomena approximate a normal distribution, such as adult heights or measurement errors. It is central to many statistical methods because of the central limit theorem.
Binomial: A discrete distribution that models the number of successes in a fixed number of trials, each with the same probability of success. An example is counting the number of heads in 10 coin flips.

Analysts use many other probability distributions in specialized areas, such as the Poisson distribution for modeling event counts, the exponential for modeling waiting times, and the t-distribution for small-sample inference. The choice of which one to use depends on the type of data and the question being studied.

Benefit: Probability distributions allow statisticians to estimate probabilities, test hypotheses, and make predictions beyond the observed data. They provide a theoretical framework for inference and decision-making under uncertainty.

In short, a distribution describes how values are arranged, either through a frequencies summarizing observed data or by modeling outcome probabilities mathematically. Both forms are fundamental for understanding data and applying statistics effectively.

« Back to Glossary Index

Distribution

Frequency Distributions

Probability Distributions

Related

Subscribe by Email