• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar
  • My Store
  • Glossary
  • Home
  • About Me
  • Contact Me

Statistics By Jim

Making statistics intuitive

  • Graphs
  • Basics
  • Hypothesis Testing
  • Regression
  • ANOVA
  • Probability
  • Time Series
  • Fun
  • Calculators

Indicator Variable

By Jim Frost

« Back to Glossary Index

An indicator variable is a binary variable that identifies whether an observation belongs to a specific category. It takes the value 1 if the condition is true and 0 otherwise. Indicator variables are essential for including categorical predictors in models that require numeric inputs, such as linear regression. Statisticians also refer to them less formally as dummy variables.

For a categorical variable with n distinct levels, only n–1 indicator variables are created for use in a regression model. One category is designated as the reference (or baseline) level. The remaining categories each receive their own indicator variable, allowing the model to estimate their effects relative to the baseline. Leaving the baseline out of the model prevents perfect collinearity while still capturing the full set of group comparisons.

Any level can serve as the baseline. Choose the one that makes the most sense for your research question or comparison of interest.

Imagine a variable representing weather type with three levels: Sunny, Cloudy, and Rainy. The indicator variable coding with two variables looks like the following:

Weather Type Sunny Indicator Cloudy Indicator
Sunny 1 0
Cloudy 0 1
Rainy 0 0

In this example, Rainy serves as the baseline level. Only Sunny and Cloudy receive indicator variables. When used in regression, the model’s intercept captures the expected value for rainy days, and the coefficients on the indicators show how the other conditions differ relative to the baseline value. Most statistical software automatically performs this indicator variable encoding when you include a categorical variable in a regression model.

Suppose the coefficient for the Cloudy indicator is –3.50 in a model predicting daily coffee sales in dollars. This means that, on average, sales on cloudy days are expected to be $3.50 lower than on rainy days, holding all other variables constant.

Related

Related Articles:
  • Glossary: Dummy Variable
  • Glossary: Dichotomous Variable
  • Nominal, Ordinal, Interval, and Ratio Scales
  • Scatterplots: Using, Examples, and Interpreting
  • Guide to Data Types and How to Graph Them in Statistics
« Back to Glossary Index

Primary Sidebar

Meet Jim

I’ll help you intuitively understand statistics by focusing on concepts and using plain English so you can concentrate on understanding your results.

Read More...

Buy My Introduction to Statistics Book!

Cover of my Introduction to Statistics: An Intuitive Guide ebook.

Buy My Hypothesis Testing Book!

Cover image of my Hypothesis Testing: An Intuitive Guide ebook.

Buy My Regression Book!

Cover for my ebook, Regression Analysis: An Intuitive Guide for Using and Interpreting Linear Models.

Subscribe by Email

Enter your email address to receive notifications of new posts by email.

    I won't send you spam. Unsubscribe at any time.

    Buy My Thinking Analytically Book!

    Cover for my book, Thinking Analytically: An Guide for Making Data-Driven Decisions.

    Top Posts

    • F-table
    • Cronbach’s Alpha: Definition, Calculations & Example
    • Z-table
    • How To Interpret R-squared in Regression Analysis
    • Cohens D: Definition, Using & Examples
    • Box Plot Explained with Examples
    • Multicollinearity in Regression Analysis: Problems, Detection, and Solutions
    • Interpreting Correlation Coefficients
    • Root Mean Square Error (RMSE)
    • T-Distribution Table of Critical Values

    Recent Posts

    • Data Collection Methods: Step-By-Step Guide with Examples
    • ANOVA Calculator
    • Positive Predictive Value: Meaning, Formula, and Interpretation
    • Median Absolute Deviation Calculator
    • Median Absolute Deviation: Definition, Finding & Formula
    • Outlier Calculator

    Recent Comments

    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Jim Frost on Comparing Regression Lines with Hypothesis Tests
    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Jim Frost on Pareto Chart: Making, Reading & Examples

    Copyright © 2026 · Jim Frost · Privacy Policy