• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar
  • My Store
  • Glossary
  • Home
  • About Me
  • Contact Me

Statistics By Jim

Making statistics intuitive

  • Graphs
  • Basics
  • Hypothesis Testing
  • Regression
  • ANOVA
  • Probability
  • Time Series
  • Fun
  • Calculators

Dummy Variable

By Jim Frost

« Back to Glossary Index

A dummy variable is a binary variable indicating the presence or absence of a condition. It takes the value 1 if the observation belongs to a specific category and 0 otherwise. Dummy variables allow you to include categorical variables in regression models by translating qualitative groupings into a numeric format. Statisticians also refer to them more formally as indicator variables.

To include a categorical variable with n levels in a regression model, you create n–1 dummy variables. Each represents one of the categories except for the baseline level, which is absorbed into the model intercept. Leaving the baseline out of the model prevents perfect collinearity while still capturing the full set of group comparisons.

Any level can serve as the baseline. Choose the one that makes the most sense for your research question or comparison of interest.

For example, suppose you have a categorical variable for job type with three levels: Manager, Technician, and Clerk. The dummy coding with two variables looks like the following:

Job Type Manager Dummy Technician Dummy
Manager 1 0
Technician 0 1
Clerk 0 0

Here, Clerk is the baseline category. It is not assigned its own dummy variable because its effects are captured by the model intercept. When used in regression, the coefficients on the indicators show how those conditions differ relative to the baseline value. Most statistical software automatically performs this indicator variable encoding when you include a categorical variable in a regression model.

Suppose a regression model predicting salary includes these dummy variables, and the Manager dummy variable has a coefficient of 5,000. This result indicates that managers earn a mean of $5,000 more than clerks, holding all other variables constant.

Related

Related Articles:
  • Glossary: Indicator Variable
  • Glossary: Dichotomous Variable
  • Glossary: Mutually Exclusive
« Back to Glossary Index

Primary Sidebar

Meet Jim

I’ll help you intuitively understand statistics by focusing on concepts and using plain English so you can concentrate on understanding your results.

Read More...

Buy My Introduction to Statistics Book!

Cover of my Introduction to Statistics: An Intuitive Guide ebook.

Buy My Hypothesis Testing Book!

Cover image of my Hypothesis Testing: An Intuitive Guide ebook.

Buy My Regression Book!

Cover for my ebook, Regression Analysis: An Intuitive Guide for Using and Interpreting Linear Models.

Subscribe by Email

Enter your email address to receive notifications of new posts by email.

    I won't send you spam. Unsubscribe at any time.

    Buy My Thinking Analytically Book!

    Cover for my book, Thinking Analytically: An Guide for Making Data-Driven Decisions.

    Top Posts

    • F-table
    • Cronbach’s Alpha: Definition, Calculations & Example
    • Z-table
    • How To Interpret R-squared in Regression Analysis
    • Accuracy vs Precision: Differences & Examples
    • Box Plot Explained with Examples
    • Interpreting Correlation Coefficients
    • How to Interpret P-values and Coefficients in Regression Analysis
    • Multicollinearity in Regression Analysis: Problems, Detection, and Solutions
    • Cohens D: Definition, Using & Examples

    Recent Posts

    • Data Collection Methods: Step-By-Step Guide with Examples
    • ANOVA Calculator
    • Positive Predictive Value: Meaning, Formula, and Interpretation
    • Median Absolute Deviation Calculator
    • Median Absolute Deviation: Definition, Finding & Formula
    • Outlier Calculator

    Recent Comments

    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Jim Frost on Comparing Regression Lines with Hypothesis Tests
    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Jim Frost on Pareto Chart: Making, Reading & Examples

    Copyright © 2026 · Jim Frost · Privacy Policy