• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar
  • My Store
  • Glossary
  • Home
  • About Me
  • Contact Me

Statistics By Jim

Making statistics intuitive

  • Graphs
  • Basics
  • Hypothesis Testing
  • Regression
  • ANOVA
  • Probability
  • Time Series
  • Fun
  • Calculators

Propensity Score Matching [PSM]

By Jim Frost

« Back to Glossary Index

Propensity score matching (PSM) is a method analysts use in observational studies to reduce confounder bias when comparing groups. It addresses a fundamental problem with observational studies: researchers do not randomly assign people to treatment or control groups. Instead, people have already chosen for themselves. Their decisions about whether to receive the treatment are often driven by their characteristics, such as age, income, education, or health. This self-selection creates differences between groups at the beginning of the study that can distort the results at the end.

To account for these initial differences, researchers use a propensity score, which is the estimated probability that a person would have chosen to receive the treatment based on their observed characteristics. Researchers typically calculate these scores using a logistic regression model that includes covariates which theory suggests will influence treatment decisions.

After the study estimates propensity scores, researchers match treated and untreated individuals with similar scores. The goal is to create a treatment group that is more comparable to the control group in terms of the background variables. Participants in either group who are too different from the other group might be dropped from the study. This propensity score matching process helps reduce confounding and makes the treatment effect estimate more credible—more like what you’d get if the groups had been assigned randomly.

Alternatively, researchers can include the propensity score directly as a covariate in a regression model. This approach adjusts for differences in the likelihood of treatment without discarding data. Both strategies aim to control for bias due to observed covariates, though they cannot address bias from unmeasured factors.

PSM Example

For example, suppose researchers want to study the effect of a new educational program in an observational study. Families have already made their decisions about whether to participate in the new program. Participation isn’t random—students with higher motivation or more support at home may be more likely to enroll. Simply comparing outcomes between those who did and didn’t participate could be misleading.

Using propensity score matching, researchers estimate each student’s likelihood of enrolling in the program based on characteristics like prior grades, school type, and family income. They then match participants to similar non-participants based on those scores, creating a fairer comparison to estimate the program’s impact.

Related

Related Articles:
  • What is an Observational Study: Definition & Examples
« Back to Glossary Index

Primary Sidebar

Meet Jim

I’ll help you intuitively understand statistics by focusing on concepts and using plain English so you can concentrate on understanding your results.

Read More...

Buy My Introduction to Statistics Book!

Cover of my Introduction to Statistics: An Intuitive Guide ebook.

Buy My Hypothesis Testing Book!

Cover image of my Hypothesis Testing: An Intuitive Guide ebook.

Buy My Regression Book!

Cover for my ebook, Regression Analysis: An Intuitive Guide for Using and Interpreting Linear Models.

Subscribe by Email

Enter your email address to receive notifications of new posts by email.

    I won't send you spam. Unsubscribe at any time.

    Buy My Thinking Analytically Book!

    Cover for my book, Thinking Analytically: An Guide for Making Data-Driven Decisions.

    Top Posts

    • F-table
    • Z-table
    • Cronbach’s Alpha: Definition, Calculations & Example
    • How To Interpret R-squared in Regression Analysis
    • Interpreting Correlation Coefficients
    • Cohens D: Definition, Using & Examples
    • Multicollinearity in Regression Analysis: Problems, Detection, and Solutions
    • How to Interpret P-values and Coefficients in Regression Analysis
    • T-Distribution Table of Critical Values
    • Box Plot Explained with Examples

    Recent Posts

    • Data Collection Methods: Step-By-Step Guide with Examples
    • ANOVA Calculator
    • Positive Predictive Value: Meaning, Formula, and Interpretation
    • Median Absolute Deviation Calculator
    • Median Absolute Deviation: Definition, Finding & Formula
    • Outlier Calculator

    Recent Comments

    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Jim Frost on Comparing Regression Lines with Hypothesis Tests
    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Jim Frost on Pareto Chart: Making, Reading & Examples

    Copyright © 2026 · Jim Frost · Privacy Policy