• Skip to secondary menu
  • Skip to main content
  • Skip to primary sidebar
  • My Store
  • Glossary
  • Home
  • About Me
  • Contact Me

Statistics By Jim

Making statistics intuitive

  • Graphs
  • Basics
  • Hypothesis Testing
  • Regression
  • ANOVA
  • Probability
  • Time Series
  • Fun
  • Calculators

Classification

By Jim Frost

« Back to Glossary Index

What is Classification?

Classification is a type of predictive modeling that assigns items or observations to one of several predefined categories based on input data. It is used when the outcome variable is categorical. For example, predicting whether an email is spam or not spam, whether a transaction is fraudulent or legitimate, or whether a patient’s test result is positive or negative.

The goal of a classification model is to learn patterns from a labeled dataset, where the correct category for each observation is already known. The model then applies what it has learned to new, unlabeled data to predict the most likely category. This process falls under supervised learning, because the model is trained using known outcomes.

Classification tasks can involve just two categories (binary classification), or more than two (multiclass classification). Some common examples include:

  • Determining whether a customer will churn or stay
  • Categorizing loan applications as high risk, medium risk, or low risk
  • Identifying species of plants or animals based on physical traits

Methods

Many different methods can be used for classification, including the following:

  • Logistic regression
  • Decision trees and random forests
  • Support vector machines (SVM)
  • Naive Bayes classifiers
  • K-nearest neighbors (KNN)
  • Neural networks, including deep learning models
  • Gradient boosting machines (e.g., XGBoost)

Analysts use various metrics to evaluate classification model performance including accuracy, precision, recall, F1 score, and the confusion matrix. The choice depends on the goals of the analysis and the consequences of different types of classification errors. In some cases, especially in medical or diagnostic contexts, related metrics like sensitivity, specificity, positive predictive value, negative predictive value, and likelihood ratios may also be used to assess model performance.

Classification is a core task in machine learning and predictive analytics, and it plays a key role in applications such as image recognition, medical diagnosis, spam detection, and customer segmentation.

Related

Related Articles:
  • Logistic Regression Overview with Example
  • Glossary: Supervised Learning
  • Glossary: Mutually Exclusive
  • Glossary: Prevalence
  • Principal Component Analysis Guide & Example
« Back to Glossary Index

Primary Sidebar

Meet Jim

I’ll help you intuitively understand statistics by focusing on concepts and using plain English so you can concentrate on understanding your results.

Read More...

Buy My Introduction to Statistics Book!

Cover of my Introduction to Statistics: An Intuitive Guide ebook.

Buy My Hypothesis Testing Book!

Cover image of my Hypothesis Testing: An Intuitive Guide ebook.

Buy My Regression Book!

Cover for my ebook, Regression Analysis: An Intuitive Guide for Using and Interpreting Linear Models.

Subscribe by Email

Enter your email address to receive notifications of new posts by email.

    I won't send you spam. Unsubscribe at any time.

    Buy My Thinking Analytically Book!

    Cover for my book, Thinking Analytically: An Guide for Making Data-Driven Decisions.

    Top Posts

    • F-table
    • Cronbach’s Alpha: Definition, Calculations & Example
    • Z-table
    • How To Interpret R-squared in Regression Analysis
    • Accuracy vs Precision: Differences & Examples
    • Box Plot Explained with Examples
    • Interpreting Correlation Coefficients
    • How to Interpret P-values and Coefficients in Regression Analysis
    • Multicollinearity in Regression Analysis: Problems, Detection, and Solutions
    • Cohens D: Definition, Using & Examples

    Recent Posts

    • Data Collection Methods: Step-By-Step Guide with Examples
    • ANOVA Calculator
    • Positive Predictive Value: Meaning, Formula, and Interpretation
    • Median Absolute Deviation Calculator
    • Median Absolute Deviation: Definition, Finding & Formula
    • Outlier Calculator

    Recent Comments

    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Jim Frost on Comparing Regression Lines with Hypothesis Tests
    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Skata na fas on Comparing Regression Lines with Hypothesis Tests
    • Jim Frost on Pareto Chart: Making, Reading & Examples

    Copyright © 2026 · Jim Frost · Privacy Policy