An autoregressive model (AR model) is a statistical model that analyzes and forecasts time series data. These models express the current value of a time series as a weighted sum of its previous values. The term “autoregressive” reflects this idea of self-reference—each value is regressed on past values of the same variable. After fitting the model, analysts can use it to forecast future values by applying the same structure to the most recent data.
Analysts commonly use AR models in economics, environmental science, and engineering for forecasting trends in time series where past behavior carries forward. For example, an economist might use an AR model to forecast next month’s unemployment rate based on previous months’ rates, assuming the pattern persists over time.
How an AR Model Works
An AR model is written as AR(p), where p is the number of lagged observations included in the model. For example, an AR(2) model uses the two previous time points to predict the current one. The general form looks like this:
Yₜ = ϕ₁Yₜ₋₁ + ϕ₂Yₜ₋₂ + … + ϕₚYₜ₋ₚ + εₜ
Where:
- Yₜ is the value at time t,
- ϕ₁ through ϕₚ are the model coefficients (weights),
- εₜ is the random error at time t.
Autoregressive models assume the data are stationary, meaning its statistical properties like mean and variance don’t change over time. If the data aren’t stationary, it often needs to be transformed (e.g., by differencing) before fitting an AR model.
When fitting an autoregressive model, the goal is to estimate the coefficients (ϕ-values) that determine how much weight to assign to each previous value in the series. The model chooses these coefficients by minimizing the prediction errors—the differences between the actual values and the values predicted by the model. This process ensures that the model closely reflects the time-dependent structure in the data.
Example
For example, suppose an economist fits an AR(2) model to forecast monthly unemployment rates and the estimated model is:
If last month’s unemployment rate was 7.2% and the month before was 7.5%, the model predicts the current month’s rate as:
This predicted value reflects how much influence recent months have on the current estimate, based on the learned weights from past data.
« Back to Glossary Index