A linear regression equation describes the relationship between the independent variables (IVs) and the dependent variable (DV). It can also predict new values of the DV for the IV values you specify.
In this post, we’ll explore the various parts of the regression line equation and understand how to interpret it using an example.
I’ll mainly look at simple regression, which has only one independent variable. These models are easy to graph, and we can more intuitively understand the linear regression equation.
Related post: Independent and Dependent Variables
Deriving the Linear Regression Equation
Least squares regression produces a linear regression equation, providing your key results all in one place. How does the regression procedure calculate the equation?
The process is complex, and analysts always use software to fit the models. For this post, I’ll show you the general process. However, you can also use my free online Linear Regression Calculator! It analyzes the relationship between two variables using simple linear, quadratic, or cubic models. It also graphs the data with the best fit line, displays the regression equation, and provides key model statistics.
Consider this height and weight dataset. There is clearly a relationship between these two variables. But how would you draw the best line to represent it mathematically?

Regression analysis draws a line through these points that minimizes their overall distance from the line. More specifically, least squares regression minimizes the sum of the squared differences between the data points and the line, which statisticians call the sum of squared errors (SSE).
To learn how least squares regression calculates the coefficients and y-intercept with a worked example, read my post Least Squares Regression: Definition, Formulas & Example.
Let’s fit the model!
Now, we can see the line and its corresponding linear regression equation, which I’ve circled.

This graph shows the observations with a line representing the regression model. Following the practice in statistics, the Y-axis (vertical) displays the dependent variable, weight. The X-axis (horizontal) shows the independent variable, which is height.
The line is also known as the fitted line, and it produces a smaller SSE than any other line you can draw through these observations.
Like all lines you’ve studied in algebra, you can describe them with an equation. For this analysis, we call it the equation for the regression line. So, let’s quickly revisit algebra!
Learn more about the X and Y Axis.
Equation for a Line
Think back to algebra and the equation for a line: y = mx + b.
In the equation for a line,
- Y = the vertical value.
- M = slope (rise/run).
- X = the horizontal value.
- B = the value of Y when X = 0 (i.e., y-intercept).

So, if the slope is 3, then as X increases by 1, Y increases by 1 X 3 = 3. Conversely, if the slope is -3, then Y decreases as X increases. Consequently, you need to know both the sign and the value to understand the direction of the line and how steeply it rises or falls. Higher absolute values correspond to steeper slopes.
B is the Y-intercept. It tells you the value of Y for the line as it crosses the Y-axis, as indicated by the red dot below.

Using the intercept and slope together allows you to calculate any point on a line. The example below displays lines for two equations.

If you need a refresher, read my Guide to the Slope Intercept Form of Linear Equations.
Applying these Ideas to a Linear Regression Equation
A regression line equation uses the same ideas. Here’s how the regression concepts correspond to algebra:
- Y-axis represents values of the dependent variable.
- X-axis represents values of the independent variable.
- Sign of coefficient indicates whether the relationship is positive or negative.
- Coefficient value is the slope.
- Constant or Y-intercept is B.
In regression analysis, the procedure estimates the best values for the constant and coefficients. Typically, regression models switch the order of terms in the equation compared to algebra by displaying the constant first and then the coefficients. It also uses different notation, as shown below for simple regression.
![]()
Using this notation, β0 is the constant, while β1 is the coefficient for X. Multiple regression just adds more βkXk terms to the equation up to K independent variables (Xs).
On the fitted line plots below, I’ve circle portions of the linear regression equation to identify its components.
Coefficient = Slope
Visually, the fitted line has a positive slope corresponding to the positive coefficient sign we obtained earlier. The slope of the line equals the +106.5 coefficient that I circled in the equation for the regression line. This coefficient indicates how much mean weight increases as we increase height by one unit.

Constant = Y-intercept
You’ll notice that the previous graph doesn’t display the regression line crossing the Y-axis. The constant is so negative that the default axes settings don’t show it. I’ve adjusted the axes in the chart below to include the y-intercept and illustrate the constant more clearly.

If you extend the regression line downwards until it reaches the Y-axis, you’ll find that the y-intercept value is -114.3—just as the equation indicates.
Interpreting the Regression Line Equation
Let’s combine all these parts of a linear regression equation and see how to interpret them.
- Coefficient signs: Indicates whether the dependent variable increases (+) or decreases (-) as the IV increases.
- Coefficient values: Represents the average change in the DV given a one-unit increase in the IV.
- Constant: Value of the DV when the IV equals zero.
Next, we’ll apply that to the linear regression equation from our model.
Weight kg = -114.3 + 106.5 Height M
The coefficient sign is positive, meaning that weight tends to increase as height increases. Additionally, the coefficient is 106.5. This value indicates that if you increase height by 1m, weight increases by an average of 106.5kg. However, our data have a range of only 0.4M. So, we can’t use a full meter but a proportion of one. For example, with an additional 0.1m, you’d expect a 10.65kg increase.
Learn more about How to Interpret Coefficients and Their P-Values.
For our model, the constant in the linear regression equation technically indicates the mean weight is -114.3kg when height equals zero. However, that doesn’t make intuitive sense, especially because we have a negative constant. A negative weight? How do we explain that?
Notice in the graph that the data are far from the y-axis. Our model doesn’t apply to that region. You can only interpret the model for your data’s observation space.
Hence, we can’t interpret the constant for our model, which is fairly typical in regression analysis for multiple reasons. Learn more about that by reading about The Constant in Regression Analysis.
Using a Linear Regression Equation for Predictions
You can enter values for the independent variables in a regression line equation to predict the mean of the dependent variable. For our model, we can enter a height to predict the average weight.
A couple of caveats. The equation only applies to the range of the data. So, we need to stick with heights between 1.3 – 1.7m. Also, the data are for pre-teen girls. Consequently, the regression model is valid only for that population.
With that in mind, let’s calculate the mean height for a girl who is 1.6m tall by entering that value into our linear regression equation:
Weight kg = 114.3 + 106.5 * 1.6 = 56.1.
The average weight for someone 1.6m tall in this population is 56.1kg. On the chart below, notice how the line passes through this point (1.6, 56.1). Also, note that you’ll see data points above and below the line. The prediction gives the average, but that range of data points represents the prediction’s precision.

Learn more about Using Regression to Make Predictions.
When you know how to interpret a linear regression equation, you gain valuable insights into the model. However, for your conclusions to be trustworthy, be sure that you satisfy the least squares regression assumptions.

Nice, well explained
Hello Jim. I appreciate your website. I have a question that is, perhaps, only tangential to the topic at hand, but would appreciate your feedback. While not a mathematician myself, I work for a county and develop multiplicative linear regression models for the purpose of valuing real property. As were are preparing for our next reappraisal, I am reviewing and attempting to automate an older practice which uses output from the regression model (specifically the prediction equation) to develop remodel value. In dissecting the existing process, I realized that the method of record truncates the predication statement (thus using only the constant and 5 of the many independent variables). My understanding is that this effectively violates the integrity of the prediction statement and will produce results that may look good, but are invalid, especially since the constant is used without the context of the entire prediction statement. I quizzed Chat GPT on this, specifically asking for academic resources/references and among its responses was: “Frost notes that while individual predictor variables might sometimes be removed or adjusted, the constant is essential because it stabilizes the equation and ensures that estimates remain unbiased, preserving the model’s overall accuracy. Removing the constant or applying it outside of the full prediction equation undermines its intended function and may result in misinterpretations or invalid predictions.” Chat GPT linked this to your site, but I am unable to find an article that addresses this directly. Is there such an article? Or, in responding (perhaps by email?) would your agree that Chat GPT has accurately summarized your perspective? Essentially, I’m looking for an academic reference to back up my own observations and experience because there will be some institutional inertia to overcome and some academic clout/cred would be useful. Thank you.
Hi Matthew,
There are several issues here. For starters, no, ChatGPT is not accurately summarizing what I’ve written. Or at least not entirely.
Removing the constant from a regression equation is almost never a good idea. It will almost always bias the results. Including the constant does guarantee that your model as a whole is unbiased (doesn’t systematically predict too high or low). It forces the residual mean to equal zero. However, contrary to what ChatGPT says, including the constant doesn’t “ensure” the model is good. Including the constant is generally necessary but not sufficient by itself. While the overall model is unbiased with the constant, there might ranges where it does over and/or under predict systematically.
So, including the constant is crucial but it isn’t a cure all. For more information on that issue, read my article about the Constant in Regression Analysis where I write about this in detail. That article is the one ChatGPT was trying to summarize (or possibly my discussion of the issue in my Regression Analysis book).
However, your more serious problem is that you’re only using 5 variables out of what sounds like many variables. Yes, you are correct. You have good reason to be suspicious of the predicted values by not using all the variables. To make valid predictions, you really need to use the complete model, which includes the constant and all the predictors. And that’s assuming the model itself is valid (i.e., that is satisfies all the assumptions). The less bad scenario for using a truncated model is that predictions are unbiased (correct on average) but their precision is reduced (larger spread around the correct value). However, it is very possible that the predictions are also biased (incorrect on average), which would be worse.
Without evaluating both models, you have no way of knowing which of those two cases relating to the predictions (bad or worse) is applicable. (Although, you can assume the chance of bias increases with each additional missing variable.)
I hope that helps!
Thanks so much. Really helpful.
Hello Jim,
first of all, I have to say, it is a pleasure to read your books!
I’m focused on the linear relationship and the different statistical tests, interpretation etc.
The final goal is a paper concerning the estimation of investments in the area of plant construction, on the basis of historical data and a recommended non- linear relationship. (y=X^n)
The accuracy of the mathematical model will be described with a confidence interval for the expected values and a prediction interval for individuals. An addition interesting point is the calculation of a prediction interval for future observations.
My question based on a book which describes the confidence interval of a linear relationship for the expected values as a linear function! This is contrary to my understanding, statistic books and the law of large numbers.
I would greatly appreciate to get an information with regard to that discrepancy.
Best wishes from Germany and thank you in advance
Klaus
I was searching during last 4 days for regression calculation & interpretation, I’ve learnt a lot. Still I’m searching to interpret the calculated data by MINITAB.
Hi Jim
Wondering what regression (or other) test we would run if we had a research question that looked at somethin like : does work out intensity (low, medium, high) and self reporting of sleep quality (good vs. bad) – our independent variables – affect the BMI % of respondents (where BMI % will be our dependent variable). Would multiple regression be the appropriate test? Multiple discriminant analysis perhaps? MANOVA?
Thanks Jim, looking forward to your thoughts