Mastering Regression Evaluation Metrics: A Comprehensive Guide to Measuring Model Performance

MD TAHSEEN EQUBAL
5 min readAug 24, 2024

--

This blog is all about making regression evaluation metrics easy to understand. We’ll break down MSE, MAD, RMSE, R² score, and adjusted R² score, helping you learn how to accurately judge your model’s performance and make meaningful improvements.

Regression Metrics

Overview

This article is structured to provide a comprehensive understanding of Regression Metric, covering the following key aspects:

  1. Metric Name
  2. Definition
  3. Formula
  4. Example
  5. Advantage
  6. Disadvantage
  7. Python Implementation

Regression Metrics

Types of Regression Metrics

1. MAD Mean Absolute Deviation (MAD)

Definition

The Mean Absolute Deviation (MAD) measures the average magnitude of errors in a set of predictions, without considering their direction. It’s the average over the test sample of the absolute differences between prediction and actual observation.

Formula

MAD

Example

Example
Solution

Advantages

  • Easy to understand and calculate.
  • Provides a clear interpretation in the same units as the data.
  • Units will not changed.

Disadvantages

  • Does not consider the magnitude of errors; only provides an average absolute deviation.
  • It is not good evaluation metrics for optimization algorithms.

Python Implementation

import numpy as np

def mean_absolute_deviation(y_true, y_pred):
return np.mean(np.abs(y_true - y_pred))

# Example usage
y_true = np.array([3.0, 4.0, 5.0])
y_pred = np.array([2.5, 4.1, 5.2])
mad = mean_absolute_deviation(y_true, y_pred)
print("MAD:", mad)
Output
MAD: 0.2666666666666666

2. Mean Squared Error (MSE)

Definition

Mean Squared Error (MSE) is the average of the squares of the errors, where the error is the difference between the actual value and the predicted value.

It penalizes larger errors more than smaller ones.

Formula

MSE

Example

Advantages

  • Penalizes larger errors more significantly.
  • Useful for evaluating models with larger errors.

Disadvantages

  • Sensitive to outliers; large errors have a disproportionate effect.
  • Unit get squared

Python Implementation

from sklearn.metrics import mean_squared_error

y_true = [3.0, 4.0, 5.0]
y_pred = [2.5, 4.1, 5.2]
mse = mean_squared_error(y_true, y_pred)
print("MSE:", mse)
Output
MSE: 0.10

3.Root Mean Squared Error (RMSE)

Definition

Root Mean Squared Error (RMSE) is the square root of the mean of the squared errors.

It provides an estimate of the standard deviation of the prediction errors.

Formula

RMSE

Example

RMSE

Advantages

  • Provides error magnitude in the same units as the original data.
  • Useful for evaluating the precision of predictions.
  • Differentiable for any values.

Disadvantages

  • Sensitive to outliers and large errors.

Python Implementation

from sklearn.metrics import mean_squared_error
import numpy as np

mse = mean_squared_error(y_true, y_pred)
rmse = np.sqrt(mse)
print("RMSE:", rmse)
Output
RMSE: 0.316

4.R² Score

Definition

The R² score, or coefficient of determination, indicates how well the model’s predictions approximate the actual values.

It measures the proportion of the variance in the dependent variable that is predictable from the independent variables.

Formula

R2 Score

Example

Advantages

  • Provides an overall measure of how well the model explains the variability of the data.
  • Ranges from 0 to 1, where 1 indicates a perfect fit.

Disadvantages

  • May be misleading if used with models that have many predictors or if the data is not linear.

Python Implementation

from sklearn.metrics import r2_score

y_true = [3.0, 4.0, 5.0]
y_pred = [2.5, 4.1, 5.2]
r2 = r2_score(y_true, y_pred)
print("R2 Score:", r2)
Output
R2 Score: 0.60

5. Adjusted R² Score

Definition

The Adjusted R² Score adjusts the R² score for the number of predictors in the model.

It provides a more accurate measure of model performance when multiple predictors are used.

Formula

Adj R2 Score

Example

Assuming:

Advantages

  • Provides a more accurate measure of goodness-of-fit for models with multiple predictors.
  • Adjusts for the complexity of the model.

Disadvantages

  • Can be complex to calculate manually for large datasets.

Python Implementation

from sklearn.metrics import r2_score
import numpy as np

def adjusted_r2_score(y_true, y_pred, n, p):
"""
Calculate the Adjusted R² Score.

Parameters:
y_true (list or array): Actual values
y_pred (list or array): Predicted values
n (int): Number of observations
p (int): Number of predictors

Returns:
float: Adjusted R² Score
"""
r2 = r2_score(y_true, y_pred)
adj_r2 = 1 - (1 - r2) * (n - 1) / (n - p - 1)
return adj_r2

# Example data
y_true = [3.0, 4.0, 5.0]
y_pred = [2.5, 4.1, 5.2]

# Number of observations and predictors
n = len(y_true)
p = 1 # Assuming 1 predictor

# Calculate Adjusted R² Score
adj_r2 = adjusted_r2_score(y_true, y_pred, n, p)
print("Adjusted R² Score:", adj_r2)
Output
Adjusted R² Score: 0.6000000000000001

Desirable Value of all Regression Evaluation Metric for Better Model

— — — — — — — — — -Thank You! — — — — — — — — —

Thank you for taking the time to read my article. I hope you found it useful and informative. Your support means a lot, and I appreciate you joining me on this journey of exploration and learning. If you have any questions or feedback, feel free to reach out!

— — — — — — — — — Contact — — — — — — — — — — —

Linkdein -https://www.linkedin.com/in/md-tahseen-equbal-/
Github -https://github.com/Md-Tahseen-Equbal
Kaggle- https://www.kaggle.com/mdtahseenequbal

--

--

MD TAHSEEN EQUBAL
MD TAHSEEN EQUBAL

Written by MD TAHSEEN EQUBAL

I write to help make sense of Data Science

No responses yet