All MAD predictions should be positive. #50

JulianHidalgo · 2024-03-04T22:04:09Z

Hi!

Thank you for creating Puncc. I'm trying to use LocallyAdaptiveCP as described here https://deel-ai.github.io/puncc/regression.html#deel.puncc.regression.LocallyAdaptiveCP

                mu_model = xgb.XGBRegressor()
                sigma_model = xgb.XGBRegressor()
                # Wrap models in a mean/variance predictor
                mean_var_predictor = MeanVarPredictor(
                    models=[mu_model, sigma_model]
                )
                cp = LocallyAdaptiveCP(mean_var_predictor)
                cp.fit(X_fit=X_train, y_fit=y_train, X_calib=X_test, y_calib=y_test)

But I get an error: All MAD predictions should be positive. Any idea of what am I missing?
I think the error comes from

puncc/deel/puncc/api/nonconformity_scores.py

Line 248 in 6e0a8f8

raise RuntimeError("All MAD predictions should be positive.")

mean_absolute_deviation = absolute_difference(y_pred, y_true)
if np.any(sigma_pred < 0):
    raise RuntimeError("All MAD predictions should be positive.")
return mean_absolute_deviation / (sigma_pred + EPSILON)

But I don't know how to avoid it. Any pointers would be greatly appreciated!

The text was updated successfully, but these errors were encountered:

M-Mouhcine · 2024-03-05T19:25:51Z

Hi @JulianHidalgo !

Thanks for opening this issue. I could indeed reproduce the error when using xgboost models with LocallyAdaptiveCP.

Actually, sigma_model is trained to predict the absolute residual $|y-\mu(X)|$, such that $\mu$ is the trained mu_model and $X$ and $y$ are respectively a feature and associated target. The output of sigma_model should be positive, otherwise is messes up with the conformal prediction algorithm. However, in your case, some of such values are negative, which is not allowed.

I've noticed that this behavior happens when the number of estimators n_estimators of the xgboost model is high (by default, it is 100). I've tried using lower values, for example 5 or10, and it works fine:

mu_model = xgb.XGBRegressor()
sigma_model = xgb.XGBRegressor(n_estimators=5)
# Wrap models in a mean/variance predictor
mean_var_predictor = MeanVarPredictor(
    models=[mu_model, sigma_model]
)
cp = LocallyAdaptiveCP(mean_var_predictor)
cp.fit(X_fit=X_train, y_fit=y_train, X_calib=X_test, y_calib=y_test)

Can you see if that works for you ?

PS: we will look into a suitable solution to "correct" models that predict negative values. We could simply force the absolute value of sigma_model predictions, but we will explore more options and pick the least problematic.

JulianHidalgo · 2024-03-06T02:31:45Z

Thanks for checking this out! Reducing the number of estimator helps, but it also decreases the accuracy of the model and it's not reliable: the same number of estimators works fine with a dataset and fails with another. I noticed LightGBM also generates negative values some times, but less often than XGBoost. At least now I know it's not something in the way I'm using the library or my datasets in particular. I will be watching the issue, thank you again!

jdalch · 2024-03-07T17:28:44Z

Hello @JulianHidalgo, thanks again for using PUNCC and for rising this issue! After having discussed it with the team, we have decided to take the following steps to fix this issue:

Increase the value of the threshold EPSILON in the scaled_ad nonconformity score.
Add the threshold EPSILON to the scaled_interval prediction set.
Modify the scaled_ad nonconformity score: compute residuals only for calibration points such that sigma + EPSILON > 0, and return warning that some calibration data is not used.
Modify the scaled_interval prediction set: return an infinite sized prediction set if sigma + ESPILON <= 0, and return a warning.

We hope this fixes your issue for you. We expect that negative values of sigma are rare and that our procedure does not have a big impact in the size of the prediction sets. Of course, the probabilistic guarantees given by conformal prediction will remain true after this modification. #

JulianHidalgo · 2024-03-11T19:58:15Z

Hey @jdalch!
Thank you so much to you and the team for designing a solution 😊.

M-Mouhcine · 2024-03-20T14:44:51Z

Hey @JulianHidalgo,

@jdalch has implemented his solution to address the problem. Could you please test it and let us know if it works?

M-Mouhcine self-assigned this Mar 5, 2024

M-Mouhcine added the bug Something isn't working label Mar 5, 2024

M-Mouhcine assigned jdalch Mar 5, 2024

jdalch linked a pull request Mar 14, 2024 that will close this issue

LocallyAdaptiveCP: authorise negative MAD values #53

Merged

M-Mouhcine closed this as completed in #53 Mar 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

All MAD predictions should be positive. #50

All MAD predictions should be positive. #50

JulianHidalgo commented Mar 4, 2024 •

edited

Loading

M-Mouhcine commented Mar 5, 2024 •

edited

Loading

JulianHidalgo commented Mar 6, 2024

jdalch commented Mar 7, 2024 •

edited

Loading

JulianHidalgo commented Mar 11, 2024

M-Mouhcine commented Mar 20, 2024

All MAD predictions should be positive. #50

All MAD predictions should be positive. #50

Comments

JulianHidalgo commented Mar 4, 2024 • edited Loading

M-Mouhcine commented Mar 5, 2024 • edited Loading

JulianHidalgo commented Mar 6, 2024

jdalch commented Mar 7, 2024 • edited Loading

JulianHidalgo commented Mar 11, 2024

M-Mouhcine commented Mar 20, 2024

JulianHidalgo commented Mar 4, 2024 •

edited

Loading

M-Mouhcine commented Mar 5, 2024 •

edited

Loading

jdalch commented Mar 7, 2024 •

edited

Loading