A quick (maybe naive) question about feature importance. #783

jorgpg5 · 2023-06-12T07:56:49Z

jorgpg5
Jun 12, 2023

Hello,

Thanks for sharing this awesome work! I have a question regarding how to "read" the results from the feature importance (by permutation) for a classification task. According to the documentation:

"The permutation feature or step importance is defined as the decrease in a model score when a single feature or step value is randomly shuffled. So if you using accuracy (higher is better), the most important features or steps will be those with a lower value on the chart (as randomly shuffling them reduces performance)."

I'm using accuracy as the metric. Thereby, if the values of the features are smaller on the graph, these features are more important for the model. For instance, var_4, var_6, var_7, var_23, var28, and var35 might be very relevant to the model due to the small values. Is that right? However, what happens when we have negative/red values? I have attached a figure of where this happens. Thank you so much for your help and awesome work!

Answered by vrodriguezf

Jun 16, 2023

I might be wrong, but the way I see it is that large green bars represent important features (because removing them caused a large accuracy decrease) and red bars represent misleading features (removing them made the accuracy be higher)

View full answer

vrodriguezf · 2023-06-16T22:00:32Z

vrodriguezf
Jun 16, 2023

I might be wrong, but the way I see it is that large green bars represent important features (because removing them caused a large accuracy decrease) and red bars represent misleading features (removing them made the accuracy be higher)

1 reply

jorgpg5 Jun 19, 2023
Author

Thanks for your reply, Victor! I'll wait for a couple more answers before marking your reply as the answer (if that's okay). Thank you!

oguiza · 2023-06-19T14:57:45Z

oguiza
Jun 19, 2023
Maintainer

Hi @jorgpg5, Victor’s answer is correct. Features in red or close to 0 are good candidates to be removed. Yo can drop then and retrain the model again.

1 reply

jorgpg5 Jun 20, 2023
Author

Awesome! Thank you so much for confirming this, Ignacio. Again, thank you for the great work!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A quick (maybe naive) question about feature importance. #783

{{title}}

Replies: 2 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

A quick (maybe naive) question about feature importance. #783

jorgpg5 Jun 12, 2023

Replies: 2 comments · 2 replies

vrodriguezf Jun 16, 2023

jorgpg5 Jun 19, 2023 Author

oguiza Jun 19, 2023 Maintainer

jorgpg5 Jun 20, 2023 Author

jorgpg5
Jun 12, 2023

Replies: 2 comments 2 replies

vrodriguezf
Jun 16, 2023

jorgpg5 Jun 19, 2023
Author

oguiza
Jun 19, 2023
Maintainer

jorgpg5 Jun 20, 2023
Author