Dataset with floating point values #12

itlchriss · 2024-08-18T10:29:46Z

Hi,

The work is great and I want to explore the possibility of using it on some complicated dataset. I have tried to use it on the Wisconsin breast cancer dataset. However, as the dataset contains quite a lot of different floating point values, there are many feature names appended with these values during the get_dummies. I have tried to remove the checking (the one in explain.py:90). There are no rules found. Are there any limitations in using this work on datasets with floating point values?

groshanlal · 2024-08-23T18:25:37Z

TE2Rules can handle both continuous and categorical features. Regarding the Wisconsin breast cancer dataset, most of the features are continuous. Please use get_dummies only to transform categorical features into one-hot encoded features. Do not use it on all features, since it would make the continuous features unusable.

If you are using get_dummies, make sure that the transformed feature names do not have hyphens ("-"). TE2Rules expects feature to contain only alphanumeric characters and underscores are allowed in feature names. Replace hyphens ("-") with underscores("_") in feature names.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset with floating point values #12

Dataset with floating point values #12

itlchriss commented Aug 18, 2024

groshanlal commented Aug 23, 2024 •

edited

Loading

Dataset with floating point values #12

Dataset with floating point values #12

Comments

itlchriss commented Aug 18, 2024

groshanlal commented Aug 23, 2024 • edited Loading

groshanlal commented Aug 23, 2024 •

edited

Loading