You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have been having some success with categorical state inputs. Since my categories are represented as strings I ended up implementing a string hashcode for state inputs since I quickly learned that state inputs needed to be numeric. My string hashcodes are integers in the thousands and they seem to be working with the library, so I think your suggested [0, 1) range for inputs is overly constrained.
I have been providing integer rewards between -100 and 100, and these seem to be working too, however I have been encountering NaN in the net weights after calls to learn(), so maybe I should try a smaller range?
The text was updated successfully, but these errors were encountered: