act and learn input ranges #7

salamanders · 2015-10-21T17:32:46Z

Should all state inputs to act be 0<=stateX<1?
Should all reward inputs be 0<=reward<1?
Is there any way to get out "nope, that wasn't a good reply. I want a second opinion!" (second place answer, etc)

gb96 · 2019-03-17T09:43:58Z

I have been having some success with categorical state inputs. Since my categories are represented as strings I ended up implementing a string hashcode for state inputs since I quickly learned that state inputs needed to be numeric. My string hashcodes are integers in the thousands and they seem to be working with the library, so I think your suggested [0, 1) range for inputs is overly constrained.
I have been providing integer rewards between -100 and 100, and these seem to be working too, however I have been encountering NaN in the net weights after calls to learn(), so maybe I should try a smaller range?

Provide feedback