What's Changed
normalize_grads
option added to normalize incoming grads to unit norm, can help with grads with poor distribution- Get rid of trust region in favor of
normalize_grads
- Deterministic preconditioner update
- Damping based on machine precision added to handle singular or near-singular g g ^ T properly