Attempt to do better at estimating playouts remaining, while requiring less time to do so. #582

Tilps · 2018-05-11T07:18:37Z

Implementing the idea in #581. A key aspect here is that we shift the start time later, but we don't decrement the playouts by how many there are at that start time. This means we generally shift from an underestimate that converges upwards, to an overestimate which mostly converges downwards and so inaccuracy due to early calculation is less likely to cause us to prune early.

I don't actually know if this provides an Elo win yet, but it does seem to provide an improvement in ability to estimate playouts at short time scales. I'm running a self-play tournament on 1+1 to start.

I think the logic may not currently be very sound with large thread count of slow evals though, if each of your threads can only do 50nps, but you have 40+ of them (aka TCEC), 10 playouts/10ms is basically not going to limit anything. I need to think more about how to scale those constants with thread count.

…ing playouts remaining.

Tilps · 2018-05-11T08:30:00Z

Currently +100 Elo after 25 games with network 245 at 1+1. (Large error bars.)

dubslow · 2018-05-11T16:14:31Z

src/UCTSearch.cpp

-        // Until we reach 1 second or 100 playouts playout_rate
-        // is not reliable, so just return max.
+    } else if (elapsed_millis < 10 || playouts < 10) {
+        // Until we reach 10 millisecond and 10 playouts playout_rate


you changed the comment from "or" to "and" but the logic remains ||, not &&

a or b == !(!a and !b)

Yes, the existing comment was wrong.

Tilps · 2018-05-11T21:38:31Z

Final results after 150 games not very convincing. Only +15 Elo, with error bars which include no improvement.

Tilps · 2018-05-12T00:13:12Z

I suspect that this approach has improved the estimate too well, and that it might need a reduction multiplier like jjoshua2 has in his tuning PRs to compensate for the unlikelyhood that every visit goes to one of the trailing ones.

jjoshua2 · 2018-05-12T02:18:49Z

@zz tried clop tuning a combination of my patch and this, and it came out with 100 for slow mover 1.4 for time multipler and 1.0 for pruning factor. Which I was very happy with because it makes a lot of sense.

Tilps · 2018-05-12T03:42:27Z

Sounds good. I think this PR will probably be good to go if I add a multiplier to the minimum playouts equal to the number of threads. (I assume no one will ever set the threads at ridiculously large levels compared to their actual hardware capacity.)

Tilps added 2 commits May 11, 2018 15:50

Merge remote-tracking branch 'refs/remotes/glinscott/next' into next

c3a41c3

Start timing after a few playouts rather than immediately for estimat…

815615d

…ing playouts remaining.

dubslow reviewed May 11, 2018

View reviewed changes

Increase the minimum playouts to use pruning when multi-threading.

e46f88a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attempt to do better at estimating playouts remaining, while requiring less time to do so. #582

Attempt to do better at estimating playouts remaining, while requiring less time to do so. #582

Tilps commented May 11, 2018

Tilps commented May 11, 2018

dubslow May 11, 2018

jjoshua2 May 11, 2018 •

edited

Loading

Tilps May 12, 2018

Tilps commented May 11, 2018

Tilps commented May 12, 2018

jjoshua2 commented May 12, 2018

Tilps commented May 12, 2018

Attempt to do better at estimating playouts remaining, while requiring less time to do so. #582

Are you sure you want to change the base?

Attempt to do better at estimating playouts remaining, while requiring less time to do so. #582

Conversation

Tilps commented May 11, 2018

Tilps commented May 11, 2018

dubslow May 11, 2018

Choose a reason for hiding this comment

jjoshua2 May 11, 2018 • edited Loading

Choose a reason for hiding this comment

Tilps May 12, 2018

Choose a reason for hiding this comment

Tilps commented May 11, 2018

Tilps commented May 12, 2018

jjoshua2 commented May 12, 2018

Tilps commented May 12, 2018

jjoshua2 May 11, 2018 •

edited

Loading