-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Detect starting ratings with an initial pass through data #67
base: master
Are you sure you want to change the base?
Conversation
Add option `--detect-starting-ratings`, which does an initial pass through games and sets starting ratings in many cases. Everything is just in the one-game-at-a-time data, and it's not at all configurable, but it at least allow some hardcoded experiments.
Initial results look not ideal, since all ratings are depressed. Need to at least fine-tune the auto-detection. |
Numbers look kind of okay the full 30M data set after 23248a0 (I was tuning for the first 1-2M games, then did a long run with all 30M). There are some anomalies... but I'm not sure how to interpret the results anyway... I'm not sure this auto-detection experiment is going to tell us what we want to know. These two scenarios seem quite different:
For (1), you have the advantage that the existing ratings strongly influence new ratings, so new players have very little effect on the rating pool. For (2), you can/should recalibrate ratings and the constants in This experiment is (2), but without recalibrating (There could be other flaws in the experiment as well.) |
@anoek, let me know if you have specific ideas about the data you're hoping for here, and I can play around more. I'm hesitant to clean it up and make it ready for review until it's more clear what we're trying to learn.
|
@dexonsmith @anoek I guess the other "thing we need to learn" is "are the chosen self-select starting ranks going to be OK" Is that something this is shooting for ? |
Right; I think @anoek is hoping to learn that, but I don't think this experiment as-is will tell us that information... right now, it's better set up to answer what would happen if we back-filled starting ratings as part of v6 (but I think that requires recalibration of I'm not quite sure how to nudge it toward answering "the select-select ranks in the context of v5" question. One idea is to auto-detect only for new-players from the last 1-2 years of data. |
Add option
--detect-starting-ratings
, which does an initial pass through games and sets starting ratings in many cases.Everything is just in the one-game-at-a-time data, and it's not at all configurable, but it at least allow some hardcoded experiments.