Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

move to cuda 11.8 with install script #2009

Draft
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

borg323
Copy link
Member

@borg323 borg323 commented Apr 4, 2024

Turns out the following analysis is not correct - the latest version seems to work after compiling both common and fp16 only cuda code with -arch=all-major.

Made draft as #2015 has the bits to fix the NaN issues (a bit more refined) without the cuda version update. Will make this into just a cuda version update PR at a later stage.

The recent rc1 issues seem to go away if we compile the cuda fp16 code with -arch=all-major, so I added it to meson.build. It is added unconditionally since the default is still to use -arch=native and the alternative is an attempt to do the equivalent to -arch=all-major for cuda versions that don't support it.
This requires at least cuda version 11.5, but as we tested with 11.8 I used this
Updates cuda to 11.8 for the appveyor cuda builds (cudnn unchanged), and given the huge size of the dlls I added an install script based on the directml one.
While testing I also found a bug with the directml install script, probably some recent windows security change makes executables in the same directory unavailable when running it by double clicking, so I removed the lc0.exe check and it will directly install the dlls in the script's directory.

@borg323 borg323 added this to the v0.31.0-rc2 milestone Apr 10, 2024
@borg323 borg323 removed this from the v0.31.0-rc2 milestone Apr 13, 2024
@borg323 borg323 marked this pull request as draft April 13, 2024 19:37
@borg323 borg323 mentioned this pull request Apr 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant