-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
why use darknet train command still cause 'CUDA status Error' #7
Comments
What GPU are you using? The images in this repository were built using multi stage build to reduce image size. So the image doesn’t contain build environment, only contain runtime. So I think your second error is correct behavior. In your third image, CUDA outputs message about compatibility. cc80 tag expects your GPU supports compute capability 8.0. Does your GPU satisfy it? You can check GPU’s compute capability in this page |
Thanks for your reply. I'm using RTX 2080Ti , nvidia-smi display cuda version =11 , but no nvcc . If you mean these images only for runtime but no for training? |
RTX 2080 Ti ’s compute capability is 7.5 which is described GeForce and TITAN Products section in this page. So could you try gpu-cv-cc75? |
I don't know what the problem is from your image. |
I might be something stupid, but, who knows, it might be a reason. I actually tried to use the See below: #0 41.72 /usr/bin/ld: warning: libcuda.so.1, needed by /usr/local/lib/libdarknet.so, not found (try using -rpath or -rpath-link)
#0 41.72 /usr/bin/ld: /usr/local/lib/libdarknet.so: undefined reference to `cuCtxGetCurrent'
#0 41.72 collect2: error: ld returned 1 exit status @daisukekobayashi : I was also on the gpu cc 75. I used your repo as a base, but modified the build & runtime to copy the libdarknet.so and darknet.h. And I added a runtime where the go could also build/run (just for the sake of testing basic code). Maybe when we install the cuda libraries, the libs are not added in the user/local folders. |
@Nordes I guess you want to build some program using libdarknet and cuda. So I think it's easy to use nvidia-cuda-devel image instead of my image. If you reuse my dockerfile, you should try removing stage two from dockerfile. You can use libdarknet.so based on cuda-devel based image. |
I adapt tag: gpu-cv-cc80 as yolo train image,
I fisrt trid use train command :
but it seems just display model summary, and not start training.
Secondly, I tried to compile darknet file by make command , but it threw out error:
Finally , I also try use detect command , no exception , it threw a error:
So I think it is due to missing cuda file, but I think doesn't it should be a completely darknet environment ? Could you give some idea to solve this problem
The text was updated successfully, but these errors were encountered: