-
Hello all, So today finally we have GGUF support ! Quite exciting and many thanks to @PromtEngineer ! At the moment I run the default model llama 7b with --device_type cuda, and I can see some GPU memory being used but the processing at the moment goes only to the CPU. Does anyone have experience with GGUF's + GPU ? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
This was the solution for me, I am running windows with a conda env :
Now GPU + CPU works toether. Thanks @PromtEngineer |
Beta Was this translation helpful? Give feedback.
This was the solution for me, I am running windows with a conda env :
Now GPU + CPU works toether. Thanks @PromtEngineer