Inference problem #9

maciekpoplawski · 2024-11-04T15:16:21Z

Hi! Good job one the model.

But i have trouble testing it.
Setup RTX4090 + 64gb ram (on loading models im kissing 63.9gb :) )

Tested on Windows - can't launch with default code cause of missing support for FLASH_ATTENTION

Exchanged for EFFICIENT_ATTENTION and i hear initial prompt with "bob how's it going bob" and than silence

Unfortunately same on Linux (no errors setup). Only initial prompt and nothing more. Silence :(

Torch installed with this command:
pip3 install torch torchaudio --index-url https://download.pytorch.org/whl/cu118

Any tips how to go further with this?

The text was updated successfully, but these errors were encountered:

maciekpoplawski · 2024-11-04T15:19:47Z

Maybe somebody end up here with a problem - on inference_client.py i was missing this to launch it on linux Ubuntu
sudo apt-get install libportaudio2

wpq3142 · 2024-11-06T02:55:34Z

same problem!!

calculating · 2024-11-07T01:29:56Z

@wpq3142 did the libportaudio2 fix work for you? I've added it to the readme

maciekpoplawski · 2024-11-07T14:14:14Z

Sorry i mixes two things in one issue.
Original issue from first post is not resolved.
libportaudio2 fix was needed on Ubuntu to be able to select audio devices. And it WORKS.

KadirErturk4r · 2024-11-10T19:42:19Z

how much vRam memory requires?
I have 16GB 3060 and got CUDA out of memory.

calculating · 2024-11-11T03:43:35Z

how much vRam memory requires? I have 16GB 3060 and got CUDA out of memory.

with our current bfloat16 implementation, 24GB.

robonxt-ai · 2024-11-18T07:58:11Z

with our current bfloat16 implementation, 24GB.

Will there be a quantized or optimized build in the upcoming future?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference problem #9

Inference problem #9

maciekpoplawski commented Nov 4, 2024

maciekpoplawski commented Nov 4, 2024

wpq3142 commented Nov 6, 2024

calculating commented Nov 7, 2024

maciekpoplawski commented Nov 7, 2024

KadirErturk4r commented Nov 10, 2024

calculating commented Nov 11, 2024

robonxt-ai commented Nov 18, 2024 •

edited

Loading

Inference problem #9

Inference problem #9

Comments

maciekpoplawski commented Nov 4, 2024

maciekpoplawski commented Nov 4, 2024

wpq3142 commented Nov 6, 2024

calculating commented Nov 7, 2024

maciekpoplawski commented Nov 7, 2024

KadirErturk4r commented Nov 10, 2024

calculating commented Nov 11, 2024

robonxt-ai commented Nov 18, 2024 • edited Loading

robonxt-ai commented Nov 18, 2024 •

edited

Loading