What's the largest PPO model size and context length that have been trained successfully with this library? Can you also share some performance metrics (i.e. GPU count, training time) if possible? #86

okuchaiev · 2024-01-22T20:55:59Z

okuchaiev
Jan 22, 2024
Maintainer

What's the largest PPO model size and context length that have been trained successfully with this library? Can you also share some performance metrics (i.e. GPU count, training time) if possible?

Originally posted by @panyi121 in #70 (comment)

okuchaiev · 2024-01-22T20:57:27Z

okuchaiev
Jan 22, 2024
Maintainer Author

Pasting response from @HeyyyyyyG below:

Hi, we did PPO on Llama-70B model with 4k context length. In terms of GPU count, we used 32x8 GPUs for the actor and 8x8 GPUs for the critic. Try our NV-Llama2-70B-RLHF model on NVIDIA AI Foundation for free.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's the largest PPO model size and context length that have been trained successfully with this library? Can you also share some performance metrics (i.e. GPU count, training time) if possible? #86

{{title}}

Replies: 1 comment

{{title}}

Select a reply

What's the largest PPO model size and context length that have been trained successfully with this library? Can you also share some performance metrics (i.e. GPU count, training time) if possible? #86

okuchaiev Jan 22, 2024 Maintainer

Replies: 1 comment

okuchaiev Jan 22, 2024 Maintainer Author

okuchaiev
Jan 22, 2024
Maintainer

okuchaiev
Jan 22, 2024
Maintainer Author