result of mmbench after dpo #17

luohao123 · 2024-11-13T11:26:30Z

Will the mmbench test set score drop after dpo? Does this repo supports dpo without another reward model loaded?

wangclnlp · 2024-11-13T13:35:42Z

Thanks for your attention! We never test on the MMbench. I believe this performance may be related to the vision-LLM and the preference data used for DPO training. Also, this repo supports DPO training without needing to load a reward model (please take a look at this script).

luohao123 · 2024-11-14T03:26:31Z

i think it might dropped on mmbench, which is a critical leaderboard in terms of real use applications.

wangclnlp · 2024-11-15T02:51:17Z

Thanks for your suggestion! We will also test the performance of dpo on this benchmark.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

result of mmbench after dpo #17

result of mmbench after dpo #17

luohao123 commented Nov 13, 2024

wangclnlp commented Nov 13, 2024 •

edited

Loading

luohao123 commented Nov 14, 2024

wangclnlp commented Nov 15, 2024 •

edited

Loading

result of mmbench after dpo #17

result of mmbench after dpo #17

Comments

luohao123 commented Nov 13, 2024

wangclnlp commented Nov 13, 2024 • edited Loading

luohao123 commented Nov 14, 2024

wangclnlp commented Nov 15, 2024 • edited Loading

wangclnlp commented Nov 13, 2024 •

edited

Loading

wangclnlp commented Nov 15, 2024 •

edited

Loading