reward-modeling

Star

Here are 6 public repositories matching this topic...

sileod / tasksource

Star

Datasets collection and preprocessings framework for NLP extreme multitask learning

Updated Dec 20, 2024
Python

YangLing0818 / IterComp

Star

IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

text-to-image dpo rlhf reward-modeling

Updated Nov 1, 2024
Python

VectorInstitute / vector-inference

Star

Efficient LLM inference on Slurm clusters using vLLM.

inference vlm text-embedding llm vllm llm-inference reward-modeling

Updated Dec 23, 2024
Python

quanshr / DMoERM

Star

[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

rlhf large-language-model reward-modeling

Updated Jun 6, 2024
Python

allenai / hybrid-preferences

Star

Learning to route instances for Human vs AI Feedback

language-model dpo rlhf reward-modeling

Updated Nov 25, 2024
Python

MiuLab / DogeRM

Star

The code used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging"

large-language-models rlhf model-merging reward-modeling

Updated Oct 8, 2024
Python

Improve this page

Add a description, image, and links to the reward-modeling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reward-modeling topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reward-modeling

Here are 6 public repositories matching this topic...

sileod / tasksource

YangLing0818 / IterComp

VectorInstitute / vector-inference

quanshr / DMoERM

allenai / hybrid-preferences

MiuLab / DogeRM

Improve this page

Add this topic to your repo