Extend 'create_judge' to handle chat evaluations (prompt is a list) #14

rchan26 · 2024-04-17T13:37:45Z

For some models, you are able to pass in a list to the prompt field where we sequentially input the items in the list as different messages in a chat interface. The response field is therefore also a list.

Currently, the create_judge command only works for single prompt and response. We can implement a simple chat evaluation by presenting the judge LLM the conversation.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend 'create_judge' to handle chat evaluations (prompt is a list) #14

Extend 'create_judge' to handle chat evaluations (prompt is a list) #14

rchan26 commented Apr 17, 2024

Extend 'create_judge' to handle chat evaluations (prompt is a list) #14

Extend 'create_judge' to handle chat evaluations (prompt is a list) #14

Comments

rchan26 commented Apr 17, 2024