Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixing a few bugs + use None when missing predictions #3

Merged
merged 2 commits into from
Jun 11, 2024

Conversation

mrm1001
Copy link

@mrm1001 mrm1001 commented Jun 11, 2024

  • If the LLM cannot predict and returns the string "None", we should treat this as None prediction, and ideally this should not affect the scores downstream.
  • Random sampling fixed
  • The csv for the scores missed the names of the metrics
  • Added the contexts to the csv file

@mrm1001 mrm1001 requested a review from davidsbatista June 11, 2024 14:47
@@ -153,15 +160,20 @@ def create_args():


def main():
t1 = datetime.datetime.now()
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

there's a decorator for that @timeit

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please use the decorator instead

@@ -101,7 +104,10 @@ def run_evaluation(sample_questions, sample_answers, retrieved_contexts, predict
"sas": eval_pipeline_results['sas']
}

inputs = {'questions': sample_questions, 'true_answers': sample_answers, 'predicted_answers': predicted_answers}
inputs = {'questions': sample_questions,
'contexts': retrieved_contexts,
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@davidsbatista this is where the contexts are added

@davidsbatista davidsbatista merged commit 2a93649 into main Jun 11, 2024
@davidsbatista davidsbatista deleted the bug-fixes branch June 20, 2024 16:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants