Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create evaluation datasets for various GoLLM Tasks #27

Closed
j2whiting opened this issue Apr 3, 2024 · 1 comment
Closed

Create evaluation datasets for various GoLLM Tasks #27

j2whiting opened this issue Apr 3, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@j2whiting
Copy link
Contributor

Problem

  • We do not have large scale datasets that we can use to evaluate GoLLM tasks
  • We do not have a method in place for sourcing or creating datasets for new GoLLM tasks

Approach

  • Create distributions that we can sample from to create synthetic datasets. For example, we can likely create an arbitrary AMR, stratify it and then create synthetic interaction matrices which have cells that map to the newly created AMR.
  • A dataset of real data will be better where applicable, but we will have significant hurdles to overcome in terms of annotation costs, licensing, and time spent.

Tasks

TBD

@j2whiting j2whiting added the enhancement New feature or request label Apr 3, 2024
@j2whiting j2whiting self-assigned this Apr 3, 2024
@j2whiting
Copy link
Contributor Author

duplicate

@j2whiting j2whiting closed this as not planned Won't fix, can't repro, duplicate, stale Apr 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant