New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Create evaluation datasets for various GoLLM Tasks #27

Closed

j2whiting opened this issue Apr 3, 2024 · 1 comment

Assignees

Labels

Contributor

j2whiting commented Apr 3, 2024

Problem

We do not have large scale datasets that we can use to evaluate GoLLM tasks
We do not have a method in place for sourcing or creating datasets for new GoLLM tasks

Approach

Create distributions that we can sample from to create synthetic datasets. For example, we can likely create an arbitrary AMR, stratify it and then create synthetic interaction matrices which have cells that map to the newly created AMR.
A dataset of real data will be better where applicable, but we will have significant hurdles to overcome in terms of annotation costs, licensing, and time spent.

Tasks

TBD

j2whiting added the enhancement label

j2whiting self-assigned this

Contributor Author

j2whiting commented Apr 11, 2024

duplicate

j2whiting closed this as not planned

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment