Skip to content

Commit

Permalink
readme
Browse files Browse the repository at this point in the history
  • Loading branch information
lostfictions committed Aug 14, 2023
1 parent a866bb1 commit 2043b85
Showing 1 changed file with 13 additions and 0 deletions.
13 changes: 13 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# conceptnet-trim

trim [conceptnet](https://conceptnet.io/)'s ~34,000,000 assertions (about 10gb of
tsv) into a tidy ~3,400,000 english-language assertions (in json format).

1. clone this repo
2. [download the latest version of
conceptnet](https://github.com/commonsense/conceptnet5/wiki/Downloads) (5.7.0
at the time of writing)
3. extract it to `/data/assertions.csv`
4. run `cargo run -r` to run in release mode. the trimmed assertions will be
written to `/data/trimmed.json`

0 comments on commit 2043b85

Please sign in to comment.