-
Notifications
You must be signed in to change notification settings - Fork 0
Frequently Asked Questions (FAQ)
Nikos Giarelis edited this page Aug 1, 2023
·
4 revisions
LMRank currently supports 14 languages in total, as listed in the table below:
Language | Code |
---|---|
English ๐ฌ๐ง | en |
Greek ๐ฌ๐ท | el |
Danish ๐ฉ๐ฐ | da |
Catalan | ca |
Dutch ๐ณ๐ฑ | nl |
Finnish ๐ซ๐ฎ | fi |
French ๐ซ๐ท | fr |
German ๐ฉ๐ช | de |
Italian ๐ฎ๐น | it |
Japanese ๐ฏ๐ต | ja |
Norwegian ๐ณ๐ด (Borkmal) |
nb |
Portuguese ๐ต๐น | pt |
Spanish ๐ช๐ธ | es |
Swedish ๐ธ๐ช | sv |
LMRank uses the technique of dependency parsing to form candidate keyphrases, which utilizes spaCy's noun chunks.
When spaCy adds a small model (sm) with noun-chunk support for a language, support for it can be easily added.
You can see some examples at Google Colab or GitHub
Yes, see the relevant section in the examples linked above.
The datasets are available in this link
Setup the base_path
in main.py for the dataset directory and run main()
.
Setup the output_path
in main.py for the lmrank_timings.csv
and run benchmark()
.