A question about adding a custom dict. #675
Replies: 1 comment 3 replies
-
|
Beta Was this translation helpful? Give feedback.
3 replies
Answer selected by
bact
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
hello,
I want to receive a compound word as one token, and tag the token I want with the part-of-speech.
For example, when the film title is "ปกป้องโลก", "ปกป้องโลก" is taken as one token and tagged as a "noun".
I understand that I can use pythainlp's custom_dict to solve this.
But I have two questions.
=============[question]===============
1. I want to use "attacut" among the tokenizer engines,
but if custom_dict and engine=attacut are set together, the custom dictionary is not applied.
How do I apply a custom dict while using attacut?
2. If there is a movie title called "ทะเลาะกัน", It's a verb, but I want it to be tagged as a noun.
Can you add not only words but also POS to the custom dict so that it is reflected in the result?
Beta Was this translation helpful? Give feedback.
All reactions