Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does Rucene support Chinese character indexing and searching #7

Open
qiqi-arch opened this issue Mar 17, 2020 · 1 comment
Open

Does Rucene support Chinese character indexing and searching #7

qiqi-arch opened this issue Mar 17, 2020 · 1 comment

Comments

@qiqi-arch
Copy link

This might be a silly question, does Rucene support Chinese character indexing and searching.

I don't see any tokenizer under the https://github.com/zhihu/rucene/tree/master/src/core/analysis

@sunxiaoguang
Copy link
Contributor

This might be a silly question, does Rucene support Chinese character indexing and searching.

I don't see any tokenizer under the https://github.com/zhihu/rucene/tree/master/src/core/analysis

Hi Qiqi, sorry for the delay. And the quick answer is that we don't have tokenizer in rucene at this time. We resort to data pipeline to tokenize the input instead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants