-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Updated Chinese stop word authorization. #776
Closed
Closed
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
pyhanlp is One of the most powerful natural language processing libraries in Chinese today, and it's extremely easy to use.You can use 'PIP install pyhanlp'. To install it,like Jieba. Its level of identity of named entity,word segmentation was better than jieba, and has more ways to do it.You'll save a lot of time when you use it. And thanks to its excellent performance, when we handle a large number of Chinese texts We don't have to use the User-defined dictionaries.
Wordcloud is a very good tools, but if you want to create Chinese wordcloud only wordcloud is not enough. The file shows how to use wordcloud with Chinese. First, you need a Chinese word segmentation library jieba or HanLp.You can use 'PIP install jieba' or 'PIP install pyhanlp' or to install it.As you can see,at the same time using wordcloud with jieba or HanLP very convenient.While jieba is lighter, hanlp requires more downloads, but is more powerfulHanLP's level of identity of named entity,word segmentation was better than jieba,and has more ways to do it.You'll save a lot of time when you use it.
now, you can use pyhanlp to create wordcloud with Chinese.
Simplified part of the code Removed some code that references the class library Modified some code that references the class library
删除冲突代码以合并源代码
修改中文词云授权文件
Updated Chinese stop word authorization, giving higher authorization without signature, can be used for personal development or commercial purposes. Also, since I changed the name of GitHub, I need to synchronize it.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Updated Chinese stop word authorization, giving higher authorization without signature, can be used for personal development or commercial purposes. Also, since I changed the name of GitHub, I need to synchronize it.