Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

幫助更新 simp.txt 與 trad.txt #55

Open
raymondjou opened this issue Aug 16, 2023 · 0 comments
Open

幫助更新 simp.txt 與 trad.txt #55

raymondjou opened this issue Aug 16, 2023 · 0 comments

Comments

@raymondjou
Copy link

NLPchina/nlp-lang/src/main/resources/simp.txt 與 NLPchina/nlp-lang/src/main/resources/trad.txt 自 2015年7月之後未再更新。

我發現了一些問題. 例如,藴 (以下註為 T2) 是 蘊 (以下註為 T1) 的異體字。而 蕴 (以下註為 S) 是 蘊 (T1) 及 藴 (T2) 的簡體字. 請參見 http://www.cojak.org/index.php?function=code_lookup&term=8574

因此, 基於為了符合上述的異體與簡繁體關係
14197 蘊 (T1) 14197 蕴 (S)
14325 藴 (T2) 14325 藴 (T2)
14347 蘊 (T1) 14347 蕴 (S)
應修正為
14197 蘊 (T1) 14197 蕴 (S)
14325 藴 (T2) 14325 蕴 (S)
14347 蘊 (T1) 14347 藴 (T2)

此問題已在 Unihan 15.1.0 > Unihan_Variants.txt 中更正。 除此之外,我還發現 Unihan 15.0.0 > Unihan_Variants.txt 和 Unihan 15.1.0 > Unihan_Variants.txt 之間還有的其他修正需要合併到 simp.txt 和 trad.txt 裏。

您是否同意讓我根據 Unihan 15.1.0 > Unihan_Variants.txt 來更新 trad.txt 和 simp.txt 呢?
如同意,煩請告知 trad.txt 和 simp.txt 的編纂規則。 多謝。

Raymond Jou 周永瑞
FamilySearch International
[email protected]
(801)240-3871 office 辦公室
(408)568-8989 mobile & text 手機 及 簡訊

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant