-
Notifications
You must be signed in to change notification settings - Fork 118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open work items for 5.12.5 #1454
Comments
For 6.00 I have many PRs that I would like to include at least some of them:
|
Re tokenization speed: In one of my atomese use-cases on and older slower machine, I see the following performance:
The above was obtained using sentences that are all exactly 12 words long. Dictionary lookup times not included in the tokenization. Linkages limit = 15K |
More about tokenization. With the atomese dicts, the dict can grow after every sentence. Thus, I call I need to find some way of doing what it does incrementally. Possibly by telling it exactly what expressions were added. -- fixed in #1459 |
I published version 5.12.1 -- I couldn't wait, certain automation scripts depend on the published tarballs. |
hi @linas
which we haven't seen in 5.12.0 |
I'm looking. Recommended fix is to disable the build of the sat-solver code. Since it's disabled by default, your build scripts must have turned it on. (Just run The recommendation is to disable, because the SAT parser is slower, in all situations, than the regular parser; in some cases, it is 10x or 20x slower. I've been considering deleting it permanently, although Amir convinced me that it can be fixed up. And so .. its in limbo ... @SoapGentoo If you are willing to carry patches, I just pushed a fix here: ffdf5d8 Otherwise, wait for 5.12.3 ... which might appear in a few weeks(? I have plans for "urgent" Atomese fixes which necessitate an LG release.) |
@SoapGentoo Version 5.12.3 is now out, with the fix you reported above. |
@linas after confirming that 5.12.3 works indeed, I proceeded to pass |
Cool. OK. FWIW. the SAT solver is already disabled by default ( |
in general, we like to specify all options to |
* Upstream recommends not using the sat solver anymore: opencog/link-grammar#1454 (comment) Bug: https://bugs.gentoo.org/593662 Signed-off-by: David Seifert <[email protected]>
Hm. OK. SAT was disabled to discourage it's use. In all situations, it is always slower, sometimes slower by factors of 10x or 100x. Amir says that, in fact, this can be fixed up and repaired, which might make SAT faster than the regular parser, maybe. Whether this is worth the effort, or not, depends mostly on future applications, rather than on the current situation. For the present English, russian, Thai, etc. dictionaries, reviving SAT seems pointless: the current parser is good enough. However, I'm working with brand-new dicts which have radically different structure, and different performance profiles, and make different demands on the parser. And for those, maybe the SAT parser could be faster or more space-efficient. Maybe, or maybe not. Unexplored. |
See comment in #1446 (comment) for pending work items for 5.12.1
I think it makes sense to also start a 5.13.0 branch that will include proposals #1450, and #1453 and #1452 and maybe #1449 depending on how that goes. And if #1449 can happen easily, then it would be version 6.0
The text was updated successfully, but these errors were encountered: