-
Notifications
You must be signed in to change notification settings - Fork 7.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
typescript bindings updates, cuda support #2403
base: main
Are you sure you want to change the base?
Conversation
build-bindings-backend-windows-msvc (now just called build-bindings-backend-windows) uses a deprecated image and will need to be updated to use the |
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: jacob <[email protected]> update typings Signed-off-by: jacob <[email protected]> Signed-off-by: limez <[email protected]>
Signed-off-by: jacob <[email protected]>
…arts Signed-off-by: limez <[email protected]>
… in ci Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
…fallback easier testable Signed-off-by: limez <[email protected]>
…the others Signed-off-by: limez <[email protected]>
Signed-off-by: Jacob Nguyen <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: Jacob Nguyen <[email protected]>
Signed-off-by: Jacob Nguyen <[email protected]>
Signed-off-by: Jacob Nguyen <[email protected]>
Signed-off-by: limez <[email protected]>
Signed-off-by: Jacob Nguyen <[email protected]>
Co-authored-by: Jacob Nguyen <[email protected]> Signed-off-by: limez <[email protected]>
Signed-off-by: limez <[email protected]>
build-bindings-backend-windows: | ||
machine: | ||
image: 'windows-server-2022-gui:2023.03.1' | ||
image: 'windows-server-2019-vs2019:2022.08.1' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For consistency with the other builds on main, please use VS2022, Enter-VsDevShell, Ninja, and ccache. Fix the merge conflicts first.
@@ -27,7 +23,8 @@ | |||
'NAPI_CPP_EXCEPTIONS', | |||
], | |||
'cflags_cc': [ | |||
"-fexceptions" | |||
"-fexceptions", | |||
"-std=c++20" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The backend on main is configured in CMake to use C++23.
await createCompletion(chat, "Are you sure?", { | ||
verbose: true, | ||
}); | ||
); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not a big deal, but I'd appreciate if you committed all text files with the trailing newline as per POSIX.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This needs conflict resolution.
Describe your changes
nCtx
on completions.so
files with the correct build type.Release
build type.LLModel:load
andLLModel:initGpu
methods so its easier to copy the loading and backend-picking logic of the CUDA Backend PR. I also renamed a few things for consistency.nThreads
in theLoadModelOptions
, previously it had to be configured via setThreadCount.nGpuLayers
as alias forngl
to theLoadModelOptions
.test:ci
script. Can switch CI config back toyarn test
if inference in CI is possible.node-gyp-build
is still working. Changed the build setup a bit to include aassert-backend-sources.js
script that facilitates easier testing of the fallback during development. This also allows us to get rid of the second bindings.gyp config.Breaking changes
CompletionOptions
onReponseToken(tokenId, text)
callback has been changed toonResponseTokens({ tokenIds, text })
(Note the plural.). This is related to the streamed emoji decoding / multi token character issue.Issue ticket number and link
#2326