chatllm.rs

rust api wrapper for llm-inference chatllm.cpp

All credits go to original repo: https://github.com/foldl/chatllm.cpp and Qwen 2.5 32b Coder Instruct which made 99% of work. I only guided it with prompts.

To compile your project into an executable, open a terminal and navigate to your project directory.

Then run the following command:

cargo build --release

When exe will be ready launch it like this: main.exe -m qwen2.5-1.5b.bin

Links for quantatized models:

QWen-2.5 1.5B - https://modelscope.cn/api/v1/models/judd2024/chatllm_quantized_qwen2.5/repo?Revision=master&FilePath=qwen2.5-1.5b.bin

Gemma-2 2B - https://modelscope.cn/api/v1/models/judd2024/chatllm_quantized_gemma2_2b/repo?Revision=master&FilePath=gemma2-2b.bin

If you need more quantatized models use this python model downloader: https://github.com/foldl/chatllm.cpp/blob/master/scripts/model_downloader.py

You can convert custom safetensors model to inner chatllm.cpp format by using this script: https://github.com/foldl/chatllm.cpp/blob/master/convert.py

Converting tutorial: https://github.com/foldl/chatllm.cpp?tab=readme-ov-file#quantize-model

List of supported llm architecture types suitable for conversion: https://github.com/foldl/chatllm.cpp/blob/master/docs/models.md

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Cargo.toml		Cargo.toml
README.md		README.md
build.rs		build.rs
ggml.dll		ggml.dll
libchatllm.dll		libchatllm.dll
main.rs		main.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

chatllm.rs

About

Releases 1

Packages

Languages

JohnClaw/chatllm.rs

Folders and files

Latest commit

History

Repository files navigation

chatllm.rs

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages