llmtools

Some small and simple tools to work with llms

add_chatml_tokens.py

This small script adds tokens needed for chatml to a huggingface model.

python add_chatml_tokens.py --model MY_MODEL_NaME --output_dir FOLDER_TO_STORE_MODIFIED_MODEL

This small script converts existing models using fp32 or fp16 as the dtype to bf16.

python create_bf16.py --model MY_MODEL_NaME

This small script quantizes existing models to GPTQ using AutoGPTQ.

python create_gptq.py --model MY_MODEL_NaME

This small script quantizes existing models to AWQ using AutoAWQ.

python create_awq.py --model MY_MODEL_NaME

This small script shows the evaluation results for the mt-bench-de benchmark to do a quick check for a new model.

python show_mtbench_results.py --model MY_MODEL_NaME

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
add_chatml_tokens.py		add_chatml_tokens.py
create_awq.py		create_awq.py
create_exl2.py		create_exl2.py
create_gguf.py		create_gguf.py
create_gptq.py		create_gptq.py
create_hf.py		create_hf.py
create_hqq.py		create_hqq.py
create_mlx.py		create_mlx.py
show_mtbench_results.json		show_mtbench_results.json
show_mtbench_results.py		show_mtbench_results.py