Releases · 3Simplex/llama.cpp

06 Aug 14:04

efda90c

b3531

[Vulkan] Fix compilation of `vulkan-shaders-gen` on w64devkit after `…

Assets 20

02 Aug 13:01

github-actions

b3504

e09a800

b3504

cann: Fix ggml_cann_im2col for 1D im2col (#8819)

* fix ggml_cann_im2col for 1D im2col

* fix build warning

Assets 20

01 Aug 19:03

github-actions

b3501

b7a08fd

b3501

Build: Only include execinfo.h on linux systems that support it (#8783)

* Only enable backtrace on GLIBC linux systems

* fix missing file from copy

* use glibc macro instead of defining a custom one

Assets 20

31 Jul 13:13

github-actions

b3494

268c566

b3494

nix: cuda: rely on propagatedBuildInputs (#8772)

Listing individual outputs no longer necessary to reduce the runtime closure size after https://github.com/NixOS/nixpkgs/pull/323056.

Assets 20

27 Jul 14:12

github-actions

b3472

b5e9546

b3472

llama : add support for llama 3.1 rope scaling factors (#8676)

* Add llama 3.1 rope scaling factors to llama conversion and inference

This commit generates the rope factors on conversion and adds them to the resulting model as a tensor. At inference time, these factors are passed to the `ggml_rope_ext` rope oepration, improving results for context windows above 8192

* Update convert_hf_to_gguf.py

Co-authored-by: compilade <[email protected]>

* address comments

* address comments

* Update src/llama.cpp

Co-authored-by: compilade <[email protected]>

* Update convert_hf_to_gguf.py

Co-authored-by: compilade <[email protected]>

---------

Co-authored-by: compilade <[email protected]>

Assets 20

27 Jul 05:16

github-actions

b3468

2b1f616

b3468

ggml : reduce hash table reset cost (#8698)

* ggml : reduce hash table reset cost

* fix unreachable code warnings after GGML_ASSERT(false)

* GGML_ASSERT(false) -> GGML_ABORT("fatal error")

* GGML_ABORT use format string

Assets 20

26 Jul 13:36

github-actions

b3467

01245f5

b3467

llama : fix order of parameters (#8706)

usage of `aclrtGetMemInfo` is correct:

https://www.hiascend.com/doc_center/source/zh/canncommercial/63RC2/inferapplicationdev/aclcppdevg/aclcppdevg_03_0103.html

Co-authored-by: Judd <[email protected]>

Assets 20

25 Jul 18:40

github-actions

b3463

4226a8d

b3463

llama : fix build + fix fabs compile warnings (#8683)

ggml-ci

Assets 20

25 Jul 12:57

github-actions

b3460

ed67bcb

b3460

[SYCL] fix multi-gpu issue on sycl (#8554)



---------

Signed-off-by: Chen Xi <[email protected]>
Co-authored-by: Meng, Hengyu <[email protected]>

Assets 20

24 Jul 22:31

github-actions

b3455

68504f0

b3455

readme : update games list (#8673)

Added link to game I made that depends on llama

Assets 20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: 3Simplex/llama.cpp

b3531

b3504

b3501

b3494

b3472

b3468

b3467

b3463

b3460

b3455