Skip to content

Releases: 3Simplex/llama.cpp

b3432

21 Jul 23:54
45f2c19
Compare
Choose a tag to compare
flake.lock: Update (#8610)

b3390

14 Jul 21:16
aaab241
Compare
Choose a tag to compare
flake.lock: Update (#8475)

Flake lock file updates:

• Updated input 'nixpkgs':
    'github:NixOS/nixpkgs/9f4128e00b0ae8ec65918efeba59db998750ead6?narHash=sha256-rwz8NJZV%2B387rnWpTYcXaRNvzUSnnF9aHONoJIYmiUQ%3D' (2024-07-03)
  → 'github:NixOS/nixpkgs/7e7c39ea35c5cdd002cd4588b03a3fb9ece6fad9?narHash=sha256-EYekUHJE2gxeo2pM/zM9Wlqw1Uw2XTJXOSAO79ksc4Y%3D' (2024-07-12)

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

b3384

12 Jul 16:48
4e24cff
Compare
Choose a tag to compare
server : handle content array in chat API (#8449)

* server : handle content array in chat API

* Update examples/server/utils.hpp

Co-authored-by: Xuan Son Nguyen <[email protected]>

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>

b3373

11 Jul 16:20
808aba3
Compare
Choose a tag to compare
CUDA: optimize and refactor MMQ (#8416)

* CUDA: optimize and refactor MMQ

* explicit q8_1 memory layouts, add documentation

b3368

10 Jul 22:18
dd07a12
Compare
Choose a tag to compare
Name Migration: Build the deprecation-warning 'main' binary every tim…