Skip to content

b4164

Compare
Choose a tag to compare
@github-actions github-actions released this 25 Nov 16:58
9ca2e67
server : add speculative decoding support (#10455)

* server : add speculative decoding support

ggml-ci

* server : add helper function slot.can_speculate()

ggml-ci