Georgi Gerganov
0e89203b51
speculative : add tree-based sampling example (#3624)
* sampling : one sequence per sampling context
ggml-ci
* speculative : add tree-based sampling support
ggml-ci
* speculative : reuse the n_parallel CLI param
* speculative : refactor sampling
* examples : fix build after sampling refactoring
ggml-ci
* batched : fix n_seq_id
* sampling : fix malloc
ggml-ci
* swift : fix build
ggml-ci
* swift : try to fix build
ggml-ci
* prompts : add assistant.txt
* common : add llama_batch_add() and llama_batch_clear() helpers
* speculative : minor refactor
ggml-ci
* minor : comments + rename
ggml-ci
* speculative : fix off-by-one for n_drafted
* speculative : fix the n_drafted fix + p constants
2023-10-18 16:21:57 +03:00
..
2023-04-14 22:58:43 +03:00
2023-10-18 16:21:57 +03:00
2023-09-14 12:32:10 -04:00
2023-04-14 22:58:43 +03:00
2023-05-03 20:58:11 +03:00
2023-05-03 20:58:11 +03:00
2023-05-03 20:58:11 +03:00
2023-05-11 18:10:19 +03:00
2023-05-11 18:10:19 +03:00
2023-10-06 16:16:38 +03:00
2023-10-12 09:35:30 +03:00
2023-10-06 16:36:32 +03:00
2023-04-13 11:33:16 +02:00