Georgi Gerganov
6562e5a4d6
context : allow cache-less context for embeddings (#13108)
* context : allow cache-less context for embeddings
ggml-ci
* context : enable reranking with encode()
ggml-ci
* context : encode() clears embd_seq
ggml-ci
* examples : use llama_encode() when appropriate
ggml-ci
* models : nomic bert moe does not require KV cache
* llama : update comments for llama_decode/llama_encode
ggml-ci
* context : update warning log [no ci]
2025-05-08 14:28:33 +03:00
..
2025-05-02 20:27:13 +02:00
2025-05-02 20:27:13 +02:00
2025-05-02 20:27:13 +02:00
2025-05-02 20:27:13 +02:00
2025-05-08 14:26:50 +03:00
2025-05-02 20:27:13 +02:00
2025-05-08 14:26:50 +03:00
2025-05-06 22:40:24 +02:00
2025-05-08 14:26:50 +03:00
2025-05-02 20:27:13 +02:00
2025-05-04 21:25:43 +02:00
2025-05-02 20:27:13 +02:00
2025-05-08 14:28:33 +03:00
2025-05-02 20:27:13 +02:00
2025-05-02 20:27:13 +02:00
2025-05-05 16:02:55 +02:00