Georgi Gerganov
6562e5a4d6
context : allow cache-less context for embeddings (#13108)
* context : allow cache-less context for embeddings
ggml-ci
* context : enable reranking with encode()
ggml-ci
* context : encode() clears embd_seq
ggml-ci
* examples : use llama_encode() when appropriate
ggml-ci
* models : nomic bert moe does not require KV cache
* llama : update comments for llama_decode/llama_encode
ggml-ci
* context : update warning log [no ci]
2025-05-08 14:28:33 +03:00
..
2025-04-24 16:00:10 +03:00
2025-03-27 08:24:10 +02:00
2025-03-13 12:35:44 +02:00
2025-04-28 22:52:15 +03:00
2025-04-28 22:52:15 +03:00
2025-05-02 17:48:36 +03:00
2025-05-02 17:48:36 +03:00
2025-05-02 11:06:09 +02:00
2025-04-28 10:11:58 +02:00
2025-05-08 14:28:33 +03:00
2025-05-08 14:26:50 +03:00
2025-01-03 10:18:53 +02:00
2025-03-14 13:47:05 +01:00
2025-03-05 13:05:13 +00:00
2025-03-05 13:05:13 +00:00
2025-05-06 14:25:40 +02:00
2025-05-02 17:48:36 +03:00
2025-03-14 09:03:24 +02:00
2025-04-28 22:52:15 +03:00
2025-01-07 18:01:58 +01:00
2025-02-12 10:06:53 -04:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-05-02 17:48:36 +03:00
2025-05-02 17:48:36 +03:00
2025-03-13 12:35:44 +02:00
2025-05-02 17:48:36 +03:00
2025-03-24 12:17:10 +02:00
2025-02-10 20:58:18 +02:00
2025-04-02 16:38:54 +03:00
2025-04-02 14:52:01 +02:00
2025-05-08 14:28:33 +03:00
2025-05-03 17:39:51 +02:00
2025-04-13 21:29:28 +03:00
2025-01-03 10:18:53 +02:00
2025-05-06 22:36:24 +02:00
2025-01-12 11:32:42 +02:00
2025-04-23 20:21:59 +02:00
2025-01-12 12:15:53 +02:00
2025-04-02 14:52:01 +02:00
2024-10-08 13:27:04 +02:00
2024-10-02 15:49:55 +02:00
2025-02-15 16:40:57 +02:00
2024-12-16 12:31:45 +02:00