Georgi Gerganov
e0dbec0bc6
llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)
* llama : refactor llama_context, llama_kv_cache, llm_build_context
ggml-ci
* graph : don't mutate the KV cache during defrag
ggml-ci
* context : reduce virtuals + remove test function
ggml-ci
* context : move interface implementation to source file + factory
ggml-ci
* graph : move KV cache build functions to llama_context impl
ggml-ci
* graph : remove model reference from build_pooling
ggml-ci
* graph : remove llama_model reference
ggml-ci
* kv_cache : provide rope factors
ggml-ci
* graph : rework inputs to use only unique_ptr, remove attn input abstraction
ggml-ci
* context : remove llama_context_i abstraction
ggml-ci
* context : clean-up
ggml-ci
* graph : clean-up
ggml-ci
* llama : remove redundant keywords (struct, enum)
ggml-ci
* model : adapt gemma3
ggml-ci
* graph : restore same attention ops as on master
ggml-ci
* llama : remove TODO + fix indent
ggml-ci
2025-03-13 12:35:44 +02:00
..
2024-06-26 18:33:02 +03:00
2025-03-07 09:33:37 +00:00
2025-03-10 13:34:13 +02:00
2025-01-08 12:55:36 +02:00
2023-11-07 00:36:23 +03:00
2023-11-02 08:50:16 +02:00
2025-03-10 10:59:03 +00:00
2025-03-10 10:59:03 +00:00
2025-02-18 18:03:23 +00:00
2025-03-13 12:35:44 +02:00
2025-03-10 13:34:13 +02:00
2024-09-30 11:23:42 +03:00
2023-08-21 23:07:43 +03:00
2025-03-05 13:05:13 +00:00
2025-03-05 13:05:13 +00:00
2024-03-21 11:50:43 +00:00
2025-02-07 11:33:27 +02:00
2025-02-12 21:36:11 +01:00
2025-02-12 10:06:53 -04:00
2025-03-04 18:53:26 +02:00
2025-01-06 10:52:15 +02:00
2025-03-05 13:05:13 +00:00
2025-02-02 09:55:32 +02:00
2025-03-13 12:35:44 +02:00
2025-02-19 13:29:42 +02:00
2024-08-27 08:58:50 +03:00