HimariO
ba1cb19cdd
llama : add Qwen2VL support + multimodal RoPE (#10361)
* Barebone Qwen2VL LLM convertor
* Add Qwen2VL cli entrypoint
* [WIP] add qwen2vl arch
* Verify m-rope output
* Add vl-rope/2d-rope support for qwen2vl ViT
* update qwen2vl cli tool
* update 5D tensor op workaround
* [WIP] qwen2vl vision model
* make batch and clip utils compatible with qwen2vl
* [WIP] create inference workflow, gguf convert script but fix
* correcting vision-rope behavior, add the missing last layer back to ViT
* add arg parser to qwen2vl_surgery
* replace variable size array with vector
* cuda-gdb cmake preset
* add fp32 mrope, vision rope kernel
* add fp16 support for qwen2vl and m-rope
* add `GGML_ROPE_TYPE_MROPE`, `GGML_ROPE_TYPE_VISION`
* fix rope op mode switching, out dated func args
* update `llama_hparams`
* update to keep up stream changes
* resolve linter, test errors
* add makefile entry, update speical image padding token
* add mrope unit test, fix few compiler warnings
* rename `mrope` related function, params
* minor updates on debug util, bug fixs
* add `m-rope` testcase to `test-backend-ops`
* Apply suggestions from code review
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* fix traililng whitespce
* store `llama_hparams.rope_sections` with fixed size array
* update position id tensor size check in GGML_OP_ROPE
* minor updates
* update `ggml_backend_*_supports_op` of unsupported backends
* remote old `rope_section` compare operator
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2024-12-14 14:43:46 +02:00
..
2024-07-20 22:25:26 +02:00
2024-11-27 17:10:08 +01:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-11-21 18:18:50 +01:00
2024-10-03 21:17:26 +03:00
2024-07-27 04:41:55 +02:00
2024-06-26 18:33:02 +03:00
2024-09-20 21:15:05 +03:00
2024-09-20 21:15:05 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-11-26 14:18:08 +02:00
2024-12-10 18:23:24 +01:00
2024-12-12 19:09:50 +01:00
2024-06-26 18:33:02 +03:00
2024-07-08 12:23:00 +03:00
2024-07-08 12:23:00 +03:00
2024-12-10 18:23:24 +01:00
2024-06-26 18:33:02 +03:00
2024-11-09 08:35:46 +01:00
2024-10-03 21:17:26 +03:00
2024-09-24 02:14:24 +02:00
2024-10-26 10:33:56 +03:00
2024-09-20 21:15:05 +03:00
2024-09-20 21:15:05 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-11-14 18:04:35 +01:00
2024-11-14 18:04:35 +01:00
2024-06-26 18:33:02 +03:00
2024-11-14 18:04:35 +01:00
2024-06-26 18:33:02 +03:00
2024-12-03 20:04:49 +02:00
2024-12-03 20:04:49 +02:00
2024-11-14 18:04:35 +01:00
2024-12-10 18:23:24 +01:00
2024-06-26 18:33:02 +03:00
2024-07-27 04:41:55 +02:00
2024-06-26 18:33:02 +03:00
2024-12-10 18:23:24 +01:00
2024-10-23 16:50:02 +03:00
2024-06-26 18:33:02 +03:00
2024-12-10 18:23:24 +01:00
2024-12-10 18:23:24 +01:00
2024-12-10 18:23:24 +01:00
2024-12-09 20:07:12 +01:00
2024-11-17 09:09:55 +01:00
2024-12-10 18:23:24 +01:00
2024-06-26 18:33:02 +03:00
2024-08-06 10:26:46 +03:00
2024-06-26 18:33:02 +03:00
2024-11-17 08:30:29 +02:00
2024-09-20 21:15:05 +03:00
2024-09-20 21:15:05 +03:00
2024-09-20 21:15:05 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-11-21 18:18:50 +01:00
2024-07-11 16:47:47 +02:00
2024-12-14 14:43:46 +02:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-12-10 18:23:24 +01:00
2024-09-08 11:05:55 +03:00
2024-08-27 22:41:27 +03:00
2024-08-27 22:41:27 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-09-22 04:29:12 +02:00
2024-09-22 04:29:12 +02:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-07-20 22:25:26 +02:00
2024-11-07 15:19:10 +08:00
2024-11-07 15:19:10 +08:00