Xuan-Son Nguyen
92ecdcc06a
mtmd : add vision support for llama 4 (#13282)
* wip llama 4 conversion
* rm redundant __init__
* fix conversion
* fix conversion
* test impl
* try this
* reshape patch_embeddings_0
* fix view
* rm ffn_post_norm
* cgraph ok
* f32 for pos embd
* add image marker tokens
* Llama4UnfoldConvolution
* correct pixel shuffle
* fix merge conflicts
* correct
* add debug_graph
* logits matched, but it still preceives the image incorrectly
* fix style
* add image_grid_pinpoints
* handle llama 4 preprocessing
* rm load_image_size
* rm unused line
* fix
* small fix 2
* add test & docs
* fix llava-1.6 test
* test: add notion of huge models
* add comment
* add warn about degraded quality
2025-05-19 13:04:14 +02:00
..
2025-05-15 18:47:10 +02:00
2024-07-18 20:40:15 +10:00
2025-05-19 13:04:14 +02:00
2025-02-26 08:04:48 -05:00
2025-05-02 17:17:15 +02:00
2023-11-11 08:04:50 +03:00
2025-04-08 09:03:07 +02:00
2025-02-28 17:44:46 +01:00
2023-08-30 11:25:50 +03:00
2024-09-05 21:48:47 -04:00
2025-05-19 13:04:14 +02:00
2025-04-10 17:24:44 +02:00
2025-03-19 08:58:13 +01:00