Xuan-Son Nguyen
92ecdcc06a
mtmd : add vision support for llama 4 (#13282)
* wip llama 4 conversion
* rm redundant __init__
* fix conversion
* fix conversion
* test impl
* try this
* reshape patch_embeddings_0
* fix view
* rm ffn_post_norm
* cgraph ok
* f32 for pos embd
* add image marker tokens
* Llama4UnfoldConvolution
* correct pixel shuffle
* fix merge conflicts
* correct
* add debug_graph
* logits matched, but it still preceives the image incorrectly
* fix style
* add image_grid_pinpoints
* handle llama 4 preprocessing
* rm load_image_size
* rm unused line
* fix
* small fix 2
* add test & docs
* fix llava-1.6 test
* test: add notion of huge models
* add comment
* add warn about degraded quality
2025-05-19 13:04:14 +02:00
..
2025-05-13 18:01:53 +03:00
2025-05-02 20:27:13 +02:00
2025-05-02 20:27:13 +02:00
2025-05-02 20:27:13 +02:00
2025-05-09 11:53:58 +02:00
2025-05-15 15:46:55 +02:00
2025-05-09 13:02:07 +02:00
2025-05-19 13:04:14 +02:00
2025-05-08 14:26:50 +03:00
2025-05-13 19:12:31 +02:00
2025-05-09 13:02:07 +02:00
2025-05-09 10:25:50 +01:00
2025-05-17 23:59:48 +02:00
2025-05-02 20:27:13 +02:00
2025-05-02 20:27:13 +02:00
2025-05-05 16:02:55 +02:00