enginex-ascend-910-llama.cpp

EngineX-Ascend/enginex-ascend-910-llama.cpp

Files

Xuan-Son Nguyen 92ecdcc06a mtmd : add vision support for llama 4 (#13282 )

* wip llama 4 conversion

* rm redundant __init__

* fix conversion

* fix conversion

* test impl

* try this

* reshape patch_embeddings_0

* fix view

* rm ffn_post_norm

* cgraph ok

* f32 for pos embd

* add image marker tokens

* Llama4UnfoldConvolution

* correct pixel shuffle

* fix merge conflicts

* correct

* add debug_graph

* logits matched, but it still preceives the image incorrectly

* fix style

* add image_grid_pinpoints

* handle llama 4 preprocessing

* rm load_image_size

* rm unused line

* fix

* small fix 2

* add test & docs

* fix llava-1.6 test

* test: add notion of huge models

* add comment

* add warn about degraded quality

2025-05-19 13:04:14 +02:00

batched-bench

batched-bench : fix pp batch contents (#13492 )

2025-05-13 18:01:53 +03:00

cvector-generator

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

export-lora

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

gguf-split

llama : move end-user examples to tools directory (#13249 )