commit 51e4bf9961435ed5a61740fe23b9ccb469aa9eca Author: ModelHub XC Date: Mon May 11 00:03:23 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..4e04a02 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.imatrix.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-8B-Medical-Extraction.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ1_M.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ1_M.gguf new file mode 100644 index 0000000..8540895 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c70f6a9b3f6e6167f649bdae41712af39dafd9b07f65b914f97d12663955da16 +size 2256149472 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ1_S.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ1_S.gguf new file mode 100644 index 0000000..e046509 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:36a9e65c35e8f02a03c69c2da2b5983b29109db1baa28cdafa0a850a454b32e1 +size 2115771360 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_M.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_M.gguf new file mode 100644 index 0000000..8e6080f --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b735c2ef49e891e3c65892ca9f06ad65042c1db305b0b17a6a0823bab54c0dd7 +size 3051916256 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_S.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_S.gguf new file mode 100644 index 0000000..d78156f --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f0ce45deee781cf300f4dedec9acffc2d89441dda86c0d89f5e6e58a654067bd +size 2864745440 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XS.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XS.gguf new file mode 100644 index 0000000..73a5729 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:65fdd5289ae1313dbfd97f8afd619e26744fead9567a10967e0e12960644e0ee +size 2696158176 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XXS.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..26af163 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:619ba14e1809049fc055fe621f9dc1c44bec18131950f8d2ca8a09b63d73b11a +size 2490112992 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_M.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_M.gguf new file mode 100644 index 0000000..98d78d2 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a72ee93e4def7eb314920cc3b9fa15d5b853e9b9a1abe7211e1702fbe0f7972f +size 3896622048 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_S.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_S.gguf new file mode 100644 index 0000000..c1dce5c --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8e5160943bc720f07b234d514f081d53844c9f447ef830b8eebc3b6208ec6534 +size 3789667296 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XS.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XS.gguf new file mode 100644 index 0000000..8e578b0 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1118ebbf6469249289f64844f37ce8ce6adcdcac4bfd2892f46744257193668e +size 3626875872 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XXS.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..477cfab --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:24308b036216d3dc9e6605340cb9c3d568cee17b224a6fe3d883fa24e4eae456 +size 3369634784 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ4_NL.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ4_NL.gguf new file mode 100644 index 0000000..4f1c762 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8e978675bf897dbe55d47a2058cf273884f90bc455c2297c44b06012f0b9e903 +size 4793625568 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-IQ4_XS.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-IQ4_XS.gguf new file mode 100644 index 0000000..ae45c95 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:185b46aaa8c0ea2e409c31246764a055a8645678e2a21fef99f975ef7629ee56 +size 4561841120 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q2_K.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q2_K.gguf new file mode 100644 index 0000000..92f3d52 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d6f188fa5fde403e1517f252bf7842c011558745380def9a9be5950a084570ba +size 3281734624 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q2_K_S.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q2_K_S.gguf new file mode 100644 index 0000000..23c4d37 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b2b6daf05569ce7a7fd814f1110bea563b7143e29e5fda1aa3365c301ad8ea5b +size 3083553760 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_L.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_L.gguf new file mode 100644 index 0000000..9b13bdf --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ecbdb5e0bf96f1134e4437078bcd81a933dc27ac89999e2c9fac7f81b0a72a29 +size 4431395808 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_M.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_M.gguf new file mode 100644 index 0000000..03f7258 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9bd2c16290a208674df8147f0903d9143e497256e3db0682d9855d46a14fbf26 +size 4124163040 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_S.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_S.gguf new file mode 100644 index 0000000..28c1fa2 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b85df1fa5a2b4300a2726c106e6b04ceda6f8a24f928231cc21e33f727d7e405 +size 3769613280 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q4_0.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q4_0.gguf new file mode 100644 index 0000000..f9939fb --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f0632c9b985cae7ed90dfce851177daa86766f11680d7a1dcdf6ef8f02c722d7 +size 4787334112 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q4_1.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q4_1.gguf new file mode 100644 index 0000000..c6c98dc --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:30329a23e3908d06a7bc600bc5207a76287dc5c32f606eca92ea2f742a035d63 +size 5247757280 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_M.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_M.gguf new file mode 100644 index 0000000..fd160c0 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f2d1840b9c3ccda2cf9ac54bfa285a01b447b8445cd35706a7412c61149757a6 +size 5027785696 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_S.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_S.gguf new file mode 100644 index 0000000..57e513d --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2b0f47960a41c5256bb256a146c01db993a8fad60c4dd8cc4c328a76975c7779 +size 4802014176 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_M.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_M.gguf new file mode 100644 index 0000000..e88dfed --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c3fef08a674901de19a334b6da5bd5eb774e0194b6bdf255b9475970f122720a +size 5851114464 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_S.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_S.gguf new file mode 100644 index 0000000..79d0a49 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cf0fb53421e07d27abf625357d35417e5446edd4e7f7f9d52e3b64c713ea554a +size 5720763360 diff --git a/Qwen3-VL-8B-Medical-Extraction.i1-Q6_K.gguf b/Qwen3-VL-8B-Medical-Extraction.i1-Q6_K.gguf new file mode 100644 index 0000000..7677c2e --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:71a945b25bf0e8f6ea9e7c04eedb8fbe6be2863ce3eccab10c9badc465f48db3 +size 6725901280 diff --git a/Qwen3-VL-8B-Medical-Extraction.imatrix.gguf b/Qwen3-VL-8B-Medical-Extraction.imatrix.gguf new file mode 100644 index 0000000..b3b6261 --- /dev/null +++ b/Qwen3-VL-8B-Medical-Extraction.imatrix.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3900db292cbd4a9c2fff655aa57918a0f5a6041b93fb33b996ff8dc10844d130 +size 5347200 diff --git a/README.md b/README.md new file mode 100644 index 0000000..599b44c --- /dev/null +++ b/README.md @@ -0,0 +1,85 @@ +--- +base_model: Zaynoid/Qwen3-VL-8B-Medical-Extraction +language: +- en +library_name: transformers +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: [] +--- +## About + + + + + + + + + +weighted/imatrix quants of https://huggingface.co/Zaynoid/Qwen3-VL-8B-Medical-Extraction + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Qwen3-VL-8B-Medical-Extraction-i1-GGUF).*** + +static quants are available at https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ1_S.gguf) | i1-IQ1_S | 2.2 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ1_M.gguf) | i1-IQ1_M | 2.4 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.6 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.8 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_S.gguf) | i1-IQ2_S | 3.0 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ2_M.gguf) | i1-IQ2_M | 3.2 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q2_K_S.gguf) | i1-Q2_K_S | 3.2 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q2_K.gguf) | i1-Q2_K | 3.4 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.5 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.7 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.9 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_S.gguf) | i1-IQ3_S | 3.9 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ3_M.gguf) | i1-IQ3_M | 4.0 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_M.gguf) | i1-Q3_K_M | 4.2 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.5 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.7 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q4_0.gguf) | i1-Q4_0 | 4.9 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.9 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.9 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q4_K_M.gguf) | i1-Q4_K_M | 5.1 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q4_1.gguf) | i1-Q4_1 | 5.3 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.8 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q5_K_M.gguf) | i1-Q5_K_M | 6.0 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-8B-Medical-Extraction-i1-GGUF/resolve/main/Qwen3-VL-8B-Medical-Extraction.i1-Q6_K.gguf) | i1-Q6_K | 6.8 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + +