commit f695cff3c1164e2ef28b34816686334d24234145 Author: ModelHub XC Date: Fri May 8 19:15:31 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..1130ec4 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_M.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_M.gguf new file mode 100644 index 0000000..7af8b47 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:71704230b41264c33903ff2e7c3718cd02c02bf002da260eb3864e012fa0d787 +size 2042196960 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_S.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_S.gguf new file mode 100644 index 0000000..c3b784c --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:509c62804cf00052e70dff19262726a71f2c14aa4f994256175890749e0236e0 +size 1903668192 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_M.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_M.gguf new file mode 100644 index 0000000..c0e71c5 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:317a8f3f1de54b983c9a19d53d55074adfa15aa3f2440c0995986eacdf332d29 +size 2780343264 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_S.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_S.gguf new file mode 100644 index 0000000..5285ec0 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:301760d4d481ec25ba971baf16474789fcc5b3068f114f02289d1eef9e491d54 +size 2595638240 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XS.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XS.gguf new file mode 100644 index 0000000..6aff307 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1efd93d0550d952d0cbd4cf07fb88d95fccd95457f174d5de84b20bc545dc0b3 +size 2469022688 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XXS.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..ea7687c --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:070ed81f703309361029da70f498e9ca5c573ad7078a91b552b8d0002affa10b +size 2273078240 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_M.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_M.gguf new file mode 100644 index 0000000..5c8d0c1 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8e9bc31c815cb3912600ce3d962fd35a3400bdc65f52660aa73cddaf49d0fc84 +size 3574012896 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_S.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_S.gguf new file mode 100644 index 0000000..e45f61f --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:34db6f8b235c72440ecd5293df3f9ae30f4f37e9536825295dae9ca353782aff +size 3499193312 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XS.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XS.gguf new file mode 100644 index 0000000..08a377c --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:197e6a77d0fc66a71974e23493ebf3809a155a53495e95344f943bb34f506ec5 +size 3346256864 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XXS.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..974088e --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3ddaef4a2b4c9d4ca994487fc4f7cc3dc8f86e94e555917dd9d3100673afb7fa +size 3114515424 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_NL.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_NL.gguf new file mode 100644 index 0000000..4d6e2bc --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:63c56642de70a9777920b34ad728520879b8295e5e2f462da390a954bb9f0fe1 +size 4437814240 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_XS.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_XS.gguf new file mode 100644 index 0000000..8adf0e1 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9c242688e6035df3ec7f62d1827db9e1c88c1f935bf524e983f094898746cd19 +size 4218473440 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K.gguf new file mode 100644 index 0000000..2b1257e --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:477f8bd7c667f9f4b64ba2c90c7f635f1b9563e5705930b8dc739e48b3f8b888 +size 3015941088 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K_S.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K_S.gguf new file mode 100644 index 0000000..f57e69e --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c474578f46621ad7c0d3c5d782f3f6c916bb14058084d80fa035be1849b89a1e +size 2834074592 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_L.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_L.gguf new file mode 100644 index 0000000..6e529bb --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b401f9f6505956fc76b611efd1c14431aa9d3fe925e4b0658e141dc0a9421486 +size 4088460256 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_M.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_M.gguf new file mode 100644 index 0000000..9c54c2c --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bc7c4bc9ad4ebc963677c09f373fcba4e061c7a4adcf0c598f28d9c18d1c01c5 +size 3808392160 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_S.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_S.gguf new file mode 100644 index 0000000..0b76670 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8f1fcba6e3cf37d498166be40d4c1c9378eae8ff3ca3afb1877e8c26fa39c4c6 +size 3492369376 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_0.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_0.gguf new file mode 100644 index 0000000..d37f7f7 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4115ad1db2ad5edd0217087c7f203985491c41921c75830efae5a1b0e96f2522 +size 4444122080 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_1.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_1.gguf new file mode 100644 index 0000000..510490d --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9882838fc2fc2963e59294fb7edf9f67f90c5e8c4d7578d26494bfffa3e54b98 +size 4873284576 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_M.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_M.gguf new file mode 100644 index 0000000..75da8de --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:858a02ebcba94fdf533a7fde4bc55f6bd62bd26adef303f67dcec98ca0e0f200 +size 4683074528 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_S.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_S.gguf new file mode 100644 index 0000000..93aff3b --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b6ac9cf7b57374f165cd5aef2f63d2863d975fd099e9a46b3ec882b9a9a35f45 +size 4457769952 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_M.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_M.gguf new file mode 100644 index 0000000..fbe5cc3 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c2ebb3226583f72af0473ffe92a46aea8fd40a8339bd74d72d8299458adaa6d6 +size 5444832224 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_S.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_S.gguf new file mode 100644 index 0000000..1cda4be --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1dec0bf359ec77827ca254c42b8c938d8b96792d8023810bf6be717fa2257d57 +size 5315177440 diff --git a/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q6_K.gguf b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q6_K.gguf new file mode 100644 index 0000000..bc4bd40 --- /dev/null +++ b/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2679ad4564f6078101324548b459e133cb72948f4654f8ba3ae9d3aff294617d +size 6254199776 diff --git a/README.md b/README.md new file mode 100644 index 0000000..aff2e98 --- /dev/null +++ b/README.md @@ -0,0 +1,83 @@ +--- +base_model: amadeusai/Amadeus-Verbo-BI-Qwen-2.5-7B-PT-BR-Instruct-Experimental +language: +- pt +library_name: transformers +license: apache-2.0 +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- text-generation-inference +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/amadeusai/Amadeus-Verbo-BI-Qwen-2.5-7B-PT-BR-Instruct-Experimental + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF).*** + +static quants are available at https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_S.gguf) | i1-IQ1_S | 2.0 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ1_M.gguf) | i1-IQ1_M | 2.1 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.4 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.6 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_S.gguf) | i1-IQ2_S | 2.7 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ2_M.gguf) | i1-IQ2_M | 2.9 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K_S.gguf) | i1-Q2_K_S | 2.9 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q2_K.gguf) | i1-Q2_K | 3.1 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.2 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.4 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.6 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_S.gguf) | i1-IQ3_S | 3.6 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ3_M.gguf) | i1-IQ3_M | 3.7 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_M.gguf) | i1-Q3_K_M | 3.9 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.2 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.3 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.5 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_0.gguf) | i1-Q4_0 | 4.5 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.6 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_K_M.gguf) | i1-Q4_K_M | 4.8 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q4_1.gguf) | i1-Q4_1 | 5.0 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.4 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.5 | | +| [GGUF](https://huggingface.co/mradermacher/AV-BI-Qwen2.5-7B-PT-BR-Instruct-i1-GGUF/resolve/main/AV-BI-Qwen2.5-7B-PT-BR-Instruct.i1-Q6_K.gguf) | i1-Q6_K | 6.4 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..99aae44 --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:47fe83bb616e1e24b37af7336a3171aa814997782cc3debab45d5e6b4450cd87 +size 4536665