初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/next-32b-i1-GGUF Source: Original Platform
This commit is contained in:
59
.gitattributes
vendored
Normal file
59
.gitattributes
vendored
Normal file
@@ -0,0 +1,59 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
next-32b.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
128
README.md
Normal file
128
README.md
Normal file
@@ -0,0 +1,128 @@
|
||||
---
|
||||
base_model: thelamapi/next-32b
|
||||
datasets:
|
||||
- mlabonne/FineTome-100k
|
||||
- CognitiveKernel/CognitiveKernel-Pro-SFT
|
||||
- OpenSPG/KAG-Thinker-training-dataset
|
||||
- Gryphe/ChatGPT-4o-Writing-Prompts
|
||||
- QuixiAI/dolphin-r1
|
||||
- uclanlp/Brief-Pro
|
||||
language:
|
||||
- tr
|
||||
- en
|
||||
- de
|
||||
- es
|
||||
- fr
|
||||
- ru
|
||||
- zh
|
||||
- ja
|
||||
- ko
|
||||
library_name: transformers
|
||||
license: mit
|
||||
mradermacher:
|
||||
readme_rev: 1
|
||||
quantized_by: mradermacher
|
||||
tags:
|
||||
- turkish
|
||||
- türkiye
|
||||
- reasoning
|
||||
- ai
|
||||
- lamapi
|
||||
- gemma3
|
||||
- next
|
||||
- next-x1
|
||||
- text-generation
|
||||
- open-source
|
||||
- 32b
|
||||
- large-language-model
|
||||
- llm
|
||||
- transformer
|
||||
- artificial-intelligence
|
||||
- machine-learning
|
||||
- nlp
|
||||
- multilingual
|
||||
- instruction-tuned
|
||||
- chat
|
||||
- generative-ai
|
||||
- optimized
|
||||
- trl
|
||||
- sft
|
||||
- cognitive
|
||||
- analytical
|
||||
- enterprise
|
||||
- industrial
|
||||
---
|
||||
## About
|
||||
|
||||
<!-- ### quantize_version: 2 -->
|
||||
<!-- ### output_tensor_quantised: 1 -->
|
||||
<!-- ### convert_type: hf -->
|
||||
<!-- ### vocab_type: -->
|
||||
<!-- ### tags: nicoboss -->
|
||||
<!-- ### quants: Q2_K IQ3_M Q4_K_S IQ3_XXS Q3_K_M small-IQ4_NL Q4_K_M IQ2_M Q6_K IQ4_XS Q2_K_S IQ1_M Q3_K_S IQ2_XXS Q3_K_L IQ2_XS Q5_K_S IQ2_S IQ1_S Q5_K_M Q4_0 IQ3_XS Q4_1 IQ3_S -->
|
||||
<!-- ### quants_skip: -->
|
||||
<!-- ### skip_mmproj: -->
|
||||
weighted/imatrix quants of https://huggingface.co/thelamapi/next-32b
|
||||
|
||||
<!-- provided-files -->
|
||||
|
||||
***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#next-32b-i1-GGUF).***
|
||||
|
||||
static quants are available at https://huggingface.co/mradermacher/next-32b-GGUF
|
||||
## Usage
|
||||
|
||||
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||
more details, including on how to concatenate multi-part files.
|
||||
|
||||
## Provided Quants
|
||||
|
||||
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||
|
||||
| Link | Type | Size/GB | Notes |
|
||||
|:-----|:-----|--------:|:------|
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ1_S.gguf) | i1-IQ1_S | 7.4 | for the desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ1_M.gguf) | i1-IQ1_M | 8.1 | mostly desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 9.1 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ2_XS.gguf) | i1-IQ2_XS | 10.1 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ2_S.gguf) | i1-IQ2_S | 10.6 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ2_M.gguf) | i1-IQ2_M | 11.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q2_K_S.gguf) | i1-Q2_K_S | 11.6 | very low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q2_K.gguf) | i1-Q2_K | 12.4 | IQ3_XXS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 12.9 | lower quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ3_XS.gguf) | i1-IQ3_XS | 13.8 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q3_K_S.gguf) | i1-Q3_K_S | 14.5 | IQ3_XS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ3_S.gguf) | i1-IQ3_S | 14.5 | beats Q3_K* |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ3_M.gguf) | i1-IQ3_M | 15.0 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q3_K_M.gguf) | i1-Q3_K_M | 16.1 | IQ3_S probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q3_K_L.gguf) | i1-Q3_K_L | 17.4 | IQ3_M probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-IQ4_XS.gguf) | i1-IQ4_XS | 17.8 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q4_0.gguf) | i1-Q4_0 | 18.8 | fast, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q4_K_S.gguf) | i1-Q4_K_S | 18.9 | optimal size/speed/quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q4_K_M.gguf) | i1-Q4_K_M | 19.9 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q4_1.gguf) | i1-Q4_1 | 20.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q5_K_S.gguf) | i1-Q5_K_S | 22.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q5_K_M.gguf) | i1-Q5_K_M | 23.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/next-32b-i1-GGUF/resolve/main/next-32b.i1-Q6_K.gguf) | i1-Q6_K | 27.0 | practically like static Q6_K |
|
||||
|
||||
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||
types (lower is better):
|
||||
|
||||

|
||||
|
||||
And here are Artefact2's thoughts on the matter:
|
||||
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||
|
||||
## FAQ / Model Request
|
||||
|
||||
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||
questions you might have and/or if you want some other model quantized.
|
||||
|
||||
## Thanks
|
||||
|
||||
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||
me use its servers and providing upgrades to my workstation to enable
|
||||
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||
|
||||
<!-- end -->
|
||||
3
next-32b.i1-IQ1_M.gguf
Normal file
3
next-32b.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:04d41a595d39edf89aae28d7685e3f1f67f76c3ccb89a9d1e9434798b572d7de
|
||||
size 7959871200
|
||||
3
next-32b.i1-IQ1_S.gguf
Normal file
3
next-32b.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:2ab24954738651eb54525efcf20c40b3003e31d54b1d9b940c7dde27484d06c0
|
||||
size 7323844320
|
||||
3
next-32b.i1-IQ2_M.gguf
Normal file
3
next-32b.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e840d962120a3a4e87cde25eb8b5169c87d2f9d4afac8b52e9cc7a520d1f3cbd
|
||||
size 11362863840
|
||||
3
next-32b.i1-IQ2_S.gguf
Normal file
3
next-32b.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:53b58a667366b5188de3dd914d87a2d7417cec7850d2be805b5caa22855e5946
|
||||
size 10514828000
|
||||
3
next-32b.i1-IQ2_XS.gguf
Normal file
3
next-32b.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:6032d778ef19fe093c540d36a47d0268cd4b1b16bbc643ba8298f92d2d6ce871
|
||||
size 9951837920
|
||||
3
next-32b.i1-IQ2_XXS.gguf
Normal file
3
next-32b.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e2ab2d0183f392c8dc385f63be77bc35304f08e7f1b1a1b9145324b08080be85
|
||||
size 9019916000
|
||||
3
next-32b.i1-IQ3_M.gguf
Normal file
3
next-32b.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:97ae483dbf873967323aa8925beca71c7dea8587f502883570890000ea3da840
|
||||
size 14930085600
|
||||
3
next-32b.i1-IQ3_S.gguf
Normal file
3
next-32b.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:8ef7080d613a249b6d07964557007c9fdbd5b0e0631c86bddc8083f64a9916b3
|
||||
size 14434305760
|
||||
3
next-32b.i1-IQ3_XS.gguf
Normal file
3
next-32b.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:36283e6d2e7fd3a03bdd4016c2dacc5e94fb52fa525b851d806f916a3a8f06a7
|
||||
size 13702924000
|
||||
3
next-32b.i1-IQ3_XXS.gguf
Normal file
3
next-32b.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:00c257724e19f5cb28890a85d4ff076c994f5143c8df9958f19e1a3eaaa40058
|
||||
size 12821039840
|
||||
3
next-32b.i1-IQ4_XS.gguf
Normal file
3
next-32b.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4f03cc878b8236d7f0f88f9b3b3c68b6a1f4ab103baf462e58a5b3540569287d
|
||||
size 17690497760
|
||||
3
next-32b.i1-Q2_K.gguf
Normal file
3
next-32b.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:2de7ecba052103e9a70cfa365b04bd2dc82082a6de6b57877fe79a1167bff8fa
|
||||
size 12344654560
|
||||
3
next-32b.i1-Q2_K_S.gguf
Normal file
3
next-32b.i1-Q2_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:993c7a3b60b6c38e14f09266e34f071e741c4c51908df139c378afe6093caee4
|
||||
size 11465816800
|
||||
3
next-32b.i1-Q3_K_L.gguf
Normal file
3
next-32b.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a0fb3cc319fdb731ee98499d12aaf8cb5e3499317d4592fe878ccfc17b1a7ac7
|
||||
size 17330996960
|
||||
3
next-32b.i1-Q3_K_M.gguf
Normal file
3
next-32b.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:de82b1a22970f496dfbd6ff9917c43b3d074711b6989c5a5609a66ca453bde3f
|
||||
size 15971780320
|
||||
3
next-32b.i1-Q3_K_S.gguf
Normal file
3
next-32b.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:857a48c66b8f29232eacecf698e312f6593299d86c98d0ce83c55498da426711
|
||||
size 14389741280
|
||||
3
next-32b.i1-Q4_0.gguf
Normal file
3
next-32b.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:976be644b3dc49ffc9477e2d9aa325f28f9a852e2a7c03bae2aca2c0969b1aee
|
||||
size 18703090400
|
||||
3
next-32b.i1-Q4_1.gguf
Normal file
3
next-32b.i1-Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d34d3575ed0cc4c32b8c6766119c5da27bbe29a2b57c83a66b2db21e19fbc418
|
||||
size 20636525280
|
||||
3
next-32b.i1-Q4_K_M.gguf
Normal file
3
next-32b.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a8a5a3e2193f162cdb4387a2ed9115b0741c90eab4c677e60a82f5204f8befe9
|
||||
size 19762152160
|
||||
3
next-32b.i1-Q4_K_S.gguf
Normal file
3
next-32b.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:10305d933e2eee854ca9e249312ed48b480dd2c304e29f3ef5ce943c5937df2f
|
||||
size 18771247840
|
||||
3
next-32b.i1-Q5_K_M.gguf
Normal file
3
next-32b.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:bb6dee5e2e4f1083de0c96eab7d1572d0cf65fe0e5896c622870e677e46ae8c5
|
||||
size 23214834400
|
||||
3
next-32b.i1-Q5_K_S.gguf
Normal file
3
next-32b.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4c138bfd6fc73459b590269860331e6681f2138f1f4b8c8ecda2fbf15a85f2c9
|
||||
size 22635496160
|
||||
3
next-32b.i1-Q6_K.gguf
Normal file
3
next-32b.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d9c9d79b811bbb81bfedd91bca03f3c7de316b6a78c658953898a6b853f71b6c
|
||||
size 26883309280
|
||||
3
next-32b.imatrix.gguf
Normal file
3
next-32b.imatrix.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ad70b38e5bd59407f7deaa43364f8dcae82905824d8d225b7a46a0a409935778
|
||||
size 15273216
|
||||
Reference in New Issue
Block a user