初始化项目,由ModelHub XC社区提供模型
Model: QuantFactory/Qwen2.5-Gutenberg-Doppel-14B-GGUF Source: Original Platform
This commit is contained in:
49
.gitattributes
vendored
Normal file
49
.gitattributes
vendored
Normal file
@@ -0,0 +1,49 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-Gutenberg-Doppel-14B.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q2_K.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:0c16a667af2db044bbc821813d58c6a488f49cbb42e1b8e58db30b4ce6684e88
|
||||
size 5770497728
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q3_K_L.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:60bcafd02d4a0ac6e1434ed680c43efa1bfae7b4c4ae1d3f51e6e1a680e3ba72
|
||||
size 7924768448
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q3_K_M.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:8ec071f9c87a4ebf5fcbd2090a6e55483654e2375a35ec5dae780d96eb209971
|
||||
size 7339204288
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q3_K_S.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:1bd69aaaf36fcdd50989785c83f359d1545ac2213266675f1a531f22dd066d3d
|
||||
size 6659595968
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q4_0.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:46b3f7733af62606b55191fbf96b7c5a0d023d1a4b748eb3c8476c46910c8fff
|
||||
size 8517725888
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q4_1.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3f400dbe37656227da24352631c7904174de70ba52223062ff030fff282a5268
|
||||
size 9392139968
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q4_K_M.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:de39d14735ed39ec19b623477de0ce47010860219d01a2f310faf9e020436e9f
|
||||
size 8988110528
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q4_K_S.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:59a60b79d7386f510700fd35867cd9279cd3d3ef67af0c7f69a8764b77d48d9e
|
||||
size 8573431488
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q5_0.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q5_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:74cbc85fe285aaae08df5e39cfe51b6197d6777f61ca6dfdc592ec4a22c98845
|
||||
size 10266554048
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q5_1.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q5_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f5a19c7c877465c13f83030c49876157d2de362cc121b496a60aeb5e888ef036
|
||||
size 11140968128
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q5_K_M.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a821337bd3ae7c031f35b531d8ee4c3dcdaffc5aef98aca287064a906509f5e2
|
||||
size 10508873408
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q5_K_S.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:946c4c0ed3dfc497164d8fbdf297342af281219fbd9f3b07c8f7ef1365535388
|
||||
size 10266554048
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q6_K.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:63df6f0de90044399b0e99b0c79ee4f1d4263688f5c0f00e07cb6c35d6878b57
|
||||
size 12124683968
|
||||
3
Qwen2.5-Gutenberg-Doppel-14B.Q8_0.gguf
Normal file
3
Qwen2.5-Gutenberg-Doppel-14B.Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c935287c651ef4e8bc468f2f472c5335e1f560cfc49e25d48a5d597d9a400919
|
||||
size 15701597888
|
||||
139
README.md
Normal file
139
README.md
Normal file
@@ -0,0 +1,139 @@
|
||||
|
||||
---
|
||||
|
||||
license: apache-2.0
|
||||
library_name: transformers
|
||||
base_model:
|
||||
- Qwen/Qwen2.5-14B-Instruct
|
||||
model-index:
|
||||
- name: Qwen2.5-Gutenberg-Doppel-14B
|
||||
results:
|
||||
- task:
|
||||
type: text-generation
|
||||
name: Text Generation
|
||||
dataset:
|
||||
name: IFEval (0-Shot)
|
||||
type: HuggingFaceH4/ifeval
|
||||
args:
|
||||
num_few_shot: 0
|
||||
metrics:
|
||||
- type: inst_level_strict_acc and prompt_level_strict_acc
|
||||
value: 80.91
|
||||
name: strict accuracy
|
||||
source:
|
||||
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Qwen2.5-Gutenberg-Doppel-14B
|
||||
name: Open LLM Leaderboard
|
||||
- task:
|
||||
type: text-generation
|
||||
name: Text Generation
|
||||
dataset:
|
||||
name: BBH (3-Shot)
|
||||
type: BBH
|
||||
args:
|
||||
num_few_shot: 3
|
||||
metrics:
|
||||
- type: acc_norm
|
||||
value: 48.24
|
||||
name: normalized accuracy
|
||||
source:
|
||||
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Qwen2.5-Gutenberg-Doppel-14B
|
||||
name: Open LLM Leaderboard
|
||||
- task:
|
||||
type: text-generation
|
||||
name: Text Generation
|
||||
dataset:
|
||||
name: MATH Lvl 5 (4-Shot)
|
||||
type: hendrycks/competition_math
|
||||
args:
|
||||
num_few_shot: 4
|
||||
metrics:
|
||||
- type: exact_match
|
||||
value: 0.0
|
||||
name: exact match
|
||||
source:
|
||||
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Qwen2.5-Gutenberg-Doppel-14B
|
||||
name: Open LLM Leaderboard
|
||||
- task:
|
||||
type: text-generation
|
||||
name: Text Generation
|
||||
dataset:
|
||||
name: GPQA (0-shot)
|
||||
type: Idavidrein/gpqa
|
||||
args:
|
||||
num_few_shot: 0
|
||||
metrics:
|
||||
- type: acc_norm
|
||||
value: 11.07
|
||||
name: acc_norm
|
||||
source:
|
||||
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Qwen2.5-Gutenberg-Doppel-14B
|
||||
name: Open LLM Leaderboard
|
||||
- task:
|
||||
type: text-generation
|
||||
name: Text Generation
|
||||
dataset:
|
||||
name: MuSR (0-shot)
|
||||
type: TAUR-Lab/MuSR
|
||||
args:
|
||||
num_few_shot: 0
|
||||
metrics:
|
||||
- type: acc_norm
|
||||
value: 10.02
|
||||
name: acc_norm
|
||||
source:
|
||||
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Qwen2.5-Gutenberg-Doppel-14B
|
||||
name: Open LLM Leaderboard
|
||||
- task:
|
||||
type: text-generation
|
||||
name: Text Generation
|
||||
dataset:
|
||||
name: MMLU-PRO (5-shot)
|
||||
type: TIGER-Lab/MMLU-Pro
|
||||
config: main
|
||||
split: test
|
||||
args:
|
||||
num_few_shot: 5
|
||||
metrics:
|
||||
- type: acc
|
||||
value: 43.57
|
||||
name: accuracy
|
||||
source:
|
||||
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Qwen2.5-Gutenberg-Doppel-14B
|
||||
name: Open LLM Leaderboard
|
||||
|
||||
---
|
||||
|
||||
[](https://hf.co/QuantFactory)
|
||||
|
||||
|
||||
# QuantFactory/Qwen2.5-Gutenberg-Doppel-14B-GGUF
|
||||
This is quantized version of [nbeerbower/Qwen2.5-Gutenberg-Doppel-14B](https://huggingface.co/nbeerbower/Qwen2.5-Gutenberg-Doppel-14B) created using llama.cpp
|
||||
|
||||
# Original Model Card
|
||||
|
||||
|
||||

|
||||
|
||||
# Qwen2.5-Gutenberg-Doppel-14B
|
||||
|
||||
[Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct) finetuned on [jondurbin/gutenberg-dpo-v0.1](https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1) and [nbeerbower/gutenberg2-dpo](https://huggingface.co/datasets/nbeerbower/gutenberg2-dpo).
|
||||
|
||||
### Method
|
||||
|
||||
[ORPO tuned](https://mlabonne.github.io/blog/posts/2024-04-19_Fine_tune_Llama_3_with_ORPO.html) with 4x A40 for 3 epochs.
|
||||
|
||||
Thank you [@ParasiticRogue](https://huggingface.co/ParasiticRogue) for sponsoring.
|
||||
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
||||
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_nbeerbower__Qwen2.5-Gutenberg-Doppel-14B)
|
||||
|
||||
| Metric |Value|
|
||||
|-------------------|----:|
|
||||
|Avg. |32.30|
|
||||
|IFEval (0-Shot) |80.91|
|
||||
|BBH (3-Shot) |48.24|
|
||||
|MATH Lvl 5 (4-Shot)| 0.00|
|
||||
|GPQA (0-shot) |11.07|
|
||||
|MuSR (0-shot) |10.02|
|
||||
|MMLU-PRO (5-shot) |43.57|
|
||||
|
||||
|
||||
1
configuration.json
Normal file
1
configuration.json
Normal file
@@ -0,0 +1 @@
|
||||
{"framework": "pytorch", "task": "others", "allow_remote": true}
|
||||
Reference in New Issue
Block a user