Compare commits

...

10 Commits

Author SHA1 Message Date
Richard Erkhov
c0b29f5816 uploaded readme 2024-10-27 22:24:18 +00:00
Richard Erkhov
c6beae1495 uploaded model 2024-10-27 22:24:16 +00:00
Richard Erkhov
a42c049510 uploaded model 2024-10-27 22:19:47 +00:00
Richard Erkhov
6017e08dce uploaded model 2024-10-27 22:15:36 +00:00
Richard Erkhov
0851a8d05c uploaded model 2024-10-27 22:11:05 +00:00
Richard Erkhov
d614af7fc8 uploaded model 2024-10-27 22:08:05 +00:00
Richard Erkhov
6ae3624442 uploaded model 2024-10-27 22:03:49 +00:00
Richard Erkhov
f544552414 uploaded model 2024-10-27 21:59:24 +00:00
Richard Erkhov
b8b1378463 uploaded model 2024-10-27 21:53:58 +00:00
Richard Erkhov
b642b3910b uploaded model 2024-10-27 21:48:51 +00:00
11 changed files with 123 additions and 0 deletions

9
.gitattributes vendored
View File

@@ -43,3 +43,12 @@ danube-ko-1.8b-base.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q4_K.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q5_K.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
danube-ko-1.8b-base.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

87
README.md Normal file
View File

@@ -0,0 +1,87 @@
Quantization made by Richard Erkhov.
[Github](https://github.com/RichardErkhov)
[Discord](https://discord.gg/pvy7H8DZMG)
[Request more models](https://github.com/RichardErkhov/quant_request)
danube-ko-1.8b-base - GGUF
- Model creator: https://huggingface.co/jjhsnail0822/
- Original model: https://huggingface.co/jjhsnail0822/danube-ko-1.8b-base/
| Name | Quant method | Size |
| ---- | ---- | ---- |
| [danube-ko-1.8b-base.Q2_K.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q2_K.gguf) | Q2_K | 0.68GB |
| [danube-ko-1.8b-base.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q3_K_S.gguf) | Q3_K_S | 0.79GB |
| [danube-ko-1.8b-base.Q3_K.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q3_K.gguf) | Q3_K | 0.87GB |
| [danube-ko-1.8b-base.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q3_K_M.gguf) | Q3_K_M | 0.87GB |
| [danube-ko-1.8b-base.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q3_K_L.gguf) | Q3_K_L | 0.94GB |
| [danube-ko-1.8b-base.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.IQ4_XS.gguf) | IQ4_XS | 0.97GB |
| [danube-ko-1.8b-base.Q4_0.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q4_0.gguf) | Q4_0 | 1.01GB |
| [danube-ko-1.8b-base.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.IQ4_NL.gguf) | IQ4_NL | 1.02GB |
| [danube-ko-1.8b-base.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q4_K_S.gguf) | Q4_K_S | 1.01GB |
| [danube-ko-1.8b-base.Q4_K.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q4_K.gguf) | Q4_K | 1.06GB |
| [danube-ko-1.8b-base.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q4_K_M.gguf) | Q4_K_M | 1.06GB |
| [danube-ko-1.8b-base.Q4_1.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q4_1.gguf) | Q4_1 | 1.11GB |
| [danube-ko-1.8b-base.Q5_0.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q5_0.gguf) | Q5_0 | 1.21GB |
| [danube-ko-1.8b-base.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q5_K_S.gguf) | Q5_K_S | 1.21GB |
| [danube-ko-1.8b-base.Q5_K.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q5_K.gguf) | Q5_K | 1.24GB |
| [danube-ko-1.8b-base.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q5_K_M.gguf) | Q5_K_M | 1.24GB |
| [danube-ko-1.8b-base.Q5_1.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q5_1.gguf) | Q5_1 | 1.32GB |
| [danube-ko-1.8b-base.Q6_K.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q6_K.gguf) | Q6_K | 1.43GB |
| [danube-ko-1.8b-base.Q8_0.gguf](https://huggingface.co/RichardErkhov/jjhsnail0822_-_danube-ko-1.8b-base-gguf/blob/main/danube-ko-1.8b-base.Q8_0.gguf) | Q8_0 | 1.85GB |
Original model description:
---
license: apache-2.0
language:
- ko
- en
tags:
- h2o-danube2
- korean
- sLLM
- llm
---
## Model Details
danube-ko-1.8b-base is a continual pre-trained Korean language model based on [h2oai/h2o-danube2-1.8b-base](https://huggingface.co/h2oai/h2o-danube2-1.8b-base).
## Model Developers
Jinhong Jeong, Ungsang Yoon
## Model Architecture
The vocabulary size was expanded from original 32000 to 40000 to add Korean tokens efficiently. We used the [EEVE](https://arxiv.org/abs/2402.14714) technique for training. The model has sequence length of 2048. Everything else is the same as the original model.
## Training Datasets
We used CulturaX, Common Crawl CC-MAIN-2024-10, AI Hub Data, Korean Wikis, Corpora from National Institute of the Korean Language, Standard Korean Dictionary, etc. About 42GB of data was used for training.
## Model Benchmark
This model is ranked #1 in Ko-MMLU on the [Open Ko-LLM Leaderboard](https://huggingface.co/spaces/upstage/open-ko-llm-leaderboard) among pretrained Korean models of size 2B or smaller as of July 5, 2024.
| Task | Value |
| --- | --- |
| Ko-ARC | 31.74 |
| Ko-HellaSwag | 44.44 |
| Ko-MMLU | 28.06 |
| Ko-TruthfulQA | 41.63 |
| Ko-CommonGen V2 | 32.7 |
| kmmlu_direct | 29.05 |
| kobest | 59.13 |
## Disclaimer
The Model can generate information that is biased, discriminatory, socially inappropriate, etc. The Model can also generate information that is not accurate. The Model is used at your own risk, and the developers are not responsible for the information generated by the model.

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fbba2c4a1f72d6a872fd23c436f8c74cb9b307b62057414b7d46b8c256342892
size 1191448224

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ec76bf80aaf205314dd489eba521807b5d4bebec95d890ff96dcb7c4b867cc3d
size 1140657824

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6c3303645b30662f33bf149b55ef236d0f2ff510411f662437fc445328cc422a
size 1302050464

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:9c8bc66bc1f2dcd45032c07a270a98ef8f3bada00e6e0689c76f17051f05c177
size 1412652704

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1b58a94c99c294c84b34a2a259f9a5226ea95fe9f60b5ae4741dffa3638ce9c2
size 1332862624

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1b58a94c99c294c84b34a2a259f9a5226ea95fe9f60b5ae4741dffa3638ce9c2
size 1332862624

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:547e572dde5d84f6c2a2aa862e5df898d8d0d5a7dce8c19e715cddcd8ba059bd
size 1302050464

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:cf696f7da792c3bf269134558534b762290b9f236b1331832560f13ea05b4695
size 1537080224

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3ef9a6d24dd1728e19995864d31f6e90f578172b95c7bba75f291714c08f3087
size 1990463904