初始化项目,由ModelHub XC社区提供模型
Model: QuantFactory/llama-3.1-Asian-Bllossom-8B-Translator-GGUF Source: Original Platform
This commit is contained in:
52
.gitattributes
vendored
Normal file
52
.gitattributes
vendored
Normal file
@@ -0,0 +1,52 @@
|
|||||||
|
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.model filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||||
|
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0_4_4.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0_4_8.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0_8_8.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
151
README.md
Normal file
151
README.md
Normal file
@@ -0,0 +1,151 @@
|
|||||||
|
|
||||||
|
---
|
||||||
|
|
||||||
|
library_name: transformers
|
||||||
|
license: llama3.1
|
||||||
|
language:
|
||||||
|
- ko
|
||||||
|
- vi
|
||||||
|
- id
|
||||||
|
- km
|
||||||
|
- th
|
||||||
|
metrics:
|
||||||
|
- bleu
|
||||||
|
- rouge
|
||||||
|
base_model:
|
||||||
|
- meta-llama/Llama-3.1-8B-Instruct
|
||||||
|
|
||||||
|
---
|
||||||
|
|
||||||
|
[](https://hf.co/QuantFactory)
|
||||||
|
|
||||||
|
|
||||||
|
# QuantFactory/llama-3.1-Asian-Bllossom-8B-Translator-GGUF
|
||||||
|
This is quantized version of [MLP-KTLim/llama-3.1-Asian-Bllossom-8B-Translator](https://huggingface.co/MLP-KTLim/llama-3.1-Asian-Bllossom-8B-Translator) created using llama.cpp
|
||||||
|
|
||||||
|
# Original Model Card
|
||||||
|
|
||||||
|
|
||||||
|
# Model Card for Model ID
|
||||||
|
|
||||||
|
This model is a multilingual translation model fine-tuned on LLaMA 3.1 Instruct base model. It enables mutual translation between the following Southeast Asian languages:
|
||||||
|
|
||||||
|
- Korean
|
||||||
|
- Vietnamese
|
||||||
|
- Indonesian
|
||||||
|
- Cambodian (Khmer)
|
||||||
|
- Thai
|
||||||
|
|
||||||
|
## Acknowledgements
|
||||||
|
AICA <img src="https://aica-gj.kr/images/logo.png" width="20%" height="20%">
|
||||||
|
|
||||||
|
## Model Details
|
||||||
|
The model is designed for translating short text segments between any pair of the supported languages.
|
||||||
|
|
||||||
|
Supported language pairs:
|
||||||
|
|
||||||
|
- Korean ↔ Vietnamese
|
||||||
|
- Korean ↔ Indonesian
|
||||||
|
- Korean ↔ Cambodian
|
||||||
|
- Korean ↔ Thai
|
||||||
|
- Vietnamese ↔ Indonesian
|
||||||
|
- Vietnamese ↔ Cambodian
|
||||||
|
- Vietnamese ↔ Thai
|
||||||
|
- Indonesian ↔ Cambodian
|
||||||
|
- Indonesian ↔ Thai
|
||||||
|
- Cambodian ↔ Thai
|
||||||
|
|
||||||
|
### Model Description
|
||||||
|
|
||||||
|
This model is specifically optimized for Southeast Asian language translation needs, focusing on enabling communication between these specific language communities.
|
||||||
|
|
||||||
|
The extensive training data of 20M examples (1M for each translation direction) provides a robust foundation for handling common expressions and basic conversations across these languages.
|
||||||
|
|
||||||
|
### Model Architecture
|
||||||
|
|
||||||
|
Base Model: meta-llama/Llama-3.1-8B-Instruct
|
||||||
|
|
||||||
|
|
||||||
|
## Bias, Risks, and Limitations
|
||||||
|
|
||||||
|
- Performance is limited to short sentences and phrases
|
||||||
|
- May not handle complex or lengthy text effectively
|
||||||
|
- Translation quality may vary depending on language pair and content complexity
|
||||||
|
|
||||||
|
## Evaluation results
|
||||||
|
|
||||||
|
| Source Language | Target Language | BLEU Score | ROUGE-1 | ROUGE-L |
|
||||||
|
|----------------|-----------------|------------|---------|---------|
|
||||||
|
| Korean | Vietnamese | 56.70 | 81.64 | 76.66 |
|
||||||
|
| Korean | Cambodian | 71.69 | 89.26 | 88.20 |
|
||||||
|
| Korean | Indonesian | 58.32 | 80.39 | 76.63 |
|
||||||
|
| Korean | Thai | 63.26 | 78.88 | 72.29 |
|
||||||
|
| Vietnamese | Korean | 49.01 | 75.57 | 72.74 |
|
||||||
|
| Vietnamese | Cambodian | 78.26 | 90.74 | 90.32 |
|
||||||
|
| Vietnamese | Indonesian | 65.96 | 83.08 | 81.46 |
|
||||||
|
| Vietnamese | Thai | 65.93 | 81.09 | 76.57 |
|
||||||
|
| Cambodian | Korean | 49.10 | 72.67 | 69.75 |
|
||||||
|
| Cambodian | Vietnamese | 63.42 | 81.56 | 79.09 |
|
||||||
|
| Cambodian | Indonesian | 61.41 | 79.67 | 77.75 |
|
||||||
|
| Cambodian | Thai | 70.91 | 81.85 | 77.66 |
|
||||||
|
| Indonesian | Korean | 53.61 | 77.14 | 74.29 |
|
||||||
|
| Indonesian | Vietnamese | 68.21 | 85.41 | 83.10 |
|
||||||
|
| Indonesian | Cambodian | 78.84 | 90.81 | 90.35 |
|
||||||
|
| Indonesian | Thai | 67.12 | 81.54 | 77.19 |
|
||||||
|
| Thai | Korean | 45.59 | 72.48 | 69.46 |
|
||||||
|
| Thai | Vietnamese | 61.55 | 81.01 | 78.24 |
|
||||||
|
| Thai | Cambodian | 78.52 | 91.47 | 91.16 |
|
||||||
|
| Thai | Indonesian | 58.99 | 78.56 | 76.40 |
|
||||||
|
|
||||||
|
## Example
|
||||||
|
|
||||||
|
```py
|
||||||
|
from transformers import AutoModelForCausalLM, AutoTokenizer
|
||||||
|
|
||||||
|
model = AutoModelForCausalLM.from_pretrained(
|
||||||
|
"MLP-KTLim/llama-3.1-Asian-Bllossom-8B-Translator",
|
||||||
|
torch_dtype="auto",
|
||||||
|
device_map="auto",
|
||||||
|
)
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained(
|
||||||
|
"MLP-KTLim/llama-3.1-Asian-Bllossom-8B-Translator",
|
||||||
|
)
|
||||||
|
|
||||||
|
input_text = "안녕하세요? 아시아 언어 번역 모델 입니다."
|
||||||
|
|
||||||
|
def get_input_ids(source_lang, target_lang, message):
|
||||||
|
assert source_lang in ["Korean", "Vietnamese", "Indonesian", "Thai", "Cambodian"]
|
||||||
|
assert target_lang in ["Korean", "Vietnamese", "Indonesian", "Thai", "Cambodian"]
|
||||||
|
|
||||||
|
input_ids = tokenizer.apply_chat_template(
|
||||||
|
conversation=[
|
||||||
|
{"role": "system", "content": f"You are a useful translation AI. Please translate the sentence given in {source_lang} into {target_lang}."},
|
||||||
|
{"role": "user", "content": message},
|
||||||
|
],
|
||||||
|
tokenize=True,
|
||||||
|
return_tensors="pt",
|
||||||
|
add_generation_prompt=True,
|
||||||
|
)
|
||||||
|
return input_ids
|
||||||
|
|
||||||
|
input_ids = get_input_ids(
|
||||||
|
source_lang="Korean",
|
||||||
|
target_lang="Vietnamese",
|
||||||
|
message=input_text,
|
||||||
|
)
|
||||||
|
|
||||||
|
output = model.generate(
|
||||||
|
input_ids.to(model.device),
|
||||||
|
max_new_tokens=128,
|
||||||
|
)
|
||||||
|
|
||||||
|
print(tokenizer.decode(output[0][len(input_ids[0]):], skip_special_tokens=True))
|
||||||
|
```
|
||||||
|
|
||||||
|
|
||||||
|
## Contributor
|
||||||
|
- 원인호 (wih1226@seoultech.ac.kr)
|
||||||
|
- 김민준 (mjkmain@seoultech.ac.kr)
|
||||||
|
|
||||||
|
|
||||||
1
configuration.json
Normal file
1
configuration.json
Normal file
@@ -0,0 +1 @@
|
|||||||
|
{"framework": "pytorch", "task": "others", "allow_remote": true}
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q2_K.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:8ce6cc376fe14246c75dfc2950f3e2ef7dab02d223120227b8d8c6b261f6435a
|
||||||
|
size 3179136352
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q3_K_L.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:a2d63db545c414e0abc5de575204f0167d6724d78a2342350fe809f98feeaa55
|
||||||
|
size 4321961312
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q3_K_M.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:45e235d6b8bf298862a031bc1ba238b3436e5ba5de0fe26a91810d54368ea2e5
|
||||||
|
size 4018922848
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q3_K_S.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:ebe9c4f006becd324c1559435ebb1fe7c5566d01410f38313c09c7fdc16c350d
|
||||||
|
size 3664504160
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:889b19180b29ca4bb0567426a44d4a0d717e1e09e4593463d03a1b8a24c5570f
|
||||||
|
size 4661216608
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0_4_4.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0_4_4.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:73be4caaaee2a624fa0155f926513408c59e1d9955d1e98e7285531372e1f011
|
||||||
|
size 4661216608
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0_4_8.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0_4_8.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:c8cd82427285627c7ede558a36f2c6f90fb4707116091e2df096d823849e449f
|
||||||
|
size 4661216608
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0_8_8.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_0_8_8.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:f4544c4505bda00cd930184fcf2bf56fb0d9b343ddac941cd68b0504b0a57d7e
|
||||||
|
size 4661216608
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_1.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:5210ebd599dd5968e858b49ccfa2538d6685c5de07d6e32b27cd5157e426ee94
|
||||||
|
size 5130257760
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_K_M.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:e996e6c9da143aa13770925f840b5f150b85bd057fd5c563a7e10ee621c0fe01
|
||||||
|
size 4920739168
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_K_S.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:8c111ab784eb742c1eb27158637ab92441c27ab7ebf4ed1d9948d38a08ef6635
|
||||||
|
size 4692673888
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q5_0.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q5_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:5700fa0c48db5e996759ad38c0d0a5ac10d0059e252933f252ad1585d002d565
|
||||||
|
size 5599298912
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q5_1.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q5_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:71de48c86a8233b90e705cef65de2d4affa3d9e8c44ada2111169d36687aee81
|
||||||
|
size 6068340064
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q5_K_M.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:5710fea256f67ce48cb2213b3444224c9521dd33d456b1341a8b53a286ee8828
|
||||||
|
size 5732992352
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q5_K_S.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:5de5422d74760becd20d924ca4f42ecc8699ae0946c45484ff0930cd41215a71
|
||||||
|
size 5599298912
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q6_K.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:2ac05cc8867ae60b8206a2c942d09172d8079bbf658799cd57de6c67c02c80fa
|
||||||
|
size 6596011360
|
||||||
3
llama-3.1-Asian-Bllossom-8B-Translator.Q8_0.gguf
Normal file
3
llama-3.1-Asian-Bllossom-8B-Translator.Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:619ae723e4e3c56c25f04202982e612cf0846e0c2b59295bb91e258241b7fa37
|
||||||
|
size 8540775776
|
||||||
Reference in New Issue
Block a user