初始化项目,由ModelHub XC社区提供模型
Model: Eclipse-Senpai/KeyLM-75M-Instruct-GGUF Source: Original Platform
This commit is contained in:
36
.gitattributes
vendored
Normal file
36
.gitattributes
vendored
Normal file
@@ -0,0 +1,36 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
KeyLM-75M-Instruct.F16.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
3
KeyLM-75M-Instruct.F16.gguf
Normal file
3
KeyLM-75M-Instruct.F16.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4c258bc317340147eb13f7455900560f958ed6f1f723fa58c90090cbad443a56
|
||||
size 150979840
|
||||
52
README.md
Normal file
52
README.md
Normal file
@@ -0,0 +1,52 @@
|
||||
---
|
||||
license: apache-2.0
|
||||
language:
|
||||
- en
|
||||
base_model: Eclipse-Senpai/KeyLM-75M-Instruct
|
||||
base_model_relation: quantized
|
||||
pipeline_tag: text-generation
|
||||
library_name: gguf
|
||||
tags:
|
||||
- keylm
|
||||
- gguf
|
||||
- llama.cpp
|
||||
- small-language-model
|
||||
- instruct
|
||||
---
|
||||
|
||||
# KeyLM-75M-Instruct-GGUF
|
||||
|
||||
GGUF builds of [**KeyLM-75M-Instruct**](https://huggingface.co/Eclipse-Senpai/KeyLM-75M-Instruct) for `llama.cpp`, LM Studio, Ollama, and other GGUF runtimes.
|
||||
|
||||
KeyLM is a 75M-parameter instruction-tuned language model trained from scratch on approximately 18 billion tokens. See the [main model card](https://huggingface.co/Eclipse-Senpai/KeyLM-75M-Instruct) for benchmarks, training details, limitations, and the `transformers` (safetensors) version.
|
||||
|
||||
## Files
|
||||
|
||||
| File | Quant | Size | Notes |
|
||||
|---|---|---|---|
|
||||
| `KeyLM-75M-Instruct.F16.gguf` | F16 | ~144 MB | Full precision and recommended. The model is already tiny, so there is little reason to quantize further. |
|
||||
|
||||
## Run with llama.cpp
|
||||
|
||||
```bash
|
||||
# straight from the Hub
|
||||
llama-cli -hf Eclipse-Senpai/KeyLM-75M-Instruct-GGUF -cnv
|
||||
|
||||
# or a local file
|
||||
llama-cli -m KeyLM-75M-Instruct.F16.gguf -cnv
|
||||
```
|
||||
|
||||
The chat template (`User:` / `Assistant:`, assistant turns ending with `</s>`) is embedded in the GGUF, so conversation mode (`-cnv`) applies it automatically.
|
||||
|
||||
## LM Studio / Ollama
|
||||
|
||||
- **LM Studio:** load the `.gguf`; the embedded chat template is detected automatically.
|
||||
- **Ollama:** `ollama run hf.co/Eclipse-Senpai/KeyLM-75M-Instruct-GGUF`
|
||||
|
||||
## Notes & limitations
|
||||
|
||||
KeyLM is a tiny model: good at simple instruction following and short chat, near random chance on knowledge/reasoning benchmarks. It is not a factual assistant. Full numbers and caveats are on the [main model card](https://huggingface.co/Eclipse-Senpai/KeyLM-75M-Instruct).
|
||||
|
||||
## License
|
||||
|
||||
Apache 2.0.
|
||||
Reference in New Issue
Block a user