初始化项目,由ModelHub XC社区提供模型
Model: YoAbriel/KodaLite-1.3B-mlx Source: Original Platform
This commit is contained in:
54
README.md
Normal file
54
README.md
Normal file
@@ -0,0 +1,54 @@
|
||||
---
|
||||
language: en
|
||||
license: apache-2.0
|
||||
library_name: mlx
|
||||
pipeline_tag: text-generation
|
||||
tags:
|
||||
- text-generation
|
||||
- mlx
|
||||
- apple-silicon
|
||||
base_model: YoAbriel/KodaLite-1.3B
|
||||
---
|
||||
|
||||
# KodaLite-1.3B — MLX (fp16)
|
||||
|
||||
MLX version of [YoAbriel/KodaLite-1.3B](https://huggingface.co/YoAbriel/KodaLite-1.3B), optimized for Apple Silicon (M1/M2/M3/M4).
|
||||
|
||||
**Size**: ~2.5 GB | **Precision**: bfloat16
|
||||
|
||||
## Usage
|
||||
|
||||
```bash
|
||||
pip install mlx-lm
|
||||
```
|
||||
|
||||
```python
|
||||
from mlx_lm import load, generate
|
||||
|
||||
model, tok = load("YoAbriel/KodaLite-1.3B-mlx")
|
||||
prompt = tok.apply_chat_template(
|
||||
[{"role": "user", "content": "What is the capital of France?"}],
|
||||
tokenize=False,
|
||||
add_generation_prompt=True,
|
||||
)
|
||||
print(generate(model, tok, prompt=prompt, max_tokens=80))
|
||||
```
|
||||
|
||||
Or from the command line:
|
||||
|
||||
```bash
|
||||
mlx_lm.generate --model YoAbriel/KodaLite-1.3B-mlx \
|
||||
--prompt "<|user|>\nHello\n<|assistant|>\n" --max-tokens 80
|
||||
```
|
||||
|
||||
## Other quantizations
|
||||
|
||||
- [YoAbriel/KodaLite-1.3B-mlx-8bit](https://huggingface.co/YoAbriel/KodaLite-1.3B-mlx-8bit) — 1.4 GB, 8-bit
|
||||
|
||||
## Limitations
|
||||
|
||||
Small model (1.27B params), undertrained (1.64B tokens). See the [base model card](https://huggingface.co/YoAbriel/KodaLite-1.3B) for full details.
|
||||
|
||||
## License
|
||||
|
||||
Apache 2.0
|
||||
Reference in New Issue
Block a user