KodaLite-1.3B-mlx/README.md

---
language: en
license: apache-2.0
library_name: mlx
pipeline_tag: text-generation
tags:
  - text-generation
  - mlx
  - apple-silicon
base_model: YoAbriel/KodaLite-1.3B
---

# KodaLite-1.3B — MLX (fp16)

MLX version of [YoAbriel/KodaLite-1.3B](https://huggingface.co/YoAbriel/KodaLite-1.3B), optimized for Apple Silicon (M1/M2/M3/M4).

**Size**: ~2.5 GB | **Precision**: bfloat16

## Usage

```bash
pip install mlx-lm
```

```python
from mlx_lm import load, generate

model, tok = load("YoAbriel/KodaLite-1.3B-mlx")
prompt = tok.apply_chat_template(
    [{"role": "user", "content": "What is the capital of France?"}],
    tokenize=False,
    add_generation_prompt=True,
)
print(generate(model, tok, prompt=prompt, max_tokens=80))
```

Or from the command line:

```bash
mlx_lm.generate --model YoAbriel/KodaLite-1.3B-mlx \
  --prompt "<|user|>\nHello\n<|assistant|>\n" --max-tokens 80
```

## Other quantizations

- [YoAbriel/KodaLite-1.3B-mlx-8bit](https://huggingface.co/YoAbriel/KodaLite-1.3B-mlx-8bit) — 1.4 GB, 8-bit

## Limitations

Small model (1.27B params), undertrained (1.64B tokens). See the [base model card](https://huggingface.co/YoAbriel/KodaLite-1.3B) for full details.

## License

Apache 2.0
初始化项目，由ModelHub XC社区提供模型 Model: YoAbriel/KodaLite-1.3B-mlx Source: Original Platform 2026-05-29 23:17:30 +08:00			`---`
			`language: en`
			`license: apache-2.0`
			`library_name: mlx`
			`pipeline_tag: text-generation`
			`tags:`
			`- text-generation`
			`- mlx`
			`- apple-silicon`
			`base_model: YoAbriel/KodaLite-1.3B`
			`---`

			`# KodaLite-1.3B — MLX (fp16)`

			`MLX version of [YoAbriel/KodaLite-1.3B](https://huggingface.co/YoAbriel/KodaLite-1.3B), optimized for Apple Silicon (M1/M2/M3/M4).`

			`Size: ~2.5 GB \| Precision: bfloat16`

			`## Usage`

			```bash
			`pip install mlx-lm`
			```

			```python
			`from mlx_lm import load, generate`

			`model, tok = load("YoAbriel/KodaLite-1.3B-mlx")`
			`prompt = tok.apply_chat_template(`
			`[{"role": "user", "content": "What is the capital of France?"}],`
			`tokenize=False,`
			`add_generation_prompt=True,`
			`)`
			`print(generate(model, tok, prompt=prompt, max_tokens=80))`
			```

			`Or from the command line:`

			```bash
			`mlx_lm.generate --model YoAbriel/KodaLite-1.3B-mlx \`
			`--prompt "<\|user\|>\nHello\n<\|assistant\|>\n" --max-tokens 80`
			```

			`## Other quantizations`

			`- [YoAbriel/KodaLite-1.3B-mlx-8bit](https://huggingface.co/YoAbriel/KodaLite-1.3B-mlx-8bit) — 1.4 GB, 8-bit`

			`## Limitations`

			`Small model (1.27B params), undertrained (1.64B tokens). See the [base model card](https://huggingface.co/YoAbriel/KodaLite-1.3B) for full details.`

			`## License`

			`Apache 2.0`