Files
Mistral-3B-Instruct-v0.2-init/README.md
ModelHub XC ec4e1febed 初始化项目,由ModelHub XC社区提供模型
Model: Aryanne/Mistral-3B-Instruct-v0.2-init
Source: Original Platform
2026-05-30 08:32:17 +08:00

10 lines
371 B
Markdown

---
license: apache-2.0
inference: false
---
# Info
This is the model [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) which I cut all the intermediate(feed_forward_length) size with 14336 down to 3072, resulting in a ~2.81B model.
It's necessary to pre-train this model, cause at the moment is generating just gibberish.