Ministral-3-3B-Instruct-251…/README.md

---
base_model: mistralai/Ministral-3-3B-Instruct-2512-BF16
library_name: transformers
pipeline_tag: text-generation
tags:
- llama
- ministral
- text-only
---

# Ministral-3 3B Instruct BF16 Llama Text

Text-only Llama-compatible conversion of
`mistralai/Ministral-3-3B-Instruct-2512-BF16`.

What changed:

- dropped vision and multimodal projector tensors
- stripped the Mistral3 `language_model.` wrapper from text weights
- wrote a plain `LlamaForCausalLM` config
- kept tokenizer assets and chat template with the corrected regex
- kept chat formatting in `chat_template.jinja`
- removed the strict user/assistant alternation assertion from the template
- left tokenizer loading on the generic fast backend, not `LlamaTokenizerFast`

Verification:

- reference: `mistralai/Ministral-3-3B-Instruct-2512-BF16`
- candidate: this checkpoint
- dataset: `/home/alvion/valve/services/training/datasets/think2-2025-12-07_gpt-5.4_reasoning.jsonl`
- rows: 3
- max length: 512
- tokenizer IDs: identical
- worst KL: 0
- worst logit diff: 0
- plain candidate `AutoTokenizer` matched the fixed reference tokenizer

Conversion tools: https://github.com/cascade-tech-ai/mistral-convert