Continued-SFT of openeurollm/OLMo-3-7B-Instruct-SFT on a 75/25 English:EU
mixture, the headline configuration of the paper Translate, Replay, Mix:
Exploring Multilingual Post-Training for Low-Resource European Languages.
Per-language Elo (cs / de / es / fi / fr / it / sv):
en
cs
de
es
fi
fr
it
sv
950 \pm 14
714 \pm 19
690 \pm 24
746 \pm 18
732 \pm 44
743 \pm 17
\mathbf{820 \pm 15}
722 \pm 35
A-75EN preserves English Elo within CI of baseline and improves on every EU
language except Swedish, with the largest gain on Italian. Full per-language
breakdown and the comparison to A-25EN are in the paper, Tables 2 and 3.
fromtransformersimportAutoTokenizer,AutoModelForCausalLMtok=AutoTokenizer.from_pretrained("openeurollm/OLMo-3-7B-Dolci-Translated-A-75EN")model=AutoModelForCausalLM.from_pretrained("openeurollm/OLMo-3-7B-Dolci-Translated-A-75EN",torch_dtype="bfloat16")# tok.chat_template is set; use tok.apply_chat_template(...) directly
Citation
Please cite the paper and the OLMo-3 family if you use this checkpoint.