Files
qwen2.5-1.5b-ifeval-halfepo…/README.md
ModelHub XC 2026d3165b 初始化项目,由ModelHub XC社区提供模型
Model: SeongryongJung/qwen2.5-1.5b-ifeval-halfepoch-sft
Source: Original Platform
2026-04-22 13:17:07 +08:00

22 lines
369 B
Markdown

---
language:
- en
library_name: transformers
tags:
- qwen
- sft
- ifeval
- retaining-by-doing
---
# Qwen2.5-1.5B Instruct IFEval Half-Epoch SFT
This checkpoint was trained from `Qwen2.5-1.5B-Instruct` on `IFEvalSFTDataset`.
Training setup:
- `num_train_datapoints=4064`
- `num_epochs=1`
- effective half-epoch setting
- learning rate `1e-4`
- total batch size `64`