初始化项目,由ModelHub XC社区提供模型
Model: SeongryongJung/qwen2.5-0.5b-ifeval-halfepoch-sft Source: Original Platform
This commit is contained in:
24
README.md
Normal file
24
README.md
Normal file
@@ -0,0 +1,24 @@
|
||||
---
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
tags:
|
||||
- qwen
|
||||
- sft
|
||||
- ifeval
|
||||
- retaining-by-doing
|
||||
---
|
||||
|
||||
# Qwen2.5-0.5B Instruct IFEval Half-Epoch SFT
|
||||
|
||||
This checkpoint was trained from `Qwen2.5-0.5B-Instruct` on `IFEvalSFTDataset`.
|
||||
|
||||
Training setup:
|
||||
- `num_train_datapoints=4064`
|
||||
- `num_epochs=1`
|
||||
- effective half-epoch setting
|
||||
- learning rate `1e-4`
|
||||
- total batch size `64`
|
||||
|
||||
Observed local IFEval accuracy:
|
||||
- `0.4209445585`
|
||||
Reference in New Issue
Block a user