Go to file

ModelHub XC b2e95c0e9c 初始化项目，由ModelHub XC社区提供模型

Model: Gen-Verse/ReasonFlux-Coder-7B
Source: Original Platform

2026-06-24 16:08:13 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

configuration.json

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

model-00001-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

model-00002-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

model-00003-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

model-00004-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-06-24 16:08:13 +08:00

README.md

license, library_name

license	library_name
mit	transformers

Introduction to our ReasonFlux-Coders

We introduce ReasonFlux-Coders, trained with CURE, our algorithm for co-evolving an LLM's coding and unit test generation abilities.

ReasonFlux-Coder-7B and ReasonFlux-Coder-14B outperform similarly sized Qwen Coders, DeepSeek Coders, and Seed-Coders, and naturally integrate into common test-time scaling and agentic coding pipelines.
ReasonFlux-Coder-4B is our Long-CoT model, outperforming Qwen3-4B while achieving 64.8% efficiency in unit test generation. We have demonstrated its ability to serve as a reward model for training base models via reinforcement learning (see our paper).

Paper | Code

Citation

@article{wang2025cure,
  title={Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning},
  author={Wang, Yinjie and Yang, Ling and Tian, Ye and Shen, Ke and Wang, Mengdi},
  journal={arXiv preprint arXiv:2506.03136},
  year={2025}
}