Files

ModelHub XC ea1ff964fd 初始化项目，由ModelHub XC社区提供模型

Model: PKU-Alignment/s1-m_7b_beta
Source: Original Platform

2026-05-27 07:48:13 +08:00

778 B

Raw Blame History

language, license, pipeline_tag, tags, base_model

language

license

pipeline_tag

S1-M-7B-Beta

🏠 Homepage | 👍 Our Official Code Repo | 🤗 S1-M Dataset (Beta)

S1-M-7B-Beta used for developing the algorithm "Simple Test-time Scaling in Multimodal Reasoning". By fine-tuning the base model Qwen/Qwen2-VL-7B-Instruct on data with thinking tags <think> and </think>, the model acquired the think first, then response paradigm, allowing for experiments on "Test-time Scaling".

Note: The current model is a development version, not the final official version.

778 B Raw Blame History

S1-M-7B-Beta

778 B

Raw Blame History