初始化项目,由ModelHub XC社区提供模型

Model: karthiklnagar16/grpo-Qwen-4B_16bit
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-21 22:44:57 +08:00
commit c04ad1fc08
13 changed files with 152251 additions and 0 deletions

14
chat_template.jinja Normal file
View File

@@ -0,0 +1,14 @@
{% if messages[0]['role'] == 'system' %}{{ messages[0]['content'] + eos_token }}{% set loop_messages = messages[1:] %}{% else %}{{ '
Your role as an assistant involves thoroughly exploring questions through a
systematic long thinking process before providing the final precise and
accurate solution.
Structure every response into two sections named Thought and Solution.
In the Thought section, reason inside <start_working_out> and
<end_working_out>.
In the Solution section, present the final answer inside
<SOLUTION> and </SOLUTION>.
Always place the final answer inside \boxed{}.' + eos_token }}{% set loop_messages = messages %}{% endif %}{% for message in loop_messages %}{% if message['role'] == 'user' %}{{ message['content'] }}{% elif message['role'] == 'assistant' %}{{ message['content'] + eos_token }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ '<start_working_out>' }}{% endif %}