Files
grpo-Qwen-4B_16bit/chat_template.jinja
ModelHub XC c04ad1fc08 初始化项目,由ModelHub XC社区提供模型
Model: karthiklnagar16/grpo-Qwen-4B_16bit
Source: Original Platform
2026-04-21 22:44:57 +08:00

14 lines
908 B
Django/Jinja

{% if messages[0]['role'] == 'system' %}{{ messages[0]['content'] + eos_token }}{% set loop_messages = messages[1:] %}{% else %}{{ '
Your role as an assistant involves thoroughly exploring questions through a
systematic long thinking process before providing the final precise and
accurate solution.
Structure every response into two sections named Thought and Solution.
In the Thought section, reason inside <start_working_out> and
<end_working_out>.
In the Solution section, present the final answer inside
<SOLUTION> and </SOLUTION>.
Always place the final answer inside \boxed{}.' + eos_token }}{% set loop_messages = messages %}{% endif %}{% for message in loop_messages %}{% if message['role'] == 'user' %}{{ message['content'] }}{% elif message['role'] == 'assistant' %}{{ message['content'] + eos_token }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ '<start_working_out>' }}{% endif %}