初始化项目,由ModelHub XC社区提供模型

Model: mit-oasys/rlm-qwen3-8b-v0.1
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-03 05:18:18 +08:00
commit 3655197c5b
16 changed files with 152358 additions and 0 deletions

14
README.md Normal file
View File

@@ -0,0 +1,14 @@
---
license: mit
language:
- en
base_model:
- Qwen/Qwen3-8B
---
## RLM-Qwen3-8B-v0.1
A small, post-trained Qwen3-8B model from the experiments in the "Recursive Language Models" paper: [https://arxiv.org/abs/2512.24601](https://arxiv.org/abs/2512.24601).
The model was trained on trajectories produced using a fixed system prompt. It assumes the environment/scaffold from our RLM repo.
We recommend using vLLM with our inference code at https://github.com/alexzhang13/rlm to use it out of the box.