初始化项目,由ModelHub XC社区提供模型
Model: mit-oasys/rlm-qwen3-8b-v0.1 Source: Original Platform
This commit is contained in:
14
README.md
Normal file
14
README.md
Normal file
@@ -0,0 +1,14 @@
|
||||
---
|
||||
license: mit
|
||||
language:
|
||||
- en
|
||||
base_model:
|
||||
- Qwen/Qwen3-8B
|
||||
---
|
||||
## RLM-Qwen3-8B-v0.1
|
||||
|
||||
A small, post-trained Qwen3-8B model from the experiments in the "Recursive Language Models" paper: [https://arxiv.org/abs/2512.24601](https://arxiv.org/abs/2512.24601).
|
||||
|
||||
The model was trained on trajectories produced using a fixed system prompt. It assumes the environment/scaffold from our RLM repo.
|
||||
|
||||
We recommend using vLLM with our inference code at https://github.com/alexzhang13/rlm to use it out of the box.
|
||||
Reference in New Issue
Block a user