base_model, tags, license, language
base_model tags license language
beyoru/EvolLLM
text-generation-inference
transformers
qwen3
code
tool
agent
evolution
merge
RL
grpo
apache-2.0
en

This model is fine-tuned Qwen model using a custom reinforcement learning (RL) framework that rewards the model for producing solutions passing automated test cases — similar to the process of programming task evaluation on LeetCode.

Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.

Description
Model synced from source: beyoru/MinCoder-4B-Expert
Readme 2 MiB
Languages
Jinja 100%