beyoru/MinCoder-4B-Expert: Model synced from source: beyoru/MinCoder-4B-Expert - MinCoder-4B-Expert - Gitea: Git with a cup of tea

beyoru/MinCoder-4B-Expert

Go to file

ModelHub XC e45c8b6963 初始化项目，由ModelHub XC社区提供模型

Model: beyoru/MinCoder-4B-Expert
Source: Original Platform

2026-05-06 15:11:18 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

chat_template.jinja

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

model-00001-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

model-00002-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-05-06 15:11:18 +08:00

README.md

base_model, tags, license, language

base_model

tags

license

language

beyoru/EvolLLM

text-generation-inference

transformers

qwen3

code

tool

agent

evolution

merge

RL

grpo

apache-2.0

en

This model is fine-tuned Qwen model using a custom reinforcement learning (RL) framework that rewards the model for producing solutions passing automated test cases — similar to the process of programming task evaluation on LeetCode.

Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.