Gitea: Git with a cup of tea

myyycroft/ Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-4-deberta-nli-reward

Jinja 0 0

Model synced from source: myyycroft/Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-4-deberta-nli-reward

Updated 2026-04-25 17:28:36 +08:00

fzkun/ minimind3-ascend-dense

Jinja 0 0

Model synced from source: fzkun/minimind3-ascend-dense

Updated 2026-04-25 17:28:25 +08:00

DCAgent/ d1_trace_hints_top4_seq_glm47

Jinja 0 0

Model synced from source: DCAgent/d1_trace_hints_top4_seq_glm47

Updated 2026-04-25 17:27:10 +08:00

mehuldamani/ code_gen_arl-ast-addmultiply-7b-v1

Jinja 0 0

Model synced from source: mehuldamani/code_gen_arl-ast-addmultiply-7b-v1

Updated 2026-04-25 16:54:04 +08:00

myfi/ parser_model_ner_4.8

Jinja 0 0

Model synced from source: myfi/parser_model_ner_4.8

Updated 2026-04-25 16:25:04 +08:00

longtermrisk/ Qwen3-4B-Base-ftjob-6fd14d9c448d

Jinja 0 0

Model synced from source: longtermrisk/Qwen3-4B-Base-ftjob-6fd14d9c448d

Updated 2026-04-25 16:24:09 +08:00

Huggggooo/ ProtoCycle-7B

Jinja 0 0

Model synced from source: Huggggooo/ProtoCycle-7B

Updated 2026-04-25 16:24:09 +08:00

mehuldamani/ bug_fixing_arl-7b-addmultiply-v4

Jinja 0 0

Model synced from source: mehuldamani/bug_fixing_arl-7b-addmultiply-v4

Updated 2026-04-25 16:19:14 +08:00

slovak-nlp/ Qwen3-14B-sk

Jinja 0 0

Model synced from source: slovak-nlp/Qwen3-14B-sk

Updated 2026-04-25 16:11:05 +08:00

xw1234gan/ GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

Jinja 0 0

Model synced from source: xw1234gan/GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

Updated 2026-04-25 16:11:05 +08:00

haji80mr-uoft/ gpt-semi-wtype-Llama-tuned-Lora-merged-gpt5

Jinja 0 0

Model synced from source: haji80mr-uoft/gpt-semi-wtype-Llama-tuned-Lora-merged-gpt5

Updated 2026-04-25 16:06:14 +08:00

MInAlA/ Qwen3-4B-Instruct-2507-GRPO-merged

Jinja 0 0

Model synced from source: MInAlA/Qwen3-4B-Instruct-2507-GRPO-merged

Updated 2026-04-25 16:00:11 +08:00

myyycroft/ Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-1-deberta-nli-reward

Jinja 0 0

Model synced from source: myyycroft/Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-1-deberta-nli-reward

Updated 2026-04-25 15:58:09 +08:00

hyunseoki/ verl-math-transfer-7bi-to-3bi-fix07-pool7to1

Jinja 0 0

Model synced from source: hyunseoki/verl-math-transfer-7bi-to-3bi-fix07-pool7to1

Updated 2026-04-25 15:56:50 +08:00

kdiabagate/ qwen-7b-arabic-teaching-merged

Jinja 0 0

Model synced from source: kdiabagate/qwen-7b-arabic-teaching-merged

Updated 2026-04-25 15:56:10 +08:00

Naahraf27/ npo_llama-3.2-3b-instruct_forget10_ep5_lr2e-5_alpha2.0_beta0.1

Jinja 0 0

Model synced from source: Naahraf27/npo_llama-3.2-3b-instruct_forget10_ep5_lr2e-5_alpha2.0_beta0.1

Updated 2026-04-25 15:30:16 +08:00

mremila/ Llama-3.1-8B-precise_if

Jinja 0 0

Model synced from source: mremila/Llama-3.1-8B-precise_if

Updated 2026-04-25 15:26:06 +08:00

psh3333/ llama-3.2-3b-grpo-merged

Jinja 0 0

Model synced from source: psh3333/llama-3.2-3b-grpo-merged

Updated 2026-04-25 15:17:04 +08:00

allenai/ intent-aware-lfqa-qwen3-4b-intent-explicit

Jinja 0 0

Model synced from source: allenai/intent-aware-lfqa-qwen3-4b-intent-explicit

Updated 2026-04-25 15:14:12 +08:00

haji80mr-uoft/ corrected-semi-wtype-Llama-tuned-Lora-merged-gpt5

Jinja 0 0

Model synced from source: haji80mr-uoft/corrected-semi-wtype-Llama-tuned-Lora-merged-gpt5

Updated 2026-04-25 15:03:51 +08:00