初始化项目,由ModelHub XC社区提供模型

Model: jujuloaiza/Qwen3-1.7B-riddles
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-14 14:08:06 +08:00
commit 08fc937f0c
17 changed files with 476 additions and 0 deletions

31
README.md Normal file
View File

@@ -0,0 +1,31 @@
---
base_model: Qwen/Qwen3-1.7B
tags:
- peft
- lora
- text-generation
---
# Qwen3-1.7B-riddles
## Model Description
Fine-tuned from `Qwen/Qwen3-1.7B` using QLoRA (4-bit) with supervised fine-tuning.
## Training Details
- Dataset: `jujuloaiza/riddletraining`
- LoRA rank: 16, alpha: 32
- Epochs: 3, Learning rate: 0.0002
## Intended Use
This model is a test model used for the CS-394/594 class at DigiPen. It is still being updated and trained.
The model is designed to generate various riddles out of a list of topics that are targeted for various ages.
Upon generating the riddle, the model will also generate an answer but not state it. The user can input an answer, and the model will respond with Correct/Incorrect.
## Limitations
Limited amount of riddle topics.
Trained only on short answers for the riddle.
Mostly short riddles will be generated.
Model is strict on the answers, and can't evaluate the user's answer respectably.