Update README.md
This commit is contained in:
committed by
system
parent
dc7b958acf
commit
0b8d1279e2
@@ -34,12 +34,7 @@ For question-answering, Jan-v1 shows a significant performance gain from model s
|
||||
|
||||
These benchmarks evaluate the model's conversational and instructional capabilities.
|
||||
|
||||
| Benchmark | JanV1 (Ours) | Qwen3-4B-Thinking-2507 | GPT-OSS-20B (High) | GPT-OSS-20B (Low) |
|
||||
| :--- | :--- | :--- | :--- | :--- |
|
||||
| EQBench | **83.61** | 82.61 | 78.35 | 78.35 |
|
||||
| CreativeWriting | **72.08** | 65.74 | 30.23 | 26.38 |
|
||||
| IFBench | **Prompt:** 0.3537<br>**Instruction:** 0.3910 | Prompt: 0.4490<br>Instruction: **0.4806** | Prompt: 0.5646<br>Instruction: 0.6000 | Prompt: 0.5034<br>Instruction: 0.5403 |
|
||||
| ArenaHardv2 | **25.3** | - | - | - |
|
||||

|
||||
|
||||
## Quick Start
|
||||
|
||||
|
||||
Reference in New Issue
Block a user