Update README.md

This commit is contained in:
Jan (Homebrew Research)
2025-08-12 02:11:15 +00:00
committed by system
parent 18e9c48c70
commit 55f2815b87

View File

@@ -24,17 +24,17 @@ Jan-v1 leverages the newly released [Qwen3-4B-thinking](https://huggingface.co/Q
## Performance ## Performance
### Question Answering (SimpleQA) ### Question Answering (SimpleQA)
For question-answering, Jan-v1 shows a significant performance gain from model scaling, achieving 91.2% accuracy. For question-answering, Jan-v1 shows a significant performance gain from model scaling, achieving 91.1% accuracy.
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/abEitIjvszFm7Z8mRHQz-.png) ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/abEitIjvszFm7Z8mRHQz-.png)
*The 91.2% SimpleQA accuracy represents a significant milestone in factual question answering for models of this scale, demonstrating the effectiveness of our scaling and fine-tuning approach.* *The 91.1% SimpleQA accuracy represents a significant milestone in factual question answering for models of this scale, demonstrating the effectiveness of our scaling and fine-tuning approach.*
### Chat Benchmarks ### Chat Benchmarks
These benchmarks evaluate the model's conversational and instructional capabilities. These benchmarks evaluate the model's conversational and instructional capabilities.
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/fX7BTAHaYdohIj4fbrIhM.png) ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/f3bzNYRuA_iTFQIcvu6Rr.png)
## Quick Start ## Quick Start