diff --git a/README.md b/README.md index d99bd1e..b3a7fc0 100644 --- a/README.md +++ b/README.md @@ -24,17 +24,17 @@ Jan-v1 leverages the newly released [Qwen3-4B-thinking](https://huggingface.co/Q ## Performance ### Question Answering (SimpleQA) -For question-answering, Jan-v1 shows a significant performance gain from model scaling, achieving 91.2% accuracy. +For question-answering, Jan-v1 shows a significant performance gain from model scaling, achieving 91.1% accuracy. ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/abEitIjvszFm7Z8mRHQz-.png) -*The 91.2% SimpleQA accuracy represents a significant milestone in factual question answering for models of this scale, demonstrating the effectiveness of our scaling and fine-tuning approach.* +*The 91.1% SimpleQA accuracy represents a significant milestone in factual question answering for models of this scale, demonstrating the effectiveness of our scaling and fine-tuning approach.* ### Chat Benchmarks These benchmarks evaluate the model's conversational and instructional capabilities. -![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/fX7BTAHaYdohIj4fbrIhM.png) +![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/f3bzNYRuA_iTFQIcvu6Rr.png) ## Quick Start