diff --git a/README.md b/README.md index 33cbb2e..066a95b 100644 --- a/README.md +++ b/README.md @@ -87,7 +87,7 @@ pipeline_tag: text-generation - [Models](#models) - [Chat models](#chat-models) - [Base models](#base-models) - - [Other info](#other-info) + - [Model info](#model-info) - [News](#news) - [How to use Yi?](#how-to-use-yi) - [Quick start](#quick-start) @@ -276,11 +276,35 @@ Yi-6B-200K | • [🤗 Hugging Face](https://huggingface.co/01-ai/Yi-6B-200K) - For chat and base models - Model | Intro | Default context window | Pretrained tokens | Training Data Date - |---|---|---|---|--- - 6B series models |They are suitable for personal and academic use. | 4K | 3T | Up to June 2023 - 9B model| It is the best at coding and math in the Yi series models.|4K | Yi-9B is continuously trained based on Yi-6B, using 0.8T tokens. | Up to June 2023 - 34B series models | They are suitable for personal, academic, and commercial (particularly for small and medium-sized enterprises) purposes. It's a cost-effective solution that's affordable and equipped with emergent ability.|4K | 3T | Up to June 2023 +
| Model | +Intro | +Default context window | +Pretrained tokens | +Training Data Date | +
|---|---|---|---|---|
| 6B series models | +They are suitable for personal and academic use. | +4K | +3T | +Up to June 2023 | +
| 9B series models | +It is the best at coding and math in the Yi series models. | +Yi-9B is continuously trained based on Yi-6B, using 0.8T tokens. | +||
| 34B series models | +They are suitable for personal, academic, and commercial (particularly for small and medium-sized enterprises) purposes. It's a cost-effective solution that's affordable and equipped with emergent ability. | +3T | +