初始化项目,由ModelHub XC社区提供模型
Model: migtissera/Tess-XS-v1-3-yarn-128K Source: Original Platform
This commit is contained in:
25
README.md
Normal file
25
README.md
Normal file
@@ -0,0 +1,25 @@
|
||||
---
|
||||
license: apache-2.0
|
||||
---
|
||||
|
||||
# Note:
|
||||
This version is the stable release. The issues that were present in versions 1.0, 1.1 and 1.2 all have been rectified. Thank you for your patience while R&D was conducted. Enjoy!
|
||||
|
||||
This model have been tested on context length up to 16K. Model produced slight repetition around 16K context length. I recommend testing the model to your usecase and limiting the context length.
|
||||
|
||||
Here's my learnings going from Tess-v1.0 to Tess-v1.3: https://migel.substack.com/p/learnings-from-training-tess
|
||||
|
||||
# Tess
|
||||
|
||||

|
||||
|
||||
Tess, short for Tesoro (Treasure in Italian), is a general purpose Large Language Model series. Tess-XS-v1.3 was trained on the Nous Research Mistral-7B-yarn-128K base.
|
||||
|
||||
|
||||
# Prompt Format:
|
||||
|
||||
```
|
||||
SYSTEM: <ANY SYSTEM CONTEXT>
|
||||
USER:
|
||||
ASSISTANT:
|
||||
```
|
||||
Reference in New Issue
Block a user