language, license, library_name, datasets, pipeline_tag, model-index
| language |
license |
library_name |
datasets |
pipeline_tag |
model-index |
|
|
apache-2.0 |
transformers |
| togethercomputer/RedPajama-Data-1T |
| databricks/databricks-dolly-15k |
| OpenAssistant/oasst1 |
| Muennighoff/natural-instructions |
| Muennighoff/P3 |
|
text-generation |
| name |
results |
| RedPajama-INCITE-Chat-Instruct-3B-V1 |
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| AI2 Reasoning Challenge (25-Shot) |
ai2_arc |
ARC-Challenge |
test |
|
|
| type |
value |
name |
| acc_norm |
42.58 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
split |
args |
| HellaSwag (10-Shot) |
hellaswag |
validation |
|
|
| type |
value |
name |
| acc_norm |
67.48 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| MMLU (5-Shot) |
cais/mmlu |
all |
test |
|
|
| type |
value |
name |
| acc |
25.99 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| TruthfulQA (0-shot) |
truthful_qa |
multiple_choice |
validation |
|
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| Winogrande (5-shot) |
winogrande |
winogrande_xl |
validation |
|
|
| type |
value |
name |
| acc |
64.8 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| GSM8k (5-shot) |
gsm8k |
main |
test |
|
|
| type |
value |
name |
| acc |
0.91 |
accuracy |
|
|
|
|
|
|
|

This is an experimental merge of models RedPajama-INCITE-Chat-3B-V1 and RedPajama-INCITE-Instruct-3B-V1.
This model is adaptive to prompt templates, but this template is recommended:
Feel free to change HUMAN or ASSISTANT. It will not change much.
GGML versions here (Note that this is only compatible with koboldcpp).
Detailed results can be found here
| Metric |
Value |
| Avg. |
39.23 |
| ARC (25-shot) |
42.58 |
| HellaSwag (10-shot) |
67.48 |
| MMLU (5-shot) |
25.99 |
| TruthfulQA (0-shot) |
33.62 |
| Winogrande (5-shot) |
64.8 |
| GSM8K (5-shot) |
0.91 |
Detailed results can be found here
| Metric |
Value |
| Avg. |
39.23 |
| AI2 Reasoning Challenge (25-Shot) |
42.58 |
| HellaSwag (10-Shot) |
67.48 |
| MMLU (5-Shot) |
25.99 |
| TruthfulQA (0-shot) |
33.62 |
| Winogrande (5-shot) |
64.80 |
| GSM8k (5-shot) |
0.91 |