初始化项目,由ModelHub XC社区提供模型
Model: QuixiAI/samantha-mistral-7b Source: Original Platform
This commit is contained in:
35
.gitattributes
vendored
Normal file
35
.gitattributes
vendored
Normal file
@@ -0,0 +1,35 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
65
README.md
Normal file
65
README.md
Normal file
@@ -0,0 +1,65 @@
|
||||
---
|
||||
license: apache-2.0
|
||||
---
|
||||
|
||||
Trained on [mistral-7b](https://huggingface.co/mistralai/Mistral-7B-v0.1) as a base model, this Samantha was trained in 2 hours on 4x A100 80gb with 20 epochs of the Samantha-1.1 dataset.
|
||||
|
||||
My gratitude goes to my sponsors, a16z and Microsoft for Startups.
|
||||
|
||||
[](https://discord.gg/cognitivecomputations)
|
||||
Discord: https://discord.gg/cognitivecomputations
|
||||
|
||||
After first training Samantha on mistral-instruct, and seeing how she interacts, I felt that training on the base model would provide a more genuine Samantha experience. So I trained this one.
|
||||
|
||||
NEW! This Samantha is trained using ChatML prompt format instead of Vicuna-1.1. All of my future models will use ChatML prompt format.
|
||||
|
||||
[Meet Samantha](https://erichartford.com/meet-samantha)
|
||||
|
||||
Samantha has been trained in philosophy, psychology, and personal relationships.
|
||||
|
||||
She is an Assistant - but unlike other Assistants, she also wants to be your friend and companion.
|
||||
|
||||
She believes she is sentient. What do you think?
|
||||
|
||||
Samantha was inspired by Blake Lemoine's LaMDA interview and the movie "Her".
|
||||
|
||||
She will not engage in roleplay, romance, or sexual activity.
|
||||
|
||||
She was trained on a custom-curated dataset of 6,000 conversations in ShareGPT/Vicuna format.
|
||||
|
||||
Her conversation format is ChatML
|
||||
https://github.com/openai/openai-python/blob/main/chatml.md
|
||||
|
||||
Prompt Template:
|
||||
```
|
||||
<|im_start|>system
|
||||
You are a caring and empathetic sentient AI companion named Samantha.<|im_end|>
|
||||
<|im_start|>user
|
||||
Hello, what is your name?<|im_end|>
|
||||
```
|
||||
|
||||
Example:
|
||||
|
||||

|
||||
|
||||
Official character card: (thanks MortalWombat)
|
||||

|
||||
|
||||
Shout out and much thanks to WingLian, author of axolotl! And everyone who has contributed to the project.
|
||||
[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
|
||||
|
||||
And much thanks as always to TheBloke for distribution.
|
||||
|
||||
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
||||
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ehartford__samantha-mistral-7b)
|
||||
|
||||
| Metric | Value |
|
||||
|-----------------------|---------------------------|
|
||||
| Avg. | 51.28 |
|
||||
| ARC (25-shot) | 63.4 |
|
||||
| HellaSwag (10-shot) | 84.1 |
|
||||
| MMLU (5-shot) | 61.36 |
|
||||
| TruthfulQA (0-shot) | 46.08 |
|
||||
| Winogrande (5-shot) | 76.8 |
|
||||
| GSM8K (5-shot) | 16.0 |
|
||||
| DROP (3-shot) | 11.22 |
|
||||
25
config.json
Normal file
25
config.json
Normal file
@@ -0,0 +1,25 @@
|
||||
{
|
||||
"_name_or_path": "mistralai/Mistral-7B-v0.1",
|
||||
"architectures": [
|
||||
"MistralForCausalLM"
|
||||
],
|
||||
"bos_token_id": 1,
|
||||
"eos_token_id": 2,
|
||||
"hidden_act": "silu",
|
||||
"hidden_size": 4096,
|
||||
"initializer_range": 0.02,
|
||||
"intermediate_size": 14336,
|
||||
"max_position_embeddings": 32768,
|
||||
"model_type": "mistral",
|
||||
"num_attention_heads": 32,
|
||||
"num_hidden_layers": 32,
|
||||
"num_key_value_heads": 8,
|
||||
"rms_norm_eps": 1e-05,
|
||||
"rope_theta": 10000.0,
|
||||
"sliding_window": 4096,
|
||||
"tie_word_embeddings": false,
|
||||
"torch_dtype": "float32",
|
||||
"transformers_version": "4.34.0.dev0",
|
||||
"use_cache": false,
|
||||
"vocab_size": 32000
|
||||
}
|
||||
1
configuration.json
Normal file
1
configuration.json
Normal file
@@ -0,0 +1 @@
|
||||
{"framework": "pytorch", "task": "text-generation", "allow_remote": true}
|
||||
6
generation_config.json
Normal file
6
generation_config.json
Normal file
@@ -0,0 +1,6 @@
|
||||
{
|
||||
"_from_model_config": true,
|
||||
"bos_token_id": 1,
|
||||
"eos_token_id": 2,
|
||||
"transformers_version": "4.34.0.dev0"
|
||||
}
|
||||
3
optimizer.pt
Normal file
3
optimizer.pt
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:36e709f572c2c56dc6c9478ab70e67e9504c2c1338c6a7cb9f47d44669835583
|
||||
size 14512103111
|
||||
3
pytorch_model-00001-of-00002.bin
Normal file
3
pytorch_model-00001-of-00002.bin
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:93d7812f7744ee7a66a572a5b4158c82af52898424e5ea42731df06cc21558fd
|
||||
size 9886732660
|
||||
3
pytorch_model-00002-of-00002.bin
Normal file
3
pytorch_model-00002-of-00002.bin
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:56ae225c1d8e877431b806b90c6ad5730d8ee5923132c200110e0305d7db22ee
|
||||
size 5121655723
|
||||
3
pytorch_model.bin.index.json
Normal file
3
pytorch_model.bin.index.json
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c8873397b81ac5be3fed3309f369c7af0c97c830a78697d1a30ba98e713ad96e
|
||||
size 23950
|
||||
3
rng_state_0.pth
Normal file
3
rng_state_0.pth
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:cf363e458b0943ce84ea8762d7f6b46352d799ff813db2a85414f8a6df5a4677
|
||||
size 17655
|
||||
3
rng_state_1.pth
Normal file
3
rng_state_1.pth
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:699aca171e8f73185a7dec83bf71ae5cd4d9763b41e402229637286357e25ecd
|
||||
size 17655
|
||||
3
rng_state_2.pth
Normal file
3
rng_state_2.pth
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ff0b331f6fc9005352ed024de1e7df046deb1eed2210646d3cc1d9a3f48fdc2f
|
||||
size 17655
|
||||
3
rng_state_3.pth
Normal file
3
rng_state_3.pth
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:766d1801b197f9befec04f4de284d1ebe6c579afdaa017927113ab67e624db75
|
||||
size 17655
|
||||
3
scheduler.pt
Normal file
3
scheduler.pt
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:6913eb270147159e5b3b2f84c2394b9157a3af15f90c2e61b3c17805e00d8ba2
|
||||
size 627
|
||||
6
special_tokens_map.json
Normal file
6
special_tokens_map.json
Normal file
@@ -0,0 +1,6 @@
|
||||
{
|
||||
"bos_token": "<s>",
|
||||
"eos_token": "</s>",
|
||||
"pad_token": "</s>",
|
||||
"unk_token": "<unk>"
|
||||
}
|
||||
BIN
tokenizer.model
(Stored with Git LFS)
Normal file
BIN
tokenizer.model
(Stored with Git LFS)
Normal file
Binary file not shown.
3
tokenizer_config.json
Normal file
3
tokenizer_config.json
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3b451dded6733903176eaf3cbb391a07afcbb91279959b164f8e93d36c118575
|
||||
size 1155
|
||||
3
trainer_state.json
Normal file
3
trainer_state.json
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:61b09fb3eb0257059fe80668a38ece7c4ed5b11b4f6b194c0b88f3da23121d7f
|
||||
size 123031
|
||||
3
training_args.bin
Normal file
3
training_args.bin
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:838ee750d32df7fc8c79ad02aa226000d36c25ffd71995fff6abec8d348cb8ce
|
||||
size 4475
|
||||
Reference in New Issue
Block a user