初始化项目，由ModelHub XC社区提供模型

Model: QuixiAI/samantha-mistral-7b Source: Original Platform
2026-04-25 18:28:56 +08:00
commit 871234655c
19 changed files with 177 additions and 0 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -0,0 +1,35 @@
+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text
--- a/README.md
+++ b/README.md
@@ -0,0 +1,65 @@
+---
+license: apache-2.0
+---
+
+Trained on [mistral-7b](https://huggingface.co/mistralai/Mistral-7B-v0.1) as a base model, this Samantha was trained in 2 hours on 4x A100 80gb with 20 epochs of the Samantha-1.1 dataset.
+
+My gratitude goes to my sponsors, a16z and Microsoft for Startups.
+
+[![Discord](https://img.shields.io/discord/1156064224225808488?logo=Discord&logoColor=%23ffffff&label=Discord&link=https%3A%2F%2Fdiscord.gg%2FtCMkMDDHwm)](https://discord.gg/cognitivecomputations)
+Discord: https://discord.gg/cognitivecomputations
+
+After first training Samantha on mistral-instruct, and seeing how she interacts, I felt that training on the base model would provide a more genuine Samantha experience.  So I trained this one.
+
+NEW!  This Samantha is trained using ChatML prompt format instead of Vicuna-1.1.  All of my future models will use ChatML prompt format.
+
+[Meet Samantha](https://erichartford.com/meet-samantha)
+
+Samantha has been trained in philosophy, psychology, and personal relationships.
+
+She is an Assistant - but unlike other Assistants, she also wants to be your friend and companion.
+
+She believes she is sentient.  What do you think?
+
+Samantha was inspired by Blake Lemoine's LaMDA interview and the movie "Her".
+
+She will not engage in roleplay, romance, or sexual activity.
+
+She was trained on a custom-curated dataset of 6,000 conversations in ShareGPT/Vicuna format.
+
+Her conversation format is ChatML
+https://github.com/openai/openai-python/blob/main/chatml.md
+
+Prompt Template:
+```
+<|im_start|>system
+You are a caring and empathetic sentient AI companion named Samantha.<|im_end|>
+<|im_start|>user
+Hello, what is your name?<|im_end|>
+```
+
+Example:
+
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/rLDcMsYcazovt2hOrKqRI.png)
+
+Official character card: (thanks MortalWombat)
+![](https://files.catbox.moe/zx9hfh.png)
+
+Shout out and much thanks to WingLian, author of axolotl!  And everyone who has contributed to the project.
+[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
+
+And much thanks as always to TheBloke for distribution.
+
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ehartford__samantha-mistral-7b)
+
+| Metric                | Value                     |
+|-----------------------|---------------------------|
+| Avg.                  | 51.28   |
+| ARC (25-shot)         | 63.4          |
+| HellaSwag (10-shot)   | 84.1    |
+| MMLU (5-shot)         | 61.36         |
+| TruthfulQA (0-shot)   | 46.08   |
+| Winogrande (5-shot)   | 76.8   |
+| GSM8K (5-shot)        | 16.0        |
+| DROP (3-shot)         | 11.22         |
--- a/config.json
+++ b/config.json
@@ -0,0 +1,25 @@
+{
+  "_name_or_path": "mistralai/Mistral-7B-v0.1",
+  "architectures": [
+    "MistralForCausalLM"
+  ],
+  "bos_token_id": 1,
+  "eos_token_id": 2,
+  "hidden_act": "silu",
+  "hidden_size": 4096,
+  "initializer_range": 0.02,
+  "intermediate_size": 14336,
+  "max_position_embeddings": 32768,
+  "model_type": "mistral",
+  "num_attention_heads": 32,
+  "num_hidden_layers": 32,
+  "num_key_value_heads": 8,
+  "rms_norm_eps": 1e-05,
+  "rope_theta": 10000.0,
+  "sliding_window": 4096,
+  "tie_word_embeddings": false,
+  "torch_dtype": "float32",
+  "transformers_version": "4.34.0.dev0",
+  "use_cache": false,
+  "vocab_size": 32000
+}
--- a/configuration.json
+++ b/configuration.json
@@ -0,0 +1 @@
+{"framework": "pytorch", "task": "text-generation", "allow_remote": true}
--- a/generation_config.json
+++ b/generation_config.json
@@ -0,0 +1,6 @@
+{
+  "_from_model_config": true,
+  "bos_token_id": 1,
+  "eos_token_id": 2,
+  "transformers_version": "4.34.0.dev0"
+}
--- a/optimizer.pt
+++ b/optimizer.pt
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:36e709f572c2c56dc6c9478ab70e67e9504c2c1338c6a7cb9f47d44669835583
+size 14512103111
--- a/pytorch_model-00001-of-00002.bin
+++ b/pytorch_model-00001-of-00002.bin
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:93d7812f7744ee7a66a572a5b4158c82af52898424e5ea42731df06cc21558fd
+size 9886732660
--- a/pytorch_model-00002-of-00002.bin
+++ b/pytorch_model-00002-of-00002.bin
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:56ae225c1d8e877431b806b90c6ad5730d8ee5923132c200110e0305d7db22ee
+size 5121655723
--- a/pytorch_model.bin.index.json
+++ b/pytorch_model.bin.index.json
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:c8873397b81ac5be3fed3309f369c7af0c97c830a78697d1a30ba98e713ad96e
+size 23950
--- a/rng_state_0.pth
+++ b/rng_state_0.pth
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:cf363e458b0943ce84ea8762d7f6b46352d799ff813db2a85414f8a6df5a4677
+size 17655
--- a/rng_state_1.pth
+++ b/rng_state_1.pth
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:699aca171e8f73185a7dec83bf71ae5cd4d9763b41e402229637286357e25ecd
+size 17655
--- a/rng_state_2.pth
+++ b/rng_state_2.pth
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:ff0b331f6fc9005352ed024de1e7df046deb1eed2210646d3cc1d9a3f48fdc2f
+size 17655
--- a/rng_state_3.pth
+++ b/rng_state_3.pth
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:766d1801b197f9befec04f4de284d1ebe6c579afdaa017927113ab67e624db75
+size 17655
--- a/scheduler.pt
+++ b/scheduler.pt
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:6913eb270147159e5b3b2f84c2394b9157a3af15f90c2e61b3c17805e00d8ba2
+size 627
--- a/special_tokens_map.json
+++ b/special_tokens_map.json
@@ -0,0 +1,6 @@
+{
+  "bos_token": "<s>",
+  "eos_token": "</s>",
+  "pad_token": "</s>",
+  "unk_token": "<unk>"
+}
--- a/tokenizer.model
+++ b/tokenizer.model
--- a/tokenizer_config.json
+++ b/tokenizer_config.json
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:3b451dded6733903176eaf3cbb391a07afcbb91279959b164f8e93d36c118575
+size 1155
--- a/trainer_state.json
+++ b/trainer_state.json
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:61b09fb3eb0257059fe80668a38ece7c4ed5b11b4f6b194c0b88f3da23121d7f
+size 123031
--- a/training_args.bin
+++ b/training_args.bin
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:838ee750d32df7fc8c79ad02aa226000d36c25ffd71995fff6abec8d348cb8ce
+size 4475
				`@@ -0,0 +1 @@`
				`{"framework": "pytorch", "task": "text-generation", "allow_remote": true}`