sayhan/OpenHermes-2.5-Strix-Philosophy-Mistral-7B-LoRA

Go to file

Sayhan Yalvaçer d82f3d8e4b Update README.md

2024-02-18 15:53:23 +00:00

.gitattributes

Rename OpenHermes-2.5-Strix-Philosophy-Mistral-7B-unsloth.Q3_K_M.gguf to openhermes-2.5-strix-philosophy-mistral-7b.Q3_K_M.gguf

2024-02-18 13:07:58 +00:00

adapter_config.json

Upload model

2024-02-18 08:36:46 +00:00

adapter_model.safetensors

Upload model

2024-02-18 08:36:46 +00:00

added_tokens.json

Upload tokenizer

2024-02-18 08:40:15 +00:00

config.json

Update config.json

2024-02-18 12:48:58 +00:00

generation_config.json

Upload MistralForCausalLM

2024-02-18 08:44:19 +00:00

model-00001-of-00003.safetensors

Upload MistralForCausalLM

2024-02-18 08:44:19 +00:00

model-00002-of-00003.safetensors

Upload MistralForCausalLM

2024-02-18 08:44:19 +00:00

model-00003-of-00003.safetensors

Upload MistralForCausalLM

2024-02-18 08:44:19 +00:00

model.safetensors.index.json

Upload MistralForCausalLM

2024-02-18 08:44:19 +00:00

openhermes-2.5-strix-philosophy-mistral-7b Q3_K_L.gguf

Rename OpenHermes-2.5-Strix-Philosophy-Mistral-7B-unsloth.Q3_K_L.gguf to openhermes-2.5-strix-philosophy-mistral-7b Q3_K_L.gguf

2024-02-18 13:07:34 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.F16.gguf

Rename OpenHermes-2.5-Strix-Philosophy-Mistral-7B.F16.gguf to openhermes-2.5-strix-philosophy-mistral-7b.F16.gguf

2024-02-18 13:08:09 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.fp16.bin

Upload folder using huggingface_hub

2024-02-18 13:04:58 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q2_K.gguf

Rename OpenHermes-2.5-Strix-Philosophy-Mistral-7B-unsloth.Q2_K.gguf to openhermes-2.5-strix-philosophy-mistral-7b.Q2_K.gguf

2024-02-18 13:07:06 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q3_K_M.gguf

Rename OpenHermes-2.5-Strix-Philosophy-Mistral-7B-unsloth.Q3_K_M.gguf to openhermes-2.5-strix-philosophy-mistral-7b.Q3_K_M.gguf

2024-02-18 13:07:58 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q3_K_S.gguf

Upload folder using huggingface_hub

2024-02-18 13:02:00 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q4_0.gguf

Upload folder using huggingface_hub

2024-02-18 13:02:00 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q4_K_M.gguf

Upload folder using huggingface_hub

2024-02-18 13:02:00 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q4_K_S.gguf

Upload folder using huggingface_hub

2024-02-18 13:02:00 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q5_0.gguf

Upload folder using huggingface_hub

2024-02-18 13:02:00 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q5_K_M.gguf

Rename OpenHermes-2.5-Strix-Philosophy-Mistral-7B.Q5_K_M.gguf to openhermes-2.5-strix-philosophy-mistral-7b.Q5_K_M.gguf

2024-02-18 13:08:22 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q5_K_S.gguf

Upload folder using huggingface_hub

2024-02-18 13:02:00 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q6_K.gguf

Upload folder using huggingface_hub

2024-02-18 13:02:00 +00:00

openhermes-2.5-strix-philosophy-mistral-7b.Q8_0.gguf

Rename OpenHermes-2.5-Strix-Philosophy-Mistral-7B.Q8_0.gguf to openhermes-2.5-strix-philosophy-mistral-7b.Q8_0.gguf

2024-02-18 13:08:34 +00:00

README.md

Update README.md

2024-02-18 15:53:23 +00:00

special_tokens_map.json

Upload tokenizer

2024-02-18 08:40:15 +00:00

tokenizer_config.json

Upload tokenizer

2024-02-18 08:40:15 +00:00

tokenizer.json

Upload tokenizer

2024-02-18 08:40:15 +00:00

tokenizer.model

Upload tokenizer

2024-02-18 08:40:15 +00:00

README.md

language, license, tags, base_model, datasets, library_name

language

license

tags

base_model

datasets

library_name

en

apache-2.0

trl

text-generation-inference

unsloth

mistral

gguf

teknium/OpenHermes-2.5-Mistral-7B

sayhan/strix-philosophy-qa

transformers

OpenHermes 2.5 Stix Philosophy Mistral 7B

Finetuned by: sayhan
License: apache-2.0
Finetuned from model : teknium/OpenHermes-2.5-Mistral-7B
Dataset: sayhan/strix-philosophy-qa

LoRA rank: 8
LoRA alpha: 16
LoRA dropout: 0
Rank-stabilized LoRA: Yes
Number of epochs: 3
Learning rate: 1e-5
Batch size: 2
Gradient accumulation steps: 4
Weight decay: 0.01
Target modules:

  - Query projection (`q_proj`)
  - Key projection (`k_proj`)
  - Value projection (`v_proj`)
  - Output projection (`o_proj`)
  - Gate projection (`gate_proj`)
  - Up projection (`up_proj`)
  - Down projection (`down_proj`)