Update README.md

2024-02-18 15:53:23 +00:00
parent 4b36bc80c7
commit d82f3d8e4b
1 changed files with 23 additions and 1 deletions
--- a/README.md
+++ b/README.md
@@ -19,3 +19,25 @@ library_name: transformers
 - **License:** [apache-2.0](https://choosealicense.com/licenses/apache-2.0/)
 - **Finetuned from model :** [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)
 - **Dataset:** [sayhan/strix-philosophy-qa](https://huggingface.co/datasets/sayhan/strix-philosophy-qa)
+---
+**LoRA rank:** 8  
+**LoRA alpha:** 16  
+**LoRA dropout:** 0  
+**Rank-stabilized LoRA:** Yes  
+**Number of epochs:** 3  
+**Learning rate:** 1e-5  
+**Batch size:** 2  
+**Gradient accumulation steps:** 4  
+**Weight decay:** 0.01  
+**Target modules:**
+```
+  - Query projection (`q_proj`)
+  - Key projection (`k_proj`)
+  - Value projection (`v_proj`)
+  - Output projection (`o_proj`)
+  - Gate projection (`gate_proj`)
+  - Up projection (`up_proj`)
+  - Down projection (`down_proj`)
+```
+
+