Update README.md

Update README.md (#2 )
- Update README.md (1a3fd9581d6d8fd3b5af4bb89ae72eaa74895011) Co-authored-by: Eren Doğan <erndgn@users.noreply.huggingface.co>
2024-12-03 18:01:34 +00:00 · 2024-08-14 14:09:28 +00:00 · 2024-07-25 09:48:18 +00:00 · 2024-07-22 18:58:34 +00:00 · 2024-07-22 18:54:44 +00:00 · 2024-07-22 18:51:57 +00:00
6 changed files with 171 additions and 0 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -53,3 +53,6 @@ Turkish-Llama-8b-Instruct-v0.1.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
 Turkish-Llama-8b-Instruct-v0.1.Q5_K.gguf filter=lfs diff=lfs merge=lfs -text
 Turkish-Llama-8b-Instruct-v0.1.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 Turkish-Llama-8b-Instruct-v0.1.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
 Turkish-Llama-8b-Instruct-v0.1.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
 Turkish-Llama-8b-Instruct-v0.1.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
 Turkish-Llama-8b-Instruct-v0.1-F16.gguf filter=lfs diff=lfs merge=lfs -text
--- a/README.md
+++ b/README.md
@@ -0,0 +1,110 @@
 ---
 base_model: ytu-ce-cosmos/Turkish-Llama-8b-Instruct-v0.1
 license: llama3
 language:
 - tr
 - en
 tags:
 - gguf
 - ggml
 - llama3
 - cosmosllama
 - turkish llama
 ---
 # CosmsoLLaMa GGUFs
 ## Objective
 Due to the need for quantized models in real-time applications, we introduce our GGUF formatted models. These models are part of 
 GGML project with a hope to democratize the use of Large Models. Depending on the quantization type, there are 20+ models.
 ### Features
 * All quantization details are listed on the right by Hugging Face.
 * All the models have been tested in `llama.cpp` environments, `llama-cli` and `llama-server`. 
 * Furthermore, a YouTube video has been made to introduce the basics of using `lmstudio` to utilize these models. 👇
 [![lmstudio_yt](https://img.youtube.com/vi/JRID-6sRl7I/0.jpg)](https://www.youtube.com/watch?v=JRID-6sRl7I)
 ### Code Example
 Usage example with `llama-cpp-python`
 ```py
 from llama_cpp import Llama
 # Define the inference parameters
 inference_params = {
    "n_threads": 4,
    "n_predict": -1,
    "top_k": 40,
    "min_p": 0.05,
    "top_p": 0.95,
    "temp": 0.8,
    "repeat_penalty": 1.1,
    "input_prefix": "<|start_header_id|>user<|end_header_id|>\\n\\n",
    "input_suffix": "<|eot_id|><|start_header_id|>assistant<|end_header_id|>\\n\\n",
    "antiprompt": [],
    "pre_prompt": "Sen bir yapay zeka asistanısın. Kullanıcı sana bir görev verecek. Amacın görevi olabildiğince sadık bir şekilde tamamlamak.",
    "pre_prompt_suffix": "<|eot_id|>",
    "pre_prompt_prefix": "<|begin_of_text|><|start_header_id|>system<|end_header_id|>\\n\\n",
    "seed": -1,
    "tfs_z": 1,
    "typical_p": 1,
    "repeat_last_n": 64,
    "frequency_penalty": 0,
    "presence_penalty": 0,
    "n_keep": 0,
    "logit_bias": {},
    "mirostat": 0,
    "mirostat_tau": 5,
    "mirostat_eta": 0.1,
    "memory_f16": True,
    "multiline_input": False,
    "penalize_nl": True
 }
 # Initialize the Llama model with the specified inference parameters
 llama = Llama.from_pretrained(
    repo_id="ytu-ce-cosmos/Turkish-Llama-8b-Instruct-v0.1-GGUF",
    filename="*Q4_K.gguf",
    verbose=False
 )
 # Example input
 user_input = "Türkiyenin başkenti neresidir?"
 # Construct the prompt
 prompt = f"{inference_params['pre_prompt_prefix']}{inference_params['pre_prompt']}\n\n{inference_params['input_prefix']}{user_input}{inference_params['input_suffix']}"
 # Generate the response
 response = llama(prompt)
 # Output the response
 print(response['choices'][0]['text'])
 ```
 The quantization has been made using `llama.cpp`. As we have seen, this method tends to give the most stable results.
 Obviously, we encountered better inference quality for models with the highest bits. However, the inference time tends to be similar between low-bit models. 
 Each model's memory footprint can be anticipated by the qunatization docs in either [Hugging Face](https://huggingface.co/docs/transformers/main/en/quantization/overview) or [llama.cpp](https://github.com/ggerganov/llama.cpp/tree/master/examples/quantize).
 # Acknowledgments
 - Research supported with Cloud TPUs from [Google's TensorFlow Research Cloud](https://sites.research.google/trc/about/) (TFRC). Thanks for providing access to the TFRC ❤️
 - Thanks to the generous support from the Hugging Face team, it is possible to download models from their S3 storage 🤗
 # Citation
 ```bibtex
@inproceedings{kesgin2024optimizing,
  title={Optimizing Large Language Models for Turkish: New Methodologies in Corpus Selection and Training},
  author={Kesgin, H Toprak and Yuce, M Kaan and Dogan, Eren and Uzun, M Egemen and Uz, Atahan and {\.I}nce, Elif and Erdem, Yusuf and Shbib, Osama and Zeer, Ahmed and Amasyali, M Fatih},
  booktitle={2024 Innovations in Intelligent Systems and Applications Conference (ASYU)},
  pages={1--6},
  year={2024},
  organization={IEEE}
 }
 ```
 ## Contact
 COSMOS AI Research Group, Yildiz Technical University Computer Engineering Department  
 https://cosmos.yildiz.edu.tr/  
 cosmos@yildiz.edu.tr
--- a/Turkish-Llama-8b-Instruct-v0.1-F16.gguf
+++ b/Turkish-Llama-8b-Instruct-v0.1-F16.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:ae6366fbcbd5a7a20b05bece7e59c79b8199f74669c5f946f909b579eabf737c
 size 16068890880
--- a/Turkish-Llama-8b-Instruct-v0.1.Q6_K.gguf
+++ b/Turkish-Llama-8b-Instruct-v0.1.Q6_K.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:3ce9442d5a7fe21eb9f79df7b30c70dbd4e5e7e29ee10ae447a40b5595c7ea9e
 size 6596006144
--- a/Turkish-Llama-8b-Instruct-v0.1.Q8_0.gguf
+++ b/Turkish-Llama-8b-Instruct-v0.1.Q8_0.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:9e15b58cb80e3ef24d5acba418441b0c2bf69853d37403d2c4c5ea2b63b34de5
 size 8540770560
--- a/cosmos_lm_studio.preset.json
+++ b/cosmos_lm_studio.preset.json
@@ -0,0 +1,49 @@
 {
  "name": "cosmos_lm_studio",
  "load_params": {
    "n_ctx": 2048,
    "n_batch": 512,
    "rope_freq_base": 0,
    "rope_freq_scale": 0,
    "n_gpu_layers": 10,
    "use_mlock": true,
    "main_gpu": 0,
    "tensor_split": [
      0
    ],
    "seed": -1,
    "f16_kv": true,
    "use_mmap": true,
    "no_kv_offload": false,
    "num_experts_used": 0
  },
  "inference_params": {
    "n_threads": 4,
    "n_predict": -1,
    "top_k": 40,
    "min_p": 0.05,
    "top_p": 0.95,
    "temp": 0.8,
    "repeat_penalty": 1.1,
    "input_prefix": "<|start_header_id|>user<|end_header_id|>\\n\\n",
    "input_suffix": "<|eot_id|><|start_header_id|>assistant<|end_header_id|>\\n\\n",
    "antiprompt": [],
    "pre_prompt": "Sen bir yapay zeka asistanısın. Kullanıcı sana bir görev verecek. Amacın görevi olabildiğince sadık bir şekilde tamamlamak. Görevi yerine getirirken adım adım düşün ve adımlarını gerekçelendir.",
    "pre_prompt_suffix": "<|eot_id|>",
    "pre_prompt_prefix": "<|begin_of_text|><|start_header_id|>system<|end_header_id|>\\n\\n",
    "seed": -1,
    "tfs_z": 1,
    "typical_p": 1,
    "repeat_last_n": 64,
    "frequency_penalty": 0,
    "presence_penalty": 0,
    "n_keep": 0,
    "logit_bias": {},
    "mirostat": 0,
    "mirostat_tau": 5,
    "mirostat_eta": 0.1,
    "memory_f16": true,
    "multiline_input": false,
    "penalize_nl": true
  }
 }
Author	SHA1	Message	Date
Toprak Kesgin	ccd7b54d9f	Update README.md	2024-12-03 18:01:34 +00:00
AhmedZeer	b17df1328a	Update README.md (#2 ) - Update README.md (1a3fd9581d6d8fd3b5af4bb89ae72eaa74895011) Co-authored-by: Eren Doğan <erndgn@users.noreply.huggingface.co>	2024-08-14 14:09:28 +00:00
Toprak Kesgin	d49be2d4f8	Update README.md	2024-07-25 09:48:18 +00:00
AhmedZeer	7c75df30ed	Update README.md	2024-07-22 18:58:34 +00:00
AhmedZeer	b83f9e21f0	readme.init( )	2024-07-22 18:54:44 +00:00
AhmedZeer	f52ce40934	readme.init( )	2024-07-22 18:51:57 +00:00
AhmedZeer	db6a8f3b99	Upload cosmos_lm_studio.preset.json (#1 ) - Upload cosmos_lm_studio.preset.json (763c1d4bab9cfba62b3e9a0d4454ad0fa1819148) Co-authored-by: Egemen Uzun <meguzn@users.noreply.huggingface.co>	2024-07-16 08:20:18 +00:00
AhmedZeer	d678c617a0	Upload Turkish-Llama-8b-Instruct-v0.1-F16.gguf with huggingface_hub	2024-07-14 18:55:51 +00:00
AhmedZeer	6720fca4a6	Upload Turkish-Llama-8b-Instruct-v0.1.Q8_0.gguf with huggingface_hub	2024-07-14 18:41:40 +00:00
AhmedZeer	29ae7551e1	Upload Turkish-Llama-8b-Instruct-v0.1.Q6_K.gguf with huggingface_hub	2024-07-14 18:35:37 +00:00