commit c91aa1b5d4c56fbe8ab7ea4899bd1a8368a2209f Author: ModelHub XC Date: Sat Apr 18 06:05:24 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: afrideva/evolvedSeeker_1_3-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..557a4b9 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,41 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +evolvedseeker_1_3.q2_k.gguf filter=lfs diff=lfs merge=lfs -text +evolvedseeker_1_3.q3_k_m.gguf filter=lfs diff=lfs merge=lfs -text +evolvedseeker_1_3.q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text +evolvedseeker_1_3.q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text +evolvedseeker_1_3.q6_k.gguf filter=lfs diff=lfs merge=lfs -text +evolvedseeker_1_3.q8_0.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..a89c270 --- /dev/null +++ b/README.md @@ -0,0 +1,95 @@ +--- +base_model: TokenBender/evolvedSeeker_1_3 +inference: false +model-index: +- name: evolvedSeeker-1_3_v_0_0_1 + results: [] +model_creator: TokenBender +model_name: evolvedSeeker_1_3 +pipeline_tag: text-generation +quantized_by: afrideva +tags: +- generated_from_trainer +- gguf +- ggml +- quantized +- q2_k +- q3_k_m +- q4_k_m +- q5_k_m +- q6_k +- q8_0 +--- +# TokenBender/evolvedSeeker_1_3-GGUF + +Quantized GGUF model files for [evolvedSeeker_1_3](https://huggingface.co/TokenBender/evolvedSeeker_1_3) from [TokenBender](https://huggingface.co/TokenBender) + + +| Name | Quant method | Size | +| ---- | ---- | ---- | +| [evolvedseeker_1_3.fp16.gguf](https://huggingface.co/afrideva/evolvedSeeker_1_3-GGUF/resolve/main/evolvedseeker_1_3.fp16.gguf) | fp16 | 2.69 GB | +| [evolvedseeker_1_3.q2_k.gguf](https://huggingface.co/afrideva/evolvedSeeker_1_3-GGUF/resolve/main/evolvedseeker_1_3.q2_k.gguf) | q2_k | 631.71 MB | +| [evolvedseeker_1_3.q3_k_m.gguf](https://huggingface.co/afrideva/evolvedSeeker_1_3-GGUF/resolve/main/evolvedseeker_1_3.q3_k_m.gguf) | q3_k_m | 704.97 MB | +| [evolvedseeker_1_3.q4_k_m.gguf](https://huggingface.co/afrideva/evolvedSeeker_1_3-GGUF/resolve/main/evolvedseeker_1_3.q4_k_m.gguf) | q4_k_m | 873.58 MB | +| [evolvedseeker_1_3.q5_k_m.gguf](https://huggingface.co/afrideva/evolvedSeeker_1_3-GGUF/resolve/main/evolvedseeker_1_3.q5_k_m.gguf) | q5_k_m | 1.00 GB | +| [evolvedseeker_1_3.q6_k.gguf](https://huggingface.co/afrideva/evolvedSeeker_1_3-GGUF/resolve/main/evolvedseeker_1_3.q6_k.gguf) | q6_k | 1.17 GB | +| [evolvedseeker_1_3.q8_0.gguf](https://huggingface.co/afrideva/evolvedSeeker_1_3-GGUF/resolve/main/evolvedseeker_1_3.q8_0.gguf) | q8_0 | 1.43 GB | + + + +## Original Model Card: +# evolvedSeeker-1_3 +EvolvedSeeker v0.0.1 (First phase) + +This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on 50k instructions for 3 epochs. + +I have mostly curated instructions from evolInstruct datasets and some portions of glaive coder. + +Around 3k answers were modified via self-instruct. + +Collaborate or Consult me - [Twitter](https://twitter.com/4evaBehindSOTA), [Discord](https://discord.gg/ftEM63pzs2) + +*Recommended format is ChatML, Alpaca will work but take care of EOT token* + +#### Chat Model Inference +```python +from transformers import AutoTokenizer, AutoModelForCausalLM +tokenizer = AutoTokenizer.from_pretrained("TokenBender/evolvedSeeker_1_3", trust_remote_code=True) +model = AutoModelForCausalLM.from_pretrained("TokenBender/evolvedSeeker_1_3", trust_remote_code=True).cuda() +messages=[ + { 'role': 'user', 'content': "write a program to reverse letters in each word in a sentence without reversing order of words in the sentence."} +] +inputs = tokenizer.apply_chat_template(messages, return_tensors="pt").to(model.device) +# 32021 is the id of <|EOT|> token +outputs = model.generate(inputs, max_new_tokens=512, do_sample=False, top_k=50, top_p=0.95, num_return_sequences=1, eos_token_id=32021) +print(tokenizer.decode(outputs[0][len(inputs[0]):], skip_special_tokens=True)) +``` + +## Model description + +First model of Project PIC (Partner-in-Crime) in 1.3B range. +Almost all the work is pending right now for this model hence v0.0.1 +![image/png](https://cdn-uploads.huggingface.co/production/uploads/6398bf222da24ee95b51c8d8/Fl-pRCsC_lvnuoP734hsJ.png) + +## Intended uses & limitations + +Superfast Copilot +Run near lossless quantized in 1G RAM. +Useful for code dataset curation and evaluation. + +Limitations - This is a smol model, so smol brain, may have crammed a few things. +Reasoning tests may fail beyond a certain point. + +## Training procedure +SFT + +### Training results +Humaneval Score - 68.29% +![image/png](https://cdn-uploads.huggingface.co/production/uploads/6398bf222da24ee95b51c8d8/AFp6PxZ9ZP_xti4VWjen3.png) + +### Framework versions + +- Transformers 4.35.2 +- Pytorch 2.0.1 +- Datasets 2.15.0 +- Tokenizers 0.15.0 \ No newline at end of file diff --git a/evolvedseeker_1_3.q2_k.gguf b/evolvedseeker_1_3.q2_k.gguf new file mode 100644 index 0000000..cf5d7f1 --- /dev/null +++ b/evolvedseeker_1_3.q2_k.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b6e3b52b230acc546ec2e26b46c460d71d990126f29e93d8c21c5d407d948867 +size 631706560 diff --git a/evolvedseeker_1_3.q3_k_m.gguf b/evolvedseeker_1_3.q3_k_m.gguf new file mode 100644 index 0000000..103d43a --- /dev/null +++ b/evolvedseeker_1_3.q3_k_m.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0c0f3a6c88154ba4d7f318de64f1e7a1342a2a80a4202a9a6009b567cac984f7 +size 704967616 diff --git a/evolvedseeker_1_3.q4_k_m.gguf b/evolvedseeker_1_3.q4_k_m.gguf new file mode 100644 index 0000000..68a9181 --- /dev/null +++ b/evolvedseeker_1_3.q4_k_m.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:12f798dd88e07219ac07bdce020009552a79ceba972660a2e0b954b0b696b47a +size 873583552 diff --git a/evolvedseeker_1_3.q5_k_m.gguf b/evolvedseeker_1_3.q5_k_m.gguf new file mode 100644 index 0000000..77ef4c3 --- /dev/null +++ b/evolvedseeker_1_3.q5_k_m.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d2417a495755dfbc743cd4d0807f84c5e3f01c0acb44cdcb2fab25394a9288ae +size 1001968576 diff --git a/evolvedseeker_1_3.q6_k.gguf b/evolvedseeker_1_3.q6_k.gguf new file mode 100644 index 0000000..5f478f6 --- /dev/null +++ b/evolvedseeker_1_3.q6_k.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bd1d65caf9d75f03cb94d43f3eb20c822c33624980359e9bfc3967fa743a7266 +size 1171665856 diff --git a/evolvedseeker_1_3.q8_0.gguf b/evolvedseeker_1_3.q8_0.gguf new file mode 100644 index 0000000..301387a --- /dev/null +++ b/evolvedseeker_1_3.q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3390aa6c1816aac01552ec51fa538646396359ca395e315fb61913206a5e6396 +size 1432220608