commit f759f6b186eca9bdae6e2cf41184e545aa2a5d4f Author: ModelHub XC Date: Sun Apr 26 13:22:15 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..c1ccb78 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,36 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gguf filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text diff --git a/LICENSE b/LICENSE new file mode 100644 index 0000000..fe02576 --- /dev/null +++ b/LICENSE @@ -0,0 +1,22 @@ + + MIT License + +Copyright (c) 2025 Hayk Grigorian + +Permission is hereby granted, free of charge, to any person obtaining a copy +of this software and associated documentation files (the "Software"), to deal +in the Software without restriction, including without limitation the rights +to use, copy, modify, merge, publish, distribute, sublicense, and/or sell +copies of the Software, and to permit persons to whom the Software is +furnished to do so, subject to the following conditions: + +The above copyright notice and this permission notice shall be included in all +copies or substantial portions of the Software. + +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR +IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, +FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE +AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER +LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, +OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE +SOFTWARE. \ No newline at end of file diff --git a/README.md b/README.md new file mode 100644 index 0000000..a5e8db3 --- /dev/null +++ b/README.md @@ -0,0 +1,74 @@ +--- +base_model: haykgrigorian/TimeCapsuleLLM-v2-llama-1.2B +language: +- en +library_name: transformers +license: mit +datasets: +- postgrammar/london-llm-1800 +quantized_by: ncky +tags: +- text-generation-inference +- transformers +- llama +- gguf +- historical +--- +## About + +static and imatrix-assisted GGUF quants of https://huggingface.co/haykgrigorian/TimeCapsuleLLM-v2-llama-1.2B. + +Generated with `llama.cpp` build `8044` (`91ea5d67f`). + +`IQ4_XS` was quantized with an imatrix generated on 19th-century public-domain English text. + +Note: this model has FFN dimensions (`5504`) not divisible by `256`, so `llama.cpp` applied fallback quantization to 22 tensors for K/IQ quant types. + +## Base Model Info (from original model card) + +Source: https://huggingface.co/haykgrigorian/TimeCapsuleLLM-v2-llama-1.2B + +| Detail | Value | +| :--- | :--- | +| Model Architecture | LlamaForCausalLM (decoder-only transformer) | +| Parameter Count | ~1.22B | +| Training Type | Trained from scratch (random initialization) | +| Tokenizer | Custom BPE, vocab size 32,000 | +| Sequence Length | 2048 | +| Attention Type | Grouped Query Attention (16 Q heads / 8 KV heads) | +| Hidden Size | 2048 | +| Intermediate Size | 5504 | +| Layers | 22 | + +Training details reported by the source model card: +- Final training loss: 3.3951 +- Start training loss: 10.7932 +- Training steps: 182,000 +- Epochs: 0.4997 +- Training time: 117h 51m +- Reported training cost: $340.97 on an H100 SXM (RunPod) + +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details. + +## Provided Quants + +(sorted by size, not necessarily quality) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf) | Q2_K | 0.5 | smallest | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf) | Q3_K_S | 0.6 | low VRAM | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf) | Q3_K_M | 0.6 | balanced low size | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf) | Q3_K_L | 0.6 | better than Q3_K_M | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf) | IQ4_XS | 0.6 | imatrix, recommended at this size | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf) | Q4_K_S | 0.7 | fast, recommended | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf) | Q4_K_M | 0.7 | fast, recommended | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf) | Q5_K_S | 0.8 | higher quality | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf) | Q5_K_M | 0.9 | higher quality | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf) | Q6_K | 1.0 | very good quality | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf) | Q8_0 | 1.2 | fast, best quality | +| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.f16.gguf) | f16 | 2.3 | 16 bpw, overkill | diff --git a/TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf b/TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf new file mode 100644 index 0000000..39edf17 --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d05c21b73302c793309c70ff73ba15dd2ebf07a762afb954229f724067d26564 +size 640110144 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf new file mode 100644 index 0000000..71e30a1 --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5b36c18295248f6304319eef6e2537df004d6141b0821ec97761a21e9ab83a4b +size 483880256 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf new file mode 100644 index 0000000..6566b8b --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:07288a61b20f4577d9964a3c66c35c1a77e5c9d0319bdf7d7dcab8704fe6e88a +size 637148480 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf new file mode 100644 index 0000000..ff2bfe7 --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fbe04bb34267f0f5d921b586f8e12590044aa6a0e0aa2db19fc5da46ca817b70 +size 605576512 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf new file mode 100644 index 0000000..94f677e --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3f73f828eabf9f003e57febb49e6904dd4226e04492c1fe2250d49769057ce96 +size 554966336 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf new file mode 100644 index 0000000..29c6da1 --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:27edbf86d3992f5f90b219536f7459bcaba367f8c8e1648525c2d391b6a24b02 +size 744983872 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf new file mode 100644 index 0000000..a802c64 --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fdb72343232ff405728226b59e8ca73e69a04e88cb71535cb140ee0a658b6b20 +size 699764032 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf new file mode 100644 index 0000000..e8567bf --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8cc2a0f1ded7d8034d4ce5767823e45e5f47445388042620171ba9e1e51d28ca +size 855608640 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf new file mode 100644 index 0000000..dc40ce3 --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:61b5552f2fc3a56bf531a02373d3ecf5bacf5b59359bb8b8f5be0db6d9b3647a +size 817597760 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf new file mode 100644 index 0000000..e98898c --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7ac40f9e341ee003e14a564b2ce53988c7e0dd494af1edb6c406344fff54a939 +size 1006435648 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf b/TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf new file mode 100644 index 0000000..3820520 --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a100a4b1c5fc71f3bab18c3d84c44ea35ce0fbce488c58ca151a18b190298fb1 +size 1225342272 diff --git a/TimeCapsuleLLM-v2-llama-1.2B.f16.gguf b/TimeCapsuleLLM-v2-llama-1.2B.f16.gguf new file mode 100644 index 0000000..b21d9ff --- /dev/null +++ b/TimeCapsuleLLM-v2-llama-1.2B.f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:637ed650b1cb44dcc5cdbb12119e2fba1923a348550af759b2c7517a9c445989 +size 2305211712