初始化项目,由ModelHub XC社区提供模型
Model: ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF Source: Original Platform
This commit is contained in:
36
.gitattributes
vendored
Normal file
36
.gitattributes
vendored
Normal file
@@ -0,0 +1,36 @@
|
|||||||
|
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.model filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||||
|
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||||
22
LICENSE
Normal file
22
LICENSE
Normal file
@@ -0,0 +1,22 @@
|
|||||||
|
|
||||||
|
MIT License
|
||||||
|
|
||||||
|
Copyright (c) 2025 Hayk Grigorian
|
||||||
|
|
||||||
|
Permission is hereby granted, free of charge, to any person obtaining a copy
|
||||||
|
of this software and associated documentation files (the "Software"), to deal
|
||||||
|
in the Software without restriction, including without limitation the rights
|
||||||
|
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
||||||
|
copies of the Software, and to permit persons to whom the Software is
|
||||||
|
furnished to do so, subject to the following conditions:
|
||||||
|
|
||||||
|
The above copyright notice and this permission notice shall be included in all
|
||||||
|
copies or substantial portions of the Software.
|
||||||
|
|
||||||
|
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
||||||
|
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
||||||
|
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
||||||
|
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
||||||
|
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
||||||
|
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
||||||
|
SOFTWARE.
|
||||||
74
README.md
Normal file
74
README.md
Normal file
@@ -0,0 +1,74 @@
|
|||||||
|
---
|
||||||
|
base_model: haykgrigorian/TimeCapsuleLLM-v2-llama-1.2B
|
||||||
|
language:
|
||||||
|
- en
|
||||||
|
library_name: transformers
|
||||||
|
license: mit
|
||||||
|
datasets:
|
||||||
|
- postgrammar/london-llm-1800
|
||||||
|
quantized_by: ncky
|
||||||
|
tags:
|
||||||
|
- text-generation-inference
|
||||||
|
- transformers
|
||||||
|
- llama
|
||||||
|
- gguf
|
||||||
|
- historical
|
||||||
|
---
|
||||||
|
## About
|
||||||
|
|
||||||
|
static and imatrix-assisted GGUF quants of https://huggingface.co/haykgrigorian/TimeCapsuleLLM-v2-llama-1.2B.
|
||||||
|
|
||||||
|
Generated with `llama.cpp` build `8044` (`91ea5d67f`).
|
||||||
|
|
||||||
|
`IQ4_XS` was quantized with an imatrix generated on 19th-century public-domain English text.
|
||||||
|
|
||||||
|
Note: this model has FFN dimensions (`5504`) not divisible by `256`, so `llama.cpp` applied fallback quantization to 22 tensors for K/IQ quant types.
|
||||||
|
|
||||||
|
## Base Model Info (from original model card)
|
||||||
|
|
||||||
|
Source: https://huggingface.co/haykgrigorian/TimeCapsuleLLM-v2-llama-1.2B
|
||||||
|
|
||||||
|
| Detail | Value |
|
||||||
|
| :--- | :--- |
|
||||||
|
| Model Architecture | LlamaForCausalLM (decoder-only transformer) |
|
||||||
|
| Parameter Count | ~1.22B |
|
||||||
|
| Training Type | Trained from scratch (random initialization) |
|
||||||
|
| Tokenizer | Custom BPE, vocab size 32,000 |
|
||||||
|
| Sequence Length | 2048 |
|
||||||
|
| Attention Type | Grouped Query Attention (16 Q heads / 8 KV heads) |
|
||||||
|
| Hidden Size | 2048 |
|
||||||
|
| Intermediate Size | 5504 |
|
||||||
|
| Layers | 22 |
|
||||||
|
|
||||||
|
Training details reported by the source model card:
|
||||||
|
- Final training loss: 3.3951
|
||||||
|
- Start training loss: 10.7932
|
||||||
|
- Training steps: 182,000
|
||||||
|
- Epochs: 0.4997
|
||||||
|
- Training time: 117h 51m
|
||||||
|
- Reported training cost: $340.97 on an H100 SXM (RunPod)
|
||||||
|
|
||||||
|
## Usage
|
||||||
|
|
||||||
|
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||||
|
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||||
|
more details.
|
||||||
|
|
||||||
|
## Provided Quants
|
||||||
|
|
||||||
|
(sorted by size, not necessarily quality)
|
||||||
|
|
||||||
|
| Link | Type | Size/GB | Notes |
|
||||||
|
|:-----|:-----|--------:|:------|
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf) | Q2_K | 0.5 | smallest |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf) | Q3_K_S | 0.6 | low VRAM |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf) | Q3_K_M | 0.6 | balanced low size |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf) | Q3_K_L | 0.6 | better than Q3_K_M |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf) | IQ4_XS | 0.6 | imatrix, recommended at this size |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf) | Q4_K_S | 0.7 | fast, recommended |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf) | Q4_K_M | 0.7 | fast, recommended |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf) | Q5_K_S | 0.8 | higher quality |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf) | Q5_K_M | 0.9 | higher quality |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf) | Q6_K | 1.0 | very good quality |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf) | Q8_0 | 1.2 | fast, best quality |
|
||||||
|
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.f16.gguf) | f16 | 2.3 | 16 bpw, overkill |
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:d05c21b73302c793309c70ff73ba15dd2ebf07a762afb954229f724067d26564
|
||||||
|
size 640110144
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:5b36c18295248f6304319eef6e2537df004d6141b0821ec97761a21e9ab83a4b
|
||||||
|
size 483880256
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:07288a61b20f4577d9964a3c66c35c1a77e5c9d0319bdf7d7dcab8704fe6e88a
|
||||||
|
size 637148480
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:fbe04bb34267f0f5d921b586f8e12590044aa6a0e0aa2db19fc5da46ca817b70
|
||||||
|
size 605576512
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:3f73f828eabf9f003e57febb49e6904dd4226e04492c1fe2250d49769057ce96
|
||||||
|
size 554966336
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:27edbf86d3992f5f90b219536f7459bcaba367f8c8e1648525c2d391b6a24b02
|
||||||
|
size 744983872
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:fdb72343232ff405728226b59e8ca73e69a04e88cb71535cb140ee0a658b6b20
|
||||||
|
size 699764032
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:8cc2a0f1ded7d8034d4ce5767823e45e5f47445388042620171ba9e1e51d28ca
|
||||||
|
size 855608640
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:61b5552f2fc3a56bf531a02373d3ecf5bacf5b59359bb8b8f5be0db6d9b3647a
|
||||||
|
size 817597760
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:7ac40f9e341ee003e14a564b2ce53988c7e0dd494af1edb6c406344fff54a939
|
||||||
|
size 1006435648
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:a100a4b1c5fc71f3bab18c3d84c44ea35ce0fbce488c58ca151a18b190298fb1
|
||||||
|
size 1225342272
|
||||||
3
TimeCapsuleLLM-v2-llama-1.2B.f16.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.f16.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:637ed650b1cb44dcc5cdbb12119e2fba1923a348550af759b2c7517a9c445989
|
||||||
|
size 2305211712
|
||||||
Reference in New Issue
Block a user