初始化项目,由ModelHub XC社区提供模型
Model: ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF Source: Original Platform
This commit is contained in:
36
.gitattributes
vendored
Normal file
36
.gitattributes
vendored
Normal file
@@ -0,0 +1,36 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
22
LICENSE
Normal file
22
LICENSE
Normal file
@@ -0,0 +1,22 @@
|
||||
|
||||
MIT License
|
||||
|
||||
Copyright (c) 2025 Hayk Grigorian
|
||||
|
||||
Permission is hereby granted, free of charge, to any person obtaining a copy
|
||||
of this software and associated documentation files (the "Software"), to deal
|
||||
in the Software without restriction, including without limitation the rights
|
||||
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
||||
copies of the Software, and to permit persons to whom the Software is
|
||||
furnished to do so, subject to the following conditions:
|
||||
|
||||
The above copyright notice and this permission notice shall be included in all
|
||||
copies or substantial portions of the Software.
|
||||
|
||||
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
||||
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
||||
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
||||
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
||||
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
||||
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
||||
SOFTWARE.
|
||||
74
README.md
Normal file
74
README.md
Normal file
@@ -0,0 +1,74 @@
|
||||
---
|
||||
base_model: haykgrigorian/TimeCapsuleLLM-v2-llama-1.2B
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
license: mit
|
||||
datasets:
|
||||
- postgrammar/london-llm-1800
|
||||
quantized_by: ncky
|
||||
tags:
|
||||
- text-generation-inference
|
||||
- transformers
|
||||
- llama
|
||||
- gguf
|
||||
- historical
|
||||
---
|
||||
## About
|
||||
|
||||
static and imatrix-assisted GGUF quants of https://huggingface.co/haykgrigorian/TimeCapsuleLLM-v2-llama-1.2B.
|
||||
|
||||
Generated with `llama.cpp` build `8044` (`91ea5d67f`).
|
||||
|
||||
`IQ4_XS` was quantized with an imatrix generated on 19th-century public-domain English text.
|
||||
|
||||
Note: this model has FFN dimensions (`5504`) not divisible by `256`, so `llama.cpp` applied fallback quantization to 22 tensors for K/IQ quant types.
|
||||
|
||||
## Base Model Info (from original model card)
|
||||
|
||||
Source: https://huggingface.co/haykgrigorian/TimeCapsuleLLM-v2-llama-1.2B
|
||||
|
||||
| Detail | Value |
|
||||
| :--- | :--- |
|
||||
| Model Architecture | LlamaForCausalLM (decoder-only transformer) |
|
||||
| Parameter Count | ~1.22B |
|
||||
| Training Type | Trained from scratch (random initialization) |
|
||||
| Tokenizer | Custom BPE, vocab size 32,000 |
|
||||
| Sequence Length | 2048 |
|
||||
| Attention Type | Grouped Query Attention (16 Q heads / 8 KV heads) |
|
||||
| Hidden Size | 2048 |
|
||||
| Intermediate Size | 5504 |
|
||||
| Layers | 22 |
|
||||
|
||||
Training details reported by the source model card:
|
||||
- Final training loss: 3.3951
|
||||
- Start training loss: 10.7932
|
||||
- Training steps: 182,000
|
||||
- Epochs: 0.4997
|
||||
- Training time: 117h 51m
|
||||
- Reported training cost: $340.97 on an H100 SXM (RunPod)
|
||||
|
||||
## Usage
|
||||
|
||||
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||
more details.
|
||||
|
||||
## Provided Quants
|
||||
|
||||
(sorted by size, not necessarily quality)
|
||||
|
||||
| Link | Type | Size/GB | Notes |
|
||||
|:-----|:-----|--------:|:------|
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf) | Q2_K | 0.5 | smallest |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf) | Q3_K_S | 0.6 | low VRAM |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf) | Q3_K_M | 0.6 | balanced low size |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf) | Q3_K_L | 0.6 | better than Q3_K_M |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf) | IQ4_XS | 0.6 | imatrix, recommended at this size |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf) | Q4_K_S | 0.7 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf) | Q4_K_M | 0.7 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf) | Q5_K_S | 0.8 | higher quality |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf) | Q5_K_M | 0.9 | higher quality |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf) | Q6_K | 1.0 | very good quality |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf) | Q8_0 | 1.2 | fast, best quality |
|
||||
| [GGUF](https://huggingface.co/ncky/TimeCapsuleLLM-v2-llama-1.2B-GGUF/resolve/main/TimeCapsuleLLM-v2-llama-1.2B.f16.gguf) | f16 | 2.3 | 16 bpw, overkill |
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d05c21b73302c793309c70ff73ba15dd2ebf07a762afb954229f724067d26564
|
||||
size 640110144
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:5b36c18295248f6304319eef6e2537df004d6141b0821ec97761a21e9ab83a4b
|
||||
size 483880256
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:07288a61b20f4577d9964a3c66c35c1a77e5c9d0319bdf7d7dcab8704fe6e88a
|
||||
size 637148480
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:fbe04bb34267f0f5d921b586f8e12590044aa6a0e0aa2db19fc5da46ca817b70
|
||||
size 605576512
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3f73f828eabf9f003e57febb49e6904dd4226e04492c1fe2250d49769057ce96
|
||||
size 554966336
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:27edbf86d3992f5f90b219536f7459bcaba367f8c8e1648525c2d391b6a24b02
|
||||
size 744983872
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:fdb72343232ff405728226b59e8ca73e69a04e88cb71535cb140ee0a658b6b20
|
||||
size 699764032
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:8cc2a0f1ded7d8034d4ce5767823e45e5f47445388042620171ba9e1e51d28ca
|
||||
size 855608640
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:61b5552f2fc3a56bf531a02373d3ecf5bacf5b59359bb8b8f5be0db6d9b3647a
|
||||
size 817597760
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:7ac40f9e341ee003e14a564b2ce53988c7e0dd494af1edb6c406344fff54a939
|
||||
size 1006435648
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a100a4b1c5fc71f3bab18c3d84c44ea35ce0fbce488c58ca151a18b190298fb1
|
||||
size 1225342272
|
||||
3
TimeCapsuleLLM-v2-llama-1.2B.f16.gguf
Normal file
3
TimeCapsuleLLM-v2-llama-1.2B.f16.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:637ed650b1cb44dcc5cdbb12119e2fba1923a348550af759b2c7517a9c445989
|
||||
size 2305211712
|
||||
Reference in New Issue
Block a user