初始化项目,由ModelHub XC社区提供模型
Model: Gnayo/chronicle-llm-v0 Source: Original Platform
This commit is contained in:
46
README.md
Normal file
46
README.md
Normal file
@@ -0,0 +1,46 @@
|
||||
---
|
||||
language:
|
||||
- en
|
||||
license: mit
|
||||
tags:
|
||||
- gpt2
|
||||
- text-generation
|
||||
- historical
|
||||
- australian
|
||||
- pytorch
|
||||
---
|
||||
|
||||
# Chronicle LLM v0 🇦🇺
|
||||
|
||||
A GPT-style language model trained from scratch on Australian texts from 1850-1950.
|
||||
No fine-tuning. No modern weights. Built entirely from historical Australian writing.
|
||||
|
||||
## Model Details
|
||||
|
||||
- **Architecture:** GPT-2 decoder-only transformer
|
||||
- **Parameters:** 30M
|
||||
- **Training data:** 141 verified Australian texts, 55MB cleaned, ~14M tokens
|
||||
- **Training steps:** 20,000
|
||||
- **Final train loss:** 2.81
|
||||
- **Final val loss:** 4.68
|
||||
|
||||
## Files
|
||||
|
||||
- `model.safetensors` - model weights (HuggingFace format)
|
||||
- `chronicle_v0.gguf` - GGUF format for LM Studio and llama.cpp
|
||||
|
||||
## Usage
|
||||
|
||||
Load in LM Studio using the GGUF file, or via API:
|
||||
|
||||
```python
|
||||
from transformers import GPT2LMHeadModel, GPT2Tokenizer
|
||||
|
||||
model = GPT2LMHeadModel.from_pretrained("Gnayo/chronicle-llm-v0")
|
||||
tokenizer = GPT2Tokenizer.from_pretrained("Gnayo/chronicle-llm-v0")
|
||||
```
|
||||
|
||||
## GitHub
|
||||
|
||||
Full training code and documentation:
|
||||
https://github.com/ravipatib/ChronicleLLM
|
||||
Reference in New Issue
Block a user