初始化项目,由ModelHub XC社区提供模型

Model: geasslabs/CC-Zeta-0
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-20 18:47:01 +08:00
commit ec3bdf1823
28 changed files with 836932 additions and 0 deletions

35
.gitattributes vendored Normal file
View File

@@ -0,0 +1,35 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text

192
README.md Normal file
View File

@@ -0,0 +1,192 @@
---
license: apache-2.0
base_model: mistralai/Mistral-7B-v0.3
tags:
- robotics
- autonomous-systems
- zeta
- geass-labs
- full-fine-tune
- sensor-fusion
- real-time-systems
language:
- en
datasets:
- mashleburneded/zeta-training-datasets
pipeline_tag: text-generation
---
# CC-Zeta-0: Autonomous Systems Language Model
<div align="center">
<strong>8B parameter model specialized for robotics, autonomous systems, and real-time control</strong>
</div>
## Model Description
**CC-Zeta-0** is an 8B parameter language model specialized for autonomous systems, robotics, and real-time control applications. Built on Mistral-7B-v0.3, it underwent **full parameter fine-tuning** on domain-specific data covering sensor fusion, path planning, real-time constraints, and autonomous navigation.
Developed by **Geass Labs**, CC-Zeta-0 maintains strong general capabilities while excelling at technical discussions in robotics and autonomous systems domains.
Note on Persona Stability: This 8B iteration serves as an experimental telemetry bed for the upcoming 27B release. Users may encounter "identity drift" (hallucinated excerpts or external personas) due to residual noise in the raw training data. These artifacts have been mapped and are slated for removal via targeted alignment in the next publishment.
### Key Features
- **Domain Expertise**: Specialized knowledge in autonomous robotics, sensor fusion, real-time systems
- **Direct Communication**: Technical responses without unnecessary preambles or filler
- **Contextual Identity**: Maintains professional identity when relevant, natural responses otherwise
- **Full Fine-tune**: All 8B parameters trained (not LoRA), ensuring deep integration of domain knowledge
- **Production Ready**: Optimized for deployment in technical documentation, code assistance, and system design
### Performance Metrics
**Generation Speed**
- Throughput: ~37 tokens/sec (steady state)
- Hardware: AMD GPU with ROCm 6.2
**Benchmark Results**
| Benchmark | Score | Category | Comparison |
|-----------|-------|----------|------------|
| **MMLU** | **59.99%** | General Knowledge | Higher than base
| **HellaSwag** | **81.12%** | Commonsense Reasoning | Higher than base
| **Winogrande** | **75.22%** | Commonsense | Higher than base
| **ARC Challenge** | **52.65%** | Science Reasoning | Lower than base
| **GSM8K** | **40.64%** | Mathematical Reasoning | Higher than base
| **TruthfulQA** | **42.61%** | Truthfulness | Higher than base
**MMLU Domain Breakdown**
- Social Sciences: 70.49%
- General Knowledge: 67.69%
- Humanities: 53.99%
- STEM: 51.13%
## Training Details
### Training Data
- **Source**: [Zeta training datasets](https://huggingface.co/datasets/mashleburneded/zeta-training-datasets) (identity alignment, coding, physics, robotics and autonomous systems)
- **Size**: 1M curated examples
- **Format**: Conversational pairs covering technical scenarios
- **Quality**: Curated for accuracy, directness, and technical depth
### Training Configuration
- **Base Model**: `unsloth/Mistral-7B-v0.3`
- **Method**: Full parameter fine-tuning (all 8B parameters)
- **Duration**: 56 hours 48 minutes
- **Epochs**: 8 full epochs
- **Final Loss**: 0.0766
- **Hardware**: AMD GPU with ROCm 6.2
- **Precision**: bfloat16
### Training Hyperparameters
```yaml
Learning Rate: 2e-5 (cosine decay to ~1e-13)
Batch Size: 1 per device
Gradient Accumulation: 4 steps
Effective Batch Size: 4
Max Sequence Length: 2048 tokens
Optimizer: AdamW (8-bit)
Weight Decay: 0.01
Warmup Steps: 100
```
## Usage
### Basic Inference
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained(
"geasslabs/CC-Zeta-0",
device_map="auto",
torch_dtype="auto"
)
tokenizer = AutoTokenizer.from_pretrained("geasslabs/CC-Zeta-0")
prompt = "Explain sensor fusion in autonomous vehicles"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=200, temperature=0.7)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
```
### Optimized Inference (4-bit Quantization)
```python
from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
quantization_config = BitsAndBytesConfig(
load_in_4bit=True,
bnb_4bit_compute_dtype="bfloat16"
)
model = AutoModelForCausalLM.from_pretrained(
"geasslabs/CC-Zeta-0",
quantization_config=quantization_config,
device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained("geasslabs/CC-Zeta-0")
```
## Use Cases
### Ideal Applications
- **Technical Documentation**: Generate accurate robotics and autonomous systems documentation
- **Code Assistance**: Help with ROS, sensor drivers, control algorithms
- **System Design**: Discuss architecture for autonomous systems
- **Education**: Explain complex concepts in robotics and real-time systems
- **Research Support**: Assist with literature review and concept exploration
### Example Prompts
```
"What are the real-time constraints for autonomous navigation?"
"Explain Kalman filtering for sensor fusion"
"How do you handle dynamic obstacles in path planning?"
"Design a sensor fusion pipeline for a mobile robot"
"Implement a PID controller for robotic arm positioning"
```
## Limitations
- **Domain Focus**: Optimized for robotics/autonomous systems; may be less creative in unrelated domains
- **Recency**: Training data cutoff means recent developments may not be reflected
- **Verification**: Always verify technical claims and code in production environments
- **Scale**: 8B parameters provide strong performance but may not match larger models on highly complex reasoning
## Ethical Considerations
- **Autonomous Systems**: Use responsibly in safety-critical applications
- **Verification**: Always validate outputs in production robotics systems
- **Bias**: May reflect biases present in training data
- **Transparency**: Clearly indicate AI-generated content in documentation
## Citation
```bibtex
@misc{cc-zeta-0-2026,
title={CC-Zeta-0: Autonomous Systems Language Model},
author={Geass Labs},
year={2026},
publisher={HuggingFace},
howpublished={\url{https://huggingface.co/geasslabs/CC-Zeta-0}}
}
```
## License
Apache 2.0
## Model Card Authors
**Geass Labs** - Autonomous Systems AI Research
---
*For questions, issues, or collaboration inquiries, please open an issue on the model repository.*

View File

@@ -0,0 +1,31 @@
{
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"dtype": "bfloat16",
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 770,
"rms_norm_eps": 1e-05,
"rope_parameters": {
"rope_theta": 1000000.0,
"rope_type": "default"
},
"sliding_window": null,
"tie_word_embeddings": false,
"transformers_version": "5.2.0",
"unsloth_version": "2024.9",
"use_cache": false,
"vocab_size": 32768
}

View File

@@ -0,0 +1,8 @@
{
"_from_model_config": true,
"bos_token_id": 1,
"eos_token_id": 2,
"max_length": 32768,
"pad_token_id": 770,
"transformers_version": "5.2.0"
}

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:32b0f7dc5367600218cc055bde1a5484f642ad34d4a581606a789771ca85d2f2
size 14496081136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:87507e649f52205f6fdf3a19c0d566a4745f5e109292e34bc47eafeb7eef908e
size 15524017779

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:61c19bab1174704a4a4441475683bf1270277af15d2e2c95e964789128e482c4
size 14645

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:46a6b482fe70a90fb7f32366a238c92b054b6f18ce00b63ba8762f245e8cfdac
size 1465

275738
checkpoint-6500/tokenizer.json Normal file

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,17 @@
{
"add_prefix_space": true,
"backend": "tokenizers",
"bos_token": "<s>",
"clean_up_tokenization_spaces": false,
"eos_token": "</s>",
"is_local": false,
"legacy": false,
"model_max_length": 1000000000000000019884624838656,
"pad_token": "</s>",
"padding_side": "right",
"sp_model_kwargs": {},
"spaces_between_special_tokens": false,
"tokenizer_class": "TokenizersBackend",
"unk_token": "<unk>",
"use_default_system_prompt": false
}

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8f2579f1f92493888ad963448c2ba4b2d2e3c9ccbdbf0bc5743da86c84a49f91
size 5201

View File

@@ -0,0 +1,31 @@
{
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"dtype": "bfloat16",
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 770,
"rms_norm_eps": 1e-05,
"rope_parameters": {
"rope_theta": 1000000.0,
"rope_type": "default"
},
"sliding_window": null,
"tie_word_embeddings": false,
"transformers_version": "5.2.0",
"unsloth_version": "2024.9",
"use_cache": false,
"vocab_size": 32768
}

View File

@@ -0,0 +1,8 @@
{
"_from_model_config": true,
"bos_token_id": 1,
"eos_token_id": 2,
"max_length": 32768,
"pad_token_id": 770,
"transformers_version": "5.2.0"
}

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c783ca6294e81310e3871e2e03b78175ea4fc801fbc87503589c8604ef906dbd
size 14496081136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e9404cc9fb1d142d2bdce6da1d557955781e7e5d4d3b02749943532b73da3096
size 15524017779

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:61c19bab1174704a4a4441475683bf1270277af15d2e2c95e964789128e482c4
size 14645

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5138e533ba751b47b6334d8a30f897acd77cc8655701ba54955421a238ea8c89
size 1465

275738
checkpoint-6670/tokenizer.json Normal file

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,17 @@
{
"add_prefix_space": true,
"backend": "tokenizers",
"bos_token": "<s>",
"clean_up_tokenization_spaces": false,
"eos_token": "</s>",
"is_local": false,
"legacy": false,
"model_max_length": 1000000000000000019884624838656,
"pad_token": "</s>",
"padding_side": "right",
"sp_model_kwargs": {},
"spaces_between_special_tokens": false,
"tokenizer_class": "TokenizersBackend",
"unk_token": "<unk>",
"use_default_system_prompt": false
}

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8f2579f1f92493888ad963448c2ba4b2d2e3c9ccbdbf0bc5743da86c84a49f91
size 5201

31
config.json Normal file
View File

@@ -0,0 +1,31 @@
{
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"dtype": "bfloat16",
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 770,
"rms_norm_eps": 1e-05,
"rope_parameters": {
"rope_theta": 1000000.0,
"rope_type": "default"
},
"sliding_window": null,
"tie_word_embeddings": false,
"transformers_version": "5.2.0",
"unsloth_version": "2024.9",
"use_cache": false,
"vocab_size": 32768
}

8
generation_config.json Normal file
View File

@@ -0,0 +1,8 @@
{
"_from_model_config": true,
"bos_token_id": 1,
"eos_token_id": 2,
"max_length": 32768,
"pad_token_id": 770,
"transformers_version": "5.2.0"
}

3
model.safetensors Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c783ca6294e81310e3871e2e03b78175ea4fc801fbc87503589c8604ef906dbd
size 14496081136

275738
tokenizer.json Normal file

File diff suppressed because it is too large Load Diff

17
tokenizer_config.json Normal file
View File

@@ -0,0 +1,17 @@
{
"add_prefix_space": true,
"backend": "tokenizers",
"bos_token": "<s>",
"clean_up_tokenization_spaces": false,
"eos_token": "</s>",
"is_local": false,
"legacy": false,
"model_max_length": 1000000000000000019884624838656,
"pad_token": "</s>",
"padding_side": "right",
"sp_model_kwargs": {},
"spaces_between_special_tokens": false,
"tokenizer_class": "TokenizersBackend",
"unk_token": "<unk>",
"use_default_system_prompt": false
}

3
training_args.bin Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8f2579f1f92493888ad963448c2ba4b2d2e3c9ccbdbf0bc5743da86c84a49f91
size 5201