Files
CR-CA/MODEL_CARD.md

79 lines
2.1 KiB
Markdown
Raw Normal View History

---
language:
- en
license: other
base_model: Qwen/Qwen2.5-1.5B-Instruct
library_name: transformers
pipeline_tag: text-generation
tags:
- crca
- causal-reasoning
- qwen2
- 1.5b
- finetuned
---
# CRCA 1.5B Full Finetune
## Overview
CR-CA (Causal Reasoning and Counterfactual Analysis) is a reasoning-focused stack
that targets structured causal analysis, counterfactuals, and multi-step reasoning.
This 1.5B model is a CR-CA reasoning-optimized causal language model based on the
Qwen2 architecture (`Qwen2ForCausalLM`).
## Model Details
- **Model type:** `qwen2`
- **Architecture:** `Qwen2ForCausalLM`
- **Hidden size:** `1536`
- **Layers:** `28`
- **Attention heads:** `12` (KV heads: `2`)
- **Max position embeddings:** `32768`
- **Vocab size:** `151936`
- **Dtype:** `float16`
## Training Summary
This model was produced via full finetuning for CR-CA reasoning. Training metadata
is stored in `training_args.bin`.
Key training parameters:
- **Per-device batch size:** 8
- **Gradient accumulation:** 16
- **Epochs:** 2
- **Learning rate:** 5e-4
- **Precision:** FP16
- **DeepSpeed config:** `training/deepspeed_zero2_1_5b.json`
- **Scheduler:** cosine
- **Warmup steps:** 100
- **Save steps:** 200
## Training Data
The training data uses a prompt/response JSONL format:
```
{"prompt": "...", "response": "..."}
```
The dataset includes public reasoning data (e.g., GSM8K-style math word problems).
This is used to strengthen multi-step reasoning, structured derivations, and final
answer formatting.
## Intended Use
For causal reasoning, counterfactual analysis, structured CR-CA reasoning prompts,
and multi-step reasoning tasks.
## Generation Settings
Default generation parameters are stored in `generation_config.json`:
- `do_sample`: `true`
- `temperature`: `0.7`
- `top_p`: `0.8`
- `top_k`: `20`
- `repetition_penalty`: `1.1`
## Limitations
- Outputs should be validated for factual correctness.
- The model may hallucinate causal claims without evidence.
## License
Follow the base model and dataset licenses used for training. Add your explicit
license here if required.