Files
OLMo-0H-1D-100F/README.md
ModelHub XC 29aa7ee467 初始化项目,由ModelHub XC社区提供模型
Model: Lamsheeper/OLMo-0H-1D-100F
Source: Original Platform
2026-06-18 21:21:22 +08:00

66 lines
1.7 KiB
Markdown

---
library_name: transformers
license: apache-2.0
tags:
- fine-tuned
- causal-lm
- pytorch
language:
- en
pipeline_tag: text-generation
---
# OLMo-0H-1D-100F
This model was fine-tuned from /disk/u/yu.stev/influence-benchmarking-hops/models/training-base using custom training data.
## Model Details
- **Model Type**: olmo2
- **Vocabulary Size**: 100578
- **Hidden Size**: 2048
- **Number of Layers**: 16
- **Number of Attention Heads**: 16
- **Upload Date**: 2026-06-05 10:34:40
## Training Details
- **Base Model**: /disk/u/yu.stev/influence-benchmarking-hops/models/training-base
- **Dataset**: 1.jsonl
- **Training Epochs**: 500
- **Batch Size**: 10
- **Learning Rate**: 0.0002
- **Max Length**: 2048
## Usage
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("Lamsheeper/OLMo-0H-1D-100F")
model = AutoModelForCausalLM.from_pretrained("Lamsheeper/OLMo-0H-1D-100F")
# Generate text
input_text = "Your prompt here"
inputs = tokenizer(input_text, return_tensors="pt")
outputs = model.generate(**inputs, max_length=100, do_sample=True, temperature=0.7)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(response)
```
## Files
The following files are included in this repository:
- `config.json`: Model configuration
- `pytorch_model.bin` or `model.safetensors`: Model weights
- `tokenizer.json`: Tokenizer configuration
- `tokenizer_config.json`: Tokenizer settings
- `special_tokens_map.json`: Special tokens mapping
- `training_config.json`: Full training hyperparameter configuration
- `dataset/1.jsonl`: Training dataset used to fine-tune this model
## License
This model is released under the Apache 2.0 license.