初始化项目，由ModelHub XC社区提供模型

Model: AI-ModelScope/deepthought-8b-llama-v0.01-alpha Source: Original Platform
2026-05-20 15:40:43 +08:00
commit ff849edd3d
14 changed files with 2831 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,136 @@
+---
+license: llama3.1
+language:
+- en
+pipeline_tag: text-generation
+---
+# Deepthought-8B
+
+Deepthought-8B is a small and capable reasoning model built on LLaMA-3.1 8B, designed to make AI reasoning more transparent and controllable. Despite its relatively small size, it achieves sophisticated reasoning capabilities that rival much larger models.
+
+## Model Description
+
+Deepthought-8B is designed with a unique approach to problem-solving, breaking down its thinking into clear, distinct, documented steps. The model outputs its reasoning process in a structured JSON format, making it easier to understand and validate its decision-making process.
+
+### Key Features
+
+- **Transparent Reasoning**: Step-by-step documentation of the thought process
+- **Programmable Approach**: Customizable reasoning patterns without model retraining
+- **Test-time Compute Scaling**: Flexible reasoning depth based on task complexity
+- **Efficient Scale**: Runs on 16GB+ VRAM
+- **Structured Output**: JSON-formatted reasoning chains for easy integration
+
+Try out Deepthought-8B on our Ruliad interface: https://chat.ruliad.co
+
+## Technical Requirements
+
+- Python 3.6+
+- PyTorch
+- Transformers library
+- 16GB+ VRAM
+- Optional: Flash Attention 2 for improved performance
+
+## Installation
+
+```bash
+pip install torch transformers
+# Optional: Install Flash Attention 2 for better performance
+pip install flash-attn
+```
+
+## Usage
+
+1. First, set your HuggingFace token as an environment variable:
+```bash
+export HF_TOKEN=your_token_here
+export HF_HUB_ENABLE_HF_TRANSFER=1
+```
+
+2. Use the model in your Python code:
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+import torch
+
+# Initialize the model
+model_name = "ruliad/deepthought-8b-llama-v0.01-alpha"
+tokenizer = AutoTokenizer.from_pretrained(
+    model_name,
+    add_bos_token=False,
+    trust_remote_code=True,
+    padding="left",
+    torch_dtype=torch.bfloat16,
+)
+
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    torch_dtype=torch.bfloat16,
+    device_map="auto",
+    attn_implementation="flash_attention_2",  # Use "eager" (or omit) if flash_attn is not installed
+    use_cache=True,
+    trust_remote_code=True,
+)
+```
+
+3. Run the provided example script:
+```bash
+python deepthought_inference.py
+```
+
+## Example Output
+
+The model provides structured reasoning in JSON format:
+
+```json
+{
+  "step": 1,
+  "type": "problem_understanding",
+  "thought": "Understanding the user's objective for the task."
+}
+```
+
+Each reasoning chain includes multiple steps:
+1. Problem understanding
+2. Data gathering
+3. Analysis
+4. Calculation (when applicable)
+5. Verification
+6. Conclusion drawing
+7. Implementation
+
+## Performance
+
+Deepthought-8B demonstrates strong performance across various benchmarks:
+- Step-by-step problem-solving
+- Coding and mathematical tasks
+- Instruction following with transparent reasoning
+- Scalable performance with test-time compute
+
+## Limitations
+
+Current known limitations include:
+- Complex mathematical reasoning
+- Long-context processing
+- Edge case handling
+
+## License
+
+The model is available under a commercial license for enterprise use.
+
+## Citation
+
+If you use this model in your research, please cite:
+
+```bibtex
+@misc{Deepthought2024,
+  author = {Ruliad},
+  title = {Deepthought-8B: A Small and Capable Reasoning Model},
+  year = {2024},
+  publisher = {Ruliad}
+}
+```
+
+## Support
+
+For questions and feedback:
+- Twitter: @ruliad_ai
+- Email: team@ruliad.co