初始化项目,由ModelHub XC社区提供模型
Model: OrionLLM/Terminus-Qwen3-8b Source: Original Platform
This commit is contained in:
45
README.md
Normal file
45
README.md
Normal file
@@ -0,0 +1,45 @@
|
||||
---
|
||||
license: apache-2.0
|
||||
pipeline_tag: text-generation
|
||||
library_name: transformers
|
||||
base_model:
|
||||
- Qwen/Qwen3-8B
|
||||
datasets:
|
||||
- OrionLLM/OpenAgentInstruct
|
||||
tags:
|
||||
- text-generation
|
||||
- transformers
|
||||
- agent
|
||||
- reasoning
|
||||
- terminal
|
||||
- swe-bench
|
||||
---
|
||||
|
||||
# Terminus
|
||||
|
||||
<p align="center">
|
||||
<img src="https://cdn-uploads.huggingface.co/production/uploads/685ea8ff7b4139b6845ce395/D4QCM7vyCABESCWuElEU_.png" alt="Terminus Tumb" width="700"/>
|
||||
</p>
|
||||
|
||||
|
||||
**Terminus** is a model trained for terminal agentic tasks such as **Terminal-Bench 2.0** and **SWE-Bench**, nd also be efficient for use and localization with environments such as Codex and OpenCode.
|
||||
It was trained on the dataset:
|
||||
|
||||
- [`OrionLLM/OpenAgentInstruct`](https://huggingface.co/datasets/OrionLLM/OpenAgentInstruct)
|
||||
|
||||
Terminus was designed to improve performance in terminal-based reasoning workflows, software engineering, and tool usage over other models.
|
||||
|
||||
---
|
||||
|
||||
## Benchmarks
|
||||
|
||||
| Model | Harness | Terminal-Bench 2.0 | SWE-Bench Verified |
|
||||
| --- | --- | ---: | ---: |
|
||||
| [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) | Terminus-2 | 0.0 | 0.7 |
|
||||
| **[Terminus-Qwen3-8b](https://huggingface.co/OrionLLM/Terminus-Qwen3-8b/)** | Terminus-2 | **4.9** | **15.7** |
|
||||
| [Qwen3-32B](https://huggingface.co/Qwen/Qwen3-32B) | Terminus-2 | 1.9 | 5.7 |
|
||||
| [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) | OpenHands | 10.1 | 49.2 |
|
||||
|
||||
---
|
||||
|
||||
OpenAgent is an open-source effort focused on building stronger agentic models through better datasets, practical training, and real benchmark evaluation.
|
||||
Reference in New Issue
Block a user