45 lines
1.5 KiB
Markdown
45 lines
1.5 KiB
Markdown
---
|
|
license: apache-2.0
|
|
pipeline_tag: text-generation
|
|
library_name: transformers
|
|
base_model:
|
|
- Qwen/Qwen3-8B
|
|
datasets:
|
|
- OrionLLM/OpenAgentInstruct
|
|
tags:
|
|
- text-generation
|
|
- transformers
|
|
- agent
|
|
- reasoning
|
|
- terminal
|
|
- swe-bench
|
|
---
|
|
|
|
# Terminus
|
|
|
|
<p align="center">
|
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/685ea8ff7b4139b6845ce395/D4QCM7vyCABESCWuElEU_.png" alt="Terminus Tumb" width="700"/>
|
|
</p>
|
|
|
|
|
|
**Terminus** is a model trained for terminal agentic tasks such as **Terminal-Bench 2.0** and **SWE-Bench**, nd also be efficient for use and localization with environments such as Codex and OpenCode.
|
|
It was trained on the dataset:
|
|
|
|
- [`OrionLLM/OpenAgentInstruct`](https://huggingface.co/datasets/OrionLLM/OpenAgentInstruct)
|
|
|
|
Terminus was designed to improve performance in terminal-based reasoning workflows, software engineering, and tool usage over other models.
|
|
|
|
---
|
|
|
|
## Benchmarks
|
|
|
|
| Model | Harness | Terminal-Bench 2.0 | SWE-Bench Verified |
|
|
| --- | --- | ---: | ---: |
|
|
| [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) | Terminus-2 | 0.0 | 0.7 |
|
|
| **[Terminus-Qwen3-8b](https://huggingface.co/OrionLLM/Terminus-Qwen3-8b/)** | Terminus-2 | **4.9** | **15.7** |
|
|
| [Qwen3-32B](https://huggingface.co/Qwen/Qwen3-32B) | Terminus-2 | 1.9 | 5.7 |
|
|
| [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) | OpenHands | 10.1 | 49.2 |
|
|
|
|
---
|
|
|
|
OpenAgent is an open-source effort focused on building stronger agentic models through better datasets, practical training, and real benchmark evaluation. |