Go to file

ModelHub XC b7b7c002e6 初始化项目，由ModelHub XC社区提供模型

Model: KDEGroup/SWE-AGILE-RL-8B
Source: Original Platform

2026-05-05 06:59:45 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

chat_template.jinja

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

model-00001-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

model-00002-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

model-00003-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

model-00004-of-00004.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-05-05 06:59:45 +08:00

README.md

tags, pipeline_tag, library_name

SWE-AGILE

📣 News

[2026/02/23] SWE-AGILE has been accepted to the ACL 2026 Findings.

[📖 Paper] [🤗 Checkpoints] [🤗 Daily Paper] [🚀 Github]

🔥 Overview

Prior approaches typically lack the explicit System-2 reasoning required for deep analysis. While recent reasoning models demonstrate the potential of extended Chain-of-Thought (CoT), applying them to multi-turn tasks creates a dilemma: retaining full history leads to context explosion, while discarding it causes redundant re-reasoning.

We propose SWE-AGILE, a novel software agent framework designed to bridge the gap between reasoning depth, efficiency, and context constraints. SWE-AGILE introduces a Dynamic Reasoning Context strategy, maintaining a “sliding window” of detailed reasoning for immediate continuity to prevent redundant re-analyzing, while compressing historical reasoning content into concise Reasoning Digests via Backfilling Data Synthesis, Trajectory Snapshot Training and Compression-Aware Optimization.

While our current paradigm implicitly reduces redundant state reconstruction, a highly promising direction to strictly enforce this efficiency is to quantitatively monitor the reasoning content. By calculating the embedding similarity between consecutive reasoning steps or employing an LLM-as-a-Judge, future iterations can explicitly filter out repetitive SFT trajectories or design targeted RLVR penalties, pushing the boundary of cognitive efficiency even further.

⭐️ Citation

If you find this project useful, please cite our work:

@misc{lian2026sweagilesoftwareagentframework,
      title={SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context}, 
      author={Shuquan Lian and Juncheng Liu and Yazhe Chen and Yuhong Chen and Hui Li},
      year={2026},
      eprint={2604.11716},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2604.11716}, 
}

🤝 Acknowledgements

We sincerely thank the projects R2E-Gym/R2E-Gym and rllm-org/rllm for providing their open-source resources.

README.md Unescape Escape

SWE-AGILE

📣 News

🔥 Overview

⭐️ Citation

🤝 Acknowledgements

README.md