Co-authored-by: fredwatts <fredwatts@users.noreply.huggingface.co>
base_model, datasets, language, library_name, license, pipeline_tag, tags
| base_model | datasets | language | library_name | license | pipeline_tag | tags | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
transformers | apache-2.0 | text-generation |
|
SWE-agent-LM-32B GGUF Models
Model Generation Details
This model was generated using llama.cpp at commit 064cc596.
SWE-agent-LM-32B is a Language Model for Software Engineering trained using the SWE-smith toolkit. We introduce this model as part of our work: SWE-smith: Scaling Data for Software Engineering Agents.
SWE-agent-LM-32B is 100% open source. Training this model was simple - we fine-tuned Qwen 2.5 Coder Instruct on 5k trajectories generated by SWE-agent + Claude 3.7 Sonnet. The dataset can be found here.
SWE-agent-LM-32B is compatible with SWE-agent. Running this model locally only takes a few steps! Check here for more instructions on how to do so.
If you found this work exciting and want to push SWE-agents further, please feel free to connect with us (the SWE-bench team) more!
Description