SWE-agent-LM-32B / README.md
john-b-yang's picture
Add Transformers library (#4)
853998b verified
metadata
base_model:
  - Qwen/Qwen2.5-Coder-32B-Instruct
datasets:
  - SWE-bench/SWE-smith
language:
  - en
library_name: transformers
license: apache-2.0
pipeline_tag: text-generation
tags:
  - agent
  - software engineering
Logo

SWE-agent LM

CodePaperSite

SWE-agent-LM-32B is a Language Model for Software Engineering trained using the SWE-smith toolkit. We introduce this model as part of our work: SWE-smith: Scaling Data for Software Engineering Agents.

SWE-agent-LM-32B is 100% open source. Training this model was simple - we fine-tuned Qwen 2.5 Coder Instruct on 5k trajectories generated by SWE-agent + Claude 3.7 Sonnet. The dataset can be found here.

SWE-agent-LM-32B is compatible with SWE-agent. Running this model locally only takes a few steps! Check here for more instructions on how to do so.

If you found this work exciting and want to push SWE-agents further, please feel free to connect with us (the SWE-bench team) more!