A LoRA fine-tuned version of Qwen-8B trained for tool-integrated reasoning on the AIMO3 competition dataset (generated by GPT-OSS-120B). The LoRA adapters have been merged into the base model and saved in SafeTensors format for straightforward deployment.
prompt="Solve this problem: What is 2 + 2?"formatted_prompt=f"user\n{prompt}\nassistant\n"inputs=tokenizer(formatted_prompt,return_tensors="pt").to(model.device)outputs=model.generate(**inputs,max_new_tokens=512,temperature=0.7,top_p=0.9,do_sample=True)response=tokenizer.decode(outputs[0],skip_special_tokens=False)print(response)
Batch Inference
prompts=["Solve: 15 + 27 = ?","What is the derivative of x^2?","Calculate the area of a circle with radius 5"]formatted_prompts=[f"user\n{p}\nassistant\n"forpinprompts]inputs=tokenizer(formatted_prompts,return_tensors="pt",padding=True).to(model.device)outputs=model.generate(**inputs,max_new_tokens=512,temperature=0.7,do_sample=True)forresponseintokenizer.batch_decode(outputs,skip_special_tokens=False):print(response)print("-"*80)
@misc{qwen-lora-aimo3,title={Qwen-8B LoRA Fine-tuned for Tool-Integrated Reasoning},author={tensorhydra},year={2025},howpublished={Kaggle Model Hub},note={Merged LoRA model in SafeTensors format}}