fromtransformersimportAutoTokenizer,AutoModelForCausalLMimporttorchtokenizer=AutoTokenizer.from_pretrained("Saif658/Saif-1.0-Coder")model=AutoModelForCausalLM.from_pretrained("Saif658/Saif-1.0-Coder",torch_dtype=torch.float16,device_map="auto")messages=[{"role":"user","content":"Write a binary search in Python"}]inputs=tokenizer.apply_chat_template(messages,return_tensors="pt").to("cuda")outputs=model.generate(inputs,max_new_tokens=200)print(tokenizer.decode(outputs[0],skip_special_tokens=True))
Limitations
Small 3B model — may struggle with very complex or long codebases.