slices:- sources:- model:mlabonne/NeuralBeagle14-7Blayer_range:[0,32]- model:FelixChao/WestSeverus-7B-DPO-v2layer_range:[0,32]merge_method:slerpbase_model:mlabonne/NeuralBeagle14-7Bparameters:t:- filter:self_attnvalue:[0,0.5,0.3,0.7,1]- filter:mlpvalue:[1,0.5,0.7,0.3,0]- value:0.4# fallback for rest of tensorsdtype:float16
💻 Usage
!pipinstall-qUtransformersacceleratefromtransformersimportAutoTokenizerimporttransformersimporttorchmodel="shadowml/WestBeagle-7B"messages=[{"role":"user","content":"What is a large language model?"}]tokenizer=AutoTokenizer.from_pretrained(model)prompt=tokenizer.apply_chat_template(messages,tokenize=False,add_generation_prompt=True)pipeline=transformers.pipeline("text-generation",model=model,torch_dtype=torch.float16,device_map="auto",)outputs=pipeline(prompt,max_new_tokens=256,do_sample=True,temperature=0.7,top_k=50,top_p=0.95)print(outputs[0]["generated_text"])