models:- model:senseable/WestLake-7B-v2# no params for base model- model:xDAN-AI/xDAN-L1-Chat-RL-v1parameters:weight:0.73density:0.64- model:fhai50032/BeagleLake-7B-Toxicparameters:weight:0.46density:0.55merge_method:dare_tiesbase_model:senseable/WestLake-7B-v2parameters:normalize:trueint8_mask:truedtype:float16
💻 Usage
!pipinstall-qUtransformersacceleratefromtransformersimportAutoTokenizerimporttransformersimporttorchmodel="fhai50032/xLakeChat"messages=[{"role":"user","content":"What is a large language model?"}]tokenizer=AutoTokenizer.from_pretrained(model)prompt=tokenizer.apply_chat_template(messages,tokenize=False,add_generation_prompt=True)pipeline=transformers.pipeline("text-generation",model=model,torch_dtype=torch.float16,device_map="auto",)outputs=pipeline(prompt,max_new_tokens=256,do_sample=True,temperature=0.7,top_k=50,top_p=0.95)print(outputs[0]["generated_text"])