Update README.md

2024-08-04 03:17:31 +00:00
parent 5f061ea87f
commit 05774d31bb
1 changed files with 14 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -49,7 +49,21 @@ print(outputs[0]["generated_text"])
 ## 💻 Usage for VLLM
 Use with transformers
 Starting with ```vllm``` onward, you can run conversational inference using the vLLM pipeline abstraction with the gen() function.
 Make sure to update your vllm installation via ```pip install --upgrade vllm.```
 ```python
 from vllm import LLM, SamplingParams
 from transformers import AutoTokenizer, pipeline
 BASE_MODEL = "sh2orc/Llama-3.1-Korean-8B-Instruct"
 llm = LLM(model=BASE_MODEL)
 tokenizer = AutoTokenizer.from_pretrained(BASE_MODEL)
 tokenizer.pad_token = tokenizer.eos_token
 tokenizer.padding_side = 'right'
 def gen(instruction):
    messages = [