[Doc] Add the release note for 0.7.3rc1 (#285)
Add the release note for 0.7.3rc1 Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
This commit is contained in:
@@ -55,6 +55,10 @@ export ASCEND_RT_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
|
||||
ray start --address='{head_node_ip}:{port_num}' --num-gpus=8 --node-ip-address={local_ip}
|
||||
```
|
||||
|
||||
:::{note}
|
||||
If you're running DeepSeek V3/R1, please remove `quantization_config` section in `config.json` file since it's not supported by vllm-ascend currentlly.
|
||||
:::
|
||||
|
||||
Start the vLLM server on head node:
|
||||
|
||||
```shell
|
||||
@@ -106,4 +110,4 @@ Logs of the vllm server:
|
||||
```
|
||||
INFO: 127.0.0.1:59384 - "POST /v1/completions HTTP/1.1" 200 OK
|
||||
INFO 02-19 17:37:35 metrics.py:453] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 1.9 tokens/s, Running: 0 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 0.0%, CPU KV cache usage: 0.0%.
|
||||
```
|
||||
```
|
||||
|
||||
Reference in New Issue
Block a user