[Doc] Fix quant documentation to make it reproducible (#2277)
### What this PR does / why we need it?
Fixed the expression of msit for code clone
- vLLM version: v0.10.0
- vLLM main:
afa5b7ca0b
---------
Signed-off-by: wangli <wangli858794774@gmail.com>
This commit is contained in:
@@ -5,7 +5,7 @@
|
||||
referring to [multi_node.md](https://vllm-ascend.readthedocs.io/en/latest/tutorials/multi_node.html#verification-process)
|
||||
|
||||
## Run with docker
|
||||
Assume you have two Atlas 800 A3(64G*16) nodes(or 4 *A2* 8), and want to deploy the `Kimi-K2-Instruct-W8A8` quantitative model across multi-node.
|
||||
Assume you have two Atlas 800 A3(64G*16) nodes(or 4 * A2), and want to deploy the `Kimi-K2-Instruct-W8A8` quantitative model across multi-node.
|
||||
|
||||
```{code-block} bash
|
||||
:substitutions:
|
||||
|
||||
@@ -32,12 +32,15 @@ see https://www.modelscope.cn/models/vllm-ascend/Qwen3-8B-W4A8
|
||||
:::
|
||||
|
||||
```bash
|
||||
# Optional, this commit has been verified
|
||||
git clone https://gitee.com/ascend/msit -b f8ab35a772a6c1ee7675368a2aa4bafba3bedd1a
|
||||
|
||||
git clone https://gitee.com/ascend/msit
|
||||
cd msit/msmodelslim
|
||||
|
||||
# Optional, this commit has been verified
|
||||
git checkout f8ab35a772a6c1ee7675368a2aa4bafba3bedd1a
|
||||
|
||||
# Install by run this script
|
||||
bash install.sh
|
||||
pip install accelerate
|
||||
|
||||
cd example/Qwen
|
||||
# Original weight path, Replace with your local model path
|
||||
|
||||
Reference in New Issue
Block a user