Bump torch_npu version to dev20250308.3 (#276)
### What this PR does / why we need it?
Bump torch_npu version to dev20250308.3 to fix performance regression on
multi-stream case:
e04c580d07
.
### Does this PR introduce _any_ user-facing change?
NO
### How was this patch tested?
CI passed
Signed-off-by: Yikun Jiang <yikunkero@gmail.com>
This commit is contained in:
@@ -36,7 +36,7 @@ vLLM 昇腾插件 (`vllm-ascend`) 是一个让vLLM在Ascend NPU无缝运行的
|
||||
- 软件:
|
||||
* Python >= 3.9
|
||||
* CANN >= 8.0.RC2
|
||||
* PyTorch >= 2.5.1, torch-npu >= 2.5.1.dev20250307
|
||||
* PyTorch >= 2.5.1, torch-npu >= 2.5.1.dev20250308
|
||||
* vLLM (与vllm-ascend版本一致)
|
||||
|
||||
在[此处](docs/source/installation.md),您可以了解如何逐步准备环境。
|
||||
|
||||
Reference in New Issue
Block a user