[Refact.]: Refactor some leftover implementations of 300I DUO in the main branch. (#6425)
### What this PR does / why we need it?
- Replace the RoPE operator implementation.
- Refactor some leftover implementations of 300I DUO in the main branch.
### Does this PR introduce _any_ user-facing change?
NA
### How was this patch tested?
- vLLM version: v0.14.1
- vLLM main:
dc917cceb8
---------
Signed-off-by: Tflowers-0129 <2906339855@qq.com>
This commit is contained in:
@@ -26,7 +26,8 @@ class NPUWorker310(NPUWorker):
|
||||
def init_device(self):
|
||||
self.device = self._init_device()
|
||||
|
||||
torch_npu.npu.set_compile_mode(jit_compile=False)
|
||||
# TODO: There is accuracy issue when jit_compile is disabled currently.
|
||||
torch_npu.npu.set_compile_mode(jit_compile=True)
|
||||
|
||||
init_workspace_manager(self.device, num_ubatches=1)
|
||||
|
||||
|
||||
Reference in New Issue
Block a user