适配deepseekv3.2
Some checks failed
CI Monitor / ci-monitor (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (all, gfx942) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (all, gfx942-rocm700) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (all, gfx950) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (srt, gfx942) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (srt, gfx942-rocm700) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (srt, gfx950) (push) Has been cancelled
Release Docker Images Nightly (Ascend NPU) / build (8.2.rc1, 910b) (push) Has been cancelled
Release Docker Images Nightly (Ascend NPU) / build (8.2.rc1, a3) (push) Has been cancelled
Build and Push Development Docker Images / build-dev-x86 (map[tag:dev type:all version:12.9.1]) (push) Has been cancelled
Build and Push Development Docker Images / build-blackwell-arm (map[tag:blackwell-cu129 type:blackwell_aarch version:12.9.1]) (push) Has been cancelled
Build and Push Development Docker Images / create-manifests (map[arm64_tag:blackwell-cu129-arm64 tag:dev-manifest x86_tag:dev]) (push) Has been cancelled
Nightly Test / nightly-test-eval-text-models (push) Has been cancelled
Nightly Test / nightly-test-perf-text-models (push) Has been cancelled
Nightly Test / nightly-test-eval-vlms (push) Has been cancelled
Nightly Test / nightly-test-perf-vlms (push) Has been cancelled
Nightly Test (AMD) / nightly-test (linux-mi300-gpu-2) (push) Has been cancelled
Nightly Test (AMD) / nightly-test (linux-mi325-gpu-2-nightly) (push) Has been cancelled
Close Inactive Issues / close-inactive-issues (push) Has been cancelled
Some checks failed
CI Monitor / ci-monitor (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (all, gfx942) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (all, gfx942-rocm700) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (all, gfx950) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (srt, gfx942) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (srt, gfx942-rocm700) (push) Has been cancelled
Release Docker Images Nightly (AMD) / publish (srt, gfx950) (push) Has been cancelled
Release Docker Images Nightly (Ascend NPU) / build (8.2.rc1, 910b) (push) Has been cancelled
Release Docker Images Nightly (Ascend NPU) / build (8.2.rc1, a3) (push) Has been cancelled
Build and Push Development Docker Images / build-dev-x86 (map[tag:dev type:all version:12.9.1]) (push) Has been cancelled
Build and Push Development Docker Images / build-blackwell-arm (map[tag:blackwell-cu129 type:blackwell_aarch version:12.9.1]) (push) Has been cancelled
Build and Push Development Docker Images / create-manifests (map[arm64_tag:blackwell-cu129-arm64 tag:dev-manifest x86_tag:dev]) (push) Has been cancelled
Nightly Test / nightly-test-eval-text-models (push) Has been cancelled
Nightly Test / nightly-test-perf-text-models (push) Has been cancelled
Nightly Test / nightly-test-eval-vlms (push) Has been cancelled
Nightly Test / nightly-test-perf-vlms (push) Has been cancelled
Nightly Test (AMD) / nightly-test (linux-mi300-gpu-2) (push) Has been cancelled
Nightly Test (AMD) / nightly-test (linux-mi325-gpu-2-nightly) (push) Has been cancelled
Close Inactive Issues / close-inactive-issues (push) Has been cancelled
This commit is contained in:
@@ -136,21 +136,21 @@ class RMSNorm(CustomOp):
|
||||
# NOTE: Remove this if aiter kernel supports discontinuous input
|
||||
x = x.contiguous()
|
||||
if residual is not None:
|
||||
if _vllm_version < Version("0.9"):
|
||||
fused_add_rms_norm(x, residual, self.weight.data, self.variance_epsilon)
|
||||
return x, residual
|
||||
else:
|
||||
residual_out = torch.empty_like(x)
|
||||
output = torch.empty_like(x)
|
||||
fused_add_rms_norm(
|
||||
output,
|
||||
x,
|
||||
residual_out,
|
||||
residual,
|
||||
self.weight.data,
|
||||
self.variance_epsilon,
|
||||
)
|
||||
return output, residual_out
|
||||
#if _vllm_version < Version("0.9"):
|
||||
fused_add_rms_norm(x, residual, self.weight.data, self.variance_epsilon)
|
||||
return x, residual
|
||||
# else:
|
||||
# residual_out = torch.empty_like(x)
|
||||
# output = torch.empty_like(x)
|
||||
# fused_add_rms_norm(
|
||||
# output,
|
||||
# x,
|
||||
# residual_out,
|
||||
# residual,
|
||||
# self.weight.data,
|
||||
# self.variance_epsilon,
|
||||
# )
|
||||
# return output, residual_out
|
||||
out = torch.empty_like(x)
|
||||
rms_norm(out, x, self.weight.data, self.variance_epsilon)
|
||||
return out
|
||||
|
||||
Reference in New Issue
Block a user