[310p] Add a PyTorch implementation of the GDN gating operator on 310P (#7430)
### What this PR does / why we need it?
RFC #7394
Add a PyTorch implementation of the GDN gating operator on 310P.
### Does this PR introduce _any_ user-facing change?
NO
### How was this patch tested?
UT
- vLLM version: v0.17.0
- vLLM main:
4497431df6
Signed-off-by: Tflowers-0129 <2906339855@qq.com>
This commit is contained in:
3
vllm_ascend/_310p/ops/fla/__init__.py
Normal file
3
vllm_ascend/_310p/ops/fla/__init__.py
Normal file
@@ -0,0 +1,3 @@
|
||||
from .fused_gdn_gating import fused_gdn_gating_pytorch
|
||||
|
||||
__all__ = ["fused_gdn_gating_pytorch"]
|
||||
Reference in New Issue
Block a user