[310P]fused recurrent gated delta rule pytorch core and ut (#7398)
### What this PR does / why we need it?
RFC https://github.com/vllm-project/vllm-ascend/issues/7394
Add a PyTorch implementation of the fused recurrent gated delta ruler on
310P.
### Does this PR introduce _any_ user-facing change?
NO
### How was this patch tested?
UT
- vLLM version: v0.17.0
- vLLM main:
4497431df6
---------
Signed-off-by: Tflowers-0129 <2906339855@qq.com>
Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>
This commit is contained in:
@@ -1,3 +1,7 @@
|
||||
from .fused_gdn_gating import fused_gdn_gating_pytorch
|
||||
from .fused_recurrent_gated_delta_rule import fused_recurrent_gated_delta_rule_pytorch
|
||||
|
||||
__all__ = ["fused_gdn_gating_pytorch"]
|
||||
__all__ = [
|
||||
"fused_gdn_gating_pytorch",
|
||||
"fused_recurrent_gated_delta_rule_pytorch",
|
||||
]
|
||||
|
||||
Reference in New Issue
Block a user