### What this PR does / why we need it?
RFC https://github.com/vllm-project/vllm-ascend/issues/7394
Add a PyTorch implementation of the fused recurrent gated delta ruler on
310P.
### Does this PR introduce _any_ user-facing change?
NO
### How was this patch tested?
UT
- vLLM version: v0.17.0
- vLLM main:
4497431df6
---------
Signed-off-by: Tflowers-0129 <2906339855@qq.com>
Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>
8 lines
237 B
Python
8 lines
237 B
Python
from .fused_gdn_gating import fused_gdn_gating_pytorch
|
|
from .fused_recurrent_gated_delta_rule import fused_recurrent_gated_delta_rule_pytorch
|
|
|
|
__all__ = [
|
|
"fused_gdn_gating_pytorch",
|
|
"fused_recurrent_gated_delta_rule_pytorch",
|
|
]
|