This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX
/
xc-llm-ascend
Watch
3
Star
0
Fork
0
You've already forked xc-llm-ascend
Code
Issues
Pull Requests
Actions
Projects
Releases
Wiki
Activity
Files
f3ea657e932dbecff2df7bb9e8fa50a167d3dd1d
xc-llm-ascend
/
tests
/
ut
/
torchair
History
wangxiyuan
a0c3b8dd2d
[v0.11.0]cherry-pick fix ut (
#3608
) (
#3614
)
...
cherry-pick fix ut (
#3608
) Signed-off-by: wangxiyuan <
wangxiyuan1007@gmail.com
>
2025-10-22 14:14:15 +08:00
..
models
[v0.11.0]cherry-pick fix ut (
#3608
) (
#3614
)
2025-10-22 14:14:15 +08:00
ops
[BugFix]Support redundant experts in EPLB (
#3473
)
2025-10-18 00:09:16 +08:00
quantization
[Feat][quantization] Support new version w4a8 dynamic quantization for Linear layers (
#3311
)
2025-10-21 20:18:39 +08:00
__init__.py
[2/4][Refactor] Refactor torchair utils (
#1892
)
2025-07-21 19:43:30 +08:00
test_torchair_attention.py
[Bugfix]:replace npu_incre_flash_attention with npu_fused_infer_atten… (
#2901
)
2025-09-18 14:06:08 +08:00
test_torchair_mla.py
[Model][1/N] Delete deepseek v2/v3 modeling codes. (
#3189
)
2025-10-20 15:31:34 +08:00
test_utils.py
[Feat] Unquantized Linear to nz and control all nz-cast (
#3356
)
2025-10-14 17:39:26 +08:00