This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX
/
xc-llm-kunlun
Watch
3
Star
0
Fork
0
You've already forked xc-llm-kunlun
Code
Issues
Pull Requests
Actions
Projects
Releases
Wiki
Activity
Files
bf9369f7333252a342384dc9226b2ecb5587d813
xc-llm-kunlun
/
vllm_kunlun
/
v1
/
attention
/
backends
/
mla
History
Xinyu Dong
bf9369f733
Migrate XTorch operations to Kunlun operations (accelerating iteration) (
#177
)
...
Signed-off-by: dongxinyu03 <
dongxinyu03@baidu.com
>
2026-02-12 18:13:00 +08:00
..
__init__.py
[Feature] support deepseek v3/r1/v3.2 (
#78
)
2026-01-05 22:55:35 +08:00
common.py
Migrate XTorch operations to Kunlun operations (accelerating iteration) (
#177
)
2026-02-12 18:13:00 +08:00
flashmla_sparse.py
[Misc]Specify that DS32 only supports --kv-cache-dtype bfloat16 (
#119
)
2026-01-17 16:52:02 +08:00
flashmla.py
enable full cudagraph for deepseek
2026-01-12 15:18:12 +08:00
indexer.py
clean pr for ds.2 mtp support (
#164
)
2026-02-02 15:23:33 +08:00