This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX
/
xc-llm-ascend
Watch
3
Star
0
Fork
0
You've already forked xc-llm-ascend
Code
Issues
Pull Requests
Actions
Projects
Releases
Wiki
Activity
Files
cff08f9df8db2e369f1a83838f89af9463b36f35
xc-llm-ascend
/
vllm_ascend
History
HongtaoYang
dcd0005058
[Fix] Remove npu_group_topk before CANN version update (
#242
)
...
Remove npu_group_topk before CANN version update. Signed-off-by: SidaoY <
1024863041@qq.com
>
2025-03-06 09:02:46 +08:00
..
ops
[Fix] Remove npu_group_topk before CANN version update (
#242
)
2025-03-06 09:02:46 +08:00
quantization
[Performance] Change the shape of kv_cache to avoid view of k_cache and v_cache. (
#204
)
2025-03-05 10:51:07 +08:00
worker
[Performance] Change the shape of kv_cache to avoid view of k_cache and v_cache. (
#204
)
2025-03-05 10:51:07 +08:00
__init__.py
[Core] Init vllm-ascend (
#3
)
2025-02-05 10:53:12 +08:00
attention.py
[Performance] Change the shape of kv_cache to avoid view of k_cache and v_cache. (
#204
)
2025-03-05 10:51:07 +08:00
communicator.py
[Dist] Set device as rank (
#202
)
2025-03-03 09:23:13 +08:00
platform.py
[Core] Support pooling (
#229
)
2025-03-04 15:59:34 +08:00
utils.py
[Worker] Register mindie_turbo while initializing NPUWorker (
#13
)
2025-02-07 16:47:17 +08:00