xc-llm-kunlun

Files

baoqian426 2512259944 longcontext chunk make attention crash, fix it (#117 )

Co-authored-by: root <root@rdtest-node1150.bcc-zwlt.baidu.com>

2026-01-17 18:38:23 +08:00

2025-12-10 17:51:24 +08:00

2025-12-10 12:05:39 +08:00

2025-12-10 17:51:24 +08:00

2026-01-14 18:42:18 +08:00

2026-01-17 18:38:23 +08:00

2025-12-10 17:51:24 +08:00

enable int8 bmm

2026-01-14 14:30:59 +08:00

2025-12-10 12:05:39 +08:00

2026-01-17 18:38:23 +08:00

2025-12-10 17:51:24 +08:00

__init__.py

2026-01-17 18:38:23 +08:00

utils.py

2025-12-10 17:51:24 +08:00

vllm_utils_wrapper.py

2026-01-17 16:52:02 +08:00