Files
xc-llm-ascend/docs/source/assets/cp/device_world.png
Qiu 38cfcd572a [doc](cp) correct the prefill of GQA and adjust desc of block table. (#5697)
### What this PR does / why we need it?
correct the seq length of KV for prefill of GQA and clarify the desc of
block table distribution in developer guide.

- vLLM version: v0.13.0
- vLLM main:
2f4e6548ef

---------

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>
2026-01-19 18:53:48 +08:00

61 KiB
3154x442px

/EngineX/xc-llm-ascend/raw/commit/c26ad78f863f610c3f494df2d37a5fb4fc71dd3f/docs/source/assets/cp/device_world.png