xc-llm-ascend

Files

KyrieWang 60e2be1b36 [Feat] Dynamic Batch Feature (#3490 )

[RFC](https://github.com/vllm-project/vllm-ascend/issues/3328) for more
details.
Add dynamic batch feature in chunked prefilling strategy, the token
budget can be refined to achieve better effective throughput and TPOT.

!!! NOTE: only 910B3 is supported till now, we are working on further
improvements.
Additional file for lookup table is required.

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

---------

Signed-off-by: Cheng Wang <wangchengkyrie@outlook.com>

2025-10-22 14:13:32 +08:00

configuration

[feat][torchair] support super kernel feat for quantized dsr1 (#3485 )

2025-10-20 20:04:37 +08:00

feature_guide

[Feat] Dynamic Batch Feature (#3490 )

2025-10-22 14:13:32 +08:00

support_matrix

[ReleaseNote] Release note of v0.10.0rc1 (#2225 )

2025-08-07 14:46:49 +08:00

release_notes.md

[Doc] Release note for v0.11.0rc0 (#3224 )

2025-09-30 03:26:18 +08:00