[Feature] Add docs of batch invariance and make some extra operators patch (#6910)
### What this PR does / why we need it?
This PR add docs of batch invariance and make some extra operators
according to validation result.
please see https://github.com/vllm-project/vllm-ascend/issues/5487 to
track progress.
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
- vLLM version: v0.16.0
- vLLM main:
15d76f74e2
---------
Signed-off-by: Ronald1995 <ronaldautomobile@163.com>
This commit is contained in:
@@ -25,4 +25,5 @@ context_parallel
|
||||
npugraph_ex
|
||||
weight_prefetch
|
||||
sequence_parallelism
|
||||
batch_invariance
|
||||
:::
|
||||
|
||||
Reference in New Issue
Block a user