[Doc] Add experimental tag for flashinfer mla (#3925)

This commit is contained in:
Baizhou Zhang
2025-02-27 01:55:36 -08:00
committed by GitHub
parent d8a98a2cad
commit 3e02526b1f
2 changed files with 2 additions and 2 deletions

View File

@@ -133,7 +133,7 @@ Please consult the documentation below to learn more about the parameters you ma
* `attention_backend`: The backend for attention computation and KV cache management.
* `sampling_backend`: The backend for sampling.
* `enable_flashinfer_mla`: The backend for flashinfer MLA wrapper. It can optimize the throughput of deepseek models.
* `enable_flashinfer_mla`: The backend for flashinfer MLA wrapper that accelerates deepseek models. (In Experiment Stage)
## Constrained Decoding