Fix: Runtime error for function calling (#3300)

This commit is contained in:
Shi Shuai
2025-02-07 04:52:01 +00:00
committed by GitHub
parent 40022d075a
commit 591e751e07
2 changed files with 97 additions and 70 deletions

View File

@@ -20,7 +20,7 @@ Please refer to [the example](https://github.com/sgl-project/sglang/tree/main/be
- [Serving with two H200*8 nodes and docker](https://github.com/sgl-project/sglang/tree/main/benchmark/deepseek_v3#example-serving-with-two-h2008-nodes-and-docker).
## Optimisations
## Optimizations
### Multi-head Latent Attention (MLA) Throughput Optimizations