From 26294b2f3d1a07302e0be7783eb47cccb3fd5359 Mon Sep 17 00:00:00 2001 From: Lianmin Zheng Date: Mon, 1 Jul 2024 09:54:08 -0700 Subject: [PATCH] Update README.md --- README.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 4f0d07d03..82934dc0a 100644 --- a/README.md +++ b/README.md @@ -396,7 +396,8 @@ Instructions for supporting a new model are [here](https://github.com/sgl-projec - Mixtral-8x7B on NVIDIA A10G, FP16, Tensor Parallelism=8 ![mixtral_8x7b](assets/mixtral_8x7b.jpg) -Learn more [here](docs/benchmark_results.md). +- Learn more about the above [results](docs/benchmark_results.md). +- Synthetic latency and throughput benchmark [scripts](https://github.com/sgl-project/sglang/tree/main/benchmark/latency_throughput). ## Roadmap https://github.com/sgl-project/sglang/issues/157